Name : DIP2012 (DIP2012) 
      Accession : NP_940343.1  
PAI name :  Not named 
      PAI accession : NC_002935_P5 
      Strain : Corynebacterium diphtheriae 241 
      Virulence or Resistance: Not determined 
      Product : fimbrial associated sortase-like protein 
      Function : - 
      Note : Similar to Actinomyces viscosus sortase-like protein SWALL:Q9AJ92 (EMBL:AF106034) (387 aa) fasta scores: E(): 2.7e-39, 43.47% id in 276 aa, and to Bifidobacterium longum NCC2705 sortase-like protein BL0676 SWALL:AAN24497 (EMBL:AE014690) (328 aa) fasta sco 
      Homologs in the searched genomes :    147 hits    ( 147 protein-level )   
Publication : 
-Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (03-OCT-2003) Cerdeno-Tarraga A.M., submitted on behalf of the Pathogen Sequencing Unit, Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA E-mail: amct@sanger.ac.uk.
  -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (08-APR-2002) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
  -Cerdeno-Tarraga,A.M., Efstratiou,A., Dover,L.G., Holden,M.T., Pallen,M., Bentley,S.D., Besra,G.S., Churcher,C., James,K.D., De Zoysa,A., Chillingworth,T., Cronin,A., Dowd,L., Feltwell,T., Hamlin,N., Holroyd,S., Jagels,K., Moule,S., Quail,M.A., Rabbinowits, "The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129", Nucleic Acids Res. 31 (22), 6516-6523 (2003) PUBMED 14602910.
 
   
       
      
      | DNA sequence :  |  |  
      
          ATGAGGCACCGGGCAGGTGAACATCGAAACGTTTTTGCCATCCTCGCATTCGTCATCGCGATCGTCTCAGTGGGATTTTT
GCTCTACCCCGTGGCCGCAACCGCATGGAACAACGCGCGCCAAGCACGCGTCGCACAGTCCTACGAGAACAGCTACGAGG
TAGACAGCCCAGCGGTACGAGACAGCGTTCTTGAGGCGGCCAGACAGTACAACACGTCGGTAGTAGGCTTCCCGATCCTC
GATCCGTGGCTGAACAGGGCGTCGAAAAACAGCGGGCCATACCTCGACTACCTGCAACAACTCAACCCGCAGCGCGCCGA
ACGTCCCGTCATAGCGTCGATAAGCATCCCTACTATCGACGCCCACCTGCCCATCTACCACGGCACCGACACCGCCACCC
TCGAGCACGGACTCGGCCACCTATACGGCTCCGCGCTACCCGTTGGCGGCACCGGCACCCACCCCGTGATCACCGGACAC
AGCGGCCTTGCCAACGCCACCCTCTTTGACAACCTCGAAGACGTCAAAGAACACGACCCCATCTACATCACCGTCCAAGG
CGAAACCCTCAAATACGAAGTAGACGCCATCAACGTAGTCCTACCCGAAGACACCAAACTCCTCGCCCCAGACCCCAACA
AAGACCAAATAACACTCATCACCTGCACCCCCTACGCCGTCAACTCCCACCGACTCCTCGTACGAGCCCACCGCGTAGAC
CTCGACCCCAACGACCCCAACCTCACACAAACCGGCACCAAAATCTGGCAACCCTGGATGCTGTGGACCGCAGCCCTAGC
ACTCACCGCCATCGCCATCATCATCACCCTCGTGCTTCGCAGGAAGAGGACAACCACCCATGAAAAATAA
  
           | 
       
      | Protein sequence :  |  |  
      
          MRHRAGEHRNVFAILAFVIAIVSVGFLLYPVAATAWNNARQARVAQSYENSYEVDSPAVRDSVLEAARQYNTSVVGFPIL
DPWLNRASKNSGPYLDYLQQLNPQRAERPVIASISIPTIDAHLPIYHGTDTATLEHGLGHLYGSALPVGGTGTHPVITGH
SGLANATLFDNLEDVKEHDPIYITVQGETLKYEVDAINVVLPEDTKLLAPDPNKDQITLITCTPYAVNSHRLLVRAHRVD
LDPNDPNLTQTGTKIWQPWMLWTAALALTAIAIIITLVLRRKRTTTHEK
  
         | 
       
       
      |