PAI Gene Information


Name : DIP2010 (DIP2010)
Accession : NP_940341.1
PAI name : Not named
PAI accession : NC_002935_P5
Strain : Corynebacterium diphtheriae 241
Virulence or Resistance: Not determined
Product : surface-anchored membrane protein
Function : -
Note : Similar to Actinomyces viscosus usher-like protein precursor TR:Q9AJ93 (EMBL:AF106034) (1411 aa) fasta scores: E(): 0.034, 22.657% id in 843 aa, and to Actinomyces naeslundii fimbrial associated protein TR:O05995 (EMBL:U85708) (375 aa) fasta scores: E():
Homologs in the searched genomes :   11 hits    ( 10 protein-level,   1 DNA-level )  
Publication :
    -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (03-OCT-2003) Cerdeno-Tarraga A.M., submitted on behalf of the Pathogen Sequencing Unit, Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA E-mail: amct@sanger.ac.uk.

    -Cerdeno-Tarraga,A.M., "Direct Submission", Submitted (08-APR-2002) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Cerdeno-Tarraga,A.M., Efstratiou,A., Dover,L.G., Holden,M.T., Pallen,M., Bentley,S.D., Besra,G.S., Churcher,C., James,K.D., De Zoysa,A., Chillingworth,T., Cronin,A., Dowd,L., Feltwell,T., Hamlin,N., Holroyd,S., Jagels,K., Moule,S., Quail,M.A., Rabbinowits, "The complete genome sequence and analysis of Corynebacterium diphtheriae NCTC13129", Nucleic Acids Res. 31 (22), 6516-6523 (2003) PUBMED 14602910.


DNA sequence :
GTGAGTAACCTACGCACCATCAAGAAACGAGCGTCTATCCCCGCAGCACTCGTCGCCATCCTCGCAATGGTCATGAGCAT
TGTTCTTGTGCCGTTAATTGCAGCGCCATCAGCGAATGCGGAGCCACTGCCGAAAAAAGAGTTTGAAACCTGTGGCGGTT
CTGTTGCGATTTCCTTTGACTTGTCCAATTCCCTGAGCGCTTCGGACGTGGAAAAATCTAAGCAAGCAGCGTTGGAGCTG
GTCAAGAGCTTGAAAGGATCTCCCTATCGTTTTGGTATTTATACCTTTGCTTCACACTCGCCTGCTGCTGGAAACAAAAA
TTTCACGCCAGTAAGTCTTGCTAATGATGACGGATACAACAAAGTTGTTGCCGCTATCAATGACATCCAGATGCCAGCGA
TCCGAGAGAACAAAAAGGGTTCTCCCAACGGTGGTACCAACTGGGAGGGCGGGCTCCAAGCAATCGCGAATGACATAGAC
AGAGGCATCAAGTATGACGCCGTTTACTTCATCACTGATGGTCAACCAACTTGGGATAACAATGGGAGAAATTGGTTGGG
AACCACCACCGAGGTTGTGGAATTAGAAAATGCCGTTACCCAAGCCAAACTTATTTCTGATAAAGGCGCAAAACTTATTC
CGGTGGGTATTGGCCAGCTTTCTGATGATAAGCCGTTTGATCTCTATAAACCGATTCTTCCTTCCGAAGACGATTACTAT
TGGTCGCGTTATCCATGGAAAATAGATCGTTCCCTGACCGGCAAACAAATGCTGGAGAAGATAACCTCACCGGGCCTAGA
GCCAATTATTTTGCCCGACTATTCCACATTGCCGCAGCGAATGGGACAACAGATTTTTACCGGATGTTTCCAAATCGCTA
AAAACATTATTGATGCAGACGGAAACGTGATAGAAAATCCAGCTGGCTGGAATTTCGATATTACAGCGGCTGGTGTGCAA
GGCATTCCTCCGTCGATCGAGACAGATAAAAATGGCCAAGACACCTTTGCTATGAAGTCGATTAATAAGGAATCCTTTAA
GATCACTATCACTGAGCGACCTACCGGAGATCAAAAACAAAACTTCCGGTTTAAAAACGCACGCTGCCAGCGCTACTCCT
ATGGGCAAGCACCTACTGATATTCCGATCAAAACTAGCGATACTTCAATTACTTTGACTGCAGACACCAAAAGCTTGATT
TCTTGCGGTTTTAACAATCTGCCAGTTGTACCAGTCTCGGTTTCGAAAAAAGTAAACGTAAATACGCCACAACTTTTAGA
AGAGCTCAACAATCAAACCTTTGATTTCACCTATAGCTGTGAAAAAGGAGCTAATGAAAAAGAAATCAAGGGAGAAATTA
AGGGCGTCCACAACGGAGAATCCAAAGAAATCGGAAAAGTTGCTGTTGGAACTCAGTGCGAAATCAAGGAAGTTACCCCC
AAAGTCGACGATTCTCGGATGAAGCTTTCCACCACTTGGAGCAGTGAAAACACCACTGCAGATGCTAATCAGGATAACGG
TACATACCGCTTTAAAGCCGACACCGATGCGTTTAAAAACAAGAAAACAGTTCTAGCTACAGCAGAAAATAACTATGAGG
CTCAAACAGCCACTATTAAGCTGACCAAGACCATCATTAACCGTGACAAAATTCCAGCAACAAAACTGCCCGAGAAGTTT
CCTGTCACTTACACCTGTCGTTACGTACCACATCCTAATGCTCGCCCCGAACATGGTGGGCTCCCAGAAACCAATCCGTA
TTTTGTAGACTCTAAAACCGTTGTCGTTCCTCGTGATGGAAGTATAGAAATCGGACCTTTTCCAGTGGGAACACAGTGCA
GTTTTGAAGAAACTGCACGACTCGATCCGAATGTTCAAGCAGACGCTAAAATTCCTGGTTTTAGTTTGAAAACTGAGTGG
AATTCCAACATCTGTTTTGGAAACACTATCGATAATAATTCTCAAGATTGTTCTACTAACTCAGTATGGATTCCCAAACC
AGGTCAATATTCGCTCAACGTAGAAAATACATACACGCGTGAGCTTGCGAGCGTGGAGATCGAAAAGACGGTGAGCGGCG
ATGCCTCTGATCTCACGAATTCACACGAGTTTTCATTCAATCTTCGATGTGAAGATTCCGGAGTAGAAGTCTATTCGCAA
GACAATATCGTGGTGAAGAAGGACGGACGGCAAGTCATCGAAAACATTCCTGTCGATGCCAATTGTACGTTGAGCGAAAA
ACAGCCTGAACAAAAAGGCGTGGATTTTGTGGTCCCCGCGCCGTTCCATCTTCGTGCTTCAACTGCCGGCGACATTGTCA
AAGTGGTTGTAGATAACACCGCAAAACGTCAGGTAGCTCCTATTTCAATACAGAAAAAAGTTCATAAAAAAGACACATTT
TCTCCTGAAATTTCTGCATCAATCGATGCATTAACATACAGTGTGGTGGCAGAATGTACGGTTCCTGGTGTAGAAACGCC
TCGAAAAGTTCTAAAAACAGTAAGTGATAATCAAACTGTTGAATTTGGAAACTTTCCAGTGGGAACTACTTGTAGCTTTA
GCGAGCTCACCGAAGCCCCTGCCGGAACCGAAATGAGTTATAAATTCGCGGATGGTCCAGAGGTGACAATTGAGGACTCC
ACTCCTATAAATAAGGTGCTGACGAATACGTTTGAAAATGCACGTGGCGAGCTAAAAGTAACCAAAAAAGTACTCGATGG
TGATATGCCTCAAGCATTAGTAGACCAGATTCCATCGAGTTTTACAGTCAACGTCGCATGCTCAATCACCGGTAATCATT
CCATCACTTTGCAAAAAGATGAGCAGAAAGCCGTACCTGGGGTTGTTGCAGGTGAAAGCTGCACATTAAGTGAGGAAGTA
ACTCCTATAACTGGGGCTACCCATCACAAGCACTGGATTAAAGGCGAGCTGCTTGAAGTTGCAGATTCTACGGACATCAC
GATTAACCCTAATGGTAGTAACGCAATTCGATTGGAAAACCATTACGAAACCGATGCTGTATCTTTGGAACTTACCAAAC
GTGTTCGGGTCATTGACCAAGTTGGAAATGACGTTAACTCGGAACTAAAAAATGCAGTTGTCCGTCCAGAACAACCCTTC
CTATTCCGATACCGTTGTGAAATCAATGGTCAAGTAGTTGCAGAAAATACCTTAAGCGCCGATGCGATTAACACTGGTGC
CACTAAGGTGCCACGGGGATCTATTTGTACGGTTGAAGAAGATTCCTCTTCAGTGGAGTTGTCTAATGCAACGTTATCTC
ACGTTGAGTTCTTCGTTCACGGAACAAAAACGAATGATAAGGCATCGGTAGCGATAAACTCGGATCATAACCGACTAGAT
GCTACTAATACTTTCACGTTGAAGACTGGCTCATTTAACCTTAAAAAGAAAGTCGATGGTGAAGGAGTATCTACCATCCA
TGAGGATCGACGCTTTGAAATTTCGTATCGTTGTACCTTAGGCGACTGGAAGAAAAACGGCACCATTACGCTGGGACGTT
TTGATAGTGCCGAATCGCATTCTGTTAAAGACATTCCCGTGGGTGCATCATGTGAGATTATTGAGGACTCTGAGAAAGCC
CAAGAGCCAAACGCACAAGTGACAGCTCGTTGGACTCATACAGACAGCACGAATGGCTGGGGCGATACCGAAGCAGCATG
CGAAAATCATGCAGCGTGCGAAGTGGATCCAAAAAATGAGTTTGCAACCACAGTGGTTATTGCTGGAAATGAGAAAGAGA
ATTTCCAAGGAACCTTTATCGTATGGAACACCTACACTTACGATAAAACAAAGGTAGAGATCAACAAGGTGTTGACGAAT
GATGGTCCAGAACTTGCTGGTAAAGATAACTTTGCCTTCACCTTGAAATGTACTGATCCTCGTTTTGCAGGAAGTGATTT
GGCAGATAAGCATTCCATTCCAGACCCCACAATTACAGTTGCATTAAATGCTAAAGGCCAAAGCCGAGCGTCGTACCAAG
TTGCAGACGAACGGCACGATAGCGTTGAGGTTCCTGTTGGGTATAACTGCACTGTGACCGAAAACCCGATTGCACTTTAT
GATGCCAAAGCGACGACCCAATTCAGTGGTCCGGCAGTGGTGGAAAATACGGCTGTGCAACGCACATCATCAAACTCCGC
CTCGGCTCGTTTTGTCACGGAGAAACAAGAAAATAATGGCACTCAAAAAATTCAGGTAACTAATGATTACATTCGTCCGC
GCGCCGATGTCATGGTGCATAAGACAATCGCAAAACCAGAACACTCGGTAGATCCTTGGTTGCTTAACACTACATACAGC
ATCACTTATAAGTGCGACGATCCATACATCAAGGATCGTTCCTATTCAAACGACGTAGATATACAAGCTGATGCAGAAAA
ACCAACGCCAATTTTCGCTGATCCTACGGCTCACGTAAAAATTCCTGCGTCGGCAGTATGTACTTTCAGTGAAAACACCG
AAGGGCATTTACCAGGAGAGGTAAAAGGCGTAGTGGATGAAACGAATAAAGTTGCTGAATTCGCTGGGGAACATGAAAAG
CGCTCCTATTTCACCCCAGAAATTAAAGATGTTGTTTTGTCGGAATCTGAACCAACACGAATTGAATTCACCAATTCATA
CGTGATGCCTCAACGAATTTTGAGCCTACAAAAATATGTTGAGGGCGACCCCGGCCATGCTGTGATTGCTCCAGAAGAAA
CATTTGAATTCTCCTACACCTGCACCATGCCGCATCTATTCCCAAATCAACCCAATCCTATGTCGCAAGAAGTAGGAAAC
AAGGTTGCACGTGGCGTCATTAAGATTCGAGAAGGTGAGACATGGCGATCTCCTGAAGTCCCTATTGGTACGTCCTGCAC
GATCAAGGAAGAAGACGACCCCGCCTTGCGCACCAAGTTGGAAAACAATGCGCTGCGCATGGTGCCTACCTACTTGTTCC
CCACGGAGCGTGCAGGAGCTGCTAGTGCGCCAGTGATTCCGCCGTTGACAGACCGTCCGATTTATAACGGCACGGAGCCT
CGCCTCCAGATGCCAGAATCAGGCATTGAGCTTAACGACGCCCACTCGCACACCGTGGTGATCAACAACGTGTACACCAC
TGACGCTGAGATCAACATTGCCAAGGTGAACGCCGATAACTCTCCGCTGCCCGGCGCGCACTTCGCCATCTATGGGATAG
GGGAGAATGGCCAGCGTAAAGAGTTGCCTGAGGTTGCGGATGCGCCGGCGAAGTCGGCGAAGTCGGTGGAGCAGGCGTTG
TTTGCAGTGCGCTTGCGCCCTGGTAGTTACGAGTTGGTGGAGACTCAGGCTCCTCAGGGTGGGCAGTTGCTGCCTAAGCC
GTGGCGTTTTGATGTCAAGGCTGCGAATGCGGGTGCGATAGGTGATCTTGAGGTGACCTTGGATAACTATGATGCTGATT
CGGGGTTGATCACGGTGGAGCACCCGCAGGGTAAGCCGTGGTTGATCAAGGTGGCTAATGTGTCGGCATCCACACTGCCG
TTGACTGGTTCGAATGGTTACTTGCGGTGGCTGTTGGCCGGTGCTGCGGGCCTGTTGGTGGCTGCAGCATTGTGGTTAGT
GGCGCGTCGTAAGCGTTAG

Protein sequence :
MSNLRTIKKRASIPAALVAILAMVMSIVLVPLIAAPSANAEPLPKKEFETCGGSVAISFDLSNSLSASDVEKSKQAALEL
VKSLKGSPYRFGIYTFASHSPAAGNKNFTPVSLANDDGYNKVVAAINDIQMPAIRENKKGSPNGGTNWEGGLQAIANDID
RGIKYDAVYFITDGQPTWDNNGRNWLGTTTEVVELENAVTQAKLISDKGAKLIPVGIGQLSDDKPFDLYKPILPSEDDYY
WSRYPWKIDRSLTGKQMLEKITSPGLEPIILPDYSTLPQRMGQQIFTGCFQIAKNIIDADGNVIENPAGWNFDITAAGVQ
GIPPSIETDKNGQDTFAMKSINKESFKITITERPTGDQKQNFRFKNARCQRYSYGQAPTDIPIKTSDTSITLTADTKSLI
SCGFNNLPVVPVSVSKKVNVNTPQLLEELNNQTFDFTYSCEKGANEKEIKGEIKGVHNGESKEIGKVAVGTQCEIKEVTP
KVDDSRMKLSTTWSSENTTADANQDNGTYRFKADTDAFKNKKTVLATAENNYEAQTATIKLTKTIINRDKIPATKLPEKF
PVTYTCRYVPHPNARPEHGGLPETNPYFVDSKTVVVPRDGSIEIGPFPVGTQCSFEETARLDPNVQADAKIPGFSLKTEW
NSNICFGNTIDNNSQDCSTNSVWIPKPGQYSLNVENTYTRELASVEIEKTVSGDASDLTNSHEFSFNLRCEDSGVEVYSQ
DNIVVKKDGRQVIENIPVDANCTLSEKQPEQKGVDFVVPAPFHLRASTAGDIVKVVVDNTAKRQVAPISIQKKVHKKDTF
SPEISASIDALTYSVVAECTVPGVETPRKVLKTVSDNQTVEFGNFPVGTTCSFSELTEAPAGTEMSYKFADGPEVTIEDS
TPINKVLTNTFENARGELKVTKKVLDGDMPQALVDQIPSSFTVNVACSITGNHSITLQKDEQKAVPGVVAGESCTLSEEV
TPITGATHHKHWIKGELLEVADSTDITINPNGSNAIRLENHYETDAVSLELTKRVRVIDQVGNDVNSELKNAVVRPEQPF
LFRYRCEINGQVVAENTLSADAINTGATKVPRGSICTVEEDSSSVELSNATLSHVEFFVHGTKTNDKASVAINSDHNRLD
ATNTFTLKTGSFNLKKKVDGEGVSTIHEDRRFEISYRCTLGDWKKNGTITLGRFDSAESHSVKDIPVGASCEIIEDSEKA
QEPNAQVTARWTHTDSTNGWGDTEAACENHAACEVDPKNEFATTVVIAGNEKENFQGTFIVWNTYTYDKTKVEINKVLTN
DGPELAGKDNFAFTLKCTDPRFAGSDLADKHSIPDPTITVALNAKGQSRASYQVADERHDSVEVPVGYNCTVTENPIALY
DAKATTQFSGPAVVENTAVQRTSSNSASARFVTEKQENNGTQKIQVTNDYIRPRADVMVHKTIAKPEHSVDPWLLNTTYS
ITYKCDDPYIKDRSYSNDVDIQADAEKPTPIFADPTAHVKIPASAVCTFSENTEGHLPGEVKGVVDETNKVAEFAGEHEK
RSYFTPEIKDVVLSESEPTRIEFTNSYVMPQRILSLQKYVEGDPGHAVIAPEETFEFSYTCTMPHLFPNQPNPMSQEVGN
KVARGVIKIREGETWRSPEVPIGTSCTIKEEDDPALRTKLENNALRMVPTYLFPTERAGAASAPVIPPLTDRPIYNGTEP
RLQMPESGIELNDAHSHTVVINNVYTTDAEINIAKVNADNSPLPGAHFAIYGIGENGQRKELPEVADAPAKSAKSVEQAL
FAVRLRPGSYELVETQAPQGGQLLPKPWRFDVKAANAGAIGDLEVTLDNYDADSGLITVEHPQGKPWLIKVANVSASTLP
LTGSNGYLRWLLAGAAGLLVAAALWLVARRKR