PAI Gene Information


Name : VC0395_A1369 (VC0395_A1369)
Accession : YP_001217312.1
PAI name : VPI-2
PAI accession : NC_009457_P3
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : -
Homologs in the searched genomes :   9 hits    ( 9 protein-level )  
Publication :
    -Feng,L., Reeves,P.R., Lan,R., Ren,Y., Gao,C., Zhou,Z., Ren,Y., Cheng,J., Wang,W., Wang,J., Qian,W., Li,D. and Wang,L., "A recalibrated molecular clock and independent origins for the cholera pandemic clones", PLoS ONE 3 (12), E4053 (2008) PUBMED 19115014.

    -Feng,L., Reeves,P.R., Lan,R., Ren,Y., Gao,C., Zhou,Z., Ren,Y., Cheng,J., Wang,W., Wang,J., Qian,W., Li,D. and Wang,L., "Direct Submission", Submitted (18-MAY-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Heidelberg,J., "Direct Submission", Submitted (16-MAR-2007) The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.


DNA sequence :
ATGTTCATGTATTCAGTATTTACCTGTCGCTATTGGCGTTACATTATGGTCCGATTAATAAAACATCAACAACTGCTGGG
GGAATATTGTATGAATGTATCGATAGAAGAGTTTACTCATTTTGATTTTCAGCTTGTTCCCGAGCCTTCGCCGCTTGATC
TGGTAATTACAGAGTCGCTCAAAAATCACATTGAAGTAAATGGCGTTAAATCTGGTGCTTTGCTCCCATTACCATTTCAG
ACTGGAATCGGGAAAACTTACACGGCACTAAATTTCTTACTCCAGCAGATGTTAGAGCAAGTACGAAGCGAGCTGAAAGA
AGAAAATACCGGCAAAAAATCCAAACGCTTGCTGTATTACGTTACTGATTCAGTAGATAACGTGGTAAGCGCGAAAGCAG
ACTTGTTGAAGTTAATTGAAAAGCAGACAGTCAAGGGTGAGCCTCGCTTCACTTTAGAACAACAAGAGTACCTTAAAGCG
CAAATAGTACACCTTCCCAACCAGTCAGAGCAGCTATTGCAATGTTCTGATGCGGTATTGAATGACGTGTTAATCGGATT
TAACTTGAACGCTGAGCGTGATGTTCAAGCAGAGTGGAGTGCCATATCAGGTCTAAGACGTCATGCGAGTAACCCTGAAG
TTAAAATCTCTTTAAATAGGCAAGCGGGTTACTTCTACCGTAATTTAATAGACCGTTTGCAAAAGAAACAAAAAGGGGCC
GATAGAGTATTGTTGAGCGGTTCGTTGCTAGCATCCGTTGAAACACTATTGCCGGGTGAAAAAATTCGTAATGGAAGTGC
TCATGTTGCTTTTTTAACGACAAGTAAGTTTCTGAAAGGCTTCCACAATACTCGTTCGCGTTACAGTCCATTGCGCGATC
TAAGTGGTGCTGTACTTATAATTGATGAAATTGACAAGCAGAATCAGGTCATCCTTTCTGAGTTGTGTAAGCAACAAGCG
CAGGACTTAATTTGGGCAATCAGAACTTTAAGAGCAAACTTTCGAGACCATCAACTTGAGAGCTCGCCTCGCTACGACAA
AATTGAAGATCTATTTGAACCGCTCCGTGAGCGACTTGAAGAGTTTGGCACCAATTGGAATCTAGCGTTCGCATTTAATA
CGGAAGGTGCTAACTTGAATGAGCGGCCTGTTCGGTTGTTTTCGGATAGAAGCTTCACGCATGTAAGTAGTGCCACTCAT
AAGCTGTCATTAAAGTCAGATTTTTTGAGACGGAAAAATCTCATATTCAGCGATGAGAAAGTAGAAGGCTCTCTCATCGA
AAAACACGGCCTTTTAACCCGCTTTGTCAATGAAGCAGATGTTATATATCAGTGGTTTCTTGGCACAATGCGTAAAGCCG
TGTTTCAGTACTGGGAGAACGTTCGTGGCTTAGAAATCGAAGTACGCGAAAATAGAAGTCTAGAAGGAACGTTTCAAGAA
GCTGTTCAATCCCTGCTTACTCACTTCAACTTACAAGAATTTGAGTCTGCAGTTTACGAGTCTTTCGATACGCGGGGGTT
ACGGCAATCTGCAGGTGGTAAAGCGAACAAGTTAAGTTCTAGCAAGAGTTACCATCATACAGGGCTAAAACTTGTAGAGG
TGGCTCATAATCAGGGGACTCGCGATACCGTAAATTGTAAAGCGTCATTCCTAAATACTTCACCATCGGGTGTTTTAGCG
GATATGGTTGATGCAGGTGCCGTTATTCTCGGCATAAGTGCAACAGCGAGAGCAGACACCGTAATACATAACTTTGATTT
TAAATACTTGAATGAGCGTTTGGGTAACAAGCTTCTATCTTTGTCGAGAGAGCAAAAACAGCGGGTAAATAATTATTACC
ATAGTAGACGCAACTATAAAGATAACGGCGTAGTTTTGACAGTCAAATATCTCAATAGCCGAGATGCGTTTCTTGATGCT
TTGTTGGAGGAATATAAGCCGGAAGCTCGATCAAGTCACTTTATCCTAAATCACTATCTCGGTATTGCGGAATCAGAGCA
GGCATTTGTTCGTAGTTGGTTGTCTAAGCTCTTAGCTAGCATTAAAGCGTTTATCTCATCCCCTGATAATCGTTATATGC
TGTCGTTGTTAAACCGCACCCTAGATACAACACGTCAGAACATTAACGATTTCATTCAGTTCTGCTGTGATAAATGGGCA
AAAGAATTTAATGTTAAAACCAAGACATTTTTTGGTGTGAATGCTGATTGGATGAGATTAGTTGGCTACGATGAAATTTC
CAAGCACCTAAATACTGAGCTAGGAAAAGTCGTTGTATTTAGCACATACGCTTCAATGGGAGCAGGCAAAAACCCTGATT
ATGCAGTTAATTTAGCTTTGGAAGGTGAAAGCTTAATATCTGTTGCCGATGTCACTTATAGCACGCAGTTGCGAAGCGAT
ATAGACAGTATTTATCTTGAAAAGCCTACTCAGCTACTCCTGTCAGATGATTACTCGCATACTGCCAACCAGCTTTGTCA
ATTCCACCAAATTTTATCTTTGCAAGAGAACGGCGAGTTGTCACCGAAAAGCGCTGAAAACTGGTGTCGCCAGCAACTTA
TGGGTATGAGCAGAGAGCGTTCTTTACAGCAATACCACCAAACAAGTGACTACCAAAGTGCAGTAAGGAAATACATTGAG
CAAGCGGTAGGAAGAGCAGGAAGAACGTCCCTGAAACGAAAACAAATTCTCTTGTTTGTTGATTCTGGTTTGAAAGAAAT
TCTGGCGGAAGAGAGTCGAGACCCAAGTTTGTTTTCGCACAAATATGTTGCTTTGGTTAATAAAGCGAAATCAGCTGGTA
AGTCTATTGTTGAGGACCGAGCTGTTCGACGTTTATTCAATCTTGCTCAAAGAAACAATAAAGACGGTATGCTATCAATC
AAAGCCCTAGTTCATCGTCTACATAATCAACCAGCATCTAAGAGTGATATTCAAGAGTGGCAAGACATTAGAACTCAGTT
GCTCCGATATCCGACGGTGGCCTTTCAACCGGAGCGATTCAATCGATTGTACTTACAGTCTATGACAAAAGGGTATTACC
GTTATCAAGGTAATTTAGATGGTGATCCGAATAGCTTCGAGTTCTTTGATAGGGTGCCTTATGGCGACATGGTGTCAGAG
GAAGATTGCAGCCTAGCGACGTTAGTACAGAACCAATATGTGAGGCCGTGGTTTGAACGCAAAGGCTTTGCCTGTTCGTG
GCAAAAAGAAGCGAATGTGATGACGCCAATTATGTTTACCAACATTTATAAGGGAGCCTTGGGCGAGCAAGCGGTAGAGG
CAGTGCTAACAGCATTCGATTTTACCTTTGAAGAAGTTCCAAATTCTATTTACGAGCGATTTGACAATAGAGTCATATTT
GCAGGGATTGAGCAACCGATCTGGCTAGACAGCAAATACTGGAAGCATGAAGGTAATGAAAGCAGCGAAGGCTACAGTTC
GAAAATTGCATTGGTAGAAGAAGAGTTTGGACCTTCGAAGTTTATTTATGTGAATGCGTTAGGGGATACTTCAAAACCCA
TTAGATACTTGAACTCGTGCTTCGTAGAAACCTCACCACAGTTAGCTAAAGTTATTGAGATTCCGGCACTAATTGATGAT
AGCAATGCCGACACCAACCGAACAGCAGTACAGGAGTTGATAAAATGGTTACACCACAGCTAG

Protein sequence :
MFMYSVFTCRYWRYIMVRLIKHQQLLGEYCMNVSIEEFTHFDFQLVPEPSPLDLVITESLKNHIEVNGVKSGALLPLPFQ
TGIGKTYTALNFLLQQMLEQVRSELKEENTGKKSKRLLYYVTDSVDNVVSAKADLLKLIEKQTVKGEPRFTLEQQEYLKA
QIVHLPNQSEQLLQCSDAVLNDVLIGFNLNAERDVQAEWSAISGLRRHASNPEVKISLNRQAGYFYRNLIDRLQKKQKGA
DRVLLSGSLLASVETLLPGEKIRNGSAHVAFLTTSKFLKGFHNTRSRYSPLRDLSGAVLIIDEIDKQNQVILSELCKQQA
QDLIWAIRTLRANFRDHQLESSPRYDKIEDLFEPLRERLEEFGTNWNLAFAFNTEGANLNERPVRLFSDRSFTHVSSATH
KLSLKSDFLRRKNLIFSDEKVEGSLIEKHGLLTRFVNEADVIYQWFLGTMRKAVFQYWENVRGLEIEVRENRSLEGTFQE
AVQSLLTHFNLQEFESAVYESFDTRGLRQSAGGKANKLSSSKSYHHTGLKLVEVAHNQGTRDTVNCKASFLNTSPSGVLA
DMVDAGAVILGISATARADTVIHNFDFKYLNERLGNKLLSLSREQKQRVNNYYHSRRNYKDNGVVLTVKYLNSRDAFLDA
LLEEYKPEARSSHFILNHYLGIAESEQAFVRSWLSKLLASIKAFISSPDNRYMLSLLNRTLDTTRQNINDFIQFCCDKWA
KEFNVKTKTFFGVNADWMRLVGYDEISKHLNTELGKVVVFSTYASMGAGKNPDYAVNLALEGESLISVADVTYSTQLRSD
IDSIYLEKPTQLLLSDDYSHTANQLCQFHQILSLQENGELSPKSAENWCRQQLMGMSRERSLQQYHQTSDYQSAVRKYIE
QAVGRAGRTSLKRKQILLFVDSGLKEILAEESRDPSLFSHKYVALVNKAKSAGKSIVEDRAVRRLFNLAQRNNKDGMLSI
KALVHRLHNQPASKSDIQEWQDIRTQLLRYPTVAFQPERFNRLYLQSMTKGYYRYQGNLDGDPNSFEFFDRVPYGDMVSE
EDCSLATLVQNQYVRPWFERKGFACSWQKEANVMTPIMFTNIYKGALGEQAVEAVLTAFDFTFEEVPNSIYERFDNRVIF
AGIEQPIWLDSKYWKHEGNESSEGYSSKIALVEEEFGPSKFIYVNALGDTSKPIRYLNSCFVETSPQLAKVIEIPALIDD
SNADTNRTAVQELIKWLHHS