PAI Gene Information


Name : VC0395_A0337 (VC0395_A0337)
Accession : YP_001216293.1
PAI name : VPI-1
PAI accession : NC_009457_P1
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Not determined
Product : DEAD/DEAH box helicase
Function : -
Note : identified by match to protein family HMM PF00270; match to protein family HMM PF00271; match to protein family HMM PF04851
Homologs in the searched genomes :   23 hits    ( 23 protein-level )  
Publication :
    -Feng,L., Reeves,P.R., Lan,R., Ren,Y., Gao,C., Zhou,Z., Ren,Y., Cheng,J., Wang,W., Wang,J., Qian,W., Li,D. and Wang,L., "A recalibrated molecular clock and independent origins for the cholera pandemic clones", PLoS ONE 3 (12), E4053 (2008) PUBMED 19115014.

    -Feng,L., Reeves,P.R., Lan,R., Ren,Y., Gao,C., Zhou,Z., Ren,Y., Cheng,J., Wang,W., Wang,J., Qian,W., Li,D. and Wang,L., "Direct Submission", Submitted (18-MAY-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Heidelberg,J., "Direct Submission", Submitted (16-MAR-2007) The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.


DNA sequence :
GTGATGAGTGATAGACAAAGTTTTCTGTATTCAGGTCATAAGCTCTCTTCTGGTGGTGCAAACGATCCGCTCCTACCAAG
ACTTGTCCAAGCCATTAATCACGCGACTGAAATTGAAATTTCAGTCTCATTCATTCAGCCATCCGGTCTGGATTTACTGT
TTGATCCTCTGTTTGATGCTGTGCAAAGTGGTGCTCAGGTAAAACTCCTAACATCAGATTATCTCTCGATTACTCATCCT
GTTGCGCTGCGACGCCTAATGTTGCTAACAGAACGCGGCGCTCAGTGTCGTGTATTTGAATGTGGTCAGCACAGCTTTCA
TATGAAATCGTACATTTTTGTCCGATGTGAACAGGGAGAAATACTTGAAGGTTGCGCTTGGATTGGTTCCAACAACATCA
GCAAAACAGCGTTATTAGACAGCCATGAATGGGCTCTACGTCACGACTTTGAACCACCTGAAACCAGTGCTGCTGCCCTT
GAGTTTCTCCATATTCGTCAACAGTTTGCTTCCATTTTCAACCACACAAATAGTAAAGATTTAACGCATACTTGGATTGA
TCATTACCTTGAACGCTATCAACAAGCGAAAAAGCAGCACGGCATGCCCATACTGGCCGATAGCCAAGATGAACAGTCAG
AACCGCCAGCGCCGAATGCAGTTCAAGTAGAGGCGTTAACCGCATTGAACGCCACTAGAGCCCAAGGATTTTCTCGTGGA
TTAGTCGTGCTAGCTACGGGCATGGGCAAAACGTGGTTAGCAGCATTTGATGCTCTACAAACTCAGTCGACAAAAGTCTT
GTTCGTTGCGCATAGAGAAGAAATTCTTCTGCAAGCGGAAAAAACCTTCTGTCAGCTTATTCCCAATGCAAAAACGGGTC
TCTACAACGGTGTAACGCAAAATACACAAGCGATGCTGCTGTTTGCTTCCGTGGCAACCATAGGTAAACAAAATCACCTG
CAACGCTTTGCAGCGGATCACTTTGATTACATTGTGGTGGATGAATTTCATCACGCGGCAGCGAGAAGTTACCGAAACTT
ACTCACCTATTTTAAGCCTAAATTTCTCCTCGGCCTAACCGCAACACCGGAACGTTCAGATCAAGCGGATATTCTCTCGC
TTTGCGACAGTAATTTAGTCTTTGAACGTAACCTTGTTCATGGAATTGATGAGAAAATTCTGGTTCCTTTTGACTATCAC
GGTATCTACGACCAAGCCGTGAACTACCAAGAAATCCCATGGCGTAACGGTAAATTCGACCCAGACTCACTCGATAATGC
CCTAGCAACTCAACGCCGTGCCGAACACGTTTACCAGCATTGGCACCAGAAAAAACAAACCCGCACACTGGCTTTTTGTG
TTTCTAAAAAACACGCAGACTTCATGGCTGAGTTTTGTCTGAGTAAAGGCATAAAAGCGATTGCGGTTTACAGTGATTCA
AAGGTTCGACGTAACCAAGCCTTACAGTGGCTAGATTCAGGAAAAATAGACATCCTTTTCTCCGTCGATCTTTTTAATGA
AGGTACCGATCTGCCCGCTATTGATACCATTTTGATGCTTCGTCCAACGGAGTCTAAAATTCTCTTTTTACAGCAGTTGG
GACGGGGATTGCGTCGAAGCATTGAAACTCAAAAAAGTAAACTGGTTGTTATTGATTTTATTGGTAATCATGACTCGTTC
TTAAATCGTCCAACCACACTCTACAATGTGAGTCATTTAAAAGACGCCTTAGCCAAACACCAACAACAAGCATTACCTGA
CGGATGCCATGTCACTTTTGATATCACTTTGCTCAACTTTTGGCAGCAATTAACCCGGAAAATGCGTTTCTCAGTGCAAG
ATGAGTATCAGCAACTTGCACACCAGCTCGCACATAGACCAACTGCTAGTGAGTTTTTCTATCATGGAATTGAAATAAGT
AAAGTGCGTAAACAAGCGCAAAGCTGGTTTCATTTGGTTGCGAGCCAAGAAAATGATCCCGAGTTGGCTGAGATTGTTAC
ACGCTATGGTGATTTTCTTCTGCATGGCATTGAAAGCACCAGCATGAGTAAATCTTTCAAAGCAATTCTACTTGAGGCCT
TACTCGAACTGGATGGATTAAGAACTCCGCCGACCTTAGCCACATTGGCTGAATGCAGTTACACTGTTATCGCACGCAGA
CCAGACCTCATGGCTGAAGACCTAACTGAAAATGCCAAACAATTTAAAGCAGCAGATAAAGATTGGCTTAACTATTGGCG
GAACAACCCTATTAAGGCCTTTACGAACAAAGCAACGAAACAGGCCACTTGGTTTTCCATCGATAGCCAACAACGCTTCG
TCGCTAACTTTGATATCCGAGAGCAAGATCTAGAACGGTTACACGATTGTATACAAGAATTGGTTGATCTGCGTTTGGCG
GAATATGCTCAGCGACCCAAGCAACATAAAGAAAATCTTATTAAGGCTAAGAGCAAAGCTGCTGTCATTTCATTAATACC
TGAACAAAAAACGATAGGAACACCTATCCCATTCTATCCAGAGCTCAAAATTGCCTGTGGCCATTTTAAGCGCGGTTCAG
AGCGAGAAGTACAGACTTATTTTGTTCCTGATGGCTATGGATTACTTGATCCCACACGGCATTTTGTCTCGCCAGCCTCT
GGTAACTCAATGAATGGCGGTAAAAATCCTATTCAAGATGGTGACTTATTATTGCTTGAGTGGGTGACACCAAGCAGTGC
TGGCTCTATATCAAACCTCACCATGGCTATAGAAACTCAAGATGAAACGGGGGATAACCAGTACTTACTGCGGGTAGTAC
GTAAAATCGCACCAAACCAGTATGAACTACAAGCACAAAATCCAAGTTATCCAAATATGCCAGCAACCGATGCAATGAAA
ACGTTCGCACGGTTAAAAGCTGTGATAAAAAATAATAGTCAGCAATCACAGACTCGATGA

Protein sequence :
MMSDRQSFLYSGHKLSSGGANDPLLPRLVQAINHATEIEISVSFIQPSGLDLLFDPLFDAVQSGAQVKLLTSDYLSITHP
VALRRLMLLTERGAQCRVFECGQHSFHMKSYIFVRCEQGEILEGCAWIGSNNISKTALLDSHEWALRHDFEPPETSAAAL
EFLHIRQQFASIFNHTNSKDLTHTWIDHYLERYQQAKKQHGMPILADSQDEQSEPPAPNAVQVEALTALNATRAQGFSRG
LVVLATGMGKTWLAAFDALQTQSTKVLFVAHREEILLQAEKTFCQLIPNAKTGLYNGVTQNTQAMLLFASVATIGKQNHL
QRFAADHFDYIVVDEFHHAAARSYRNLLTYFKPKFLLGLTATPERSDQADILSLCDSNLVFERNLVHGIDEKILVPFDYH
GIYDQAVNYQEIPWRNGKFDPDSLDNALATQRRAEHVYQHWHQKKQTRTLAFCVSKKHADFMAEFCLSKGIKAIAVYSDS
KVRRNQALQWLDSGKIDILFSVDLFNEGTDLPAIDTILMLRPTESKILFLQQLGRGLRRSIETQKSKLVVIDFIGNHDSF
LNRPTTLYNVSHLKDALAKHQQQALPDGCHVTFDITLLNFWQQLTRKMRFSVQDEYQQLAHQLAHRPTASEFFYHGIEIS
KVRKQAQSWFHLVASQENDPELAEIVTRYGDFLLHGIESTSMSKSFKAILLEALLELDGLRTPPTLATLAECSYTVIARR
PDLMAEDLTENAKQFKAADKDWLNYWRNNPIKAFTNKATKQATWFSIDSQQRFVANFDIREQDLERLHDCIQELVDLRLA
EYAQRPKQHKENLIKAKSKAAVISLIPEQKTIGTPIPFYPELKIACGHFKRGSEREVQTYFVPDGYGLLDPTRHFVSPAS
GNSMNGGKNPIQDGDLLLLEWVTPSSAGSISNLTMAIETQDETGDNQYLLRVVRKIAPNQYELQAQNPSYPNMPATDAMK
TFARLKAVIKNNSQQSQTR