PAI Gene Information


Name : VC0820 (VC0820)
Accession : NP_230468.1
PAI name : VPI-1
PAI accession : NC_002505_P3
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Virulence
Product : ToxR-activated gene A protein
Function : -
Note : similar to GP:3004926; identified by sequence similarity
Homologs in the searched genomes :   6 hits    ( 6 protein-level )  
Publication :
    -Heidelberg,J.F., Eisen,J.A., Nelson,W.C., Clayton,R.A., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Umayam,L.A., Gill,S.R., Nelson,K.E., Read,T.D., Tettelin,H., Richardson,D., Ermolaeva,M.D., Vamathevan,J., Bass,S., Qin,H., Dragoi,I., , "DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae", Nature 406 (6795), 477-483 (2000) PUBMED 10952301.

    -Heidelberg,J.F., Eisen,J.A., Nelson,W.C., Clayton,R.A., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Umayam,L.A., Gill,S.R., Nelson,K.E., Read,T.D., Tettelin,H., Richardson,D., Ermolaeva,M.D., Vamathevan,J., Bass,S., Qin,H., Dragoi,I., , "Direct Submission", Submitted (18-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Heidelberg,J.F., Eisen,J.A., Nelson,W.C., Clayton,R.A., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Umayam,L.A., Gill,S.R., Nelson,K.E., Read,T.D., Tettelin,H., Richardson,D., Ermolaeva,M.D., Vamathevan,J., Bass,S., Qin,H., Dragoi,I., , "Direct Submission", Submitted (14-JUN-2000) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA.


DNA sequence :
ATGATTTTTTCGAAAAATGTTTACGAGGAGTTAGTGGTGGTAAGATATTCACTCTTAATGAAAGTTTCTTTTGCAATACT
TATTTTCTTAGTTGGATGTAATGAGAATGCAACATCCAGTAACGACCAGTATTTGACTGATCCTGATATAAGTGAACAAA
CAAAGAAGCCATCAAGGCCTATAATTGATGAAAAAAATAAAGGTGTAACAGATACTTCGGTTACAATAGAATGGGATAAG
ATTGAGTGTGAAAAAAATTTTAGTCATTATAATGTAATTGTCTATAGAAAAGATCGAATAGAAGATGTAATAACTATCAG
AACTAGGAATAATAGTGTTTTTATCGATGATTTAAAACCTAACAGCCAGTATTCTATAGATGTCTCAAGTTGTTTACACT
CTGCTTGTTCAGAAAGCGCAAAAATAGAATTCATTACGTTGAACGAGATAGATTATTATCACACCACAGAAATAGAAAAA
AATGTGTATGGAAGCTTGGAAGGTGAAGTTAGATTTGTGCAAACGCATGTCATTTCTCCTGAGGGTAGAAAGAATGAGCC
TGAAATAATCACAGGAAGAGATGCATTAATATTGTTTAAGCCATCAATAAAAAACTCAAGTTCAATTTTGATGAAAATTT
ATTCAGAAGATGGACTCACTAGCAAAGTTGTAATGAAATCACCATCTATGTTGCCAAAAACTGATCAACCAATAGATATT
GATGAAAATAATAAAGTTGTAAGTTACTCTAACTCATATTGGAGTGCAGAGATACCATGGAATAAAATGAAAAGTGGTAT
GTCATTACATTTTGAAGACGAAAACGGCAACTTAGGTATTATTGAGTCAGAACGTATAAAATTTTCAGCACCTAGTGAAT
TGATTATTCAAAATATAGATCTTGGAATGTTATATAAACCAAGAGGAAGAAATATCGTAATAAAAGAATTGGAAAGAACA
GCCGTTGATTACTTTCAGAAAGTTCCTGTCTCAAAGTTGATTTTTTCAGATTATACCCCTATTCATTTTGAAAAAATAAC
ATTACCAAATGGTACAGTATATACTGAGAAAAGTGCTGATATAGGTGGATGGCATCAAGGTGATATGAGAGAAGCAGTAG
GGAAAGCACTAGTATCTACCGGAATAAATAATGCTAATTTAGGTATAGTTGCCTCGTCAGGATATTCTCAACAATACAAC
AGATTAACGAATCATATTACCGCACACACAAATATTGGATATTATAACAATGGAGTGGTTGTGCATGGAGGGAGTGGTGG
TGGTGGGATCGTTACACTAGAAAATACACTTCATAACGAATGGTCTCATGAATTGGGACATAACTATGGATTGGGACATT
ATGTTGCAGGTGGTACTAGTCATGGACCTGATACTTCATGGGGTTGGGATGGCTATTATAAAAGATTCATAGCTAACTTT
GATTGGAAACGTTCACCACAATCAAATATAAGGCCAGATAATCAAGAAGTTGTTAAGCCCTTCATGGACAAGTATACATA
TCTTTGGGATGCAATGTCTGGAGGATATGACCATCAAAATGGAATCATTAGTAGATATACACTTCATCATCCATATGTTG
CAAGAATTATCCAAGATTGGCTTAAAAATGGGGCTGTTGTAATAAATAATGATTATATGGTTTGGGATGAATTAAAAAAT
ATCTATGTGTATAAGGGAACGAACTTCAAAGTTCCAATAAAAAAAGGTGTACCTGTTGTGACGATATTAGGGGTTTATGA
TCCTGACAAAATTAATCCAAGTCAATTGTATCCTCCGACATACAGCAATTATGGAAATATATTCGATTTAGAAAAACCTC
GTTCAGAATCATCCTTAAAAGGGTGGCAATATGTTAAAGATGTCAACTATCTAGATAGAGTTAATACACATTGGCATACG
ATGCTCGTAAATAGAAAAGAAGAAAAAATATGTCGATTTTCTTATCTAAGCCCTAAAGGTAAAAAATTTGAATTTCTAGG
GTATGAAGACATTGAGAATAAAATATGCACAGGAGGTAGAAGTATTCACTATTTAGAAGACGGCAAGAAAAATCCAATAG
AATCCAAGTATAATGATTATTTTTTATTATCAATAGATGGTGATGGAGAAATAAGTTATGTTCCTGATTCTACTATTGGT
GAAAGTAAAATATGTTCACTAAAGATGTCTGGTACTGTATACGGTGCAGGTTTTATTAAAGGAAACTCTTGTCGGCAAAT
TGACGGTGTTTTTATGAACGGATTTCAATGGGCTTTTACATTAAATCAATCAGGAGTAAATAGTACCTATACATGGTCAA
ATGAATGTGTATTAAAAATTAAAGATAAAGATAATAATATTGAATCAATATCGATACCAAATTATAGAATAGAAAAAAAT
CAGAGTAATAAAATTCATCTTAATATAAGCAGAGAAAAGCCCATAATAGATATTAACGTGTATTGTGGAGAACATGAGTT
AACTAGCATAAAGGTTTCTGATAATCCTGATATAAAATTACTAAAAGGACCTATTATTGTTGGGCAAGAGCATGGTTACA
CAAGCTATGAGCCTAAGCTTCCTAGTGGTTGGTTCAAACATTATGACAATTTTGAACCCAAAAATGAAATCAACCATGAA
TTAGGAAAGATGCGTGTAAATGATAATGATGAATATATTTGTCGATTTAATTTTTCTGATTCAGATAGGGAAATGAAATT
TGTTGGTTATGTGAGTCAATTATCTGAAAGTAAGTACATTTGTACTGGTGGAAGTGAGATCTATTACAAGAAAAACGAAA
TTAATATTGAACTATCATCAAAAGAAAACGATTTTGAATGGTTATCAGTAAGAGATAAAAATTTGGTAGGATCAAAAATA
GAATTTGATAACAATAAGACATTGTGCGTATTAGATAATAGATCATTTTATGGTGCTGGTTACCTCGATGAAAACAATAG
ATGTACACAAGATAGACAAATTCATTGGTCTAATGGTAAACAATGGTTATTTAGTACTTATAAAACGATGACCTACCATT
AA

Protein sequence :
MIFSKNVYEELVVVRYSLLMKVSFAILIFLVGCNENATSSNDQYLTDPDISEQTKKPSRPIIDEKNKGVTDTSVTIEWDK
IECEKNFSHYNVIVYRKDRIEDVITIRTRNNSVFIDDLKPNSQYSIDVSSCLHSACSESAKIEFITLNEIDYYHTTEIEK
NVYGSLEGEVRFVQTHVISPEGRKNEPEIITGRDALILFKPSIKNSSSILMKIYSEDGLTSKVVMKSPSMLPKTDQPIDI
DENNKVVSYSNSYWSAEIPWNKMKSGMSLHFEDENGNLGIIESERIKFSAPSELIIQNIDLGMLYKPRGRNIVIKELERT
AVDYFQKVPVSKLIFSDYTPIHFEKITLPNGTVYTEKSADIGGWHQGDMREAVGKALVSTGINNANLGIVASSGYSQQYN
RLTNHITAHTNIGYYNNGVVVHGGSGGGGIVTLENTLHNEWSHELGHNYGLGHYVAGGTSHGPDTSWGWDGYYKRFIANF
DWKRSPQSNIRPDNQEVVKPFMDKYTYLWDAMSGGYDHQNGIISRYTLHHPYVARIIQDWLKNGAVVINNDYMVWDELKN
IYVYKGTNFKVPIKKGVPVVTILGVYDPDKINPSQLYPPTYSNYGNIFDLEKPRSESSLKGWQYVKDVNYLDRVNTHWHT
MLVNRKEEKICRFSYLSPKGKKFEFLGYEDIENKICTGGRSIHYLEDGKKNPIESKYNDYFLLSIDGDGEISYVPDSTIG
ESKICSLKMSGTVYGAGFIKGNSCRQIDGVFMNGFQWAFTLNQSGVNSTYTWSNECVLKIKDKDNNIESISIPNYRIEKN
QSNKIHLNISREKPIIDINVYCGEHELTSIKVSDNPDIKLLKGPIIVGQEHGYTSYEPKLPSGWFKHYDNFEPKNEINHE
LGKMRVNDNDEYICRFNFSDSDREMKFVGYVSQLSESKYICTGGSEIYYKKNEINIELSSKENDFEWLSVRDKNLVGSKI
EFDNNKTLCVLDNRSFYGAGYLDENNRCTQDRQIHWSNGKQWLFSTYKTMTYH