PAI Gene Information


Name : VPI2_0013c (VPI2_0013c)
Accession : ACA01830.1
PAI name : VPI-2
PAI accession : EU272902
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Not determined
Product : type I restriction enzyme HsdR
Function : -
Note : COG0610, similar to AA sequence (same species):RefSeq:NP_231400.1, protein motif:InterPro:IPR007409, AA sequence similar to TIGR locus VC1765 in Vibrio cholerae strain N16961
Homologs in the searched genomes :   81 hits    ( 81 protein-level )  
Publication :
    -Coelho,A., "Direct Submission", Submitted (09-NOV-2007) Genetica, Universidade Federal do Rio de Janeiro, Rua Prof. Rodolpho Rocco, CCS, Bl. A-Ilha do Fundao, Rio de Janeiro, RJ 21949-0, Brazil REMARK Sequence update by submitter.

    -Figueiredo,S.C., Neves-Borges,A.C. and Coelho,A., "The neuraminidase gene is present in the non-toxigenic Vibrio cholerae Amazonia strain: a different allele in comparison to the pandemic strains", Mem. Inst. Oswaldo Cruz 100 (6), 563-569 (2005) PUBMED 16302067.

    -Figueiredo,S.C.A. and Coelho,A., "Direct Submission", Submitted (14-NOV-2004) Genetica, Universidade Federal do Rio de Janeiro, Rua Prof. Rodolpho Rocco, CCS, Bl. A-Ilha do Fundao, Rio de Janeiro, RJ 21949-0, Brazil.

    -Figueiredo,S.C.A. and Coelho,A., "Direct Submission", Submitted (03-MAR-2005) Genetica, Universidade Federal do Rio de Janeiro, Rua Prof. Rodolpho Rocco, CCS, Bl. A-Ilha do Fundao, Rio de Janeiro, RJ 21949-0, Brazil REMARK Sequence update by submitter.

    -Figueiredo,S.C.A., Reis,R.C., Goncalves,M.S.M., Beltrao,P.J.M.S.I. and Coelho,A., "The VPI-2 pathogenicity island of Vibrio cholerae Amazonia", Unpublished.


DNA sequence :
GTGGTATTTATGGTCAGTAAAACTAATGAACAGGCGCTAGAGGCCGCAATCGAAAAAGGTTTAGCAGGTATTTGTAAAGA
AGAGTTAGCGCTGGGTGAAGCGCCTCTTAATTACAATAATGATCTTTATCTTATTGGCTCTCCAAGTGACTTTGACAAGC
AGTACGCTTTAGATACCCGTTTGTTTTGGCAATTCCTCGAAGATACTCAAGCGAGTGAGTTAGAAAAACTTAAGCGCACC
AGCCCTCACGATTGGCAGAGAAAAATCCTTGAGCGTTTCGATCGTATGATCATGCGCCATGGTGTTTTGCGACTATTGAA
AAAAGGGCTAGACGTTGACGATGCCTTTCTATCGTTGATGTACCCGGCACCACTCGCCAGTAGTTCTGAGAAGGTAAAAA
AAGATTTTTCTGCCAACCTGTTTAGTGTGACTCGCCAGGTTTGTTATTCCAATGCAAATCCATTGGAAGAAATTGACATG
GTGCTTTTTATCAATGGTATCCCTCTGATTACCCTTGAGCTTAAAAACCCATGGACGGGCCAGAACGCTGTTTATCATGG
TCAAAAGCAGTACCGCGATGATAGAGATGCAAATCAGCCATTGTTGAACTTTGCTCGTTGCTTGGTTCACATGGCGGTCG
ATACCGATGAAGTCTATATGACGACTAAGCTGGCAGGCAAAAATACCTTCTTCCTGCCGTTCAACAAAGGCTTCAACTTT
GGCAAAGGTAACCCGATCAATCCACATGGGCACAAGACGGCCTACTTGTGGCAAGAGGTATTCCGCAAAGAAAGCATCGC
CAATATTATTCAGCACTTTATTCGTCTTGATGGCAGCAGTAAAAAGCAGTTGGACAAACGAACTTTGTTCTTCCCTCGAT
ATCACCAAATGGATGTCGTGCGTCGCTTGGTTGATCACTGCTCAGTTAATGGTGTTGGGCAAACGTATTTGATACAGCAC
TCAGCGGGGTCAGGTAAATCTAACTCAATTACATGGGCGGCGTATCAGCTCATCGAAACTTACCCTATTAGCGATGATCT
ACCAGGAAGTAGAGGAAAAGAAGTGCCTCTATTCGATTCGGTCATCGTTGTTACTGACCGCCGATTGCTCGATAAGCAGT
TACGCGACAATATAAAAGAGTTCTCTGAAGTGAAGAACATTGTTGCGCCTGCGTTTAAATCGTCAGAACTAAAGTCAGCA
TTAGAGAATGGTAAGAAAATCATCATTACCACCATTCAAAAGTTTCCTTATATTGTCGATGGTATTGCAGACTTAAGTGA
TAGGCGCTTTGCTGTCATCATTGATGAAGCACACAGTTCGCAGGATGGGCATAACCAAGATAAGTTAAATGAAGCAATGG
GGTTTGTTTCGGAGGATGTTTTAGACAAAGCATTACAAAGTGCGAAAAATCGTAAGATGCGCTCAAATGCATCTTACTTT
GCATTCACCGCAACGCCCAAAAACACCACTCTAGAAAAGTTTGGTCAGCGACAAGCAGACGGAACTTATGTTCCTTTTCA
TTTGTACTCGATGAAACAAGCCATCGAAGAAGGGTTTATCCTCGATGTTATTGCCAATTACACGACCTATAAGAGCTACT
ACGAGATTGAAAAATCAATCCAAGATAACCCTGAGTTTGACAGCAAAAAAGCGCAGAAGCGCTTACGAGCGTATGTAGAG
GCAAGCCAAGAGACGATAGACACTAAAGCCGAGATCATGCTTGAGCATTTCATCAAGCATGTCGTTAACGGCAAGAAGCT
AAAAGGTAAAGGCAAAGGTATGGTGGTGACTCAAAACATTGAGTCAGCCATACGTTACTATCGGGCACTAACCATACAGC
TCAATAAAATGGGCAATCCGTTTAAAGTTGCCATTGCATTCTCTGGCTCAAAAGAAGTCGATGGGATTGAGTACACAGAA
GCTGACATTAATGGTTTCCCAGAAGGTGATACCAAAGATTACTTTGATGTGAACTACAAGCGTAAGGAGCCGGACTCTCC
AATCCCTAAGCACGTAGACCAAGATGCTTACCGGTTGCTGGTAGTGGCGAATAAGTATTTAACAGGCTTTGATCAGCCGA
AACTTTGTGCCATGTACGTGGATAAAAAGTTGGCTAGCGTTTTGTGTGTCCAAGCCCTGTCACGCTTAAACCGTTCAGCG
CCAAAGTACGGTAAGAAAACGGAAGATCTGTTTGTGTTGGATTTTTTCAACTCAGTGGATGATATCAAAACCGCATTCGA
CCCTTTCTATACATCCACAACGCTTTCTGAAGCGACAGATGTCAATGTTCTTCATGAGCTAAAAGATGATATGGACGACA
CAGGTGTTTATGAGTGGTTTGAAGTTGAGGAGTTCAACAAGCGTTTCTTTGAAGGACGAGAGGCTCAGGACCTAAGCCCA
ATCATTGACATCGCAGCTGCGCGTTTTAACCACGAGTTAGAACTAGAGAACGAGTTTAAAGTCGATTTCAAAGTGAAAGC
CAAGCAGTTTGTGAAAATCTACGGCCAGATTGCATCTATCATGCCTTATGAGGTCGTTCAGTGGGAAAAACTGTTCTGGT
TCTTGAAATTTTTGATTCCGAAGCTTTCCGTCGAAGACCCAGATAAAGAAGCACTAGACTCATTACTAGATTCAGTTGAT
TTGAGCTCTTATGGATTACAAAGGGTTAAGCTTAACCATTCCATCGAGCTAGATGACTCTGAAACTGAGTTAGATCCTCA
AAACCCGAACCCGCGCGGGGCTTATGGTCCTGAAGCTGAGAAAGATCCGTTAGATGAGATCATCAAAATCTTTAACGAAC
GCTGGTTCCAAGGCTGGAGCGCAACACCAGAAGAGCAGAGAGTTAAGTTTGTGAACATTGCTGAGAGCATCCGAAATCAC
CCAGATTTCGAAGCTAAATACCAAAACAACGCAGACCCTCATACGCGAGAGTTAGCGTTTGAAAAGATGTTGAAAGAAAT
CATGCTTCAACGTCGTAAAGACGAGTTAGAGCTCTACAAGCTGTTTGCCCAAGACCCAGCTTTTAAAGCGTCTTGGACGC
AGAGCATGCAGCGTATGGTTGGGATGTAA

Protein sequence :
MVFMVSKTNEQALEAAIEKGLAGICKEELALGEAPLNYNNDLYLIGSPSDFDKQYALDTRLFWQFLEDTQASELEKLKRT
SPHDWQRKILERFDRMIMRHGVLRLLKKGLDVDDAFLSLMYPAPLASSSEKVKKDFSANLFSVTRQVCYSNANPLEEIDM
VLFINGIPLITLELKNPWTGQNAVYHGQKQYRDDRDANQPLLNFARCLVHMAVDTDEVYMTTKLAGKNTFFLPFNKGFNF
GKGNPINPHGHKTAYLWQEVFRKESIANIIQHFIRLDGSSKKQLDKRTLFFPRYHQMDVVRRLVDHCSVNGVGQTYLIQH
SAGSGKSNSITWAAYQLIETYPISDDLPGSRGKEVPLFDSVIVVTDRRLLDKQLRDNIKEFSEVKNIVAPAFKSSELKSA
LENGKKIIITTIQKFPYIVDGIADLSDRRFAVIIDEAHSSQDGHNQDKLNEAMGFVSEDVLDKALQSAKNRKMRSNASYF
AFTATPKNTTLEKFGQRQADGTYVPFHLYSMKQAIEEGFILDVIANYTTYKSYYEIEKSIQDNPEFDSKKAQKRLRAYVE
ASQETIDTKAEIMLEHFIKHVVNGKKLKGKGKGMVVTQNIESAIRYYRALTIQLNKMGNPFKVAIAFSGSKEVDGIEYTE
ADINGFPEGDTKDYFDVNYKRKEPDSPIPKHVDQDAYRLLVVANKYLTGFDQPKLCAMYVDKKLASVLCVQALSRLNRSA
PKYGKKTEDLFVLDFFNSVDDIKTAFDPFYTSTTLSEATDVNVLHELKDDMDDTGVYEWFEVEEFNKRFFEGREAQDLSP
IIDIAAARFNHELELENEFKVDFKVKAKQFVKIYGQIASIMPYEVVQWEKLFWFLKFLIPKLSVEDPDKEALDSLLDSVD
LSSYGLQRVKLNHSIELDDSETELDPQNPNPRGAYGPEAEKDPLDEIIKIFNERWFQGWSATPEEQRVKFVNIAESIRNH
PDFEAKYQNNADPHTRELAFEKMLKEIMLQRRKDELELYKLFAQDPAFKASWTQSMQRMVGM