Name : VC1765 (VC1765)
Accession : NP_231400.1
PAI name : VPI-2
PAI accession : NC_002505_P5
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Not determined
Product : type I restriction enzyme HsdR
Function : -
Note : similar to GB:M63891 PID:551948; identified by sequence similarity
Homologs in the searched genomes : 81 hits ( 81 protein-level )
Publication :
-Heidelberg,J.F., Eisen,J.A., Nelson,W.C., Clayton,R.A., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Umayam,L.A., Gill,S.R., Nelson,K.E., Read,T.D., Tettelin,H., Richardson,D., Ermolaeva,M.D., Vamathevan,J., Bass,S., Qin,H., Dragoi,I., , "DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae", Nature 406 (6795), 477-483 (2000) PUBMED 10952301.
-Heidelberg,J.F., Eisen,J.A., Nelson,W.C., Clayton,R.A., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Umayam,L.A., Gill,S.R., Nelson,K.E., Read,T.D., Tettelin,H., Richardson,D., Ermolaeva,M.D., Vamathevan,J., Bass,S., Qin,H., Dragoi,I., , "Direct Submission", Submitted (18-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
-Heidelberg,J.F., Eisen,J.A., Nelson,W.C., Clayton,R.A., Gwinn,M.L., Dodson,R.J., Haft,D.H., Hickey,E.K., Peterson,J.D., Umayam,L.A., Gill,S.R., Nelson,K.E., Read,T.D., Tettelin,H., Richardson,D., Ermolaeva,M.D., Vamathevan,J., Bass,S., Qin,H., Dragoi,I., , "Direct Submission", Submitted (14-JUN-2000) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA.
DNA sequence : | |
GTGGTATTTATGGTTAGTAAAACAAATGAACAGGCGCTAGAGGCCGCAATCGAAAAAGGTTTAGCAGGTATTTGTAAAGA
AGAGTTAGCGCTGGGTGAAGCGCCTCTTAATTACAATAATGATCTTTATCTTATTGGCTCTCCAAGTGACTTTGACAAGC
AGTACGCTTTAGATACCCGTTTGTTTTGGCAATTCCTCGAAGATACTCAAGCGAGTGAGTTAGAAAAACTTAAGCGCACC
AGCCCTCACGATTGGCAGAGAAAAATCCTTGAGCGTTTCGATCGTATGATCAAGCGCCATGGTGTTTTGCGACTATTGAA
AAAAGGGCTAGACGTTGACGATGCCTTTCTATCGTTGATGTACCCGGCACCACTCGCCAGTAGTTCTGAGAAGGTAAAAA
AAGATTTTTCTGCCAACCTGTTTAGTGTGACTCGCCAGGTTTGTTATTCCAATGCAAATCCATTGGAAGAAATTGACATG
GTGCTTTTTATCAATGGTATCCCTCTGATTACCCTTGAGCTTAAAAACCCATGGACGGGCCAGAACGCTGTTTATCATGG
TCAAAAGCAGTACCGCGATGATAGAGATGCAAATCAGCCATTGTTGAACTTTGCTCGTTGCTTGGTTCACATGGCGGTCG
ATACCGATGAAGTCTATATGACGACTAAGCTGGCAGGCAAAAATACCTTCTTCCTGCCGTTCAACAAAGGCTTCAACTTT
GGCAAAGGTAACCCGATTAATCCACATGGGCACAAGACGGCCTACTTGTGGCAAGAGGTATTCCGCAAAGAAAGCATCGC
CAATATTATTCAGCACTTTATTCGTCTTGATGGCAGCAGTAAAAAGCAGTTGGACAAACGAACTTTGTTCTTCCCTCGAT
ATCACCAAATGGATGTCGTACGTCGCTTGGTTGATCACTGCTCAGTTAATGGTGTTGGGCAAACGTATTTGATACAGCAC
TCAGCGGGGTCAGGTAAATCTAACTCAATTACATGGGCGGCGTATCAGCTCATCGAAACTTACCCTATTAGCGATGATCT
ACCAGGAAGTAGAGGAAAAGAAATGCCTCTATTCGATTCGGTCATCGTTGTTACTGACCGCCGATTGCTCGATAAGCAGT
TACGCGACAATATAAAAGAGTTCTCTGAAGTGAAGAACATTGTTGCGCCTGCGTTTAAATCGTCAGAACTAAAGTCAGCA
TTAGAGAATGGTAAGAAAATCATCATTACCACCATTCAAAAGTTTCCCTATATTGTCGATGGTATTGCAGACTTAAGTGA
TAGGCGCTTTGCTGTCATCATTGATGAAGCACACAGTTCGCAGGATGGGCATAACCAAGATAAGTTAAATGAAGCAATGG
GGTTTGTTTCGGAGGATGTTTTAGACAAAGCATTACAAAGTGCGAAAAATCGTAAGATGCGCTCAAATGCATCTTACTTT
GCATTCACCGCAACGCCCAAAAACACCACTCTAGAAAAGTTTGGTCAGCGACAAGCAGACGGAACTTATGTTCCTTTTCA
TTTGTACTCGATGAAACAAGCCATCGAAGAAGGGTTTATCCTCGATGTTATTGCCAATTACACGACCTATAAGAGCTACT
ACGAGATTGAAAAATCAATCCAAGATAACCCTGAGTTTGACAGCAAAAAAGCACAGAAGCGCTTACGAGCGTATGTAGAG
GCAAGCCAAGAGACGATAGACACTAAAGCCGAGATCATGCTTGAGCATTTCATCAAGCATGTCGTTAACGGCAAGAAGCT
AAAAGGTAAAGGCAAAGGTATGGTGGTGACTCAAAACATTGAGTCAGCCATACGTTACTATCGGGCACTAACCAGACAGC
TCAATAAAATGGGCAATCCGTTTAAAGTTGCCATTGCATTCTCTGGCTCAAAAGAAGTCGATGGGATTGAGTACACAGAA
GCTGACATTAATGGTTTCCCAGAAGGTGATACCAAAGATTACTTTGATGTGAACTACAAGCGTAAGGAGCCGGACTCTCC
AATCCCTAAGCACGTAGACCAAGATGCTTACCGGTTGCTGGTAGTGGCGAATAAGTATTTAACAGGCTTTGATCAGCCGA
AACTTTGTGCCATGTACGTGGATAAAAAGTTGGCTAGCGTTTTGTGTGTCCAAGCTCTGTCACGCTTAAACCGTTCAGCG
CCAAAGTACGGTAAGAAAACGGAAGATCTGTTTGTGTTGGATTTCTTCAACTCAGTGGATGATATCAAAACCGCATTCGA
CCCTTTCTATACATCCACAACGCTTTCTGAAGCGACAGATGTCAATGTTCTTCATGAGCTAAAAGATGATATGGACGACA
CAGATGTTTATGAGTGGTTTGAAGTTGAGGAGTTCAACAAGCGTTTCTTTGAAGGACGAGAGGCTCAGGACCTAAGCCCA
ATCATTGACATCGCAGCTGCGCGTTTTAACCACGAGTTAGAACTAGAGAACGAGTTTAAAGTCGATTTCAAAGTGAAAGC
CAAGCAGTTTGTGAAAATCTACGGCCAGATTGCATCTATCATGCCTTATGAGGTCGTTCAGTGGGAAAAACTGTTCTGGT
TCTTGAAATTTTTGATTCCGAAGCTTTCCGTCGAAGACCCAGATAAAGAAGCACTAGACTCATTACTAGATTCAGTTGAT
TTGAGCTCTTATGGATTACAAAGGGTTAAGCTTAACCATTCCATCGAGCTAGATGACTCTGAAACTGAGTTAGATCCTCA
AAACCCGAACCCGCGCGGGGCTTATGGTCCTGAAGCTGAGAAAGATCCGTTAGATGAGATCATCAAAATCTTTAACGAAC
GCTGGTTCCAAGGCTGGAGCGCAACACCAGAAGAGCAGAGAGTTAAGTTTGTGAACATTGCTGAGAGCATCCGAAATCAC
CCAGATTTCGAAGCTAAATACCAAAACAACGCAGACCCTCATACGCGAGAGTTAGCGTTTGAAAAGATGTTGAAAGAAAT
CATGCTTCAACGTCGTAAAGACGAGTTAGAGCTCTACAAGCTGTTTGCCCAAGACCCAGCTTTTAAAGCGTCTTGGACGC
AGAGCATGCAGCGTATGGTTGGGATGTAA
|
Protein sequence : | |
MVFMVSKTNEQALEAAIEKGLAGICKEELALGEAPLNYNNDLYLIGSPSDFDKQYALDTRLFWQFLEDTQASELEKLKRT
SPHDWQRKILERFDRMIKRHGVLRLLKKGLDVDDAFLSLMYPAPLASSSEKVKKDFSANLFSVTRQVCYSNANPLEEIDM
VLFINGIPLITLELKNPWTGQNAVYHGQKQYRDDRDANQPLLNFARCLVHMAVDTDEVYMTTKLAGKNTFFLPFNKGFNF
GKGNPINPHGHKTAYLWQEVFRKESIANIIQHFIRLDGSSKKQLDKRTLFFPRYHQMDVVRRLVDHCSVNGVGQTYLIQH
SAGSGKSNSITWAAYQLIETYPISDDLPGSRGKEMPLFDSVIVVTDRRLLDKQLRDNIKEFSEVKNIVAPAFKSSELKSA
LENGKKIIITTIQKFPYIVDGIADLSDRRFAVIIDEAHSSQDGHNQDKLNEAMGFVSEDVLDKALQSAKNRKMRSNASYF
AFTATPKNTTLEKFGQRQADGTYVPFHLYSMKQAIEEGFILDVIANYTTYKSYYEIEKSIQDNPEFDSKKAQKRLRAYVE
ASQETIDTKAEIMLEHFIKHVVNGKKLKGKGKGMVVTQNIESAIRYYRALTRQLNKMGNPFKVAIAFSGSKEVDGIEYTE
ADINGFPEGDTKDYFDVNYKRKEPDSPIPKHVDQDAYRLLVVANKYLTGFDQPKLCAMYVDKKLASVLCVQALSRLNRSA
PKYGKKTEDLFVLDFFNSVDDIKTAFDPFYTSTTLSEATDVNVLHELKDDMDDTDVYEWFEVEEFNKRFFEGREAQDLSP
IIDIAAARFNHELELENEFKVDFKVKAKQFVKIYGQIASIMPYEVVQWEKLFWFLKFLIPKLSVEDPDKEALDSLLDSVD
LSSYGLQRVKLNHSIELDDSETELDPQNPNPRGAYGPEAEKDPLDEIIKIFNERWFQGWSATPEEQRVKFVNIAESIRNH
PDFEAKYQNNADPHTRELAFEKMLKEIMLQRRKDELELYKLFAQDPAFKASWTQSMQRMVGM
|
|