Name : VC0395_A1363 (VC0395_A1363)
Accession : YP_001217306.1
PAI name : VPI-2
PAI accession : NC_009457_P3
Strain : Vibrio cholerae IEC224
Virulence or Resistance: Not determined
Product : type I restriction enzyme HsdR
Function : -
Note : identified by match to protein family HMM PF04313; match to protein family HMM PF04851
Homologs in the searched genomes : 81 hits ( 81 protein-level )
Publication :
-Feng,L., Reeves,P.R., Lan,R., Ren,Y., Gao,C., Zhou,Z., Ren,Y., Cheng,J., Wang,W., Wang,J., Qian,W., Li,D. and Wang,L., "A recalibrated molecular clock and independent origins for the cholera pandemic clones", PLoS ONE 3 (12), E4053 (2008) PUBMED 19115014.
-Feng,L., Reeves,P.R., Lan,R., Ren,Y., Gao,C., Zhou,Z., Ren,Y., Cheng,J., Wang,W., Wang,J., Qian,W., Li,D. and Wang,L., "Direct Submission", Submitted (18-MAY-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
-Heidelberg,J., "Direct Submission", Submitted (16-MAR-2007) The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
DNA sequence : | |
GTGGTATTTATGGTTAGTAAAACAAATGAACAGGCGCTAGAGGCCGCAATCGAAAAAGGTTTAGCAGGTATTTGTAAAGA
AGAGTTAGCGCTGGGTGAAGCGCCTCTTAATTACAATAATGATCTTTATCTTATTGGCTCTCCAAGTGACTTTGACAAGC
AGTACGCTTTAGATACCCGTTTGTTTTGGCAATTCCTCGAAGATACTCAAGCGAGTGAGTTAGAAAAACTTAAGCGCACC
AGCCCTCACGATTGGCAGAGAAAAATCCTTGAGCGTTTCGATCGTATGATCAAGCGCCATGGTGTTTTGCGACTATTGAA
AAAAGGGCTAGACGTTGACGATGCCTTTCTATCGTTGATGTACCCGGCACCACTCGCCAGTAGTTCTGAGAAGGTAAAAA
AAGATTTTTCTGCCAACCTGTTTAGTGTGACTCGCCAGGTTTGTTATTCCAATGCAAATCCATTGGAAGAAATTGACATG
GTGCTTTTTATCAATGGTATCCCTCTGATTACCCTTGAGCTTAAAAACCCATGGACGGGCCAGAACGCTGTTTATCATGG
TCAAAAGCAGTACCGCGATGATAGAGATGCAAATCAGCCATTGTTGAACTTTGCTCGTTGCTTGGTTCACATGGCGGTCG
ATACCGATGAAGTCTATATGACGACTAAGCTGGCAGGCAAAAATACCTTCTTCCTGCCGTTCAACAAAGGCTTCAACTTT
GGCAAAGGTAACCCGATTAATCCACATGGGCACAAGACGGCCTACTTGTGGCAAGAGGTATTCCGCAAAGAAAGCATCGC
CAATATTATTCAGCACTTTATTCGTCTTGATGGCAGCAGTAAAAAGCAGTTGGACAAACGAACTTTGTTCTTCCCTCGAT
ATCACCAAATGGATGTCGTACGTCGCTTGGTTGATCACTGCTCAGTTAATGGTGTTGGGCAAACGTATTTGATACAGCAC
TCAGCGGGGTCAGGTAAATCTAACTCAATTACATGGGCGGCGTATCAGCTCATCGAAACTTACCCTATTAGCGATGATCT
ACCAGGAAGTAGAGGAAAAGAAATGCCTCTATTCGATTCGGTCATCGTTGTTACTGACCGCCGATTGCTCGATAAGCAGT
TACGCGACAATATAAAAGAGTTCTCTGAAGTGAAGAACATTGTTGCGCCTGCGTTTAAATCGTCAGAACTAAAGTCAGCA
TTAGAGAATGGTAAGAAAATCATCATTACCACCATTCAAAAGTTTCCCTATATTGTCGATGGTATTGCAGACTTAAGTGA
TAGGCGCTTTGCTGTCATCATTGATGAAGCACACAGTTCGCAGGATGGGCATAACCAAGATAAGTTAAATGAAGCAATGG
GGTTTGTTTCGGAGGATGTTTTAGACAAAGCATTACAAAGTGCGAAAAATCGTAAGATGCGCTCAAATGCATCTTACTTT
GCATTCACCGCAACGCCCAAAAACACCACTCTAGAAAAGTTTGGTCAGCGACAAGCAGACGGAACTTATGTTCCTTTTCA
TTTGTACTCGATGAAACAAGCCATCGAAGAAGGGTTTATCCTCGATGTTATTGCCAATTACACGACCTATAAGAGCTACT
ACGAGATTGAAAAATCAATCCAAGATAACCCTGAGTTTGACAGCAAAAAAGCACAGAAGCGCTTACGAGCGTATGTAGAG
GCAAGCCAAGAGACGATAGACACTAAAGCCGAGATCATGCTTGAGCATTTCATCAAGCATGTCGTTAACGGCAAGAAGCT
AAAAGGTAAAGGCAAAGGTATGGTGGTGACTCAAAACATTGAGTCAGCCATACGTTACTATCGGGCACTAACCAGACAGC
TCAATAAAATGGGCAATCCGTTTAAAGTTGCCATTGCATTCTCTGGCTCAAAAGAAGTCGATGGGATTGAGTACACAGAA
GCTGACATTAATGGTTTCCCAGAAGGTGATACCAAAGATTACTTTGATGTGAACTACAAGCGTAAGGAGCCGGACTCTCC
AATCCCTAAGCACGTAGACCAAGATGCTTACCGGTTGCTGGTAGTGGCGAATAAGTATTTAACAGGCTTTGATCAGCCGA
AACTTTGTGCCATGTACGTGGATAAAAAGTTGGCTAGCGTTTTGTGTGTCCAAGCTCTGTCACGCTTAAACCGTTCAGCG
CCAAAGTACGGTAAGAAAACGGAAGATCTGTTTGTGTTGGATTTCTTCAACTCAGTGGATGATATCAAAACCGCATTCGA
CCCTTTCTATACATCCACAACGCTTTCTGAAGCGACAGATGTCAATGTTCTTCATGAGCTAAAAGATGATATGGACGACA
CAGATGTTTATGAGTGGTTTGAAGTTGAGGAGTTCAACAAGCGTTTCTTTGAAGGACGAGAGGCTCAGGACCTAAGCCCA
ATCATTGACATCGCAGCTGCGCGTTTTAACCACGAGTTAGAACTAGAGAACGAGTTTAAAGTCGATTTCAAAGTGAAAGC
CAAGCAGTTTGTGAAAATCTACGGCCAGATTGCATCTATCATGCCTTATGAGGTCGTTCAGTGGGAAAAACTGTTCTGGT
TCTTGAAATTTTTGATTCCGAAGCTTTCCGTCGAAGACCCAGATAAAGAAGCACTAGACTCATTACTAGATTCAGTTGAT
TTGAGCTCTTATGGATTACAAAGGGTTAAGCTTAACCATTCCATCGAGCTAGATGACTCTGAAACTGAGTTAGATCCTCA
AAACCCGAACCCGCGCGGGGCTTATGGTCCTGAAGCTGAGAAAGATCCGTTAGATGAGATCATCAAAATCTTTAACGAAC
GCTGGTTCCAAGGCTGGAGCGCAACACCAGAAGAGCAGAGAGTTAAGTTTGTGAACATTGCTGAGAGCATCCGAAATCAC
CCAGATTTCGAAGCTAAATACCAAAACAACGCAGACCCTCATACGCGAGAGTTAGCGTTTGAAAAGATGTTGAAAGAAAT
CATGCTTCAACGTCGTAAAGACGAGTTAGAGCTCTACAAGCTGTTTGCCCAAGACCCAGCTTTTAAAGCGTCTTGGACGC
AGAGCATGCAGCGTATGGTTGGGATGTAA
|
Protein sequence : | |
MVFMVSKTNEQALEAAIEKGLAGICKEELALGEAPLNYNNDLYLIGSPSDFDKQYALDTRLFWQFLEDTQASELEKLKRT
SPHDWQRKILERFDRMIKRHGVLRLLKKGLDVDDAFLSLMYPAPLASSSEKVKKDFSANLFSVTRQVCYSNANPLEEIDM
VLFINGIPLITLELKNPWTGQNAVYHGQKQYRDDRDANQPLLNFARCLVHMAVDTDEVYMTTKLAGKNTFFLPFNKGFNF
GKGNPINPHGHKTAYLWQEVFRKESIANIIQHFIRLDGSSKKQLDKRTLFFPRYHQMDVVRRLVDHCSVNGVGQTYLIQH
SAGSGKSNSITWAAYQLIETYPISDDLPGSRGKEMPLFDSVIVVTDRRLLDKQLRDNIKEFSEVKNIVAPAFKSSELKSA
LENGKKIIITTIQKFPYIVDGIADLSDRRFAVIIDEAHSSQDGHNQDKLNEAMGFVSEDVLDKALQSAKNRKMRSNASYF
AFTATPKNTTLEKFGQRQADGTYVPFHLYSMKQAIEEGFILDVIANYTTYKSYYEIEKSIQDNPEFDSKKAQKRLRAYVE
ASQETIDTKAEIMLEHFIKHVVNGKKLKGKGKGMVVTQNIESAIRYYRALTRQLNKMGNPFKVAIAFSGSKEVDGIEYTE
ADINGFPEGDTKDYFDVNYKRKEPDSPIPKHVDQDAYRLLVVANKYLTGFDQPKLCAMYVDKKLASVLCVQALSRLNRSA
PKYGKKTEDLFVLDFFNSVDDIKTAFDPFYTSTTLSEATDVNVLHELKDDMDDTDVYEWFEVEEFNKRFFEGREAQDLSP
IIDIAAARFNHELELENEFKVDFKVKAKQFVKIYGQIASIMPYEVVQWEKLFWFLKFLIPKLSVEDPDKEALDSLLDSVD
LSSYGLQRVKLNHSIELDDSETELDPQNPNPRGAYGPEAEKDPLDEIIKIFNERWFQGWSATPEEQRVKFVNIAESIRNH
PDFEAKYQNNADPHTRELAFEKMLKEIMLQRRKDELELYKLFAQDPAFKASWTQSMQRMVGM
|
|