Gene Information

Name : HMPREF0868_0494 (HMPREF0868_0494)
Accession : YP_003474824.1
Strain : Clostridiales genomosp. UPII9-5
Genome accession: NC_013895
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : 3.1.21.3
Position : 1318535 - 1321603 bp
Length : 3069 bp
Strand : -
Note : identified by match to protein family HMM PF04313; match to protein family HMM PF04851; match to protein family HMM TIGR00348

DNA sequence :
ATGCCAGGATTTTATACAGAAGCGGACTATGAGAGTTCGATAATAGAATTATTCCAAAATATGGGATACAGGTATGTCTA
TGCACCAGATTTGGAGCGTGACTTTCATAGCCCCTTATACGAAGAGGAATTACTGTCAGCGTTACATAGGCTAAATCCAA
AAATGCCGGAAGATGCTATAGCCGATGCACTGTTCAAATTGAAAAATTTTGAAAATGCTGAGCTTGTCCAGAAGAATGAA
CTATTTATGGACTATCTTCAACACGGAATTGAAGTTAGGTACTTCGTGGATGGCGAGGAACGCTTCGGCCTCGTATATAT
TGTCGACTATAAAAATCCTGATAATAACTCCTTTGTTGTAGCAAATCAGTGGACTTTTATTGAGAACAGCAATAAACGTC
CGGATGTGCTTCTGTTTTTAAATGGCATGCCAGTTGTACTTGTTGAGCTGAAATCGCCATCTCGTGAAGAGACTGATGCT
TCAGAGGGCTATCTGCAGATTAGAAACTATATGCAAGAAATCCCGTCGATGTTTATATATAACTGCATTTGTGTTATTAG
CGATCATCTGACTAGCAAAGCCGGCACCATCACTTCCGGTGAGGATCGCTTCATGGAATGGAAAACAAAAGACGGTAGTT
ATGAGAATACACAATACGCTCAGTTTGACACATTCTTTGAGGGAATGTTTGAAAAAGAGCGTTTGCTTGACATCATCAAA
AACTTTATTTGTTTCTCCAATGACGGGTTGAATAAGTTTAAGATTCTGGCGGGTTATCACCAGTATTTTGCAGTTCGAAA
GGCTATCGAATCTACAAAAAACGCAACAGTCACTGACGGTAAAGGTGGCGTATTCTGGCATACACAGGGCAGCGGGAAAT
CTCTATCTATGGTTTTTTATGCCCACCTTCTGCAGGAAGTATTGGATAGCCCAACTATCGTAGTAATTACAGACCGTAAC
GATCTTGATGATCAGCTTTACGGACAGTTTGCTAAGTGCAAAGATTTTCTACGTCAGAATCCGGTACATGCTACCTGTAG
GAAATTGACAGAGACTTCCGGTAAAAATGATATCGGATTGAAAGACTGGCTGGAAGGCAGGCAGGCAAACGGCATCATTT
TTACAACGATGCAGAAATTTGAAGAATCATCAGAGCCACTTTCTAAGCGCCGTAACATCATCGTGATGGCTGATGAAGCG
CATCGTAGCCAGTATGGGTTGAAAGAAAAAATTGACGCTAAAACCGGTGAGATAAAGACTGGAACGGCACGTATCATACG
TGACAGTCTGCCAAACGCTACATATATTGGCTTTACTGGAACACCTATTGCCGCAAAAGATAGAAATACTCGCGAAGTGT
TCGGTGACTACATCGATATTTACGATATGACGCAAGCTGTAGAAGACGGTGCTACAAGACCGGTCTATTACGAAAGCCGC
GTTATTAAGCTAAAATTTGATGAGCCTACGCTTCATCTGATCGATCAGGAATACGACATTATGGCAAATAATGCTGACCC
TGAAGTGATTGAGAAAAGCAAGAAAGAGCTTAGCCAGATGGATGCTGTTCTAGGCAATGACGCTACTATTGATTCTCTTA
CTAACGACATTATCAGCCACTACGAAAACTATCGAGAAAACCTGCTAACCGGAAAAGCAATGATAGTTGCTTATTCTCGC
GAAATCGCTATGAAAATTTACAAGCGTATTCTTGAGGTTCGCCCAAGCTGGCAGAAAAAGGTTAAGGTCGTAATGACCGA
GAGTAATAAAGATCCTGAAGAGTGGCGTGCTGTTATTGGAAATAAGCATCACAGGGATGAGCTTGCTAAAGAATTCAAAG
ATAATAATAGCGAGATGAAAATCGCCATAGTTGTTGATATGTGGCTTACAGGATTTGATGTTCCTTCTCTTGCAACAATG
TATGTCTATAAGCCAATGCAGGGATATAACTTGATGCAAGCCATTGCCCGCGTCAATCGTGTTTTTAAGGATAAAGAAGG
CGGTTTGATTGTTGACTACGTGGGCATAGCGTCAGCATTGAAACAAGCCATGAATGACTATACAGCTCGCGATAAAAAGA
ACTATGGGGATACCGATATTGCTAAAGTTGCATATCCAAAGTTCCTTGAGAAGCTTTCCATTTGCCGTGATCTATTCCAC
GGATATGATTATTCCAAGTTTACGAATGGTACTGATCTAGAACGCTCAAAGACTATCACCGGTGCAGTTAACTTTATTGT
GAGCATTGACAAGGAAAGAGAACGTGAAGACTTCATCAAAGAGGCACTGCTATTGCATCAGGCTTTGTCGCTCTGCTCAT
CACTTGTAGAGAGAGGCCTTCGAGTAGAAGCTGCATTCTTTGAATCTGTTCGCGTGCTTGTTATGCGCTTAATAAATCAG
GGTGAAGATAAAAAAATATCTCTGCCGGAAATGAACGCTCGCATCAATGAACTTCTGAAATCCAGCATCAAGAGTGATGG
CGTTATTAATTTGTTCTCTGATGTTAAAGAAGAATTCTCCCTATTTGATCCTAAATTTCTTGAGGAAATTTCAAAGATGA
AGGAGAAAAACCTTGCTGTTGAACTTTTGAAAAAATTGATTGCTGAGCAGATACAGATCTATAGACGATCAAATGTAGTC
AAATCCGAGAAGTTCAGCGAAATTATTCAAGGTGTTATGAATCGATATCTTAATGGGATGCTGACAAATGAAGAGGTTAT
TGAAGAGCTTTTGAAAATGGCACAGCAAATTCAAGATGCCCACAAGGCTGGAGATGAACTTGGCCTTTCTGAAGATGAGC
TAGCCTTTTATGACGCATTAACCAAACCACAGGCTATTAAGGATTTCTATGAAAATGATGAGCTGATTGCCATCACTAAA
GAACTTACTGAAACACTTCGTAAGAATCGTACGATTGATTGGCAAAAACGTGATTCCGCACGTGCTAAAATGCGCATGAT
GATTAAAAGGCTTCTCAAACAGCACCGCTATCCACCTGAGGGTATGGAGGATGCTGTCAAAACAGTAATGACTCAGTGTG
AGCTGTGGACGGATAGAACAGATCTATAG

Protein sequence :
MPGFYTEADYESSIIELFQNMGYRYVYAPDLERDFHSPLYEEELLSALHRLNPKMPEDAIADALFKLKNFENAELVQKNE
LFMDYLQHGIEVRYFVDGEERFGLVYIVDYKNPDNNSFVVANQWTFIENSNKRPDVLLFLNGMPVVLVELKSPSREETDA
SEGYLQIRNYMQEIPSMFIYNCICVISDHLTSKAGTITSGEDRFMEWKTKDGSYENTQYAQFDTFFEGMFEKERLLDIIK
NFICFSNDGLNKFKILAGYHQYFAVRKAIESTKNATVTDGKGGVFWHTQGSGKSLSMVFYAHLLQEVLDSPTIVVITDRN
DLDDQLYGQFAKCKDFLRQNPVHATCRKLTETSGKNDIGLKDWLEGRQANGIIFTTMQKFEESSEPLSKRRNIIVMADEA
HRSQYGLKEKIDAKTGEIKTGTARIIRDSLPNATYIGFTGTPIAAKDRNTREVFGDYIDIYDMTQAVEDGATRPVYYESR
VIKLKFDEPTLHLIDQEYDIMANNADPEVIEKSKKELSQMDAVLGNDATIDSLTNDIISHYENYRENLLTGKAMIVAYSR
EIAMKIYKRILEVRPSWQKKVKVVMTESNKDPEEWRAVIGNKHHRDELAKEFKDNNSEMKIAIVVDMWLTGFDVPSLATM
YVYKPMQGYNLMQAIARVNRVFKDKEGGLIVDYVGIASALKQAMNDYTARDKKNYGDTDIAKVAYPKFLEKLSICRDLFH
GYDYSKFTNGTDLERSKTITGAVNFIVSIDKEREREDFIKEALLLHQALSLCSSLVERGLRVEAAFFESVRVLVMRLINQ
GEDKKISLPEMNARINELLKSSIKSDGVINLFSDVKEEFSLFDPKFLEEISKMKEKNLAVELLKKLIAEQIQIYRRSNVV
KSEKFSEIIQGVMNRYLNGMLTNEEVIEELLKMAQQIQDAHKAGDELGLSEDELAFYDALTKPQAIKDFYENDELIAITK
ELTETLRKNRTIDWQKRDSARAKMRMMIKRLLKQHRYPPEGMEDAVKTVMTQCELWTDRTDL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42