Gene Information

Name : Dgeo_2012 (Dgeo_2012)
Accession : YP_605475.1
Strain : Deinococcus geothermalis DSM 11300
Genome accession: NC_008025
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2119917 - 2123057 bp
Length : 3141 bp
Strand : -
Note : KEGG: bte:BTH_I2740 type I restriction-modification system endonuclease, ev=0.0, 84% identity; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family: (2.2e-229); PFAM: type III restriction enzyme, res subunit: (1.2e-10) protein of unknown function

DNA sequence :
ATGGCATTTCTGTCGGAAGCCGAGGTCGAAAACGCCCTTCTGGATCAGCTGCGCGCGCTCGGCTACAGCATCGAACGCGA
GGAGGACATCGGCCCCGACGGCCACCGGCCGGAGCGCGAGAGCCACGACGAGGTGGTGCTGCGCAAGCGGTTCGAGGACG
CCGTTGCCCGCCTGAACCCCGGGGTGCCACTGGAGGCGCGTCATGATGCCGTGCGGCGCGTGACGCAGTCCGAGCTGCCA
TCGTTGCTTGCAGAAAACCGCCGCCTGCACAAACTGCTGACCGAAGGCGTGGATGTGGAGTACTACGCGGACGATGGCGT
GCTCACCGCCGGCAAGGCGCGGCTGATCGACTTCGATGACCCGGCCAACAACGACTGGCTGGCGGTGAACCAGTTCGTCG
TCATCAACGGGCAGTACCAGCGGCGACCCGATGTCGTGGTGTTCGTGAACGGCTTGCCGCTTGCGGTGATCGAGCTCAAG
GCGCCGGGCAACGACCAAGCCACGCTCACGGGCGCGTTCAACCAGTTGCAGACCTACAAGGGGGAGATCACCCAGCTTTT
TCGCACCAACGCACTGCTGGTCACGTCGGACGGCATTTCCGCTCGGGTGGGGTCGCTGTCGGCCGACTTCGAGCGCTTCA
TGCCGTGGCGCACCACCGACGGTCGGGAGGTGGCGCCCAAGGGAGCGCCGGAGCTGGAGACGCTGATCGAAGGCGTGTTC
GAACATCGCCGCTTGCTCGATCTGTTGCGTCACTTCACGGTCTTCGGCGAAACAGGCGCTGGGCTCATCAAGATCATCGC
GGGCTACCATCAGTTCCACGCGGTACGACATGCGGTCGAGCGCACGGTGGCCGCATCCTCTGCCGGGGGAGACAGAAAGG
CCGGGGTGATCTGGCATACCCAGGGCTCGGGCAAGAGCCTGTTGATGGCGTTCTACGCAGGTCTTCTCGTTAGACACCCG
GCGCTGGAAAACCCGACCCTGGTCGTGCTGACCGATCGCAACGACCTGGACGATCAGCTCTTCGCCACCTTTTCGATGTG
CCGCGACCTGATCCGGCAGACGCCGGTGCAAGCAGAGGGACGCGAGCACTTGAAAACGCTGCTGGACCGGGCCTCGGGCG
GGGTGATCTTCACGACGCTGCAAAAGTTTGGCGAGATCGACGGGCCACTGACCACCCGGCGCAACGTGGTGGTCATCGCC
GACGAGGCGCACCGCAGCCAGTACGGCTTCAAGGCCAAGGTGGATGCCAAGACGGGCGAGATCTCCTACGGCTTCGCCAA
GTACCTGCGAGACGCGCTGCCGAACGCCTCGTTCATCGGCTTCACCGGTACGCCCATCGAGGCAGGCGACGTGAACACCC
CGGCGGTGTTCGGCCACTACATCGACATTTACGACATCAGCCGCGCGGTGGAAGACGGCGCGACGGTGCCCATCTACTAC
GAATCGCGGTTGGCGCGCATTGAACTCGACGAGGACGAAAAGCCGAAGATCGACGCCGAGATCGAGGAGATTCTGGAAGA
CGAGGAAGAACCCGCCCGCGAGCGCGCCAAGCAGAAGTGGGCGACGGTGGAGGCGCTCGTTGGCGCGGACAAGCGCCTGC
GACTGATCGCCCAGGATATCGTGCAGCACTTCGAGGCTCGCGTAGCCGCGCTGGACGGCAAGGCGATGATCGTCTGCATG
AGCCGGCGCATCTGCGTCAAGCTCTACAACGAGATCGTGAAGCTGCGTCCCGGATGGCACAGCGATGACGACAACGCCGG
GGCCGTCAAGATCGTGATGACCGGGGCGGCCTCCGATCCGCCCGAGTGGCAGAAGCATATCGGCAACAAGGCACGGCGCG
ATCTGTTGGCCCGCCGCGCCCGCGACCCCAAAGACCCCTTGAAGCTCGTCATCGTGCGCGACATGTGGCTGACGGGCTTC
GATGCGCCGTGCATGCACACCATGTATGTGGACAAGCCGATGCGCGGCCACGGGCTGATGCAGGCGATTGCGCGGGTGAA
CCGGGTGTTTCGCGACAAGCCCGCCGGGCTGATCGTGGACTACATCGGCATTGCGCAGAACCTCAAAAACGCGCTTGCGC
AGTACTCGCCGCGCGACCGCGAAAACACCGGCATCGACGAAGCCGAAGCCATCGCGGTAATGCTGGAAAAATACGAGGTC
GTGCGCGACATGTTCCACGGCTTTGACTACCGCTCGGGTCTCAACGGTTCGCCCCAGGAGCGGCTGGCAATGATGGCGGG
GGCCATCGAGTGGATCCTGGAGAGGCAGCAGCAGTGGGCGGCGCAGGAAACCACCCCGGAAGGCAAGAAGGCCGCGCACC
GGCGCTTTGGCGATGCGGTGCTGGCCTTGTCCAAGGCGTATGCCTTGGCTTCCGCCTCGGACCCGGCGCGTGCTATCCGC
GAAGAGGTGGGGTTTTTCCAGGCGATCCGTGCCGCGCTGATCAAGAGCAGCACGGGCTCCGACGCAAACCCGCAAGCGCG
CGAGTGGGCCATCCAGCAGATCGTCAGTCGCGCGGTGGTCTCGACCGAGATTGTCGATATCCTAACTGCCGCGGGCATCA
AGAGTCCGGACATCTCCATTCTGTCCGACGACTTCCTGGCCGAAGTGCAGCAGATGGAGAAAAAGAACCTGGCGCTGGAA
GCCCTGCGCAAGCTCATCAACGACGGCATCCGCTCACGCGCCAAGGCCAACGTCGTGCAGACCCGTGCGTTTTCGCAGCG
GCTGGAGGATGCCGTTGCACGCTACCACGCCAACGCCATCACCACCGCCGAGGTGCTGCAGGAGCTGATCCACTTGGCCA
AAGACATCCGCGCGGCGCGCCAGCGTGGCGAAGAGTCTGGATTGTCCGACGAGGAGATTGCCTTCTACGACGCCCTGGCC
GAGAACGAAAGCGCGGTTCAGGTCATGGGGGATGAGAAGCTGCGCGTGATTGCCCACGAGCTGCTGGTGAACCTGCGCGA
AAACGTCTCCGTGGACTGGGCCCACCGTGAATCGGCCCGCGCTCGCCTGCGCGTGCTGGTCAAGCGCATCCTGCGCAAGT
ACGGTTACCCGCCTGATTTGCAGGACGCGGCGGTGCAGACGGTGCTGCAGCAGGCCGAGGCACTGTCGGCGGTGTGGAGT
CTGGCTCGTAACTCTGGGTAG

Protein sequence :
MAFLSEAEVENALLDQLRALGYSIEREEDIGPDGHRPERESHDEVVLRKRFEDAVARLNPGVPLEARHDAVRRVTQSELP
SLLAENRRLHKLLTEGVDVEYYADDGVLTAGKARLIDFDDPANNDWLAVNQFVVINGQYQRRPDVVVFVNGLPLAVIELK
APGNDQATLTGAFNQLQTYKGEITQLFRTNALLVTSDGISARVGSLSADFERFMPWRTTDGREVAPKGAPELETLIEGVF
EHRRLLDLLRHFTVFGETGAGLIKIIAGYHQFHAVRHAVERTVAASSAGGDRKAGVIWHTQGSGKSLLMAFYAGLLVRHP
ALENPTLVVLTDRNDLDDQLFATFSMCRDLIRQTPVQAEGREHLKTLLDRASGGVIFTTLQKFGEIDGPLTTRRNVVVIA
DEAHRSQYGFKAKVDAKTGEISYGFAKYLRDALPNASFIGFTGTPIEAGDVNTPAVFGHYIDIYDISRAVEDGATVPIYY
ESRLARIELDEDEKPKIDAEIEEILEDEEEPARERAKQKWATVEALVGADKRLRLIAQDIVQHFEARVAALDGKAMIVCM
SRRICVKLYNEIVKLRPGWHSDDDNAGAVKIVMTGAASDPPEWQKHIGNKARRDLLARRARDPKDPLKLVIVRDMWLTGF
DAPCMHTMYVDKPMRGHGLMQAIARVNRVFRDKPAGLIVDYIGIAQNLKNALAQYSPRDRENTGIDEAEAIAVMLEKYEV
VRDMFHGFDYRSGLNGSPQERLAMMAGAIEWILERQQQWAAQETTPEGKKAAHRRFGDAVLALSKAYALASASDPARAIR
EEVGFFQAIRAALIKSSTGSDANPQAREWAIQQIVSRAVVSTEIVDILTAAGIKSPDISILSDDFLAEVQQMEKKNLALE
ALRKLINDGIRSRAKANVVQTRAFSQRLEDAVARYHANAITTAEVLQELIHLAKDIRAARQRGEESGLSDEEIAFYDALA
ENESAVQVMGDEKLRVIAHELLVNLRENVSVDWAHRESARARLRVLVKRILRKYGYPPDLQDAAVQTVLQQAEALSAVWS
LARNSG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 49
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 48
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 48
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 48