Gene Information

Name : Thein_1571 (Thein_1571)
Accession : YP_004626395.1
Strain : Thermodesulfatator indicus DSM 15286
Genome accession: NC_015681
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease, HsdR family
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1629591 - 1632719 bp
Length : 3129 bp
Strand : -
Note : COGs: COG0610 Type I site-specific restriction-modification system R (restriction) subunit and related helicase; InterProIPR004473:IPR014001:IPR014021:IPR007409:IPR 006935; KEGG: chl:Chy400_1964 type I site-specific deoxyribonuclease, HsdR family; PFAM: p

DNA sequence :
ATGGCTGAATCCGAAGTTGAATCGGCGGCGCTGGCCTGGCTCGAATCCCTAGGCTGGCAAATAAAGCACGGGCCGGAGAT
CGCTCCCGGGGAGCCTTTCGCCGAGCGGGACGATTATCAGGAGGTAATTCTCCCACAACGCCTGAAGGATGCTCTGGCCC
GGCTAAACCCGGAACTTCCTATCGAGGCCCTAGATGAGGCTTTCCGAAAGCTCGTAAATCCTCCCGGCGCCACCGTAGAG
GCTCGTAACCGGGCCTTCCACCGAATGCTCGTGGACGGAGTGACGGTAGAATATCGCCGGGAGGATGGCTCCATAGCCGG
TGCCCAGGCGCGCGTAATAGACTTTGACGATTCCGAAAATAACGACTTTCTGGCCGTCAACCAGTTTACCGTTACCGAAG
GTCGCCATACCCGCCGGCCGGACATCGTGCTTTTTGTAAATGGGCTTCCGCTAGTCATCATCGAATTGAAAAATCCGGCG
GACGAAGAAGCCACCATCTGGACCGCTTATCAGCAACTTCAGACTTACAAAGCGGAACTTCCCACTCTCTTTGCCTTCAA
TGAACTTTTAGTCATTTCGGACGGGCTGGAAGCCAGGATGGGTACCTTGACCGCCGGGCGGGAATGGTTCAAACCCTGGC
GCACGATCTCAGGCGAGCGAGTGGAGGACGAAGGCGTCCTCCAGCTTGAAGTCCTTCTTAAGGGTGTTTTCGACCTGGAG
CGGTTTTTGGAACTGATCCGAGATTTTATGGTGTACGAAGACGACGGCGGAAGGCTTTCCAAAAAAGTGGCTGGATACCA
CCAATTCCATGCCGTGCGAGTAGCGGTGCGGGAGACCCTGAGGGCAGCTGCGCTCGCCAAAGACGAAGGCTTGCGCGTGA
GAGAAGAAATAGGCCGTTATGAAGTGCGGGGCCAAGGTGGGCGGCCTGGCGACCGGCGTATCGGGGTAGTATGGCATACG
CAGGGCTCCGGCAAAAGCTTGACCATGGTCTTCTATGCTGGACGCATCATTCGTGAGCCGGCCATGCAAAATCCTACGGT
GGTGGTGCTCACCGACCGAAACGATCTGGATGATCAACTGTTTGGAGTTTTTTCCCGCTGTCAGGAACTTCTACGCCAGG
AGCCTGTTCAGGCCAAAAGCCGGGCTCATCTTCGGGAGCTTCTTTCACGAGAAGCCGGAGGTATCATCTTTACCACCATT
CAAAAGTTCTTTCCTGACGAAAAGGGCGACCAGCATCCGCTACTTTCTTCACGGCGCAACATTGTGGTAATTGCCGATGA
AGCTCATCGAAGCCAGTACGATTTCATTGACGGTTTTGCCCGCCATATTCGGGATGCCTTACCCAATGCCTCTTTTATTG
CTTTCACGGGAACACCGATTGAGCTTGAGGATCGCAACACGCGAGCCGTCTTTGGCGACTACATCTCCATTTACGATATC
CAGCGAGCCGTAGAAGACGGTGCGACGGTGCCCATCTACTACGAAAGCCGGCTGGCAAAGCTTGCGCTACCCAAAGAGCT
AAAGCCCAAAATCGACGAAGAATTCGAGGAAGTCACCGAAAGAGAAGAGGTCGAGCGTAAGGAGAAGCTCAAAACCAAAT
GGGCACAACTTGAAGCCATTGTGGGGGCGGAACCACGCCTCCGCATGATCGCTAAAGATATAGTGAAGCATTTTGAGCGT
CGTCTGGAAGCCCTCGATGGAAAGGGCATGATCGTTTGCATGAGCCGCCGGATATGCGTCGATCTTTATAACCAGATCAT
TCACCTGCGTCCTGATTGGCACCACGAAGATGACGATAAAGGCGTAATCAAAGTGGTGATGACCGGTTCGGCGTCGGATC
CTCCAGAATGGCAGCCCCACATCCGTAACAAAGAAAGACGGGAGTTCCTGGCCCGGCGTTTTCGTGATCCAAATGATCCC
TTGAAGCTGGTAATCGTGCGCGACATGTGGCTTACGGGCTTCGATTGTCCGAGCCTTCATACCATGTACATCGACAAACC
CATGCGCGGCCACGGGCTCATGCAGGCCATCGCCCGGGTGAACCGCGTCTTCCGCGATAAACCCGGAGGGCTGGTGGTGG
ACTATATTGGTCTGGCCCGCGAATTAAAACAGGCGCTTGCGGTTTATACCGAAAGCGGAGGTAAGGGCCGCACGGCCCTT
GATCAGGAAGAAGCGGTGGCGGTCATGCAGGAAAAATACGAAATTTGTTGTGATCTCTTTCACGGCTTCGACTGGTCGGC
CTGGAAAACCGGAACACCTGAAGAACGTCTGGCCCTCCTTCCCGCTGCACAGGAACATATTTTGGCCCAGCCGGATGGGA
AAGACCGTTTCGTCAAGGCGGTGCTCGAACTTTCCAAGGCCTTTGCCCTGGCCGTTCCTCATGAAGAGGCCCTACGCATC
CGTGACGACGTGGCCTTTTTTCAGGCCGTACGCTCCGCCCTGGTGAAACGCGCGCCTCTTGACGCCCGTCCCCAAGAAGA
ATTGGATTACGCCCTTCGCCAGTTGGTGGATCGGGCCGTGGACCCTGAAGGAGTAGTGGATATCTTTGCGGCTGCAGGTC
TTAAAAAACCAGATATTTCTATTCTTTCAGAAGAATTTCTAGCCGAAATCCAGGATATGCCTCAAAAGAATCTAGCGGTG
GAACTTTTGCGTAAACTTTTGCAGGGAGAAATCCGCACCCGGCGGCGCAAAAATGTGGTTCAGGCCCGGCGTTTTTCTGA
GATGCTTGAGCGCGCTCTCCGTCGCTATCAGAACCGCGCCATTGAAGCCGCCCAGGTCATTGAAGAGCTGATTGCGCTTG
CCCGCGAGATGCGTAAGTCCGACCGAAGAGGGGAAGAATTGGGTTTAAGCGAGGAAGAGGTGGCCTTTTACGATGCGCTT
GCGGCCAACGAGAGCGCGGTGGAAGTCTTGGGAGATAAAACTTTGCGAAAGATCGCCCAGGAGCTGGTGCGCCTGGTACG
AGAAAATGTCACCGTGGATTGGGCCCAGCGCGAAAATGTAAGGGCTTATTTGAGGGTTCTGGTGAAACGCACCCTTCGTA
AATATGGTTACCCGCCGGACAAACAGGAAGAGGCCACGCAGACGGTGCTCAAACAGGCCGAAGTTCTGGGCGGGGAGGTT
GTAGAGTAG

Protein sequence :
MAESEVESAALAWLESLGWQIKHGPEIAPGEPFAERDDYQEVILPQRLKDALARLNPELPIEALDEAFRKLVNPPGATVE
ARNRAFHRMLVDGVTVEYRREDGSIAGAQARVIDFDDSENNDFLAVNQFTVTEGRHTRRPDIVLFVNGLPLVIIELKNPA
DEEATIWTAYQQLQTYKAELPTLFAFNELLVISDGLEARMGTLTAGREWFKPWRTISGERVEDEGVLQLEVLLKGVFDLE
RFLELIRDFMVYEDDGGRLSKKVAGYHQFHAVRVAVRETLRAAALAKDEGLRVREEIGRYEVRGQGGRPGDRRIGVVWHT
QGSGKSLTMVFYAGRIIREPAMQNPTVVVLTDRNDLDDQLFGVFSRCQELLRQEPVQAKSRAHLRELLSREAGGIIFTTI
QKFFPDEKGDQHPLLSSRRNIVVIADEAHRSQYDFIDGFARHIRDALPNASFIAFTGTPIELEDRNTRAVFGDYISIYDI
QRAVEDGATVPIYYESRLAKLALPKELKPKIDEEFEEVTEREEVERKEKLKTKWAQLEAIVGAEPRLRMIAKDIVKHFER
RLEALDGKGMIVCMSRRICVDLYNQIIHLRPDWHHEDDDKGVIKVVMTGSASDPPEWQPHIRNKERREFLARRFRDPNDP
LKLVIVRDMWLTGFDCPSLHTMYIDKPMRGHGLMQAIARVNRVFRDKPGGLVVDYIGLARELKQALAVYTESGGKGRTAL
DQEEAVAVMQEKYEICCDLFHGFDWSAWKTGTPEERLALLPAAQEHILAQPDGKDRFVKAVLELSKAFALAVPHEEALRI
RDDVAFFQAVRSALVKRAPLDARPQEELDYALRQLVDRAVDPEGVVDIFAAAGLKKPDISILSEEFLAEIQDMPQKNLAV
ELLRKLLQGEIRTRRRKNVVQARRFSEMLERALRRYQNRAIEAAQVIEELIALAREMRKSDRRGEELGLSEEEVAFYDAL
AANESAVEVLGDKTLRKIAQELVRLVRENVTVDWAQRENVRAYLRVLVKRTLRKYGYPPDKQEEATQTVLKQAEVLGGEV
VE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 52
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 47
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 47
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 47