Gene Information

Name : hsdR1 (THI_0615)
Accession : YP_003623201.1
Strain : Thiomonas sp. 3As
Genome accession: NC_014145
Putative virulence/resistance : Unknown
Product : Type I site-specific deoxyribonuclease HsdR
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 618341 - 621550 bp
Length : 3210 bp
Strand : +
Note : Evidence 2a : Function of homologous gene experimentally demonstrated in an other organism; PubMedId : 8921897; Product type e : enzyme

DNA sequence :
ATGACGAGCGCAGTCCTTGAAGACCACCTCGAACAAGCCACGCTTCAGTGGCTCTCCGCGCTCGGCTGGCAAACCGCGCA
TGGCCCTGACATCTCACCGCCCGACGCCAGGACCGCTGGCACCGAGCGCGACACCTACCGCCAGGTCTGCCTACCCCATC
GCCTTGCGGCCACGATTCAGCGCCTGAATCCCAACATCCCGACGAGCGCACGGGACGCGGCTCTGCGCCTAGTCCTCAAC
CCCAACACCCCCGGCCTCGTCGCCGCCAACCGCCAGTTCCACCGCTGGCTGGTCGAAGGTGTCCCGGTCGAGTACCAGAG
GGACGGCGAAACCCGCGGCGACCGCGTCCGGCTCATCGACTTCAGCGACGTTGGCCGAAACGACTGGCTGGCCGTCAACC
AGTTCACCGTGCAAGGCCCCAAGCACACCCGCCGCCCCGACCTCGTGCTGTTCCTCAACGGCCTGCCCCTGGTGGTGCTG
GAACTCAAGAACCCCGGCGACGAGAACGCCGACATCTGGGGCGCCTTCAACCAGCTCCAGGCCTACAAAGACGACATTCC
CGACCTGTTCATCGACAACGAGCTGCTGGTCATCACCGACGGCATCTCCGCCCGCATGGGCTCGCTCACCGCCGACCGGG
AACGCTTCATGGCCTGGCGCACCATCGACGGCCACACCACCGACCCGCTCGGGTCCATGCGCGAACTGGAAACCCTGGTC
CAAGGTGCCTTCGACCGCCAGACCCTGCTCGACTATCTGCAGCACTTCATCCTGTTCGAGGACGATGGGGGGCTGGTCAA
GAAGGTCGCCGGCTACCACCAGTTCCATGCCGTGCGCGCCGTGGTCGACAGCGTGCTCAAGGCCAGCGCCCCTGGCGGCT
CGCGCAAGGGCGGGGTGGTGTGGCACACCCAGGGGGCTGGCAAGAGCATCGAGATGACCTGCCTGGCCGGCATGCTCATC
GGGCACCCGGAGCTCCAGAACCCCACGGTCTTGATGGTCACCGACCGCAACGACCTGGACAACCAACTGTTCGGCGTGTT
CGCCGGCGCCACCGAGCTGCTGCACGAAACCCCGGTGCAGGCCGCCACCCGCCCCAAGCTGCGCGAGCTGCTGGGCAACC
GCCCCAGCGGCGGCATCGTCTTCACCACCATCCAGAAGTTCGCACCTGGCGAGGACGAAGACAGCTTCCCGGTGCTGTCC
GAGCGCAGCAACATCATCGTCATCTGCGACGAAGCCCACCGCAGCCAGTACGGCTTTGCCGCCAAGCTGCCCGGACAGGA
CGATGCCACCCGTCGCTACCGCCAGACCGCGGCCGCGCCGCTGAGCGCGCAGGATGCCGGCGCGATTGGGGCCACCACGG
CCTTGGTCACAGCGGCCCCGGCCTCCAGCGTGCGCTACGGCTACGCCCAGTACCTGCGCGACGCGTTGCCCAACGCCACC
TTCGTCGCCTTCACCGGCACCCCCGTGTCGCTGGAGGACCGCGACACCCGCGCCGTGTTCGGCGACTACGTCCACATCTA
CGACGTCGAGCAGGCGGTCAAGGACGGCGCCACGGTGCCCATCTATTACGAGTCGCGCCTGGCGAAGCTCGAACTCGCGG
AGGAGGACACCACCTGGCTTGATGATGACGTCGACCAATTGACCGAGGACGATGAGGACGACGCCAGCAAGACCAGCAAG
CTGCGCCGCTGGGCCGCGCTGGAAAAACTCGTTGGCGCACCGCCCCGGATTCAGAAGGTGGCCGCCGACATCGTCGAACA
CTTCGAAAACCGCCTTGCCGCCCTGGACGGTAAGGCCATGGTCGTCGGCATGAGCCGCGAGATTTGCGTCCATCTCTACG
ACGCTCTCGTCGCCCTGCGGCCGGAATGGCACGACCCCGACCCCGACAAAGGCGCCATCAAAATCATCATGACCGGCTCG
GCCTCGGACAAGCCGATGCTCAAGCCGCACATCTACACCAAAGAGGTCAAGAAGCGCCTGGAGCGGCGCTACAAGGACCC
TGCCGACCCGTTCAAGCTGGTCATCGTGCGCGACATGTGGCTGACCGGCTTCGATGCGCCCTGTCTGCACACGATGTATA
TCGACAAGCCCATGCGTGGCCACAACCTGATGCAGGCCATCGCCCGGGTCAACCGGGTGTTCAAGGACAAGCCCGGCGGT
CTGGTCGTCGACTACATCGGCATTGCCAACGAGCTCAAGGCCGCCCTGAAGGACTACACCCTGGCCCACGGCAAGGGGCG
GCCGACCATCGACACCCACGAGGCCCTGCGCTTGCTGCTGGAGAAGATGGAGGTGCTCCACGGCATCCTGCACGGCTGCG
ACTACGCGGACTTCCGCACCCAAGCCTGGCAACTGCTGCCCAAGGTGGCCAACCATATCCTGGGCCAGGATGACGGCAAG
AAGCGCTTCGCCGACGCCGTCCTGTCCGCCACCAAGGCGTTCGCGCTGTGCTGCACGCTTGACGAGGCTCTTGAACACCG
GGACGAACTGGCCTTCCTGCAGGCCACCAAGGCTGCCCTGACCAAGCACACCACACAGGACAAGAAGCTCAGCGACGAGC
AGAAAGAACACGCGCTGCGGCAGATTCTGAGCAAGGCCGTGGTCAGCGCCGAGGTCGTCGACATCTTCCAGGCGGCCGGG
TTGAACAAACCGGATATCGGCATTCTGTCCGAGGAGTTCCTCGACGACGTGCGGCACATGAAGGAGCGCAATCTGGCGGT
GGAGCTGCTGGAGCGCCTGCTCAAGGATGACATCAAGACCCGATTCAAGACCAATGTGGTCAAGGCGGCCAAGTACAGCG
AATTGCTTCAGGAAAGCCTCAAGCGCTACCGCAACCGCGCCATCGAGACGGCCCAGGTCATCGAGGAACTGATTGCGATG
GCCAAGCAGTTCCAGCAGGAGGCCATCAGGGGCGCCGCGCTGGGGCTGAGTCCCGAGGAGCGGGCGTTCTACGACGCGCT
GGCCGCCAATGAGTCGGCGGTCAGAGAACTGGGCGACGAGACGCTGAAGAAGATTGCGGTCGAGCTGACGCTGAAACTGC
GTAACTCTGTAACCGTCGATTGGTCCGTGCGCGACGCGGTGCGGGCAAGCATTCGGGTGATGGTCAAGACGCTGTTACGG
CGTTACAAGTATCCGCCGGACAAGCAGGAGGAGGCAACCGAGACGGTGCTCAAGCAGGCGGAGATGCTGTCGGCGGAATG
GGTAGGGTGA

Protein sequence :
MTSAVLEDHLEQATLQWLSALGWQTAHGPDISPPDARTAGTERDTYRQVCLPHRLAATIQRLNPNIPTSARDAALRLVLN
PNTPGLVAANRQFHRWLVEGVPVEYQRDGETRGDRVRLIDFSDVGRNDWLAVNQFTVQGPKHTRRPDLVLFLNGLPLVVL
ELKNPGDENADIWGAFNQLQAYKDDIPDLFIDNELLVITDGISARMGSLTADRERFMAWRTIDGHTTDPLGSMRELETLV
QGAFDRQTLLDYLQHFILFEDDGGLVKKVAGYHQFHAVRAVVDSVLKASAPGGSRKGGVVWHTQGAGKSIEMTCLAGMLI
GHPELQNPTVLMVTDRNDLDNQLFGVFAGATELLHETPVQAATRPKLRELLGNRPSGGIVFTTIQKFAPGEDEDSFPVLS
ERSNIIVICDEAHRSQYGFAAKLPGQDDATRRYRQTAAAPLSAQDAGAIGATTALVTAAPASSVRYGYAQYLRDALPNAT
FVAFTGTPVSLEDRDTRAVFGDYVHIYDVEQAVKDGATVPIYYESRLAKLELAEEDTTWLDDDVDQLTEDDEDDASKTSK
LRRWAALEKLVGAPPRIQKVAADIVEHFENRLAALDGKAMVVGMSREICVHLYDALVALRPEWHDPDPDKGAIKIIMTGS
ASDKPMLKPHIYTKEVKKRLERRYKDPADPFKLVIVRDMWLTGFDAPCLHTMYIDKPMRGHNLMQAIARVNRVFKDKPGG
LVVDYIGIANELKAALKDYTLAHGKGRPTIDTHEALRLLLEKMEVLHGILHGCDYADFRTQAWQLLPKVANHILGQDDGK
KRFADAVLSATKAFALCCTLDEALEHRDELAFLQATKAALTKHTTQDKKLSDEQKEHALRQILSKAVVSAEVVDIFQAAG
LNKPDIGILSEEFLDDVRHMKERNLAVELLERLLKDDIKTRFKTNVVKAAKYSELLQESLKRYRNRAIETAQVIEELIAM
AKQFQQEAIRGAALGLSPEERAFYDALAANESAVRELGDETLKKIAVELTLKLRNSVTVDWSVRDAVRASIRVMVKTLLR
RYKYPPDKQEEATETVLKQAEMLSAEWVG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 56
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 46
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 45
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 45