Gene Information

Name : hsdR (GAU_0178)
Accession : YP_002759690.1
Strain : Gemmatimonas aurantiaca T-27
Genome accession: NC_012489
Putative virulence/resistance : Unknown
Product : type I restriction-modification system restriction subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : 3.1.21.3
Position : 209016 - 212165 bp
Length : 3150 bp
Strand : -
Note : protein synonym: type I restriction enzyme R protein

DNA sequence :
ATGCCTCGTTTCTCTGAATCGATCGTCGAAGACGCCGCCCTCGAGTGGCTGGCCGGGCTCGGTTATGAGGTCCGGCACGG
TCCCGATATGGCAGCCGGAGAACCTGGCGCCGAACGCAGCGACCCCGGCTATCGCGACGTGGTCCTGGAACACCGGCTCC
GCGCAGCCCTCCTCCGCCTCAATCCGTTGCTTCAGGCGGAGGCCATCGACGAGGCGTACCGCAAACTGACCCGCCTCGAC
GCCCCGTCGGCCATGGACCGCAATCGCGCCATGCATCGCATGCTGCTGAATGGCGTGGAGGTCGAGTTTCGGCGTCCCGA
CGGATCAATTGGCGGCGGACAGGTGCAGATCTTCGACTTCGACACGCCGACGAACAACAACTGGCTGGCCGTCAACCAGT
TCTCCGTGGCCGAGGGACAACACCACCGCCGGCCTGACGTGGTGGTGTTTGTCAACGGACTGCCACTTGCCGTGCTCGAA
CTCAAGAACGCCGCCGACGAAAACGCCACCATCTGGTCGGCTTGGCAGCAGTTGCAGACCTATCAGGCACAGATCCCGAC
CCTGTTCACCAGCAATCTGGCTCTGGTGATTTCCGACGGGGTTCAGGCGCGCATCGGCGCGCTCGGCGCCGGCAAGGAAT
GGTTCAAGCCTTGGCGCACGGTGAGCGGAACCACGAGTGCGCCGAGTTCACTGTCTGAGCTGGAAGTCTTGCTGCACGGT
GTGTTCGAACACCGTCGCTTCCTCGACCTGCTGCACTACTTCGTGGCCTTCGAGCAGGAAGATGACGGCCCGCTCGTCAA
AAAGTTGGCGGGATATCACCAGTATCACGCCGTCAACGTGGCGGTGGAAGAAACCCTGCGGGCCGCGCGCAGCGTCAACC
CCGACCGTATTGCCGAAAACATCGGGCGCTATTCATCCGGTGAGCAACCCGGTGGGGAACCCGGCGACCGCCGCGTCGGC
GTGGTGTGGCACACCCAGGGATCGGGGAAGAGTCTCACCATGGCCTTCTATGCCGGACGGGTGATCCTGCACCCCGGCAT
GGCCAACCCCACGGTGGTGGTGATCACCGACCGCAACGATCTGGACGAACAACTGTTCGGCACCTTCGCGCGTTGCCGCA
ACCTGCTGCGCCAGGAACCGGTGCAGGCCAAGGACCGCGCCGACCTGCGGAAGCAACTCGCCGTGAGCGCGGGTGGCGTG
ATTTTCACCACCATCCAGAAGTTCATGCCCGATGAACGGGGCGACAAACATCCCCCGCTCTCGGACCGCCGCAATGTGAT
CGTCATTGCCGACGAGGCGCACCGCAGTCAGTACGAATTCCACGATGGCTTCGCCGGACACATGCGCGATGCTTTGCCAA
ACGCCTCGTTCGTGGGATTCACCGGAACACCGATCGAAAAAGCCGATGCCAACACGCGGCAGGTGTTCGGCGAATACATC
AGCGTGTACGATATCCAGCGCGCGGTGGTCGACGGAGCGACTGTACCCATCTACTACGAAAGCCGCTTGGCAAAACTTGA
GTTTGCAGAATCAGAGAAGTTGGGCATCGATTCGGCATTCGAGGAAGTCACGGAAGGAGAGGAAGTCGAGCGCAAGGAGC
AACTCAAGAGCAAATGGGCGCAGCTCGAGGTCATCGCCGGCGCCGACACGCGACTCGATCTCATTGCCCGGGATATCGTG
GATCACTTCGAGCGCCGCAACGAGGTGCTCGACGGCAAGGCCATGATCGTGGTCATGAGTCGACGCATTGCGGTGGCGTT
GTACAACAGAATTGTCGCGCTGCGTCCTGAATGGCACAGTGATCGCAACGACGAAGGACTGCTGAAGATCGTCATGACCG
GATCGGCGAGCGACCCCGTCGACTGGCAGCCTCACATCCGCAACAAGACTGACCGCGAGGCCCTCGCCACCCGATTCCGC
GATGGATCGAGTGGATTTCGACTCGTCATCGTGCGCGACATGTGGCTCACGGGCTTCGACTGCCCGAGCCTCGCCACGAT
GTACGTGGACAAGCCGATGCGAGGCCACGGGCTCATGCAAGCCATCGCCCGAGTGAATCGCGTATTCAAGGACAAACCGG
GTGGGCTCGTGGTGGACTATCTGGGTCTTGCCGATGCGCTCAAGAGTGCACTCGCCACCTACACTGAAGCAGGAGGCACG
GGCAAGACGGCACTCGATCAGGAAGAGGCCGTGGCCGTAATGTTGGAGAAGCATGAGATCTGTGTCGGCCTATTCCACGG
TTTTGACCGTAGCGCGTGGTATGTTGGCACGGCGCAGCAGCGACTGTCCCTTTTGCCGGCGGCTCAGGAGCATGTGCTGG
CACTTCCAGACGGCAAGGAGCGCCTCATGCGGCATGTCAGCGAACTGACGCGCGCCTTTGCCTTGGCCGTGCCTCACGAC
ACCGCCATGCACATTCGCGACGATGTGGGATTCTTCCAGGCTGTTCGGGCTGTGCTGGCCAAGAACGAGGTGGGCAGAGC
TCGACCGAGAAGCGATGTGGAACACGCCATTCGGCAGATCGTTTCCAAGGCGCTCGTGTCGGACGAAGTCATCGACGTAT
TCACCGCGGCGGGGCTCAAGAAGCCGGATATCTCGATTCTCTCCGATGAGTTTCTGGCGGAGGTGCGTGGCATGCCGCAC
CGCAATCTGGCCGTGGAGCTGTTGCAGAAACTGCTGAAAGGAGAAATCCGGACCCGCTCACGGCGCAACGTAGTGCAGGC
GCGCTCGTTTGCCGGCCTGCTGGAGCAGGCGCTGCGCCGGTATCAGAACCGGGCGGTGGAAACCGCACAAATCATCGAAG
AACTCATCAAGCTGGCCAAGGACATGCGGGCCGCCAACGCCCGCGGCGAGGCGCTTGGGCTCACCGACGATGAACTGGCG
TTTTACGACGCGCTCGAAGTGAACGACAGCGCCGTTCAGGTGCTCGGAGACGAGCAACTGCGGGTCATTGCCCGTGAACT
CGTAGCCACAATCCGAAAGAATGTGTCGATCGACTGGGCCATCCGCGACAACATGCGCGCGCAATTGCGCGTGTACGTCA
AACGCATCCTCCGCAAGTACGGCTATCCACCTGACAAACAGGAGCGGGCCACGCAGACGGTGTTGGAGCAGGCGGAATTG
CTAAGCAGTGAGTGGGCGATCGCTGCGTGA

Protein sequence :
MPRFSESIVEDAALEWLAGLGYEVRHGPDMAAGEPGAERSDPGYRDVVLEHRLRAALLRLNPLLQAEAIDEAYRKLTRLD
APSAMDRNRAMHRMLLNGVEVEFRRPDGSIGGGQVQIFDFDTPTNNNWLAVNQFSVAEGQHHRRPDVVVFVNGLPLAVLE
LKNAADENATIWSAWQQLQTYQAQIPTLFTSNLALVISDGVQARIGALGAGKEWFKPWRTVSGTTSAPSSLSELEVLLHG
VFEHRRFLDLLHYFVAFEQEDDGPLVKKLAGYHQYHAVNVAVEETLRAARSVNPDRIAENIGRYSSGEQPGGEPGDRRVG
VVWHTQGSGKSLTMAFYAGRVILHPGMANPTVVVITDRNDLDEQLFGTFARCRNLLRQEPVQAKDRADLRKQLAVSAGGV
IFTTIQKFMPDERGDKHPPLSDRRNVIVIADEAHRSQYEFHDGFAGHMRDALPNASFVGFTGTPIEKADANTRQVFGEYI
SVYDIQRAVVDGATVPIYYESRLAKLEFAESEKLGIDSAFEEVTEGEEVERKEQLKSKWAQLEVIAGADTRLDLIARDIV
DHFERRNEVLDGKAMIVVMSRRIAVALYNRIVALRPEWHSDRNDEGLLKIVMTGSASDPVDWQPHIRNKTDREALATRFR
DGSSGFRLVIVRDMWLTGFDCPSLATMYVDKPMRGHGLMQAIARVNRVFKDKPGGLVVDYLGLADALKSALATYTEAGGT
GKTALDQEEAVAVMLEKHEICVGLFHGFDRSAWYVGTAQQRLSLLPAAQEHVLALPDGKERLMRHVSELTRAFALAVPHD
TAMHIRDDVGFFQAVRAVLAKNEVGRARPRSDVEHAIRQIVSKALVSDEVIDVFTAAGLKKPDISILSDEFLAEVRGMPH
RNLAVELLQKLLKGEIRTRSRRNVVQARSFAGLLEQALRRYQNRAVETAQIIEELIKLAKDMRAANARGEALGLTDDELA
FYDALEVNDSAVQVLGDEQLRVIARELVATIRKNVSIDWAIRDNMRAQLRVYVKRILRKYGYPPDKQERATQTVLEQAEL
LSSEWAIAA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 49
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 46
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 46
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 46