Gene Information

Name : hsdR (R2846_0805)
Accession : YP_005829269.1
Strain : Haemophilus influenzae R2846
Genome accession: NC_017452
Putative virulence/resistance : Unknown
Product : Type I restriction enzyme HindVIIP, R protein
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 848078 - 851245 bp
Length : 3168 bp
Strand : -
Note : -

DNA sequence :
ATGCTCAACGAAAACGACATAGAACAACTCACTCTTCAGCGCCTGCAATCCCTCGGTTGGGAATATCGCTATGGTAAAGA
CTTGCCTGTTCATGAGGGCAAGTTTGCCCGTGGCGATTTGAGCGGCGTAGTCTTTGTTGAGCAACTGCGTGAGGCGGTGC
GTAAACTTAATCCTCAGCTGCCTGAAAGTGCGGTTGATTCTGTAGTGAAATCGGCAACGAAAAGCGATATTGGCGACTTG
GTTGTGCGTAATCAGACGTTTTATAAACTGCTGCGTGATGGCGTGCGGGTCGAATATACGCAAAACGGCGAACAGAAAAT
TGAGATGGTGCGTTTGGTGGATTTCGAGCATTGGGGAAACAACCGTTTTGTCGCCGTCAATCAGCTGGAAATCCGCAGCC
GTAAAGGGGGCAAGCGGATTCCCGATATTATCGGCTTTGTAAATGGCTTACCGTTGGTGGTATTTGAGCTCAAAAATCCA
CTACGTGAATCGGCGGATTTGTTGCAAGCGTTTAATCAGTTTGAAACCTATAAAGATGAAATTGCCGAGCTGTTTGTTTA
CAACCAAGCTCTGATTATTTCAGACGGCATTGTCGCCCGTTTGGGTTCGCTTTCGGCAGATTTCCAACGCTTTACGCCGT
GGAAAGTGGTCGATGAAAAAAATAAAAGCGCGCGGTTATATTTTGACGATGAGTTGCAAAGCCTGCTTAATGGCCTGTTG
CAGCCTAAGGATTTACTCGACTATATCCGCTATTTCGTCTTGTTTGAACGGGATTCCGTTGGCAAAACCATTAAAAAAAT
CGCGGCGTACCATCAATATTACGGCGTAAATGAAGCGGTAGAATCCACGATTTTTGCCACAAGCGAGCAAGGCGATAAAC
GCATTGGTGTGATGTGGCATACGCAGGGTTCGGGCAAGTCGATTTCGATGCTGTTTTATGCAGGCAAACTGCTTGCACAG
CCTGAATTGAAAAATCCTACCATTGTAGTGGTTACCGACCGCAACGATTTAGACGGTCAGCTCTTCCAAACCTTTTCTTC
AGGCAAAGATTTAATCAAGCAAACACCGCAACAAGTGGAAGACCGTGATCAACTGCGCCAACTGCTCGCACAAAATGAAG
TCGGCGACGTATTTTTTACCACGATTCAGAAATTCGCCCTAAATGAGGAAGAAAGCCGCTTCCCTATTTTAAATGAGCGC
AACAATATTATTGTGATCAGCGATGAGGCTCACCGCAGCCAATATGGCTTTACGCAAAAGCTGCATAACGGCAAGTTTCA
GACAGGTTATGCTCGCCATTTGCGTGATGCTTTACCTAATGCCTCGTTTATTGGTTTTACAGGTACGCCAATTAGCCTTG
AAGATAAGGACACGCAAGATGTGTTCGGTCGTTATGTGTCCATTTATGACTTGCAAGATGCGGTGGAAGATGGTGCAACT
GTGCCGATTGTGTATGAAGCACGCCAAATCAAGCTAGCGGAGAATGCTAATCACGATGAATTATTTGCAGAAATTGATGA
ACTGCTGGAAGGCGAAGAAAACCCGAAATTACGCTTGCGAGAAAAATTGTTCGGCTCAGAAGCACGATTGCATGATTTAG
CGATCGATTTTGTGCAACATTTTGCAAAACGCAATGAAGTGGTTGACAGCAAAGCGATGATGGTGGTTTCCAGCCGTCAG
ATTTGCGTGGACTTATACAATGAAATCATCAAATTGCGCCCTGAATGGCATTCGGACAATATCAACGAAGGGGCGATTAA
AATTGTGATGACAGGTTCTGCTTCCGATGCGTCTGAAATGCAGAAACACGTTTACAGTAAGCAGGAAAAACAAACGCTAG
AACGCCGCTTTAAAGACCCGAACGATCCGCTGAAGGTGGTGATTGTGCGTGATATGTGGCTGACAGGCTTTGATGCACCG
TGCTGTAACACTATGTATCTTGATAAGCCAATGAAGGGGCATAACCTAATGCAAGCCATCGCACGAGTAAACCGTGTATT
CCGCAATAAAAGCCGAGAAAATGGCGGCTTGATTGTGGATTATGTAGGCATTGCTGACGAGCTCGAAAAAGCCACTCGGC
AATATACAAACTCACAAGGCAAGGGCAAACTGGCTGATAGCGTGATTGATGTATTCTTTAAAATGAAAGAGCATTTAGAA
GTTATCCGCAGCCTGTTTGCAACGCCAGTTGAGGGGAAAACCTTTGATGTTCAGACGGTCTTAGAAAAAGATAATCCGAA
TGATCTTTTGATGGCGATTCGTTTTGCCGCCAACCATATTTTAAGCCTTGATCAATTATCGTTTGATGGCAAAGCGCACG
AGCAGCATTGGTTTAATAAAAAAGAAACCGAACCACGCAAAAAAGCCTTTTTGAAAGCGGCAGGCTTGGTAAAAAAAGGC
TATATGCTGTGCGGCACATTGGCTGAAGTTGAGCCGTATAACCAAGAAATCGCCTTTTATGATGCCGTACGGGCAATTTT
AACTAAACGTGAACAAAAAGGCACAGGCACAAATGAAAGACAGATTTTATTGAAAAAATTGGTTAATCAAACTGTGTATT
CTGAAGGCGTGATTGATTTATTTGATCTGCTAGAAAAACCACAACCACAAATTAGCTTGCTTTCCGAGGAATTTTTACAA
ACTGTAAAAAATAGCCCGACTAAAAATTTATGGGTTACGGCAATGGAACGTTATTTAGCAAGTGAAATTAAAGTTAAATC
AGGCACAAACTTAACATTGCAAAAAGATTTTGAACGGCGTTTGAAGGAAGCATTGAATCAATACCACAATCACAATTTGA
CTGTGGTAGAGATTTTGGACGAACTCTTTAAAATGAGCCAAGATTTCCAAGAACGTTTAGCATTAGGGAAAAAACTAGGA
TTAACCAAGGAAGAACTAGCCTTCTATGAAGCTCTATCTCAAAATCAAAGTGCAAAAGATTTGATGGGTGATGAAGTGCT
TTCTAAACTGGCGAAAGAAATCACGGAAACACTTAGAAAATCGGTCACAATCGACTGGCAGTACAAAGAAGCGGTGCGGG
CAAAAATGCGTATTCTCGTTAAACGCGCACTACAACGCTACAAATATCCACCCGATAAACAGGAAGAAGCGATAACTTAT
GTGATTAAACAAGCTGAAGAAATTGCTGAGGATTTAACTGGTTTATAA

Protein sequence :
MLNENDIEQLTLQRLQSLGWEYRYGKDLPVHEGKFARGDLSGVVFVEQLREAVRKLNPQLPESAVDSVVKSATKSDIGDL
VVRNQTFYKLLRDGVRVEYTQNGEQKIEMVRLVDFEHWGNNRFVAVNQLEIRSRKGGKRIPDIIGFVNGLPLVVFELKNP
LRESADLLQAFNQFETYKDEIAELFVYNQALIISDGIVARLGSLSADFQRFTPWKVVDEKNKSARLYFDDELQSLLNGLL
QPKDLLDYIRYFVLFERDSVGKTIKKIAAYHQYYGVNEAVESTIFATSEQGDKRIGVMWHTQGSGKSISMLFYAGKLLAQ
PELKNPTIVVVTDRNDLDGQLFQTFSSGKDLIKQTPQQVEDRDQLRQLLAQNEVGDVFFTTIQKFALNEEESRFPILNER
NNIIVISDEAHRSQYGFTQKLHNGKFQTGYARHLRDALPNASFIGFTGTPISLEDKDTQDVFGRYVSIYDLQDAVEDGAT
VPIVYEARQIKLAENANHDELFAEIDELLEGEENPKLRLREKLFGSEARLHDLAIDFVQHFAKRNEVVDSKAMMVVSSRQ
ICVDLYNEIIKLRPEWHSDNINEGAIKIVMTGSASDASEMQKHVYSKQEKQTLERRFKDPNDPLKVVIVRDMWLTGFDAP
CCNTMYLDKPMKGHNLMQAIARVNRVFRNKSRENGGLIVDYVGIADELEKATRQYTNSQGKGKLADSVIDVFFKMKEHLE
VIRSLFATPVEGKTFDVQTVLEKDNPNDLLMAIRFAANHILSLDQLSFDGKAHEQHWFNKKETEPRKKAFLKAAGLVKKG
YMLCGTLAEVEPYNQEIAFYDAVRAILTKREQKGTGTNERQILLKKLVNQTVYSEGVIDLFDLLEKPQPQISLLSEEFLQ
TVKNSPTKNLWVTAMERYLASEIKVKSGTNLTLQKDFERRLKEALNQYHNHNLTVVEILDELFKMSQDFQERLALGKKLG
LTKEELAFYEALSQNQSAKDLMGDEVLSKLAKEITETLRKSVTIDWQYKEAVRAKMRILVKRALQRYKYPPDKQEEAITY
VIKQAEEIAEDLTGL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42