Gene Information

Name : hsdR (R2866_0867)
Accession : YP_005827547.1
Strain : Haemophilus influenzae R2866
Genome accession: NC_017451
Putative virulence/resistance : Unknown
Product : Type I restriction enzyme HindVIIP, R protein
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 888251 - 891433 bp
Length : 3183 bp
Strand : -
Note : -

DNA sequence :
ATGCTCAACGAAAACGACATCGAACAACTCACTCTTCAACGCCTGCAATCCCTCGGTTGGGAATATCGCTATGGTAAAGA
CTTGCCTGTTCATGAGGGCAAGTTTGCCCGTGGCGATTTGAGCGGCGTAGTCTTTGTTGAGCAACTGCGTGAGGCGGTGC
GTAAACTTAATCCGCAGCTGCCTGAAAGTGCGGTGGATTCTGTGGTGAAATCCGCGACGAAAAGCGATATTGGCGATTTG
GTTGTGCGTAATCAGACGTTTTATAAACTGCTGCGTGATGGCGTGCGGGTGGAATATCAGGCTCCCAATAGCCACGGACA
AAATGAGCAGAAAATTGAGATGGTGCGTTTGGTGGATTTCGAGCATTGGGAAAACAACCGTTTTGTTGCCGTCAATCAGC
TGGAAATCCGCAGCCGCAAAGGAGGTAAACGAATTCCCGATATTATCGGCTTTGTCAATGGCTTACCGTTGGTGGTATTT
GAGCTCAAAAATCCACTACGTGAATCGGCGGATTTGTTGCAGGCGTTTAATCAGTTTGAAACCTATAAAGATGAAATTGC
CGAGCTGTTTGTTTACAACCAAGCCCTGATTATTTCAGACGGTATTGTCGCCCGTTTGGGTTCGCTTTCGGCAGATTTCC
AACGCTTTACGCCGTGGAAAGTGGTCGATGAAAAAAATAAAAGCGTGCGGTTATATTTTGACGATGAGTTGCAAAGCCTG
CTTAATGGCTTGCTAAAGCCTGAGGATTTATTGGACTATATCCGCTATTTCGTCTTATTTGAACGGGATTCCGTTGGCAA
AACCATTAAAAAAATCGCAGCGTACCATCAATATTACGGCGTAAATGAAGCGGTAGATTCCACTATTTGGGCGACCTCAG
AAAAAGGCGACCGCCGCATTGGCGTGATGTGGCATACGCAAGGTTCGGGTAAATCAATTTCCATGTTGTTTTATGCAGGC
AAACTGCTTGCACAGCCTGAATTGAAAAATCCCACTATTGTGGTGGTCACCGACCGTAACGATTTGGACGGTCAGCTTTT
CCAAACTTTTTCTTCAGGCAAGGATTTAATCAAACAAACACCGCAGCAAGTAGAAGACCGTGATCAACTGCGCCAACTGC
TCGCGCAAAATGAAGTCGGCGGCGTGTTTTTTACTACGATTCAGAAATTCGCCCTAAATGAGGAAGAAAGCCGCTTCCCT
GTTTTAAATGAGCGCAGCAATATTATTGTGATCAGCGATGAGGCTCACCGCAGCCAATATGGCTTTACGCAAAAGCTGCA
TAACGGCAAGTTTCAGGCAGGCTACGCCCGCCATTTACGCGATGCGCTGCCCAATGCATCGTTTATCGGTTTTACAGGTA
CGCCAATTAGCCTAGAAGATAAGGACACGCAAGATGTGTTCGGTCGTTATGTGTCCATTTATGACTTGCAAGATGCGGTG
GAAGATGGCGCAACCGTGCCGATTGTGTATGAAGCACGCCAAATCAAGCTAGCGGAGAATGCTAATCACGATGAATTATT
TGCAGAAATTGATGAACTGCTGGAAGGCGAAAAAAACCCGAAATTACGCTTGCGAGAAAAATTGCTCGGCTCAGAAGCTC
GATTGCATGATTTAGCGGTCGATTTTGTGCAACATTTTGCCAAACGCAATGAAGTGGTGGACAGCAAAGCGATGATGGTG
GTTTCCAGCCGTCAGATTTGCGTGGATTTGTATAATCAGATCATCGCTCTGCACCCTGAATGGCATTCGGATAATATCAA
TGAGGGTGCGATTAAAATTGTGATGACAGGTTCTGCTTCCGATGCGTCTGAAATGCAGAAACACGTTTACAGTAAGCAGG
AAAAACAAACGCTAGAACGCCGCTTTAAAGACCCGAACGATCCGCTGAAGGTGGTGATTGTGCGTGATATGTGGCTGACA
GGCTTTGATGCACCTTGCTGTAACACGATGTATATCGACAAGCCGATGCAGGGGCATAACCTAATGCAAGCCATCGCACG
AGTAAACCGTGTATTCGCTAACAAAAGTCGTGAAAACGGTGGGCTTATCGTGGATTATGTTGGTTTGGCAGAAGAATTAC
GAGCAGCCACACAGCAATACACCAACTCTACTGGCAAAGGGCAATTAGCGGAAGATGTGCAAAGCGTGTTCTTTAAAATG
AAAGAACAGCTTGAATTTATCCGAACTTTGTTTGCGACACCAATTGAAGGAAAAACTTTTGATGTTCAGATTGCCTTAGA
AAAAGATAATCCGAATGATCTTTTGATGGCGATTCGTTTTGCCGCCAACCATATTTTAAGCCTTGATCAATTATCGTTTG
ATGGCAAAGCGCACGAGCAGCATTGGTTTAATAAAAAAGAAACCGAGCCACGCAAAAAAGCCTTTTTGAAGGCGGCAGGT
TTGGTAAAAAAAGGCTATATGCTGTGCGGTACATTGGCTGAAGTTGAGCCGTATAACCAAGAAATCGCCTTTTATGATGC
CGTACGGGCAATTTTAACTAAACGTGAACAAAAAGGCACAGGCACAAATGAAAGACAGATTTTATTGAAAAAATTGGTCA
ATCAAACTGTGTATTCTGAAGGCGTGATTGATTTATTCGATCTGCTAGAAAAACCACAACCACAAATTAGCTTACTTTCC
GAGGAATTTTTACAAACTGTAAAAAATAGCCCAACTAAAAATTTATGGGTTAGCGCGATGGAGCGTTATTTGGCAAGTGA
AATTAAAGTTAAATCAGGCACAAACTTAACATTGCAAAAAGATTTTGAACGGCGTTTGAAGGAAGCATTGAATCAATACC
ACAATCACAATTTGACTGTGGTAGAGATTTTGGACGAACTCTTTAAAATGAGCCAAGATTTCCAAGAACGTTTAGCATTA
GGGAAAAAACTAGGATTAACCAAGGAAGAACTAGCCTTCTATGAAGCTCTATCTCAAAATCAAAGTGCAAAAGATTTGAT
GGGTGATGAAGTGCTTTCTAAACTGGCGAAAGAAATCACGGAAACACTTAGAAAATCGGTCACAATCGACTGGCAGTACA
AAGAAGCGGTGCGGGCAAGAATTAGATTACTCGTTCGACGTGCCTTACAAAAATATAAATACCCGCCTGATAAACAGGAA
GAAGCGGTAACTTATGTGATTAAACAAGCTGAAGAAATTGCTGAGGATTTAACTGGTTTATAA

Protein sequence :
MLNENDIEQLTLQRLQSLGWEYRYGKDLPVHEGKFARGDLSGVVFVEQLREAVRKLNPQLPESAVDSVVKSATKSDIGDL
VVRNQTFYKLLRDGVRVEYQAPNSHGQNEQKIEMVRLVDFEHWENNRFVAVNQLEIRSRKGGKRIPDIIGFVNGLPLVVF
ELKNPLRESADLLQAFNQFETYKDEIAELFVYNQALIISDGIVARLGSLSADFQRFTPWKVVDEKNKSVRLYFDDELQSL
LNGLLKPEDLLDYIRYFVLFERDSVGKTIKKIAAYHQYYGVNEAVDSTIWATSEKGDRRIGVMWHTQGSGKSISMLFYAG
KLLAQPELKNPTIVVVTDRNDLDGQLFQTFSSGKDLIKQTPQQVEDRDQLRQLLAQNEVGGVFFTTIQKFALNEEESRFP
VLNERSNIIVISDEAHRSQYGFTQKLHNGKFQAGYARHLRDALPNASFIGFTGTPISLEDKDTQDVFGRYVSIYDLQDAV
EDGATVPIVYEARQIKLAENANHDELFAEIDELLEGEKNPKLRLREKLLGSEARLHDLAVDFVQHFAKRNEVVDSKAMMV
VSSRQICVDLYNQIIALHPEWHSDNINEGAIKIVMTGSASDASEMQKHVYSKQEKQTLERRFKDPNDPLKVVIVRDMWLT
GFDAPCCNTMYIDKPMQGHNLMQAIARVNRVFANKSRENGGLIVDYVGLAEELRAATQQYTNSTGKGQLAEDVQSVFFKM
KEQLEFIRTLFATPIEGKTFDVQIALEKDNPNDLLMAIRFAANHILSLDQLSFDGKAHEQHWFNKKETEPRKKAFLKAAG
LVKKGYMLCGTLAEVEPYNQEIAFYDAVRAILTKREQKGTGTNERQILLKKLVNQTVYSEGVIDLFDLLEKPQPQISLLS
EEFLQTVKNSPTKNLWVSAMERYLASEIKVKSGTNLTLQKDFERRLKEALNQYHNHNLTVVEILDELFKMSQDFQERLAL
GKKLGLTKEELAFYEALSQNQSAKDLMGDEVLSKLAKEITETLRKSVTIDWQYKEAVRARIRLLVRRALQKYKYPPDKQE
EAVTYVIKQAEEIAEDLTGL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 49
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42