Gene Information

Name : hsdR3 (NTHI1843)
Accession : YP_249269.1
Strain : Haemophilus influenzae 86-028NP
Genome accession: NC_007146
Putative virulence/resistance : Unknown
Product : type I restriction enzyme HindVIIP R protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : 3.1.21.3
Position : 1694179 - 1697346 bp
Length : 3168 bp
Strand : +
Note : Similar to: HI1285, T1RH_HAEIN

DNA sequence :
ATGCTCAACGAAAACGACATCGAACAACTCACCCTTCAACGCCTGCAATCCCTCGGTTGGGAATATCGCTATGGTAAAGA
TTTGCCTGTTCATGAGGGCGAATTTTCCCGTGGCAATTTGAGCGGTGTAGTTTTTATTGAGCAACTGCGTGAGGCGGTGC
GTAAACTTAATCCGCAGCTGCCTGAAAGTGCGGTTGATTCTGTGGTGAAATCCGCGACAAAAAGCGATATTGGCGACTTG
GTTGTGCGTAATCAGACGTTTTATAAATTGTTGCGTGATGGTGTACGGGTAGAATATCAAGATGATGGTGAGCAGAAAAT
CGAGATGGCGCGTTTGGTGGATTTTGAGCATGGGGAAAACAACCGCTTTGTCGCCGTCAATCAGCTGGAAATCCGCAGCC
GTAAAGGAGGTAAACGGATTCCCGATATTATCGGCTTTGTAAATGGGCTGCCTTTGGTGGTATTTGAGCTCAAAAATCCG
CTGCGTGAATCGGCGGATTTATTGCAGGCGTTTAATCAGTTTGAAACCTATAAAGATGAAATTGCCGAGCTGTTTGTTTA
CAACCAAGCCCTGATTATTTCAGACGGCATTGCCGCCCGTTTGGGTTCGCTTTCGGCAGATTTTCAACGCTTTACGCCGT
GGAAAGTGGTCGATGAAAAAAATAAAAGCGTGCGGTTATATTTTGACGATGAGTTGCAAAGCCTGCTCAATGGCTTAATG
CAGCCTGAGGATTTATTGGACTATATCCGCTATTTCGTCTTGTTTGAACGGGATTCCGTTGGCAAAACCATTAAAAAAAT
CGCGGCATACCATCAATATTACGGCGTAAATGAAGCAGTAGAATCCACGATTTTTGCCACAAGCGAGCAAGGCGATAAAC
GCATCGGTGTGATGTGGCACACGCAGGGTTCGGGTAAGTCGATTTCGATGCTGTTTTATGCAGGCAAACTGCTTGCACAG
CCTGAATTGAAAAATCCTACCATTGTAGTGGTTACCGACCGCAACGATTTAGATGGTCAGCTTTTCCAAACTTTTTCTTC
AGGCAAAGATTTAATCAAACAAACACCGCAGCAAGTAGAAGACCGTGATCAACTGCGCCAACTGCTCGCACAAAATGAAG
TCGGCGGCGTGTTTTTTACTACGATTCAGAAATTCGCCCTAAATGAGGAAGAAAGCCGCTTCCCTATTTTAAATGAGCGC
AACAATATTATTGTGATCAGCGATGAGGCTCACCGCAGCCAATATGGCTTTACGCAAAAGCTGCATAACGGCAAGTTTCA
GACAGGTTATGCTCGCCATTTGCGTGATGCTTTACCTAATGCCTCGTTTATTGGTTTTACAGGTACGCCAATTAGCCTTG
AAGATAAGGACACGCAAGATGTGTTCGGTCGTTATGTGTCCATTTATGACTTGCAAGATGCGGTGGAAGATGGCGCAACC
GTGCCGATTGTGTATGAAGCACGCCAAATCAAGCTAGCGGAGAATGCTAATCACGATGAATTATTTGCAGAAATTGATGA
ACTGCTGGAAGGCGAAGAAAACCCGAAATTACGCTTGCGAGAAAAATTGCTCGGCTCAGAAGCTCGATTGCATGATTTAG
CGATCGATTTTGTGCAACATTTTGCAAAACGCAATGAAGTGGTGGACAGCAAAGCGATGATGGTGGTTTCCAGCCGTCAG
ATTTGCGTGGATTTGTATAATCAGATCATCGCTCTGCACCCTGAATGGCATTCGGACAATATTAACGAAGGGGCGATTAA
AATTGTGATGACAGGTTCTGCTTCCGATGCGTCTGAAATGCAGAAACACGTTTACAGTAAACAGGAAAAGCAAACGTTAG
AACGTCGTTTTAAAGACCCGAACGATCCGCTGAAAGTGGTGATTGTGCGTGATATGTGGCTGACAGGCTTTGATGCACCG
TGCTGTAACACTATGTATCTTGATAAGCCAATGAAAGGGCATAACCTAATGCAAGCCATCGCACGAGTAAACCGTGTATT
CCGCAATAAAAGCCGAGAAAATGGCGGCTTGATTGTGGATTATGTAGGCATTGCTGACGAGCTCGAAAAAGCCACTCGGC
AATATACAAACTCACAAGGCAAGGGCAAACTGGCTGATAGCGTGATTGATGTATTCTTTAAAATGAAAGAGCATTTAGAA
GTTATCCGCAGCCTGTTTGCAACGCCAGTTGAGGGGAAAACCTTTGATGTTCAGGCTGCCTTAGAAAAAGATAATCCGAA
TGATCTTTTGATGGCGATTCGTTTTGCCGCCAACCATATTTTAAGCCTTGATCAATTATCGTTTGATGGCAAAGCGCACG
AGCAGCATTGGTTTAATAAAAAAGAAACCGAGCCACGCAAAAAAGCCTTTTTGAAAACGGCAGGCTTGGTAAAAAAAGGC
TATATGCTGTGCGGCACATTGGCTGAAGTTGAGCCGTATAACCAAGAAATCGCCTTTTATGATGCCGTACGGGCAATTTT
AACTAAACGTGAACAAAAAGGCACAGGCACAAATGAAAGACAGATTTTATTGAAAAAATTGGTCAATCAAACTGTGTATT
CTGAAGGCGTGATTGATTTATTCGATCTGCTAGAAAAACCACAACCACAAATCAGTTTGCTTTCCGAGGAATTTTTACAA
ACTGTAAAAAATAGCCCAACTAAAAATTTATGGGTTAGCGCGATGGAGCGTTATTTGGCAAGTGAAATTAAAGTTAAATC
AGGCACAAACTTAACCTTGCAAAAAGATTTTGAACGGCGTTTGAAGGAAGCATTGAATCAATACCACAATCACAATTTGA
CTGTGGTAGAGATTTTGGACGAACTCTTTAAAATGAGCCAAGATTTCCAAGAACGTTTAGCATTAGGGAAAAAACTAGGA
TTAACCAAGGAAGAACTAGCCTTCTATGAAGCTCTATCTCAAAATCAAAGTGCAAAAGATTTGATGGGTGATGAAGTGCT
TTCTAAACTGGCGAAAGAAATCACGGAAACACTTAGAAAATCGGTCACAATCGACTGGCAGTACAAAGAAGCGGTGCGGG
CAAGAATTAGATTACTCGTTCGACGTGCCTTACAAAAATATAAATACCCGCCTGATAAACAGGAAGAAGCGGTAACTTAT
GTGATTAAACAAGCTGAAGAAATTGCTGAGGATTTAACTGGTTTATAA

Protein sequence :
MLNENDIEQLTLQRLQSLGWEYRYGKDLPVHEGEFSRGNLSGVVFIEQLREAVRKLNPQLPESAVDSVVKSATKSDIGDL
VVRNQTFYKLLRDGVRVEYQDDGEQKIEMARLVDFEHGENNRFVAVNQLEIRSRKGGKRIPDIIGFVNGLPLVVFELKNP
LRESADLLQAFNQFETYKDEIAELFVYNQALIISDGIAARLGSLSADFQRFTPWKVVDEKNKSVRLYFDDELQSLLNGLM
QPEDLLDYIRYFVLFERDSVGKTIKKIAAYHQYYGVNEAVESTIFATSEQGDKRIGVMWHTQGSGKSISMLFYAGKLLAQ
PELKNPTIVVVTDRNDLDGQLFQTFSSGKDLIKQTPQQVEDRDQLRQLLAQNEVGGVFFTTIQKFALNEEESRFPILNER
NNIIVISDEAHRSQYGFTQKLHNGKFQTGYARHLRDALPNASFIGFTGTPISLEDKDTQDVFGRYVSIYDLQDAVEDGAT
VPIVYEARQIKLAENANHDELFAEIDELLEGEENPKLRLREKLLGSEARLHDLAIDFVQHFAKRNEVVDSKAMMVVSSRQ
ICVDLYNQIIALHPEWHSDNINEGAIKIVMTGSASDASEMQKHVYSKQEKQTLERRFKDPNDPLKVVIVRDMWLTGFDAP
CCNTMYLDKPMKGHNLMQAIARVNRVFRNKSRENGGLIVDYVGIADELEKATRQYTNSQGKGKLADSVIDVFFKMKEHLE
VIRSLFATPVEGKTFDVQAALEKDNPNDLLMAIRFAANHILSLDQLSFDGKAHEQHWFNKKETEPRKKAFLKTAGLVKKG
YMLCGTLAEVEPYNQEIAFYDAVRAILTKREQKGTGTNERQILLKKLVNQTVYSEGVIDLFDLLEKPQPQISLLSEEFLQ
TVKNSPTKNLWVSAMERYLASEIKVKSGTNLTLQKDFERRLKEALNQYHNHNLTVVEILDELFKMSQDFQERLALGKKLG
LTKEELAFYEALSQNQSAKDLMGDEVLSKLAKEITETLRKSVTIDWQYKEAVRARIRLLVRRALQKYKYPPDKQEEAVTY
VIKQAEEIAEDLTGL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 49
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42