Gene Information

Name : hsdR3 (HIBPF06820)
Accession : YP_004135181.1
Strain : Haemophilus influenzae F3031
Genome accession: NC_014920
Putative virulence/resistance : Unknown
Product : type i restriction enzyme hindviip r protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 634079 - 637246 bp
Length : 3168 bp
Strand : -
Note : -

DNA sequence :
ATGCTCAACGAAAACGACATCGAACAACTCACCCTGCAACGCCTGCAATCCCTCGGTTGGGAATATCGCTATGGTAAAGA
CTTGCCTGTTCATGAGGGCGAGTTTGCCCGTGGCGATTTGAGCGGCGTAGTCTTTGTTGAGCAACTGCGTGAGGCGGTGC
GTAAACTTAATCCTCAGCTGCCTGAAAGTGCGGTGGATTCTGTGGTGAAATCGGCGACGAAAAGCGATATTGGCGACTTG
GTGGTACGCAATCAGGCGTTTTATAAACTGTTGCGTGATGGTGTGCGGGTAGAATATACGTTAAACGGTGAGCAAAAAAT
CGAGATGGTGCGCTTGGTGGATTTCGAGCATTGGGGAAACAACCGTTTTGTCGCCGTCAATCAATTGGAAATCCGCAGCC
GCAAAGGAGGTAAACGGATTCCCGATATTATCGGCTTTGTAAATGGCTTACCGTTGGTGGTATTTGAGCTCAAAAATCCG
CTGCGTAAATCGGCGGATTTGTTGCAAGCGTTTAATCAGTTTGAAACCTATAAAGATGAAATTGCCGAGCTGTTTGTTTA
TAACCAAGCCTTGATTATTTCAGACGGCATTGTCGCCCGTTTAGGCTCGCTTTCGGCAGATTTCCAACGCTTTACGCCGT
GGAAAGTGGTCGATGAAAAAAATAAAAGCGCACGCTTATATTTTGACGATGAATTGCAAAGCCTGCTTAATGGCTTGCTA
AAGCCTGAGGATTTATTGGACTATATCCGCTATTTCGTCTTGTTTGAATGGGATTCCGTTGGCAAAACCATTAAAAAAAT
CGCAGCGTACCATCAATATTACGGCGTAAATGAAGCGGTAGATTCCACTATTTGGGCGACCTCAGAAAAAGGCGACCGCC
GCATTGGCGTGATGTGGCATACGCAAGGTTCGGGTAAATCAATTTCCATGTTGTTTTATGCAGGCAAACTGCTTGCACAG
CCTGAATTGAAAAATCCCACTATTGTGGTGGTCACCGACCGTAACGATTTGGACGGTCAGCTTTTCCAAACTTTTTCTTC
AGGCAAGGATTTAATCAAACAAACACCGCAGCAAGTAGAAGACCGTGATCAACTGCGCCAACTGCTCGCGCAAAATGAAG
TCGGCGGCGTGTTTTTTACTACGATTCAGAAATTCGCCCTAAATGAGGAAGAAAGCCGCTTCCCTATTTTAAATGAGCGC
AACAATATTATTGTGATCAGCGATGAGGCTCACCGCAGCCAATATGGCTTTACGCAAAAGCTGCATAACGGCAAGTTTCA
GACAGGCTACGCCCGCCATTTGCGTGATGCTTTACCTAATGCCTCGTTTATCGGTTTTACTGGTACGCCAATTAGCCTAG
AAGATAAGGACACGCAAGATGTGTTCGGTCGTTATGTGTCCATTTATGACTTGCAAGATGCGGTGGAAGATGGAGCGACC
GTGCCGATTATTTACGAAGCTCGCCAAATCAAGCTAGCGGAGAATGCTAATCACGATGAATTATTTGCAGAAATTGATGA
ACTGCTGGAAGGCGAAGAAAACCCGAAATTACGCTTGCGAGAAAAATTGCTCGGCTCAGAAGCACGATTGCATGATTTAG
CGGTCGATTTTGTGCAACATTTTGCCAAACGTAATGAAGTGGCGGACAGCAAAGCGATGATGGTGGTTTCCAGCCGTCAG
ATTTGCGTGGACTTATACAATGAAATCATCAAACTGCGCCCTGAATGGCATTCGGACAATATCAACGAAGGGGCGATTAA
AATTGTGATGACAGGTTCTGCTTCCGATGCGCCGGAAATGCAGAAACACGTTTACAGTAAACAGGAAAAACAAACGCTAG
AACGCCGCTTTAAAGACCCGAACGATCCGCTGAAAGTGGTGATTGTGCGTGATATGTGGCTGACGGGATTTGATGCGCCT
TGCTGTAATACGATGTATATCGACAAGCCGATGCAAGGGCATAACTTAATGCAGGCGATCGCCCGAGTAAACCGTGTATT
CCGCAATAAAAGCCGAGAAAATGGCGGCTTGATTGTGGATTATGTAGGCATTGCTGACGAGCTCAAAGATGCCACCCAAC
AATATACTAACTCGCAAGGCAAGGGAAAGCTGGCTGATAGCGTGATTGATGTATTCTTTAAAATGAAAGAGCATTTAGAA
GTTATCCGCAGCCTGTTTGCAACGCCAGTTGAGGGGAAAACCTTTTATGTTCAGACGGTCTTAGAAAAAGATAATCCGAA
TGAGCTTTTGATGGCGATTCGTTTTGCTGCCAACCATATTTTAAGCCTTGATCAATTACCGTTTGATGGCAAAGCGCACG
AGCAGCATTGGTTTAATAAAAAAGAAACCGAACCACGCAAAAAAGCCTTTTTGAAAGCGGCAGGCTTGGTAAAAAAAGGC
TATATGCTGTGCGGCGCATTGGCTGAAGTTGAGCCGTATAACCAAGAAATAGCCTTTTATGATGCCGTACGGGCAATTTT
AACTAGACGTGAACAAAAAGGCACAGGCACAAATGAAAGACAGATTTTATTGAAAAAATTGGTCAATCAAACTGTGTATT
CTGAAGGCGTGATTGATTTATTCGATCTGCTAGAAAAACCACAACCACAAATTAGCTTACTTTCCGAGGAATTTTTACAA
ACTGTAAAAAATAGCCCGACTAAAAATTTATGGGTTACGGCAATGGAACGTTATTTAGCAAGCGAAATCAAAGCCAAATC
AGGCGCGAACCTCACCTTGCAAAAAGACTTCGAGCAGCGCTTAAAAGAAGCCCTAAATCAATACCACAATCACAATTTGA
CTGTGGTAGAGATTTTGGACGAACTCTTTAAAATGAGCCAAGATTTCCAAGAACGTCTAGCATTAGGGAAAAAACTAGGA
TTAACCAAGGAAGAACTTGCCTTCTATGAAGCCCTATCTCAAAATCAAAGTGCAAAAGATTTGATGGGTGATGAAGTGCT
TTCTAAACTGGCGAAAGAAATCACGGAAACACTTAGAAAATCAGTCACAATCGACTGGCAATACAAAGAGGCGGTACGAG
CAAAAATGCGTATTCTCGTTAAACGCACCCTACAACGCTACAAATACCCTCCCGATAAACAGGAAGAAGCGGTAACTTAT
GTGATTAAACAAGCTGAAGAAATTGCTGAGGATTTAACTGGTTTATAA

Protein sequence :
MLNENDIEQLTLQRLQSLGWEYRYGKDLPVHEGEFARGDLSGVVFVEQLREAVRKLNPQLPESAVDSVVKSATKSDIGDL
VVRNQAFYKLLRDGVRVEYTLNGEQKIEMVRLVDFEHWGNNRFVAVNQLEIRSRKGGKRIPDIIGFVNGLPLVVFELKNP
LRKSADLLQAFNQFETYKDEIAELFVYNQALIISDGIVARLGSLSADFQRFTPWKVVDEKNKSARLYFDDELQSLLNGLL
KPEDLLDYIRYFVLFEWDSVGKTIKKIAAYHQYYGVNEAVDSTIWATSEKGDRRIGVMWHTQGSGKSISMLFYAGKLLAQ
PELKNPTIVVVTDRNDLDGQLFQTFSSGKDLIKQTPQQVEDRDQLRQLLAQNEVGGVFFTTIQKFALNEEESRFPILNER
NNIIVISDEAHRSQYGFTQKLHNGKFQTGYARHLRDALPNASFIGFTGTPISLEDKDTQDVFGRYVSIYDLQDAVEDGAT
VPIIYEARQIKLAENANHDELFAEIDELLEGEENPKLRLREKLLGSEARLHDLAVDFVQHFAKRNEVADSKAMMVVSSRQ
ICVDLYNEIIKLRPEWHSDNINEGAIKIVMTGSASDAPEMQKHVYSKQEKQTLERRFKDPNDPLKVVIVRDMWLTGFDAP
CCNTMYIDKPMQGHNLMQAIARVNRVFRNKSRENGGLIVDYVGIADELKDATQQYTNSQGKGKLADSVIDVFFKMKEHLE
VIRSLFATPVEGKTFYVQTVLEKDNPNELLMAIRFAANHILSLDQLPFDGKAHEQHWFNKKETEPRKKAFLKAAGLVKKG
YMLCGALAEVEPYNQEIAFYDAVRAILTRREQKGTGTNERQILLKKLVNQTVYSEGVIDLFDLLEKPQPQISLLSEEFLQ
TVKNSPTKNLWVTAMERYLASEIKAKSGANLTLQKDFEQRLKEALNQYHNHNLTVVEILDELFKMSQDFQERLALGKKLG
LTKEELAFYEALSQNQSAKDLMGDEVLSKLAKEITETLRKSVTIDWQYKEAVRAKMRILVKRTLQRYKYPPDKQEEAVTY
VIKQAEEIAEDLTGL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 49
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42