Gene Information

Name : hsdR1 (NTHI0193)
Accession : YP_247831.1
Strain : Haemophilus influenzae 86-028NP
Genome accession: NC_007146
Putative virulence/resistance : Virulence
Product : type I site-specific restriction-modification system, R subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 175311 - 178337 bp
Length : 3027 bp
Strand : +
Note : -

DNA sequence :
ATGGTTTCAGGAACTAAGGAAAAAGATTTAGAAATTGCCATCGAAAAAGCCTTAACTGGCACTTGGCGTGAAAACATGGA
AAATAAGCTGGGCGAGCCGAAGGCTGAATACCTGCCGCGCCATCATGGTTTTAAACTGGCATTTTCACAGGATTTTGATG
CGCAGTTTGCCATCGACACACGTCTGTTTTGGCAATTCCTGCAAACCAGCCAAGAGGCAGAACTTGCCCGTTTTCAACAA
CTCAACCCAAACGACTGGCAGCGTAAAATTTTGGAGCGATTAGACCGCCAAATAAAGAAAAACGGCGTGTTGCACCTGCT
GAAAAAAGGCTTGGATATTGATAGCGCCCATTTTGATTTGCTCTACCCCGTTCCGCTTGCCAGCAGCGGCGAAAAGGTCA
AGCAGCGTTTTGAACAGAATTTGTTTAGCTGTATGCGTCAAGTGCCTTATTCTGCCTCAAGCAATGAAACGGTGGATATG
GTGCTGTTTGCCAATGGCTTGCCGATTATTGCCCTTGAGCTGAAAAACCATTGGACAGGTCAGACAGCCATTGATGCGCA
AAAACAATACCTCAACCGTGATTTAAGCCAAACGTTGTTCCATTTCGGGCGTTGTTTGGCGCATTTTGCCTTAGATACGG
AAGAAGCTTATATGACCACCAAATTGGCGGGGCCTGCTACGTTTTTCTTGCCGTTTAACTTGGGCAACAACTGCGGTAAG
GGTAATCCGCCCAATCCCAATGGACACCGCACGGCGTATTTATGGCAAGAGGTGTTCGGCAAAGCAAGCCTTGCCAACAT
TATTCAGCATTTTATGCGCTTAGACGGTTCAACCAAAGATCCGTTGGATAAACGTACCCTCTTTTTCCCTCGCTATCACC
AATTAGATGTGGTCCGCCGTTTGATTGCTGATGTCAGTGAACATGGCGTGGGTAAACGTTATTTGATTCAACATTCTGCC
GGTTCGGGCAAGTCTAATTCCATTACTTGGCTGGCGTATCAGTTGATTGAGGCATATCCGCGCAATGAAAAGGCGGCAAA
CGGTAGAGAGGCAGACCGCCCGATTTTTGATTCGGTGATTGTCGTAACCGACCGTCGTTTGTTGGATAAGCAACTGCGCG
ACAATATCAAAGATTTTTCAGAAGTTAAAAACATTGTTGCGCCGGCGTTGAGTTCGGCAGAGTTGCGCCAATCGCTTGAG
CAGGGCAAAAAAATCATTATTACCACGATTCAAAAATTCCCGTTTATTGTCGATGGCATTGCTGATTTAGGCGACAAACA
ATTTGCGGTGATTATTGATGAGGCACACAGCTCACAATCAGGTTCGGCACACGACAATATGAACCGGGCCATCGGCAAAA
CGGAAGACCTTGATGCTGAAGATGTGCAAGATTTGATTTTACAAACCATGCAATCCCGCAAAATGCACGGCAATGCGTCG
TATTTTGCTTTCACCGCCACACCGAAAAACAGCACTTTGGAAAAATTCGGCGAAAAACAGGCGGATGGCAAGTTTAAGCC
GTTCCACCTTTATTCTATGAAGCAGGCGATTGAAGAAGGCTTTATTTTGGATGTAATCGCCAATTACACCACCTATAAAA
GTTTTTATGAGATCACTAAGTCGATTGAAGATAATCCGGAGTTTGATAGTAAAAAGGCTCAAAGCCGTCTGAAAGCCTAT
GTGGAGCGTTCGCAACAAACGATTGATACTAAAGCGGAGATAATGCTGGATCATTTTATTTACCAAGTTTTCAACCGTAA
AAAACTCAAAGGCAAAGCCAAGGGAATGGTGGTAACGCAAAATATTGAAACCGCCATCCGCTATTTTCAGGCGTTAAAAC
ATTTGCTGGCCGGGCGGGGTAATCCGTTTAAAATTGCGATTGCGTTTTCAGGCAGTAAAGTGGTTGACGGTGTCGAATAC
ACCGAAGCGGAAATGAACGGCTTTGCAGAAAGCGAAACCAAAGAGTATTTCGATCAAGATGAATATCGTTTGCTGGTGGT
CGCCAATAAATATCTGACCGGTTTCGATCAGCCGAAATTGTGTGCCATGTATGTGGATAAGAAACTCTCCGGCGTGCTTT
GCGTGCAGGCTTTATCTCGTTTGAATCGCAGTGCGAATAAGTTGAGTAAACGCACGGAAGATTTGTTTGTATTGGACTTT
TTTAACAGCGTTGAAGATATTCAGCAGGCATTTGAGCCGTTTTATACTTCTACTTCGTTGTCGCAGGCAACCGATGTCAA
TGTCTTGCATGATTTGAAAGACCGGTTGGATGAAACCGGCGTGTACGAACAAGCGGAGGTCAACGATTTTACTGAAGGCT
ATTTTGCCAATAAAGACGCACAGCAATTAAGCAGTATGATTGATGTGGCTGTCCAACGTTTTGATGATGAATTGGAATTG
GATTTGGATCGAAATGAAAAAGTTGATTTTAAAATCAAGGCAAAACAGTTTTTAAAAATTTACGGGCAAATGGCCTCCAT
CATCAATTTTGAAAATATCGCTTGGGAAAAGCTCTATTGGTTCCTCAAATTCTTAGTACCCAAATTAAAAGTACAAGACC
CGATGGATGAATTTGATGAAATTTTAGATGCAGTGGATTTAAGCTCTTACGGCTTGGCGCGCACCAAGCTGAATTACAGC
ATTAAATTAGATGATGAAGAAACAGAGCTTGACCCGCAAAACCCCAATCCGCGCGGTACGCATGGTGAAGATAAAGAAAA
AGATCCGATTGATGAAATTATTCGTGTATTTAACGAAAGATGGTTTCAAGATTGGAGCGCAACGCCGGATGAGCAACGGG
TAAAATTTATCAATATTACCGAGCGCATCCGCAGCCATAAAGACTTTGAGCAGAAATATCAAAATAACCCGGATATTCAT
ACCCGTGAATTGGCTTTCCAAGCCATTTTGCGCGATGTGATGAGCGAACGCCATAGGGATGAATTAGAGCTATACAAACT
TTTTGCCAAAGATGCCGCATTTAGAACCGCTTGGACGCAAAGTTTGCAACGGGCTTTGGCTGGATAG

Protein sequence :
MVSGTKEKDLEIAIEKALTGTWRENMENKLGEPKAEYLPRHHGFKLAFSQDFDAQFAIDTRLFWQFLQTSQEAELARFQQ
LNPNDWQRKILERLDRQIKKNGVLHLLKKGLDIDSAHFDLLYPVPLASSGEKVKQRFEQNLFSCMRQVPYSASSNETVDM
VLFANGLPIIALELKNHWTGQTAIDAQKQYLNRDLSQTLFHFGRCLAHFALDTEEAYMTTKLAGPATFFLPFNLGNNCGK
GNPPNPNGHRTAYLWQEVFGKASLANIIQHFMRLDGSTKDPLDKRTLFFPRYHQLDVVRRLIADVSEHGVGKRYLIQHSA
GSGKSNSITWLAYQLIEAYPRNEKAANGREADRPIFDSVIVVTDRRLLDKQLRDNIKDFSEVKNIVAPALSSAELRQSLE
QGKKIIITTIQKFPFIVDGIADLGDKQFAVIIDEAHSSQSGSAHDNMNRAIGKTEDLDAEDVQDLILQTMQSRKMHGNAS
YFAFTATPKNSTLEKFGEKQADGKFKPFHLYSMKQAIEEGFILDVIANYTTYKSFYEITKSIEDNPEFDSKKAQSRLKAY
VERSQQTIDTKAEIMLDHFIYQVFNRKKLKGKAKGMVVTQNIETAIRYFQALKHLLAGRGNPFKIAIAFSGSKVVDGVEY
TEAEMNGFAESETKEYFDQDEYRLLVVANKYLTGFDQPKLCAMYVDKKLSGVLCVQALSRLNRSANKLSKRTEDLFVLDF
FNSVEDIQQAFEPFYTSTSLSQATDVNVLHDLKDRLDETGVYEQAEVNDFTEGYFANKDAQQLSSMIDVAVQRFDDELEL
DLDRNEKVDFKIKAKQFLKIYGQMASIINFENIAWEKLYWFLKFLVPKLKVQDPMDEFDEILDAVDLSSYGLARTKLNYS
IKLDDEETELDPQNPNPRGTHGEDKEKDPIDEIIRVFNERWFQDWSATPDEQRVKFINITERIRSHKDFEQKYQNNPDIH
TRELAFQAILRDVMSERHRDELELYKLFAKDAAFRTAWTQSLQRALAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
hsdR1 YP_247831.1 type I site-specific restriction-modification system, R subunit VFG1098 Protein 0.0 68