Gene Information

Name : hsdR2 (R2846_0537)
Accession : YP_005829019.1
Strain : Haemophilus influenzae R2846
Genome accession: NC_017452
Putative virulence/resistance : Virulence
Product : type I restriction modification system, restriction enzyme component HsdR2
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 537585 - 540608 bp
Length : 3024 bp
Strand : -
Note : -

DNA sequence :
ATGGTTTCAGGAACTAAGGAAAAAGATTTGGAAATTGCCATCGAAAAAGCCTTAACCGGCACTTGGCGTGAAAACATGGA
AAATAAGCTGGGCGAGCCGAAAGCAGAATACCACCTGCCACGCCATCATGGTTTTGAACTGGCATTTTCACAGGATTTTG
ATGCACAGTTTGCCATTGACACGCGTCTATTTTGGCAATTCCTGCAAACCAGCCAAAAGGCAGAACTTGCCCGTTTTCAA
CAGCTAAACCCAAACGACTGGCAACGTAAAATTCTAGAGCGGTTAGACCGCCAAATAAAGAAAAACGGCGTACTACATTT
GCTCAAAAAAGGCTTGGATATTGATAGCGCTCATTTTGATTTGCTCTACCCCGTTCCGCTTGCCAGCAGCGGCGAAAAGG
TCAAACAGCGTTTTGAGCAAAATCTGTTTAGCTGTATGCGACAAGTGCCTTATTCTGCCTCAAGCAATGAAACGGTGGAT
ATGGTGCTGTTTGTCAATGGCTTGCCAATCATTACCCTTGAGCTGAAAAACCATTGGACAGGTCAGACGGCCATTGATGC
CCAAAAACAATACCGCAACCGAGATTTAAGCCAAACACTGTTTCACTTCGGGCGTTGTTTGGCGTATTTTGCCTTAGATA
CGGAAGAAGCTTATATGACCACCAAATTGGCGGGGCCTGCTACGTTTTTCTTGCCGTTTAATTTGGGCAACAACTGCGGT
AAGGGCAATCCACTCAATCCCAACGGACACCGCACGTCGTATTTATGGCAAGAGGTGTTCAGCAAAGCCAGCCTTGCCAA
CATTATTCAGCATTTTATGCGTTTAGATGGGTCAACCAAAGATCCACTGGAGAAACGCTCGCTCTTTTTCCCTCGCTATC
ATCAATTAGAGGTGGTGCGCCGTTTGATTGCTGATGTCAGTGAACAAGGCGTGGGCAAACGCTATTTAATTCAACACTCT
GCCGGCTCGGGTAAATCCAATTCCATTACTTGGTTAGCGTATCAGTTGATTGAGGCTTATCCGTGCAATGAAAAGGCGGC
AAACGGCAGGGAGGCAGACCGCCCGATTTTTGATTCGGTGATTGTTGTTACCGACCGTCGTTTGCTTGATAAACAGTTGC
GTGACAACATTAAAGATTTTTCTGAAGTCAAAAATATTGTTGCGCCGGCGTTGAGTTCGGCAGAATTGCGCCAATCGCTT
GAGCAGGGCAAAAAAATCATTATTACCACGATTCAAAAATTCCCGTTTATTGTGGATGGCATTGCTGATTTGGGTGATAA
ACAATTTGCGGTGATTATTGATGAGGCGCACAGCTCACAATCAGGTTCGGCACACGACAATATGAACCGAGCCATCGGTA
AAACGGAAGACCTTGATGCCGAAGATGTGCAAGATTTGATTTTACAAGCGATGCAATCCCGCAAAATGCGCGGCAATGCG
TCGTATTTTGCTTTCACCGCCACACCGAAAAACAGCACTTTGGAAAAATTCGGCGAAAAACAGGCGGATGGCAAGTTTAA
GCCGTTCCACCTTTATTCTATGAAGCAGGCGATTGAAGAAGGCTTTATTTTGGATGTAATCGCGAATTACACCACTTATA
AAAGTTTTTATGAGATCACTAAATCGATTGAAGATAATCCGGAGTTTGACAGCAAGAAGGCTCAAGGCCGTCTGAAGGCC
TATGTGGAGCGTTCGCAACAAACGATTGATACCAAGGCGGAAATTATGCTGGATCATTTTATCCGGCAAATCTTTAGCCG
TAAAAAACTCAAAGGCAAAGCCAAGGGAATGGTGGTAACGCAAAATATTGAAACCGCCATCCGCTATTTTCAGGCGTTAA
AACACTTGCTGGCCGGGCGGGGTAATCCGTTTAAAATTGCGATTGCGTTTTCAGGCAGTAAAGTGGTTGACGGTGTCGAA
TACACCGAAGCGGAAATGAACGGCTTTGCAGAAAGCGAAACCAAAGAGTATTTCGATCAAGATGAATATCGTTTGCTGGT
GGTCGCCAATAAATATCTGACCGGTTTCGATCAGCCGAAATTGTGTGCCATGTATGTGGATAAGAAACTCTCCGGCGTGC
TTTGCGTGCAGGCTTTATCTCGTTTGAATCGCAGTGCGAATAAGTTGAGTAAACGCACGGAAGATTTGTTTGTATTGGAC
TTTTTTAACAGCGTTGAAGATATTCAGCAGGCATTTGAGCCGTTTTATACTTCTACTTCGTTGTCGCAGGCAACCGATGT
CAATGTCTTGCATGATTTGAAAGACCGGTTGGATGAAACCGGCGTGTACGAACAAGCGGAGGTCAACGATTTTACTGAAG
GCTATTTTGCCAATAAAGACGCACAGCAATTAAGCAGTATGATTGATGTGGCTGTCCAACGTTTTGATGATGAATTGGAT
TTGGATCGAAATGAAAAAGTTGATTTTAAAATCAAGGCAAAACAGTTTTTAAAAATCTACGGGCAAACGGCCTCCATCAT
CAATTTTGAAAATATCGCTTGGGAAAAGCTCTATTGGTTCCTCAAATTCTTAGTGCCGAAATTAAAAGTACAAGACCCGA
TGGATGAATTTGATGAAATTTTAGATGCTGTGGATTTAAGCTCTTACGGCTTGGCGCGTACCAAACTGAATTACAGCATT
AAATTAGATGATGAAGAAACAGGGCTTGACCCGCAAAACCCCAATCCGCGCGGTACGCATAGTGAAGATAAAGAAAAAGA
TCCGATTGATGAAATTATTCGTGTATTTAACGAAAGATGGTTTCAAGATTGGAGCGCAACGCCGGATGAGCAACGGGTAA
AATTTATCAATATTACCGAGCGCATCCGTAGCCATAAAGACTTTGAGCAAAAATATCAAAACAACCCGGATATTCATACC
CGTGAATTGGCTTTCCAAGCCATTTTGCGCGATGTCATGAGCGAACGCCATAGGGATGAATTAGAGCTATACAAACTCTT
TGCCAAAGATGCCGCATTTAGAACCGCTTGGACGCAAAGTTTGCAACGGGCTTTGGCTGGATAG

Protein sequence :
MVSGTKEKDLEIAIEKALTGTWRENMENKLGEPKAEYHLPRHHGFELAFSQDFDAQFAIDTRLFWQFLQTSQKAELARFQ
QLNPNDWQRKILERLDRQIKKNGVLHLLKKGLDIDSAHFDLLYPVPLASSGEKVKQRFEQNLFSCMRQVPYSASSNETVD
MVLFVNGLPIITLELKNHWTGQTAIDAQKQYRNRDLSQTLFHFGRCLAYFALDTEEAYMTTKLAGPATFFLPFNLGNNCG
KGNPLNPNGHRTSYLWQEVFSKASLANIIQHFMRLDGSTKDPLEKRSLFFPRYHQLEVVRRLIADVSEQGVGKRYLIQHS
AGSGKSNSITWLAYQLIEAYPCNEKAANGREADRPIFDSVIVVTDRRLLDKQLRDNIKDFSEVKNIVAPALSSAELRQSL
EQGKKIIITTIQKFPFIVDGIADLGDKQFAVIIDEAHSSQSGSAHDNMNRAIGKTEDLDAEDVQDLILQAMQSRKMRGNA
SYFAFTATPKNSTLEKFGEKQADGKFKPFHLYSMKQAIEEGFILDVIANYTTYKSFYEITKSIEDNPEFDSKKAQGRLKA
YVERSQQTIDTKAEIMLDHFIRQIFSRKKLKGKAKGMVVTQNIETAIRYFQALKHLLAGRGNPFKIAIAFSGSKVVDGVE
YTEAEMNGFAESETKEYFDQDEYRLLVVANKYLTGFDQPKLCAMYVDKKLSGVLCVQALSRLNRSANKLSKRTEDLFVLD
FFNSVEDIQQAFEPFYTSTSLSQATDVNVLHDLKDRLDETGVYEQAEVNDFTEGYFANKDAQQLSSMIDVAVQRFDDELD
LDRNEKVDFKIKAKQFLKIYGQTASIINFENIAWEKLYWFLKFLVPKLKVQDPMDEFDEILDAVDLSSYGLARTKLNYSI
KLDDEETGLDPQNPNPRGTHSEDKEKDPIDEIIRVFNERWFQDWSATPDEQRVKFINITERIRSHKDFEQKYQNNPDIHT
RELAFQAILRDVMSERHRDELELYKLFAKDAAFRTAWTQSLQRALAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
hsdR2 YP_005829019.1 type I restriction modification system, restriction enzyme component HsdR2 VFG1098 Protein 0.0 67