Gene Information

Name : hsdR2 (R2866_0493)
Accession : YP_005827190.1
Strain : Haemophilus influenzae R2866
Genome accession: NC_017451
Putative virulence/resistance : Virulence
Product : Probable type I restriction modification system, restriction enzyme component HsdR2
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 493994 - 497014 bp
Length : 3021 bp
Strand : -
Note : -

DNA sequence :
ATGGTTTCAGGAACTAAGGAAAAAGATTTGGAAATTGCCATCGAAAAAGCCTTAACTGGCACTTGGCGTGAAAACATGGA
AAATAATCTGGGCGAGCCTAAGGCTGAATACCTGCCGCGCCATCATGGTTTTGAGCTGGCATTTTCACAGGATTTTGATG
CGCTGTTTGCTATTGATACGCGCCTGTTTTGGCAATTCCTGCAAACCAGCCAAAAGGCAGAACTTGCCCGTTTTCAACAG
CTAAACCCAAACGACTGGCAACGTAAAATTCTAGAGCGGTTAGACCGCCAAATAAAGAAAAACAGCGTACTACATTTGCT
CAAAAAAGGCGTGGATATTGATAGCGCTCATTTTGATTTGCTCTACCCCGTTCCGCTTGCCAGCAGCGGCGAAAAGATCA
AACAGCGTTTTGAGCAAAATCTGTTTAGCTGTATGCGCCAAGTGCCTTATTCTGCCTCAAGCAATGAAACGGTGGATATG
GTGCTGTTTGTCAATGGCTTGCCAATCATTACCCTTGAGCTGAAAAACCATTGGACAGGTCAGACGGCCATTGATGCCCA
AAAACAATACCGCAACCGAGATTTAAGCCAAACACTGTTTCACTTCGGGCGTTGTTTGGCGTATTTTGCCTTAGATACGG
AAGAAGCTTATATGACCACCAAATTGGCGGGGCCTGCTACGTTTTTCTTGCCGTTTAACTTGGGCAACAACTGCGGTAAG
GGCAATCCGCCCAATCCAAATGGGCACCGCACAGCATATTTATGGCAAGAGGTGTTCGGCAAAGCCAGCCTTGCCAACAT
TATTCAGCATTTTATGCGTTTAGACGGTTCAACCAAAGATCCGCTGGAGAAACGCTCGCTCTTTTTCCCACGCTATCATC
AATTAGAGGTGGTACGCCGTTTGATTGCTGATGTCAGTGAACAAGGCGTGGGCAAACGCTATTTGATTCAACACTCTGCC
GGCTCGGGCAAATCTAATTCTATTACTTGGCTGGCGTATCAGTTGATTGAGGCTTATCCGTGCAATGAAAAGGCGGCAAA
CGGCAGGGAGGCAGACCGCCCGATTTTTGATTCGGTGATTGTCGTTACCGACCGCCGTTTGCTTGATAAACAGTTGCGTG
ACAACATTAAAGATTTTTCTGAAGTCAAAAATATTGTTGCGCCGGCGTTGAGTTCGGCAGAATTGCGCCAATCGCTTGAG
CAGGGCAAAAAAATCATTATTACCACGATTCAAAAATTCCCGTTTATTGTCGATGGCATTGCTGATTTAGGCGACAAACA
ATTTGCGGTGATTATTGACGAGGCACACAGCTCGCAATCCGGTTCGGCACACGACAATATGAACCGAGCCATCGGTAAAA
CGGAAGACCTTGATGCCGAAGATGTGCAAGATTTGATTTTACAAGCCATGCAATCCCGCAAAATGCGCGGTAATGCGTCG
TATTTTGCTTTTACCGCCACCCCTAAAAACAGCACTTTGGAAAAATTCGGCGAGAAACAGGCGGATGGTAAATTTAAGCC
GTTCCACCTTTATTCTATGAAGCAGGCGATTGAAGAAGGCTTTATTTTGGATGTGATTGCAAATTACACCACTTATAAAA
GTTTTTATGAGATCACTAAATCGATTGAAGATAATCCGGAGTTTGATAGTAAAAAGGCACAAAGCCGTCTGAAAGCTTAT
GTAGAGCGTTCGCAACAAACGATTGATACCAAGGCGGAAATTATGCTGGATCATTTTATCCGGCAAATCTTTAACCGTAA
AAAACTCAAAGGCAAAGCCAAGGGAATGGTGGTAACGCAAAATATTGAAACCGCCATCCGCTATTTTCAGGCGTTAAAAC
GCTTGCTGGCCGGACGGGGCAATCCGTTTAAAATTGCGATTGCGTTTTCAGGCAGTAAAGTGGTTGATGGCGTTGAATAC
ACCGAAGCGGAAATGAACGGCTTTGCAGAAAGCGAAACCAAAGAGTATTTCGATCAAGATGAATATCGTTTGCTGGTGGT
CGCCAATAAATATCTGACCGGTTTCGATCAGCCGAAATTGTGTGCCATGTATGTGGATAAGAAACTCTCCGGGGTACTTT
GCGTGCAGGCTTTATCTCGTTTGAATCGCAGTGCGAATAAGTTGGGTAAACGCACGGAAGATTTGTTTGTATTGGACTTT
TTTAACAGTGTTGAAGATATTCAGCAGGCATTTGAGCCGTTTTATACTTCTACTTCGTTGTCGCAGGCAACCGATGTCAA
TGTCTTGCATGATTTGAAAGACCGGTTGGATGAAACCGGCGTGTACGAACAAGCGGAGGTCGACGATTTTACCGAAGGCT
ATTTTGCCAATAAAGACGCACAGCAATTAAGCAGTATGATTGATGTGGCTGTCCGACGTTTTGATGATGAATTGGATTTG
GATCGAAATGAAAAAGTTGATTTTAAAATCAAGGCAAAACAGTTTTTAAAAATCTACGGGCAAATGGCCTCCATCATCAA
TTTTGAAAATATCGCTTGGGAGAAACTCTATTGGTTCCTCAAATTCTTAGTACCGAAATTAAAAGTACAAGACCCGATGG
ATGAATTTGATGAAATTTTAGATGCTGTGGATTTAAGCTCTTACGGCTTGGCGCGTACCAAACTGAATTACAGCATTAAA
TTAGATGATGAAGAAACAGGGCTTGACCCGCAAAACCCCAATCCGCGCGGTACGCATGGCGAAGATAAAGAAAAAGATCC
GATTGATGAAATTATTCGTGTATTTAACGAAAGATGGTTTCAAGGCTGGAGCGCAACGCCGGATGAGCAACGGGTAAAAT
TTATCAATATTACCGAGCGCATCCGTAGCCATAAAGACTTTGAGCAAAAATATCAAAATAACCCGGATATTCATACCCGT
GAATTGGCTTTCCAAGCCATTTTGCGCGATGTGATGAGCGAACGCCATCGGGATGAATTGGAACTATACAAACTTTTTGC
CAAAGATGCCGCATTTAGAACCGCTTGGACGCAAAGTTTGCAACGGGCCTTGGCTGGATAG

Protein sequence :
MVSGTKEKDLEIAIEKALTGTWRENMENNLGEPKAEYLPRHHGFELAFSQDFDALFAIDTRLFWQFLQTSQKAELARFQQ
LNPNDWQRKILERLDRQIKKNSVLHLLKKGVDIDSAHFDLLYPVPLASSGEKIKQRFEQNLFSCMRQVPYSASSNETVDM
VLFVNGLPIITLELKNHWTGQTAIDAQKQYRNRDLSQTLFHFGRCLAYFALDTEEAYMTTKLAGPATFFLPFNLGNNCGK
GNPPNPNGHRTAYLWQEVFGKASLANIIQHFMRLDGSTKDPLEKRSLFFPRYHQLEVVRRLIADVSEQGVGKRYLIQHSA
GSGKSNSITWLAYQLIEAYPCNEKAANGREADRPIFDSVIVVTDRRLLDKQLRDNIKDFSEVKNIVAPALSSAELRQSLE
QGKKIIITTIQKFPFIVDGIADLGDKQFAVIIDEAHSSQSGSAHDNMNRAIGKTEDLDAEDVQDLILQAMQSRKMRGNAS
YFAFTATPKNSTLEKFGEKQADGKFKPFHLYSMKQAIEEGFILDVIANYTTYKSFYEITKSIEDNPEFDSKKAQSRLKAY
VERSQQTIDTKAEIMLDHFIRQIFNRKKLKGKAKGMVVTQNIETAIRYFQALKRLLAGRGNPFKIAIAFSGSKVVDGVEY
TEAEMNGFAESETKEYFDQDEYRLLVVANKYLTGFDQPKLCAMYVDKKLSGVLCVQALSRLNRSANKLGKRTEDLFVLDF
FNSVEDIQQAFEPFYTSTSLSQATDVNVLHDLKDRLDETGVYEQAEVDDFTEGYFANKDAQQLSSMIDVAVRRFDDELDL
DRNEKVDFKIKAKQFLKIYGQMASIINFENIAWEKLYWFLKFLVPKLKVQDPMDEFDEILDAVDLSSYGLARTKLNYSIK
LDDEETGLDPQNPNPRGTHGEDKEKDPIDEIIRVFNERWFQGWSATPDEQRVKFINITERIRSHKDFEQKYQNNPDIHTR
ELAFQAILRDVMSERHRDELELYKLFAKDAAFRTAWTQSLQRALAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
hsdR2 YP_005827190.1 Probable type I restriction modification system, restriction enzyme component HsdR2 VFG1098 Protein 0.0 68