Gene Information

Name : CGSHiEE_02750 (CGSHiEE_02750)
Accession : YP_001290371.1
Strain : Haemophilus influenzae PittEE
Genome accession: NC_009566
Putative virulence/resistance : Virulence
Product : putative type I site-specific restriction-modification system, R subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 548632 - 551628 bp
Length : 2997 bp
Strand : -
Note : COG0610 Type I site-specific restriction-modification system, R (restriction) subunit and related helicases

DNA sequence :
TTGGAAATTGCCATCGAAAAAGCCTTAACCGGCACTTGGCGTGAAAACATGGAAAATAAGCTGGGCGAGCCGAAAGCAGA
ATACCACCTGCCACGCCATCATGGTTTTGAACTGGCATTTTCACAGGATTTTGATGCACAGTTTGCCATTGACACGCGTC
TATTTTGGCAATTCCTGCAAACCAGCCAAAAGGCAGAACTTGCCCGTTTTCAACAGCTAAACCCAAACGACTGGCAACGT
AAAATTCTAGAGCGGTTAGACCGCCAAATAAAGAAAAACGGCGTACTACATTTGCTCAAAAAAGGCTTGGATATTGATAG
CGCTCATTTTGATTTGCTCTACCCCGTTCCGCTTGCCAGCAGCGGCGAAAAGGTCAAACAGCGTTTTGAGCAAAATCTGT
TTAGCTGTATGCGACAAGTGCCTTATTCTGCCTCAAGCAATGAAACGGTGGATATGGTGCTGTTTGTCAATGGCTTGCCA
ATCATTACCCTTGAGCTGAAAAACCATTGGACAGGTCAGACGGCCATTGATGCCCAAAAACAATACCGCAACCGAGATTT
AAGCCAAACACTGTTTCACTTCGGGCGTTGTTTGGCGTATTTTGCCTTAGATACGGAAGAAGCTTATATGACCACCAAAT
TGGCGGGGCCTGCTACGTTTTTCTTGCCGTTTAATTTGGGCAACAACTGCGGTAAGGGCAATCCACTCAATCCCAACGGA
CACCGCACGTCGTATTTATGGCAAGAGGTGTTCAGCAAAGCCAGCCTTGCCAACATTATTCAGCATTTTATGCGTTTAGA
TGGGTCAACCAAAGATCCACTGGAGAAACGCTCGCTCTTTTTCCCTCGCTATCATCAATTAGAGGTGGTGCGCCGTTTGA
TTGCTGATGTCAGTGAACAAGGCGTGGGCAAACGCTATTTAATTCAACACTCTGCCGGCTCGGGTAAATCCAATTCCATT
ACTTGGTTAGCGTATCAGTTGATTGAGGCTTATCCGTGCAATGAAAAGGCGGCAAACGGCAGGGAGGCAGACCGCCCGAT
TTTTGATTCGGTGATTGTTGTTACCGACCGTCGTTTGCTTGATAAACAGTTGCGTGACAACATTAAAGATTTTTCTGAAG
TCAAAAATATTGTTGCGCCGGCGTTGAGTTCGGCAGAATTGCGCCAATCGCTTGAGCAGGGCAAAAAAATCATTATTACC
ACGATTCAAAAATTCCCGTTTATTGTGGATGGCATTGCTGATTTGGGTGATAAACAATTTGCGGTGATTATTGATGAGGC
GCACAGCTCACAATCAGGTTCGGCACACGACAATATGAACCGAGCCATCGGTAAAACGGAAGACCTTGATGCCGAAGATG
TGCAAGATTTGATTTTACAAGCGATGCAATCCCGCAAAATGCGCGGCAATGCGTCGTATTTTGCTTTCACCGCCACACCG
AAAAACAGCACTTTGGAAAAATTCGGCGAAAAACAGGCGGATGGCAAGTTTAAGCCGTTCCACCTTTATTCTATGAAGCA
GGCGATTGAAGAAGGCTTTATTTTGGATGTAATCGCGAATTACACCACTTATAAAAGTTTTTATGAGATCACTAAATCGA
TTGAAGATAATCCGGAGTTTGACAGCAAGAAGGCTCAAGGCCGTCTGAAGGCCTATGTGGAGCGTTCGCAACAAACGATT
GATACCAAGGCGGAAATTATGCTGGATCATTTTATCCGGCAAATCTTTAGCCGTAAAAAACTCAAAGGCAAAGCCAAGGG
AATGGTGGTAACGCAAAATATTGAAACCGCCATCCGCTATTTTCAGGCGTTAAAACACTTGCTGGCCGGGCGGGGTAATC
CGTTTAAAATTGCGATTGCGTTTTCAGGCAGTAAAGTGGTTGACGGTGTCGAATACACCGAAGCGGAAATGAACGGCTTT
GCAGAAAGCGAAACCAAAGAGTATTTCGATCAAGATGAATATCGTTTGCTGGTGGTCGCCAATAAATATCTGACCGGTTT
CGATCAGCCGAAATTGTGTGCCATGTATGTGGATAAGAAACTCTCCGGCGTGCTTTGCGTGCAGGCTTTATCTCGTTTGA
ATCGCAGTGCGAATAAGTTGAGTAAACGCACGGAAGATTTGTTTGTATTGGACTTTTTTAACAGCGTTGAAGATATTCAG
CAGGCATTTGAGCCGTTTTATACTTCTACTTCGTTGTCGCAGGCAACCGATGTCAATGTCTTGCATGATTTGAAAGACCG
GTTGGATGAAACCGGCGTGTACGAACAAGCGGAGGTCAACGATTTTACTGAAGGCTATTTTGCCAATAAAGACGCACAGC
AATTAAGCAGTATGATTGATGTGGCTGTCCAACGTTTTGATGATGAATTGGATTTGGATCGAAATGAAAAAGTTGATTTT
AAAATCAAGGCAAAACAGTTTTTAAAAATCTACGGGCAAACGGCCTCCATCATCAATTTTGAAAATATCGCTTGGGAAAA
GCTCTATTGGTTCCTCAAATTCTTAGTGCCGAAATTAAAAGTACAAGACCCGATGGATGAATTTGATGAAATTTTAGATG
CTGTGGATTTAAGCTCTTACGGCTTGGCGCGTACCAAACTGAATTACAGCATTAAATTAGATGATGAAGAAACAGGGCTT
GACCCGCAAAACCCCAATCCGCGCGGTACGCATAGTGAAGATAAAGAAAAAGATCCGATTGATGAAATTATTCGTGTATT
TAACGAAAGATGGTTTCAAGATTGGAGCGCAACGCCGGATGAGCAACGGGTAAAATTTATCAATATTACCGAGCGCATCC
GTAGCCATAAAGACTTTGAGCAAAAATATCAAAACAACCCGGATATTCATACCCGTGAATTGGCTTTCCAAGCCATTTTG
CGCGATGTCATGAGCGAACGCCATAGGGATGAATTAGAGCTATACAAACTCTTTGCCAAAGATGCCGCATTTAGAACCGC
TTGGACGCAAAGTTTGCAACGGGCTTTGGCTGGATAG

Protein sequence :
MEIAIEKALTGTWRENMENKLGEPKAEYHLPRHHGFELAFSQDFDAQFAIDTRLFWQFLQTSQKAELARFQQLNPNDWQR
KILERLDRQIKKNGVLHLLKKGLDIDSAHFDLLYPVPLASSGEKVKQRFEQNLFSCMRQVPYSASSNETVDMVLFVNGLP
IITLELKNHWTGQTAIDAQKQYRNRDLSQTLFHFGRCLAYFALDTEEAYMTTKLAGPATFFLPFNLGNNCGKGNPLNPNG
HRTSYLWQEVFSKASLANIIQHFMRLDGSTKDPLEKRSLFFPRYHQLEVVRRLIADVSEQGVGKRYLIQHSAGSGKSNSI
TWLAYQLIEAYPCNEKAANGREADRPIFDSVIVVTDRRLLDKQLRDNIKDFSEVKNIVAPALSSAELRQSLEQGKKIIIT
TIQKFPFIVDGIADLGDKQFAVIIDEAHSSQSGSAHDNMNRAIGKTEDLDAEDVQDLILQAMQSRKMRGNASYFAFTATP
KNSTLEKFGEKQADGKFKPFHLYSMKQAIEEGFILDVIANYTTYKSFYEITKSIEDNPEFDSKKAQGRLKAYVERSQQTI
DTKAEIMLDHFIRQIFSRKKLKGKAKGMVVTQNIETAIRYFQALKHLLAGRGNPFKIAIAFSGSKVVDGVEYTEAEMNGF
AESETKEYFDQDEYRLLVVANKYLTGFDQPKLCAMYVDKKLSGVLCVQALSRLNRSANKLSKRTEDLFVLDFFNSVEDIQ
QAFEPFYTSTSLSQATDVNVLHDLKDRLDETGVYEQAEVNDFTEGYFANKDAQQLSSMIDVAVQRFDDELDLDRNEKVDF
KIKAKQFLKIYGQTASIINFENIAWEKLYWFLKFLVPKLKVQDPMDEFDEILDAVDLSSYGLARTKLNYSIKLDDEETGL
DPQNPNPRGTHSEDKEKDPIDEIIRVFNERWFQDWSATPDEQRVKFINITERIRSHKDFEQKYQNNPDIHTRELAFQAIL
RDVMSERHRDELELYKLFAKDAAFRTAWTQSLQRALAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 67

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CGSHiEE_02750 YP_001290371.1 putative type I site-specific restriction-modification system, R subunit VFG1098 Protein 0.0 67