Gene Information

Name : CGSHiGG_03090 (CGSHiGG_03090)
Accession : YP_001292001.1
Strain : Haemophilus influenzae PittGG
Genome accession: NC_009567
Putative virulence/resistance : Virulence
Product : putative type I site-specific restriction-modification system, R subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 565268 - 568288 bp
Length : 3021 bp
Strand : +
Note : COG0610 Type I site-specific restriction-modification system, R (restriction) subunit and related helicases

DNA sequence :
ATGGTTTCAGGAACTAAGGAAAAAGATTTGGAAATTGCCATCGAGAAAGCTTTAACCGGCACTTGGCGTGAAAACATGGA
AAATAAGCTGGGCGAGCCGAAGGCGGAATACCTGCCACGCCATCATGGCTTTAAGTTGGCATTTTCACAGGATTTTGATG
CGAAGTTTGCGATTGACTTGCGTCTGTTTTGGCAATTCCTGCAAACCGGCCAAAAGGCAGAACTTGTCCGTTTTCAACAG
CTCAATCCAAACGACTGGCAACGTAAAATTCTAGAGCGGTTAGACCGCCAAATAAAGAAATATGGCGTATTGCAACTCTT
GAAAAAAGGCTTGGACATTGATAGCGCCCATTTTGATTTGCTCTACCCCGTTCCGCTTGCCAGCAGCGGTGAAAAGGTCA
AGCAGCGTTTTGAACAGAATCTGTTTAGCTGTATGCGACAAGTGCCTTATTCCGCATCAAGCAATGAAACAGTGGATATG
GTGCTGTTTGTCAATGGCTTGCCAATCATTACCCTTGAGCTGAAAAACCATTGGACGGGTCAGACGGCTATTGATGCCCA
AAAACAATATCGCAACCGAGATTTAAGCCAAACGCTGTTTCATTTTGGGCGTTGTTTGGCGCATTTTGCCTTAGATACGG
AAGAAGCTTATATGACCACCAAGCTGGCGGGGTCTGATACGTTTTTCTTGCCGTTCAATTTGGGCAACAACTATGGCAAA
GGCAACCTGGCCAATCCAAATGGGCACCGCACGGCGTATTTATGGCAAGAGGTGTTCAGCAAAGCCAGCCTTGCCAACAT
TATTCAGCATTTTATGCGTTTAGATGGGTCAACCAAAGATCCACTGGAGAAACGCTCGCTCTTTTTCCCTCGTTATCATC
AATTAGAGGTGGTGCGCCGTTTGATTGCGGATGTCAGTGAACAAGGCGTGGGCAAACGCTATTTGATTCAACACTCTGCC
GGCTCGGGTAAGTCTAATTCCATTACTTGGTTGGCGTATCAGTTGATTGAGGCTTATCCGCGCAATGAAAAGGCGGCAAA
CGGCAGGGAGGCAGACCGCCCGATTTTTGATTCGGTGATTGTCGTAACCGACCGTCGTTTGCTGGATAAACAGCTGCGCG
ACAATATCAAAGATTTTTCGGAAGTTAAAAACATTGTTGCGCCGGCGTTGAGTTCGGCGGAATTGCGCCAATCGCTTGAG
CAGGGCAAAAAAATCATTATTACCACGATTCAAAAATTTCCGTTTATTGTCGATGGCATTGCTGATTTAGGCGACAAACA
ATTTGCGGTGATTATTGATGAGGCACACAGCTCACAATCAGGTTCGGCACACGACAATATGAACCGGGCCATCGGCAAAA
CGGAAGACCTTGATGCCGAAGATGTGCAAGATTTGATTTTACAAACCATGCAATCCCGCAAAATGCGCGGTAATGCGTCG
TATTTTGCTTTCACCGCCACACCGAAAAACAGCACTTTGGAAAAATTCGGCGAGAAACAGGCGGATGGCAAGTTTAAGCC
GTTCCACCTTTATTCTATGAAGCAGGCGATTGAAGAAGGCTTTATTTTGGATGTAATCGCCAATTACACCACCTATAAAA
GTTTTTATGAGATCACTAAATCGATTGAAGATAATCCGGAGTTTGATAGTAAGAAGGCTCAAGGCCGTCTGAAAGCTTAT
GTGGAGTGCTCTCAACAAACGATTGATACCAAGGCAGAAATCATGCTGGATCATTTTATCCAGCAAATCTTGAACCGTAA
AAAACTCAAAGGCAAAGCCAAGGGAATGGTGGTAACACAAAATATTGAAACGGCCATTCGTTATTTTCAGGCGTTAAACC
GCCTTTTAGCTGAACGAGGTAATCCGTTTAAAATCGCCATTGCGTTTTCAGGCAGTAAAGTGGTTGATGGTGTTGAATAC
ACCGAAGCGGAAATGAACGGCTTTGCAGAAAGCGAAACCAAAGAGTATTTCGATCAAGATGAATATCGTTTGCTGGTGGT
CGCCAATAAATATCTGACCGGTTTTGACCAGCCGAAATTGTGTGCGATGTATGTGGATAAGAAACTCTCCGGCGTACTTT
GTGTGCAAGCTTTATCTCGTTTAAATCGCAGTGCGAATAAGTTGGGTAAACGCACGGAAGATTTGTTTGTGTTGGACTTT
TTTAACAGCGTTGAAGATATTCAGCAAGCATTTGAGCCGTTTTATACTTCTACGTCGTTGTCGCAAGCAACCGATGTCAA
TGTCTTGCATGATTTGAAAGATCAGTTGGATGAAACAGGCGTGTATGAACAAGCGGAGGTCAACGATTTTACTGAAGGCT
ATTTTGCCAATAAAGACGCACAACAATTAAGCAGTATTATTGATGTAGCTGTCCAACGTTTCGACCATGAGTTGGCGCTT
GAGCCAACCCAAAAAGTCGATTTTAAAATCAAGGCAAAACAGTTTTTAAAAATTTACGGGCAAATGGCTTCCATCATCAA
TTTTGAAAATATCGCTTGGGAAAAGCTCTATTGGTTCCTCAAATTCTTAGTACCCAAATTAAAAGTACAAGACCCGATGG
ATGAATTTGATGAAATTTTAGATGCGGTGGATTTAAGCTCTTACGGCTTGGCACGTACCAAACTGAATTACAGCATTAAA
TTAGATGATGAAGAAACAGAGCTTGACCCGCAAAACCCCAATCCGCGTGGTACGCATGGTGAAGATAAAGAAAAAGATCC
GATTGATGAAATTATTCGTGTATTTAACGAAAGATGGTTTCAAGATTGGAGCGCAACGCCGGATGAGCAACGGGTAAAAT
TTATCAATATTACCGAGCGCATCCGTAGCCATAAAGACTTTGAGCAGAAATATCAAAATAACCCGGATATTCATACCCGT
GAATTGGCTTTCCAAGCCATTTTGCGCGATGTGATGAGCGAACGCCACCGAGATGAACTAGAGCTATACAAACTTTTTGC
CAAAGATGCCGCATTTAGAACCGCTTGGACGCAAAGTTTGCAACGGGCTTTGGCTGGATAG

Protein sequence :
MVSGTKEKDLEIAIEKALTGTWRENMENKLGEPKAEYLPRHHGFKLAFSQDFDAKFAIDLRLFWQFLQTGQKAELVRFQQ
LNPNDWQRKILERLDRQIKKYGVLQLLKKGLDIDSAHFDLLYPVPLASSGEKVKQRFEQNLFSCMRQVPYSASSNETVDM
VLFVNGLPIITLELKNHWTGQTAIDAQKQYRNRDLSQTLFHFGRCLAHFALDTEEAYMTTKLAGSDTFFLPFNLGNNYGK
GNLANPNGHRTAYLWQEVFSKASLANIIQHFMRLDGSTKDPLEKRSLFFPRYHQLEVVRRLIADVSEQGVGKRYLIQHSA
GSGKSNSITWLAYQLIEAYPRNEKAANGREADRPIFDSVIVVTDRRLLDKQLRDNIKDFSEVKNIVAPALSSAELRQSLE
QGKKIIITTIQKFPFIVDGIADLGDKQFAVIIDEAHSSQSGSAHDNMNRAIGKTEDLDAEDVQDLILQTMQSRKMRGNAS
YFAFTATPKNSTLEKFGEKQADGKFKPFHLYSMKQAIEEGFILDVIANYTTYKSFYEITKSIEDNPEFDSKKAQGRLKAY
VECSQQTIDTKAEIMLDHFIQQILNRKKLKGKAKGMVVTQNIETAIRYFQALNRLLAERGNPFKIAIAFSGSKVVDGVEY
TEAEMNGFAESETKEYFDQDEYRLLVVANKYLTGFDQPKLCAMYVDKKLSGVLCVQALSRLNRSANKLGKRTEDLFVLDF
FNSVEDIQQAFEPFYTSTSLSQATDVNVLHDLKDQLDETGVYEQAEVNDFTEGYFANKDAQQLSSIIDVAVQRFDHELAL
EPTQKVDFKIKAKQFLKIYGQMASIINFENIAWEKLYWFLKFLVPKLKVQDPMDEFDEILDAVDLSSYGLARTKLNYSIK
LDDEETELDPQNPNPRGTHGEDKEKDPIDEIIRVFNERWFQDWSATPDEQRVKFINITERIRSHKDFEQKYQNNPDIHTR
ELAFQAILRDVMSERHRDELELYKLFAKDAAFRTAWTQSLQRALAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CGSHiGG_03090 YP_001292001.1 putative type I site-specific restriction-modification system, R subunit VFG1098 Protein 0.0 68