Gene Information

Name : HICON_03820 (HICON_03820)
Accession : YP_004137536.1
Strain : Haemophilus influenzae F3047
Genome accession: NC_014922
Putative virulence/resistance : Virulence
Product : restriction endonuclease, type I, R subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 114344 - 117364 bp
Length : 3021 bp
Strand : +
Note : -

DNA sequence :
ATGGTTTCAGGAACTAAGGAAAAAGATTTAGAAATTGCCATCGAAAAAGCCTTAACTGGCACTTGGCGTGAAAACATGGA
AAATAAGCTGGGCGAGCCGAAGGCGGAATACCTGCCGCGCCATCATGGTTTTGAGCTGGCATTTTCACAGGATTTTGATG
CGCAGTTTGCCATTGACACGCGCCTGTCTTGGCAATTCCTGCAAACCAGCCAAGAGGCAGAATTAGGTCGTTTTCAACAG
CTCAATCCCAACGATTGGCAACGTAAAATTCTAGAACGGTTAGACCGACAAATAAAGAAAAACGGCGTGTTGCACCTGCT
GAAAAAAGGCTTGGACATTGATAGCGCCCATTTTGATTTGCTCTACCCCGTTCCGCTTGCCAGCAGTGGCGAAAAGGTCA
AGCAGCGTTTTGAACAGAATCTGTTTAGCTGTATGCGACAAGTGCCTTATTCCGCATCAAGCAATGAAACGGTGGATATG
GTGCTGTTTGTCAATGGCTTGCCGATTATTACCCTTGAGCTGAAAAACCATTGGACGGGTCAGACGGCCATTGATGCCCA
AAAACAATATCGCAACCGAGATTTAAGCCAAACGTTGTTCCATTTCGGGCGTTGTTTGGCGCATTTTGCCTTAGATACGG
AAGAAGCTTATATGACCACCAAATTGGCGGGGCCTGCTACGTTTTTCTTGCCGTTTAACTTGGGCAACAACTGCGGTAAG
GGTAATCCGCCCAATCCCAATGGACACCGCACGGCGTATTTATGGCAAGAGGTGTTCGGCAAAGCAAGCCTTGCCAACAT
TATTCAGCATTTTATGCGTTTAGATGGGTCAACCAAAGATCCTTTGGATAAACGCACACTCTTTTTCCCTCGCTATCATC
AATTAGAGGTGGTGCGCCATTTGATTGCTGATGTCAGTGAACAAGGCGTGGGCAAACGCTATTTGATTCAACACTCTGCT
GGCTCGGGTAAATCTAATTCTATTACTTGGTTGGCGTATCAGTTGATTGAGGCTTATCCGCGCAATAAACAGGCGGCAAA
CGGCAGGGAGGCAGACCGCCCGATTTTTGATTCGGTCATTGTCGTTACCGACCGCCGTTTGTTGGATAAGCAACTGCGCG
ACAATATCAAAGATTTTTCAGAAGTTAAAAATATTGTTGCGCCGGCGTTGAGTTCGGCGGAATTGCGCCAATCCCTTGAG
CAGGGCAAAAAAATCATTATTACCACGATTCAAAAATTCCCGTTTATTGTCGATGGCATTGCTGATTTAGGCGACAAACA
ATTTGCGGTGATTATTGATGAGGCACACAGCTCACAATCAGGTTCGGCACACGACAATATGAACCGAGCCATCGGCAAAA
CGGAAGACCTTGATGCTGAAGATGTGCAAGATTTGATTTTACAAACCATACAATCCCGCAAAATGCACGGCAATGCGTCG
TATTTTGCTTTCACCGCCACACCGAAAAACAGCACTTTGGAAAAATTCGGCGAAAAACAGGCGGATGGCAAGTTTAAGCC
GTTCCACCTTTATTCTATGAAGCAGGCAATTGAAGAAGGCTTTATTTTGGATGTAATCGCCAATTACACCACCTATAAAA
GTTTTTATGAGATCACTAAATCGATTGAAGATAATCCGGAGTTTGACAGTAAAAAGGCTCAAAGCCGTCTGAAAGCTTAT
GTGGAGCGTTCGCAACAAACGATTGATACCAAGGCGGAAATTATGCTGGATCATTTTATCCGGCAAATCTTTAACCGTAA
AAAACTCAAAGGCAAAGCCAAGGGAATGGTGGTAACGCAAAATATTGAAACCGCCATCCGCTATTTTCAGGCGTTAAAAC
ACTTGCTGGCCGGGCGGGGTAATCCGTTTAAAATTGCGATTGCGTTTTCAGGCAGTAAAGTGGTTGACGGTGTCGAATAC
ACCGAAGCGGAAATGAACGGCTTTGCAGAAAGCGAAACCAAAGAGTATTTCGATCAAGATGAATATCGTTTGCTGGTGGT
CGCCAATAAATATCTGACCGGTTTCGATCAGCCGAAATTGTGTGCCATGTATGTGGATAAGAAACTCTCCGGCGTGCTTT
GCGTGCAGGCTTTATCTCGTTTGAATCGCAGTGCGAATAAGTTGAGTAAACGCACGGAAGATTTGTTTGTATTGGACTTT
TTTAACAGCGTTGAAGATATTCAGCAGGCATTTAAGCCGTTTTATACTTCTACTTCGTTGTCGCAGGCAACCGATGTCAA
TGTCTTGCATGATTTGAAAGACCGGTTGGATGAAACCGGCGTGTACGAACAAGCGGAGGTCAACGATTTTACTGAAGGCT
ATTTTGCCAATAAAGACGCACAGCAATTAAGCAGTATGATTGATGTGGCTGTCCAACGTTTTGATGATGAATTGGATTTG
GATCGAAATGAAAAAGTTGATTTTAAAATCAAGGCAAAACAGTTTTTAAAAATCTACGGGCAAACGGCCTCCATCATCAA
TTTTGAAAATATCGCTTGGGAAAAGCTCTATTGGTTCCTCAAATTCTTAGTGCCGAAATTAAAAGTACAAGACCCGATGG
ATGAATTTGATGAAATTTTAGATGCTGTGGATTTAAGCTCTTACGGCTTGGCGCGTACCAAACTGAATTACAGCATTAAA
TTAGATGATGAAGAAACAGGGCTTGACCCGCAAAACCCCAATCCGCGCGGTACGCATGGCGAAGATAAAGAAAAAGATCC
GATTGATGAAATTATTCGTGTATTTAACGAAAGATGGTTTCAAGGCTGGAGCGCAATGCCGGATGAGCAACGGGTAAAAT
TTATCAATATTACCGAGCGCATCCGCAACCATAAAGACTTTGAGCAAAAATATCAAAATAACCCGGATATTCATACCCGT
GAATTGGCTTTCCAAGCCATTTTGCGCGATGTGATGAGCGAACGCCATCGGGATGAATTGGAACTATACAAACTTTTTGC
CAAAGATGCCGCATTTAGAACCGCTTGGACGCAAAGTTTGCAACGGGCTTTGGCTGGATAG

Protein sequence :
MVSGTKEKDLEIAIEKALTGTWRENMENKLGEPKAEYLPRHHGFELAFSQDFDAQFAIDTRLSWQFLQTSQEAELGRFQQ
LNPNDWQRKILERLDRQIKKNGVLHLLKKGLDIDSAHFDLLYPVPLASSGEKVKQRFEQNLFSCMRQVPYSASSNETVDM
VLFVNGLPIITLELKNHWTGQTAIDAQKQYRNRDLSQTLFHFGRCLAHFALDTEEAYMTTKLAGPATFFLPFNLGNNCGK
GNPPNPNGHRTAYLWQEVFGKASLANIIQHFMRLDGSTKDPLDKRTLFFPRYHQLEVVRHLIADVSEQGVGKRYLIQHSA
GSGKSNSITWLAYQLIEAYPRNKQAANGREADRPIFDSVIVVTDRRLLDKQLRDNIKDFSEVKNIVAPALSSAELRQSLE
QGKKIIITTIQKFPFIVDGIADLGDKQFAVIIDEAHSSQSGSAHDNMNRAIGKTEDLDAEDVQDLILQTIQSRKMHGNAS
YFAFTATPKNSTLEKFGEKQADGKFKPFHLYSMKQAIEEGFILDVIANYTTYKSFYEITKSIEDNPEFDSKKAQSRLKAY
VERSQQTIDTKAEIMLDHFIRQIFNRKKLKGKAKGMVVTQNIETAIRYFQALKHLLAGRGNPFKIAIAFSGSKVVDGVEY
TEAEMNGFAESETKEYFDQDEYRLLVVANKYLTGFDQPKLCAMYVDKKLSGVLCVQALSRLNRSANKLSKRTEDLFVLDF
FNSVEDIQQAFKPFYTSTSLSQATDVNVLHDLKDRLDETGVYEQAEVNDFTEGYFANKDAQQLSSMIDVAVQRFDDELDL
DRNEKVDFKIKAKQFLKIYGQTASIINFENIAWEKLYWFLKFLVPKLKVQDPMDEFDEILDAVDLSSYGLARTKLNYSIK
LDDEETGLDPQNPNPRGTHGEDKEKDPIDEIIRVFNERWFQGWSAMPDEQRVKFINITERIRNHKDFEQKYQNNPDIHTR
ELAFQAILRDVMSERHRDELELYKLFAKDAAFRTAWTQSLQRALAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 68

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HICON_03820 YP_004137536.1 restriction endonuclease, type I, R subunit VFG1098 Protein 0.0 68