Gene Information

Name : CGSHiEE_04165 (CGSHiEE_04165)
Accession : YP_001290621.1
Strain : Haemophilus influenzae PittEE
Genome accession: NC_009566
Putative virulence/resistance : Unknown
Product : putative type I restriction enzyme HindVIIP R protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 860295 - 863480 bp
Length : 3186 bp
Strand : -
Note : -

DNA sequence :
ATGATGGAGAACAATAAAATGCTCAACGAAAACGACATAGAACAACTCACTCTTCAGCGCCTGCAATCCCTCGGTTGGGA
ATATCGCTATGGTAAAGACTTGCCTGTTCATGAGGGCAAGTTTGCCCGTGGCGATTTGAGCGGCGTAGTCTTTGTTGAGC
AACTGCGTGAGGCGGTGCGTAAACTTAATCCTCAGCTGCCTGAAAGTGCGGTTGATTCTGTAGTGAAATCGGCAACGAAA
AGCGATATTGGCGACTTGGTTGTGCGTAATCAGACGTTTTATAAACTGCTGCGTGATGGCGTGCGGGTCGAATATACGCA
AAACGGCGAACAGAAAATTGAGATGGTGCGTTTGGTGGATTTCGAGCATTGGGGAAACAACCGTTTTGTCGCCGTCAATC
AGCTGGAAATCCGCAGCCGTAAAGGGGGCAAGCGGATTCCCGATATTATCGGCTTTGTAAATGGCTTACCGTTGGTGGTA
TTTGAGCTCAAAAATCCACTACGTGAATCGGCGGATTTGTTGCAAGCGTTTAATCAGTTTGAAACCTATAAAGATGAAAT
TGCCGAGCTGTTTGTTTACAACCAAGCTCTGATTATTTCAGACGGCATTGTCGCCCGTTTGGGTTCGCTTTCGGCAGATT
TCCAACGCTTTACGCCGTGGAAAGTGGTCGATGAAAAAAATAAAAGCGCGCGGTTATATTTTGACGATGAGTTGCAAAGC
CTGCTTAATGGCCTGTTGCAGCCTAAGGATTTACTCGACTATATCCGCTATTTCGTCTTGTTTGAACGGGATTCCGTTGG
CAAAACCATTAAAAAAATCGCGGCGTACCATCAATATTACGGCGTAAATGAAGCGGTAGAACCCACGATTTTTGCCACAA
GCGAGCAAGGCGATAAACGCATTGGTGTGATGTGGCATACGCAGGGTTCGGGCAAGTCGATTTCGATGCTGTTTTATGCA
GGCAAACTGCTTGCACAGCCTGAATTGAAAAATCCTACCATTGTAGTGGTTACCGACCGCAACGATTTAGACGGTCAGCT
CTTCCAAACCTTTTCTTCAGGCAAAGATTTAATCAAGCAAACACCGCAACAAGTGGAAGACCGTGATCAACTGCGCCAAC
TGCTCGCACAAAATGAAGTCGGCGACGTATTTTTTACCACGATTCAGAAATTCGCCCTAAATGAGGAAGAAAGCCGCTTC
CCTATTTTAAATGAGCGCAACAATATTATTGTGATCAGCGATGAGGCTCACCGCAGCCAATATGGCTTTACGCAAAAGCT
GCATAACGGCAAGTTTCAGACAGGTTATGCTCGCCATTTGCGTGATGCTTTACCTAATGCCTCGTTTATTGGTTTTACAG
GTACGCCAATTAGCCTTGAAGATAAGGACACGCAAGATGTGTTCGGTCGTTATGTGTCCATTTATGACTTGCAAGATGCG
GTGGAAGATGGTGCAACTGTGCCGATTGTGTATGAAGCACGCCAAATCAAGCTAGCGGAGAATGCTAATCACGATGAATT
ATTTGCAGAAATTGATGAACTGCTGGAAGGCGAAGAAAACCCGAAATTACGCTTGCGAGAAAAATTGTTCGGCTCAGAAG
CACGATTGCATGATTTAGCGATCGATTTTGTGCAACATTTTGCAAAACGCAATGAAGTGGTTGACAGCAAAGCGATGATG
GTGGTTTCCAGCCGTCAGATTTGCGTGGACTTATACAATGAAATCATCAAATTGCGCCCTGAATGGCATTCGGACAATAT
CAACGAAGGGGCGATTAAAATTGTGATGACAGGTTCTGCTTCCGATGCGTCTGAAATGCAGAAACACGTTTACAGTAAGC
AGGAAAAACAAACGCTAGAACGCCGCTTTAAAGACCCGAACGATCCGCTGAAGGTGGTGATTGTGCGTGATATGTGGCTG
ACAGGCTTTGATGCACCGTGCTGTAACACTATGTATCTTGATAAGCCAATGAAGGGGCATAACCTAATGCAAGCCATCGC
ACGAGTAAACCGTGTATTCCGCAATAAAAGCCGAGAAAATGGCGGCTTGATTGTGGATTATGTAGGCATTGCTGACGAGC
TCGAAAAAGCCACTCGGCAATATACAAACTCACAAGGCAAGGGCAAACTGGCTGATAGCGTGATTGATGTATTCTTTAAA
ATGAAAGAGCATTTAGAAGTTATCCGCAGCCTGTTTGCAACGCCAGTTGAGGGGAAAACCTTTGATGTTCAGACGGTCTT
AGAAAAAGATAATCCGAATGATCTTTTGATGGCGATTCGTTTTGCCGCCAACCATATTTTAAGCCTTGATCAATTATCGT
TTGATGGCAAAGCGCACGAGCAGCATTGGTTTAATAAAAAAGAAACCGAACCACGCAAAAAAGCCTTTTTGAAAGCGGCA
GGCTTGGTAAAAAAAGGCTATATGCTGTGCGGCACATTGGCTGAAGTTGAGCCGTATAACCAAGAAATCGCCTTTTATGA
TGCCGTACGGGCAATTTTAACTAAACGTGAACAAAAAGGCACAGGCACAAATGAAAGACAGATTTTATTGAAAAAATTGG
TTAATCAAACTGTGTATTCTGAAGGCGTGATTGATTTATTTGATCTGCTAGAAAAACCACAACCACAAATTAGCTTGCTT
TCCGAGGAATTTTTACAAACTGTAAAAAATAGCCCGACTAAAAATTTATGGGTTACGGCAATGGAACGTTATTTAGCAAG
TGAAATTAAAGTTAAATCAGGCACAAACTTAACATTGCAAAAAGATTTTGAACGGCGTTTGAAGGAAGCATTGAATCAAT
ACCACAATCACAATTTGACTGTGGTAGAGATTTTGGACGAACTCTTTAAAATGAGCCAAGATTTCCAAGAACGTTTAGCA
TTAGGGAAAAAACTAGGATTAACCAAGGAAGAACTAGCCTTCTATGAAGCTCTATCTCAAAATCAAAGTGCAAAAGATTT
GATGGGTGATGAAGTGCTTTCTAAACTGGCGAAAGAAATCACGGAAACACTTAGAAAATCGGTCACAATCGACTGGCAGT
ACAAAGAAGCGGTGCGGGCAAAAATGCGTATTCTCGTTAAACGCGCACTACAACGCTACAAATATCCACCCGATAAACAG
GAAGAAGCGATAACTTATGTGATTAAACAAGCTGAAGAAATTGCTGAGGATTTAACTGGTTTATAA

Protein sequence :
MMENNKMLNENDIEQLTLQRLQSLGWEYRYGKDLPVHEGKFARGDLSGVVFVEQLREAVRKLNPQLPESAVDSVVKSATK
SDIGDLVVRNQTFYKLLRDGVRVEYTQNGEQKIEMVRLVDFEHWGNNRFVAVNQLEIRSRKGGKRIPDIIGFVNGLPLVV
FELKNPLRESADLLQAFNQFETYKDEIAELFVYNQALIISDGIVARLGSLSADFQRFTPWKVVDEKNKSARLYFDDELQS
LLNGLLQPKDLLDYIRYFVLFERDSVGKTIKKIAAYHQYYGVNEAVEPTIFATSEQGDKRIGVMWHTQGSGKSISMLFYA
GKLLAQPELKNPTIVVVTDRNDLDGQLFQTFSSGKDLIKQTPQQVEDRDQLRQLLAQNEVGDVFFTTIQKFALNEEESRF
PILNERNNIIVISDEAHRSQYGFTQKLHNGKFQTGYARHLRDALPNASFIGFTGTPISLEDKDTQDVFGRYVSIYDLQDA
VEDGATVPIVYEARQIKLAENANHDELFAEIDELLEGEENPKLRLREKLFGSEARLHDLAIDFVQHFAKRNEVVDSKAMM
VVSSRQICVDLYNEIIKLRPEWHSDNINEGAIKIVMTGSASDASEMQKHVYSKQEKQTLERRFKDPNDPLKVVIVRDMWL
TGFDAPCCNTMYLDKPMKGHNLMQAIARVNRVFRNKSRENGGLIVDYVGIADELEKATRQYTNSQGKGKLADSVIDVFFK
MKEHLEVIRSLFATPVEGKTFDVQTVLEKDNPNDLLMAIRFAANHILSLDQLSFDGKAHEQHWFNKKETEPRKKAFLKAA
GLVKKGYMLCGTLAEVEPYNQEIAFYDAVRAILTKREQKGTGTNERQILLKKLVNQTVYSEGVIDLFDLLEKPQPQISLL
SEEFLQTVKNSPTKNLWVTAMERYLASEIKVKSGTNLTLQKDFERRLKEALNQYHNHNLTVVEILDELFKMSQDFQERLA
LGKKLGLTKEELAFYEALSQNQSAKDLMGDEVLSKLAKEITETLRKSVTIDWQYKEAVRAKMRILVKRALQRYKYPPDKQ
EEAITYVIKQAEEIAEDLTGL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42