Gene Information

Name : HICON_18090 (HICON_18090)
Accession : YP_004138718.1
Strain : Haemophilus influenzae F3047
Genome accession: NC_014922
Putative virulence/resistance : Unknown
Product : type I restriction enzyme HindVIIP R protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1431932 - 1435132 bp
Length : 3201 bp
Strand : -
Note : -

DNA sequence :
ATGATGGAGAACAATAAAATGCTCAACGAAAACGACATAGAACAACTCACTCTTCAACGCCTGCAACCCCTCGGTTGGGA
ATATCGCTACGGTAAAGACTTGCCTGTTCATGAGGGCAAGTTTGCCCGTGGCGATTTGAGCGGCGTAGTCTTTGTTGAGC
AACTGCGTGAGGCGGTGCGTAAACTTAATCCTCAGCTGCCTGAAAGTGCGGTGGATTCTGTAGTGAAATCGGCAACGAAA
AGCGATATTGGCGACTTGGTGGTACGCAATCAGGCGTTTTATAAACTGCTGCGTGATGGCGTGCGGGTGGAATATCAGGC
TCCCAATAGCCACGGACAAAATGAGCAGAAAATCGAGATGGCACGTTTGGTGGATTTCGAGCATTGGGGAAACAACCGTT
TTGTCGCCGTCAATCAATTGGAAATCCGCAGCCGCAAAGGAGGTAAACGAATTCCCGATATTATCGGCTTTGTCAATGGC
TTACCGTTGGTGGTATTTGAGCTCAAAAATCCGCTGCGTGAATCGGCGGATTTGTTGCAGGCGTTTAATCAATTTGAAAC
CTATAAAGATGAAATTGCCGAGCTGTTTGTTTACAACCAAGCTCTGATTATTTCAGACGGCATTGTCGCCCTTTTGGGTT
CGCTTTCGGCAGATTTCCAACGTTTTACGCCGTGGAAAGTGGTCGATGAAAAAAATAAAAGCGCGCGGTTATATTTTGAC
GATGAGTTGCAAAGCCTGCTTAATGGCTTGCTAAAGCCTGAGGATTTATTGGACTATATCCGCTATTTCGTCTTGTTTGA
ACGGGATTCCGTTGGCAAAACCATTAAAAAAATCGCGGCGTACCATCAATATTACGGCGTAAATGAAGCAGTCGATTCCA
CCATTTGGGCGACTTCAGAACAAGGCGACCGCCGCATTGGTGTGATGTGGCACACGCAGGGTTCGGGCAAGTCGATTTCG
ATGCTGTTTTATGCAGGCAAACTGCTTGCACAGCCTGAATTGAAAAATCCTACCATTGTAGTGGTTACCGACCGCAACGA
TTTAGATGGTCAGCTTTTCCAAACTTTTTCTTCAGGCAAAGATTTAATCAAACAAACACCGCAGCAAGTAGAAGACCGTG
ATCAACTGCGCCAACTGCTCGCACAAAATGAAGTCGGCGGCGTGTTTTTTACTACGATTCAGAAATTCGCCCTAAATGAG
GAAGAAAGCCGCTTCCCTATTTTAAATGAGCGCAGTAATATTATTGTGATCAGCGATGAGGCTCACCGCAGCCAATATGG
CTTTACGCAAAAGCTGCATAACGGCAAGTTTCAGACAGGTTATGCTCGCCATTTGCGTGATGCTTTACCTAATGCCTCGT
TTATTGGTTTTACAGGTACGCCAATTAGCCTAGAAGATAAGGACACGCAAGATGTGTTCGGTCGTTATGTGTCCATTTAT
GACTTGCAAGATGCGGTGGAAGATGGCGCAACCGTGCCGATTGTGTATGAAGCACGCCAAATCAAGCTAGCGGAGAATGC
TAATCACGATGAATTATTTGCAGAAATTGATGAACTGCTGGAAGGCGAAGAAAACCCGAAATTACGCTTGCGAGAAAAAT
TGCTCGGCTCAGAAGCTCGATTGCATGATTTAGCGGTCGATTTTGTGCAACATTTTGCCAAACGCAATGAAGTGGTGGAC
AGCAAATCGATGATGGTGGTTTCCAGCCGTCAGATTTGCGTGGATTTGTATAATCAGATCATCGCTCTGCACCCTGAATG
GCATTCGGACAATATTAACGAAGGGGCGATTAAAATTGTGATGACAGGTTCTGCTTCCGATGCGTCTGAAATGCAGAAAC
ACGTTTACAGTAAGCAGGAAAAACAAACGCTAGAACGCCGCTTTAAAGACCCGAACGATCCGCTAAAAGTGGTGATTGTG
CGTGATATGTGGCTGACAGGCTTTGATGCACCGTGCTGTAACACTATGTATCTTGATAAGCCAATGAAGGGGCATAACCT
AATGCAAGCCATCGCCCGAGTAAACCGTGTATTCCGCAATAAAAGCCGAGAAAATGGCGGCTTGATTGTGGATTATGTAG
GCATTGCTGACGAGCTCAAAGATGCCACCCAACAATATACTAACTCGCAAGGCAAGGGCAAGCTGGCTGATAGCGTGATT
GATGTATTCTTTAAAATGAAAGAGCATTTAGAAGTTATCCGCAGCCTGTTTACAACGCCAGTTGAGGGGAAAACCTTTGA
TGTTCAGACGGTCTTAGAAAAAGATAATCCGAATGAGCTTTTGATGGCGATTCGTTTTGCTGCCAACCATATTTTAAGCC
TTGATCAATTATCGTTTGATGGCAAAGCGCACGAGCAGCATTGGTTTAATAAAAAAGAAACAGAACCACGCAAAAAAGCC
TTTTTGAAGGCGGCAGGCTTGGTAAAAAAAGGCTATATGCTGTGCGGCGCATTGGCTGAAGTTGAGCCGTATAACCAAGA
AATCGCCTTTTATGATGCCGTACGGGCAATTTTAACTAAACGTGAACAGAAAGGCACAGGCACAAATGAAAGACAGATTT
TATTGAAAAAATTGGTTAATCAAACTGTGTATTCTGAAGGCGTGATTGATTTATTTGATCTGCTAGAAAAACCACAACCA
CAAATTAGCTTGCTTTCCGAGGAATTTTTACAAACTGTAAAAAATAGCCCAACTAAAAATTTATGGGTTACGGCAATGGA
ACGTTATTTAGCAAGTGAAATTAAAGTTAAATCAGGCACAAACTTAACATTGCAAAAAGATTTTGAACAGCGTTTGAAAG
AAGCATTGAATCAATACCACAACCATAATTTGACTGTGGTAGAGATTTTGGACCAACTCTTTAAAATGAGCCAAGATTTC
CAAGAACGTCTAGCATTAGGGAAAAAACTAGGATTAACCAAAGAAGAACTTGCCTTCTATGAAGCTCTATCTCAAAATCA
AAGTGCAAAAGATTTGATGGGTGATGAAGTGCTTTCTAAACTGGCGAAAGAAATCACGGAAACACTTAGAAAATCGGTCA
CAATCGACTGGCAGTACAAAGAAGCGGTGCGGGCAAGAATTAGATTACTCGTTCGACGTGCCTTACAAAAATATAAATAC
CCGCCTGATAAACAGGAAGAAGCGGTAACTTATGTGATTAAACAAGCTGAAGAAATTGCTGAGGATTTAACTGGTTTATA
A

Protein sequence :
MMENNKMLNENDIEQLTLQRLQPLGWEYRYGKDLPVHEGKFARGDLSGVVFVEQLREAVRKLNPQLPESAVDSVVKSATK
SDIGDLVVRNQAFYKLLRDGVRVEYQAPNSHGQNEQKIEMARLVDFEHWGNNRFVAVNQLEIRSRKGGKRIPDIIGFVNG
LPLVVFELKNPLRESADLLQAFNQFETYKDEIAELFVYNQALIISDGIVALLGSLSADFQRFTPWKVVDEKNKSARLYFD
DELQSLLNGLLKPEDLLDYIRYFVLFERDSVGKTIKKIAAYHQYYGVNEAVDSTIWATSEQGDRRIGVMWHTQGSGKSIS
MLFYAGKLLAQPELKNPTIVVVTDRNDLDGQLFQTFSSGKDLIKQTPQQVEDRDQLRQLLAQNEVGGVFFTTIQKFALNE
EESRFPILNERSNIIVISDEAHRSQYGFTQKLHNGKFQTGYARHLRDALPNASFIGFTGTPISLEDKDTQDVFGRYVSIY
DLQDAVEDGATVPIVYEARQIKLAENANHDELFAEIDELLEGEENPKLRLREKLLGSEARLHDLAVDFVQHFAKRNEVVD
SKSMMVVSSRQICVDLYNQIIALHPEWHSDNINEGAIKIVMTGSASDASEMQKHVYSKQEKQTLERRFKDPNDPLKVVIV
RDMWLTGFDAPCCNTMYLDKPMKGHNLMQAIARVNRVFRNKSRENGGLIVDYVGIADELKDATQQYTNSQGKGKLADSVI
DVFFKMKEHLEVIRSLFTTPVEGKTFDVQTVLEKDNPNELLMAIRFAANHILSLDQLSFDGKAHEQHWFNKKETEPRKKA
FLKAAGLVKKGYMLCGALAEVEPYNQEIAFYDAVRAILTKREQKGTGTNERQILLKKLVNQTVYSEGVIDLFDLLEKPQP
QISLLSEEFLQTVKNSPTKNLWVTAMERYLASEIKVKSGTNLTLQKDFEQRLKEALNQYHNHNLTVVEILDQLFKMSQDF
QERLALGKKLGLTKEELAFYEALSQNQSAKDLMGDEVLSKLAKEITETLRKSVTIDWQYKEAVRARIRLLVRRALQKYKY
PPDKQEEAVTYVIKQAEEIAEDLTGL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 41