Gene Information

Name : A7H1H_0910 (A7H1H_0910)
Accession : YP_008330844.1
Strain : Arcobacter butzleri 7h1h
Genome accession: NC_021878
Putative virulence/resistance : Unknown
Product : type I restriction-modification system, R subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 868895 - 871753 bp
Length : 2859 bp
Strand : +
Note : Pfam matches to PF12008.3 EcoR124_C, and to PF04313.9 HSDR_N, and to PF04851.10 ResIII

DNA sequence :
ATGAGTAAACAAAGTGAAGCATTATTGGAAGAGAGTTTAATCAAACAATTAGAGAGTTTGGATTATGAAAGAGTTTATAT
AAAAGATGAAAAAGAGCTTTTAGAAAATCTAAAAAAACAACTTGAAATTCACAATAAAACAACTTTAAGCCAAACAGAAT
TTCAAAGAGTTTTGAATCATCTAAATAAAGGTGATATTTTTGAAAAGGCTACTATTTTAAGAGATAAATATGCACTTTTA
AAAGATGACAATAAAACAACTATCTATATAGAGTTTTTAGATAGCATAAACTGGTGTCAAAACATCTTTCAAGTTACTTC
TCAAGTAACAATAGAAGGAAAATATAAAAATAGATATGATGTAACACTTCTAATAAATGGTTTACCACTTATACAAATAG
AGTTAAAAAGAAGAGGATTAGAGCTAAAAGAAGCTTTTAATCAAATAAACAGATATCAAAGACACTCATATAGTTTTAAT
AGTGCTTTATTCTCATATATTCAAATTTTTGTAATTTCAAATGGAGTAAATACAAAATACTACTCAAACAACAAAAAACA
GTCATTTAAACAAACATTTTTTTGGGCAGATGAAGACAACAACAATATCACAGATTTAAGTGCTTTTACAAAAGTTTTTT
TAGAAAAGTGTCATATATCTAAAATGATATGTAGATATATTGTTTTAGCACAAGCTAAAAAAATACTTATGGTTTTAAGA
CCATATCAATTTTGGGCAGTTGAAGCTATCATAAATAGAGTAAAAAATACAAATAAAAATGGCTATATTTGGCATACAAC
AGGAAGCGGAAAAACTCTAACATCTTTTAAAGCTTCTCAAATTCTTACAAATCTTAGTGAAGTTGAAAAAGTAGTATTTG
TAGTAGATAGAAAAGATTTGGATTCTCAAACTACAAAAGAATTCAATAGCTTTAGTTCAGGAAGTGTAGATGGTACAGAT
AATACAAAAATTCTTGTAAATCAGTTTTTAGATAAGTATAAAGATAAAAAAGGTGAGTTGAGAAATAGTAAACTTATAAT
CACAACTATTCAAAAACTAAATGGTGCTATTTCTAAAAAAAGATATTTAGATGAGATGATGACAATCAAAGATAAAAAGA
TAGTATTTATTTTTGATGAGTGCCATAGAAGTCAGTTTGGAGAAACTCACAAAAATATTGTTAAGTTTTTTACAAATGCA
CAACTTATAGGTTTTACAGGAACACCAATATTTGAAGAAAATGCTTTAGGAAATAAATTTGGTAAAAAAACGACAGCAGA
ACTTTTTGGAGAAAAACTTCATAAATACATAATAACAAATGCAATAAAAGATGAAAATGTTTTAAAATTTTGTGTTGAAT
ATGTAGGAAGATATAAGAAAAAAGATAGTGCAAATGAGATAGATATTGAAGTTGAAGGTATAGATACAAAAGAACTTTTA
GAAAGTGATGCAAGAATTGAAAAAATAGTTGATTATATTTTGGTAAATCACGATAGAAAAACTCACTCAAAAGAGTTTAA
TTCTATGATGTGTGTTAGTAGCGTTGAGGTTTTATGTAAGTATTATGAATCTTTTAAAGCAAAAAGCCATAATCTTAAAA
TTGCAACCATTTTTTCATATAGTACAAATGAAGATGACAAAGATGCAAATGGAGTTTATAGTATAGATGAGAGTGATTTT
ATAGTAGTTGAATCAAATATAAATGAACACTCAAGAGATAAGTTGGAAGAGTATATAAGTGATTATAATAAACTTTTTGA
TACAAACTATACAACAAAAGATACAAAAAGTTTTTATAACTATTATGATGATATTTCAAGAAGAACAAAGAAAAAAGAGA
TAGATATTTTACTTGTTGTAAATATGTTTTTAACTGGATTTGATGCTCCTATATTAAATACTTTATATGTAGATAAAAAT
CTGAAATATCATGGATTAATCCAAGCTTTTAGTAGAACAAATAGGATTTTAAATGAAAAAAAATCACAAGGAAATATAAT
TTGTTTTAGAAATTTGAAAAATAATACAGATGAAGCGATAGCTCTTTTTTCAAATCAAGATAATGAAGATAAAGTTTTGA
TGGAACCATATGAGTATTATATGGAGAAATGTAATGAAATATTTATGAAACTTTTACAAATAGCTCCAAGTGTAAGTAGT
ATAGATAAGATTATAGATGAAAATATTCAGCTTGAATTTATAAAAACTTTTAGAGAACTAATAAGAGTTAAAAATATTTT
AGAGGGATTTGCTGATTTTAAATGGTTTGATTTATCTATGAGTGAACAACAATTTGAAGATTATAAATCAAAATATCTTG
ATTTATATGACAAAATCAGAAGTGAAAAAAATGCAAAAGATGGAAAAGTATCTATTTTGAAAGATGTAGATTTTGAATTA
GAACTTATTCATAAAGATGAGATAAATGTTTCATATATTTTAAAATTATTAGCTAAATATAGTAAAACCAAAACTAAAGA
TAAAACAGTACAAAAAGAAAATATTAATAATCTAATAAATTCAAATCCAAAATTAAGAAGTAAAAAAGAACTAATTGAAG
AGTTTATAAATACTACTTTAGATGGAATTGAGATTGAAAATATAGAAGACGAATTTTTTAGATTTATAGATAGTAAAAAA
GATATTGCTTTTAATGAGTTATGTTTAGAAGAAAAACTTGATATAGGGAAAACAAAAGAATTAGTTGAAAACTATCTTTA
TGATGGAAGAAAACCTATGAGTGATGATATTGTGAATATCCTTGAAACAAAACCAAAATTATTAGAAAGAAAAATAGTAA
TACCTAAAGTTTTAAATAAAATAGTTGAATTTGTAGAAAAATTTTATAATTTTGTTTAA

Protein sequence :
MSKQSEALLEESLIKQLESLDYERVYIKDEKELLENLKKQLEIHNKTTLSQTEFQRVLNHLNKGDIFEKATILRDKYALL
KDDNKTTIYIEFLDSINWCQNIFQVTSQVTIEGKYKNRYDVTLLINGLPLIQIELKRRGLELKEAFNQINRYQRHSYSFN
SALFSYIQIFVISNGVNTKYYSNNKKQSFKQTFFWADEDNNNITDLSAFTKVFLEKCHISKMICRYIVLAQAKKILMVLR
PYQFWAVEAIINRVKNTNKNGYIWHTTGSGKTLTSFKASQILTNLSEVEKVVFVVDRKDLDSQTTKEFNSFSSGSVDGTD
NTKILVNQFLDKYKDKKGELRNSKLIITTIQKLNGAISKKRYLDEMMTIKDKKIVFIFDECHRSQFGETHKNIVKFFTNA
QLIGFTGTPIFEENALGNKFGKKTTAELFGEKLHKYIITNAIKDENVLKFCVEYVGRYKKKDSANEIDIEVEGIDTKELL
ESDARIEKIVDYILVNHDRKTHSKEFNSMMCVSSVEVLCKYYESFKAKSHNLKIATIFSYSTNEDDKDANGVYSIDESDF
IVVESNINEHSRDKLEEYISDYNKLFDTNYTTKDTKSFYNYYDDISRRTKKKEIDILLVVNMFLTGFDAPILNTLYVDKN
LKYHGLIQAFSRTNRILNEKKSQGNIICFRNLKNNTDEAIALFSNQDNEDKVLMEPYEYYMEKCNEIFMKLLQIAPSVSS
IDKIIDENIQLEFIKTFRELIRVKNILEGFADFKWFDLSMSEQQFEDYKSKYLDLYDKIRSEKNAKDGKVSILKDVDFEL
ELIHKDEINVSYILKLLAKYSKTKTKDKTVQKENINNLINSNPKLRSKKELIEEFINTTLDGIEIENIEDEFFRFIDSKK
DIAFNELCLEEKLDIGKTKELVENYLYDGRKPMSDDIVNILETKPKLLERKIVIPKVLNKIVEFVEKFYNFV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SSP0054 YP_300144.1 type I site-specific restriction-modification system restriction subunit Not tested SCC15305cap Protein 0.0 45