Gene Information

Name : Alfi_1812 (Alfi_1812)
Accession : YP_006410821.1
Strain : Alistipes finegoldii DSM 17242
Genome accession: NC_018011
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2024284 - 2027325 bp
Length : 3042 bp
Strand : +
Note : PFAM: Type I restriction enzyme R protein N terminus (HSDR_N); Domain of unknown function (DUF3387); Type III restriction enzyme, res subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family

DNA sequence :
ATGCACTTCACCGAAGATGATTTTGAAAACGCTATTCTCGAGTTGTTTCGAGAGCAATTAGGCTATGATTATGTGTACGG
TCCCAATGTAATGCGCGACTATGCAGAACCGCTTTACGTGGAGGTGCTGGAGGCTGTGTTGCCGCAGATCAATCGTGGAC
TGCCACAGGCCGCTATTGACGAAGCTATGGTGAAAATTCGGACCTATGAGGGCGGAACGTTGGTACAGAAGAATGAATTG
TTCACGGATTATTTGCAAAATGGCGTAGCTGTCAATTATTTCGATGGCCGCGAGCAATGTTCCGCAAATGTCCGGCTTGT
CGATTACGATTCACCGTTACATAATCGGTTTACGATCGCTAATCAATGGACGGTCGATGGGCACTCGGTAAGGCGTGCGG
ATATGATCGTATTTGTCAATGGATTGCCGCTGGTGGTGGTCGAACTCAAATCGCCATCGCGTGAGAATACAGACGTGTCG
GAAGCTTATGCACAATTGCGTAACTATATGCAGGAGATTCCGTCACTCTTTATCTATAACGCTTTTTGTGTGATGAGCGA
TCAGGCGATGACTAAGGCGGGGACGATCACAGCGGGTGAAGACCGTTTTATGCAGTGGAAAACGGTAGATGGGAGTTATG
AAGATACCCATAGCGCGAATTTCGATGTGCTTTTCGCGGGAATGTTCGAAAAAACGCGGTTTGTTGAATTGTTGCGGAAT
TTTGTTTGTTATTCGAAAGACGGTAAACAGGATATTAAGATATTAAGTGCCTATCATCAGTTTTATGCCGTACACAAGGC
TGTGCTTTCGACGGTTAAAGCAGCTGAGACAGATGGTCGGGGTGGCGTGTTTTGGCATACGCAGGGCAGCGGAAAGTCAT
TGTCGATGGTCTTTTTCGCCAAGCAGTTGCAGCAGGCGATGTCGTCGCCGACTATCGTCGTGCTGACAGACCGTAACGAT
TTGGACGGTCAGTTGTACCGGCAGTTCGCTTGTTGCAGGGATTTTTTGCGTCAGACACCCGTGCAGGCCGAAAGTCGGGC
TCATCTTCGGGAATTATTGGCGGGACGCGAAGCGAACGGTATCTTTTTCTCGACGATGCAGAAATTCGAAGAGAGCGAAG
AACCTCTTTCGACGCGACGAAATATAGTTGTTATGGCCGACGAGGCGCATCGCAGTCAATACGGATTGGAGGAGAGGGTC
AGGATGGTTACGGATGCTGACGGGGTAACGCAAGCCAAAGTTGTAATCGGCGCGGCGCGTCTGGTGCGTAATGCGTTGCC
GAATGCTACCTATATCGGGTTTACCGGAACGCCTATTTCGCAAAAAGATCGGTCGACACGCGAAGTGTTTGGAGATTACA
TCGACGTGTACGATATGACGCAGTCGGTGGAGGACGGTGCGACACGGCCGGTATTTTACGAGAGCCGTGTAATCAATCTG
AAACTCGACGAGCAGTCCTTGCGGCGTATTGATGCGGAGTATGATGCGATGGCTGAGGAGGCGGAAGAGTATGTCATTGA
GAAAAGCAAGCGTGAATTGGGGCGGCTCGACTCGATCCTTGGAGCTGACGCGACGGTGGCATCATTGTGCGAGGACATCG
TAAAACACTATGAGGAATTCCGGCAATACGAGCAGACGGGTAAGGCGATGATAGTAGCCTATTCGCGGCCAATAGCGATC
AAGATTTACCGTCGGATTCTCGAAATGCGTCCGGTGTGGAGTGACAAGCTGGCTGTTGTGATGACTTCCGGTAATAAAGA
TCCGGAAGACTGGCGGGCGATTATTGGAAATGATTCCCACAAGAAAGAGTTGGAGAAGCGGTTCAAAGACAACGATAGCT
CGTTGAAAATCGTCATCGTAGTTGATATGTGGCTTACGGGTTTCGACGTACCTTCGCTTTCGACGATGTATGTCTATAAA
CCGATGTCCGGACACAATCTAATGCAGGCTATTGCTCGTGTGAATCGTGTGTTTGGGGATAAACAAGGTGGTTTGGTTGT
GGATTATGTGGGTATCGCTTCGGCGTTGAAGACGGCGATGAACGATTATACATACCGTGACCGCAAAAATTATGGTGATA
CGGATGTGGCTAAAACCGCCTATCCGGAGTTTCAGAAGAAACTGGACGTTTGCCGTGATCTGATGTATGGATTCGATTAT
GGTGCTTTCTTCGGTAAGTCTGATTTGGAGCGGGCGAAAGCCATCAGCGGAGGTGTCGATTTCATGCAGTCCCCCGAGCG
GATGGAAACGAAAAAACTCTATATCAAAGAGGCGCTGCTGCTGCGGCAGGCATTGTCGCTTTGTCAGAGTTTACTGAATT
ATGAGCAACGTATCGAAGCTGCCTATTTTGAGGCGGTTCGCACATTACTGACGCGCGTGGAAGCCAAGGGCAAGATTTCG
TTCCGTGAGATTAACGGGCGTATTAATGAATTGCTCAAGCAGAGTATCAAGAGCGAAGGGGTAATTAATCTTTTCTCCGA
TATCAAGGAGGAGTTCTCTTTGTTCGATTCGAAATTCCTGGAAGAGGTTGCCCGGATGAAGGAACGGAACTTCGCCGTAG
AATTATTGCGTAGGTTGATTGCAGAGCAGGTACAACTATATCAGCGAACGAATACGGTACGAGCCGAGAAGTTTTCGGAA
ATTCTGGCCGATGCCATGAGCCGCTATTTGAAAGGGATGCTGACGAACGAAGAGGTTATCGAAGAACTGCTGAAAATAGC
CCGTGAGATCGTTCACGGCGAAAAGGCCGGCAAGTCGCTTAATCTGAACAGCGAAGAACTTGCCTTTTATGATGCGTTGA
CCAAGCCTGAGGCTGTAAAAGATTTCTATTCCAACGATCAGTTGGTCGCTATTACGCGAGAGTTGACAGATGCGCTTCGG
CGAAACAAGACGATTGACTGGAATATGAAGGAGAGTGCTCGTGCCGGAATGCGGCGTATTGTCAAACGATTGTTGAAAAA
GTATAATTATCCGCCTGCTGGGCAGGAAGATGCTTTGAATACGATTATGGAGCAGTGTAAGAAGTGGAACGAAAATAATT
GA

Protein sequence :
MHFTEDDFENAILELFREQLGYDYVYGPNVMRDYAEPLYVEVLEAVLPQINRGLPQAAIDEAMVKIRTYEGGTLVQKNEL
FTDYLQNGVAVNYFDGREQCSANVRLVDYDSPLHNRFTIANQWTVDGHSVRRADMIVFVNGLPLVVVELKSPSRENTDVS
EAYAQLRNYMQEIPSLFIYNAFCVMSDQAMTKAGTITAGEDRFMQWKTVDGSYEDTHSANFDVLFAGMFEKTRFVELLRN
FVCYSKDGKQDIKILSAYHQFYAVHKAVLSTVKAAETDGRGGVFWHTQGSGKSLSMVFFAKQLQQAMSSPTIVVLTDRND
LDGQLYRQFACCRDFLRQTPVQAESRAHLRELLAGREANGIFFSTMQKFEESEEPLSTRRNIVVMADEAHRSQYGLEERV
RMVTDADGVTQAKVVIGAARLVRNALPNATYIGFTGTPISQKDRSTREVFGDYIDVYDMTQSVEDGATRPVFYESRVINL
KLDEQSLRRIDAEYDAMAEEAEEYVIEKSKRELGRLDSILGADATVASLCEDIVKHYEEFRQYEQTGKAMIVAYSRPIAI
KIYRRILEMRPVWSDKLAVVMTSGNKDPEDWRAIIGNDSHKKELEKRFKDNDSSLKIVIVVDMWLTGFDVPSLSTMYVYK
PMSGHNLMQAIARVNRVFGDKQGGLVVDYVGIASALKTAMNDYTYRDRKNYGDTDVAKTAYPEFQKKLDVCRDLMYGFDY
GAFFGKSDLERAKAISGGVDFMQSPERMETKKLYIKEALLLRQALSLCQSLLNYEQRIEAAYFEAVRTLLTRVEAKGKIS
FREINGRINELLKQSIKSEGVINLFSDIKEEFSLFDSKFLEEVARMKERNFAVELLRRLIAEQVQLYQRTNTVRAEKFSE
ILADAMSRYLKGMLTNEEVIEELLKIAREIVHGEKAGKSLNLNSEELAFYDALTKPEAVKDFYSNDQLVAITRELTDALR
RNKTIDWNMKESARAGMRRIVKRLLKKYNYPPAGQEDALNTIMEQCKKWNENN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42