Gene Information

Name : hsdR (BF1836)
Accession : YP_211471.1
Strain : Bacteroides fragilis NCTC 9343
Genome accession: NC_003228
Putative virulence/resistance : Unknown
Product : type I restriction enzyme R protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2141547 - 2144312 bp
Length : 2766 bp
Strand : +
Note : Similar to Campylobacter jejuni type I restriction enzyme R protein HsdR SWALL:Q8RJ98 (EMBL:AF486635) (987 aa) fasta scores: E(): 2.1e-43, 40.94% id in 977 aa, and to Bacteroides thetaiotaomicron type I restriction enzyme EcoR124II R protein BT4535 SWALL:

DNA sequence :
ATGACAATACAGTCAGAACAAGCATTAGAGCAAGGTTTGATAAAAACACTAGTCTCTATGAATTATCAGAAGATTGAAAT
AGCAGATGAAAATGCTCTGATTGTCAATTTTCGTAATCAACTTAATATACATAACAATATCGAGCTTACCGATGATGAGT
TTAATCGAATAATGATTCATTTGGACAGTGGCTCAATATTCGTAAAAGCAGAAAAACTTCGTAATCGCTTCCCACTCACT
CGTGATGATGAGTCGGTAAAGTGGATTGAGTTCCTAAACACACAAGAGTGGTGTAAGAATGAATTTCAGGTTTCCAATCA
GATTACCGATGAGGGTAGACGTAAATGTAGGTATGATGTTACGATTTTGGTTAATGGACTACCGCTGGTACAAGTGGAGC
TGAAAAAACGTGGTGTAGAACTTAAACAGGCATACAGCCAAGTTCAACGCTACCACAAAACAGCATTCAAGGGACTATTT
AACTATATTCAGATATTTGTCATCTCAAACGGTGTAAACACTCGCTACTTCGCCAATAATCCAAATCAAGGATATAAGTT
TACATTCCCTTGGGCAGATTTCAAAAACAACCATATTGACAGATTAGATCTCTTTGCTGCGATGTTTTTTGAACAATGTA
CACTGGGGAAAATGCTTGCCAAATATGTTGTGTTGCACCAATCAGACAAGTGCCTGATGATTCTCAGACCGTATCAATTT
TATGCAGTTGAGGCTTTGCTTGACAAGGTTGCTAACTCTGTCAAGAACGGATATATTTGGCACACTACAGGCAGTGGTAA
GACGCTTACATCCTTTAAAGCGGCTCAACTAATAGCCGAAATGAGTGATATTGATAAAGTGCTATTTGTTGTTGACAGGC
ACGATTTGGATACCCAAACCAAGAAAGAGTATGATGCTTTTGCTCCGGGAGCAGTGGATAGTACCGACAATACAAAAGAG
TTGGTGAAGGCATTGCAGGGCAAGAAAAAACTGATGATTACTACAATTCAAAAACTGAATAACGCAGTTCAGAAAGATAG
ATACAGTAAAGGCTTACAAGGGGTAAAAGATAAAAATATTGTAATGATATTTGATGAGTGTCACCGCAGCCAGTTCGGAG
AAATGCATTCTAATATAACAGGCTTTTTTAGCAAAATCCGCTACTTTGGATTTACAGGAACCCCCATTTTTGCGGACAAT
GCCAACAATGGTTGCACAACAAAAGATATTTTTGGTGAAAGACTTCACGAATATCTAATCAAAGATGCTATTGCAGATGA
AAACGTACTCGGATTCTTGGTGGAGTATCATGGCAAATGGAAACGAAAAAGTGAAAACGACAAACAGGTTAAATCTATTG
ATACAGCAGAAGTACTATTGAAGGATGAAAGGATAGCCTCAATAACCGATTTTATACTTTCCAACTATAACTCTTCGACA
TACGAAAGAGATTTCAATGCGATGTTGGCAGTTGGTGGAGTTAAAATGCTCATTAAATATTACGATATGTTTAAGAGTCG
CAACCACGACTTAAAGATTGCTACGATCTTCACATATACTCAAAATGAAGAATCAGAGGACGAATTTACAGGTTTAGGGC
AAGGATTTGTTTCTGTTCAAAACACCCGAGATATTTTAGAGAGCTATATCAAAGATTACAACGAGACATTTGGGACGGAA
TTTGACACAGACCATTTCGGGCTATACTACGATGATATCAATAAGCGAATGAAAAACCGTCAAATCGACCTCTTGATAGT
CTCCGATATGTTTCTTACGGGGTTCGATGCTAAGAAACTCAATACACTTTACGTCGACAAAAACCTCAACTATCACGGTC
TATTACAAGCATTTTCACGAACAAATCGCGTGCTGAATGAGAAGAAAAAATTTGGTAAAATCACTTGTTTCAGAGACCTT
AAACAACAGACGGACGATGCAATAAAACTTTACTCCAATAACAAATCGTCAGAGGTAGTCTTAATGAAACCTTACGAAAA
ACTTGTTGAGCGATTCAATGAGATGGCAGCCGAATTTTTATCCTATTTCCCTACGGTAAAAAGCGTTGGCAACTTAGAAT
CAGAGTTAGACAAACGCCGCTTTGTCATTCTCTTCAGAGCAATGCTAAGACTAAGAAATGAGGTGAAGGGATATAATGAA
TTTGATGCAGAAGATCTTACAATTGAAGAACAACGATTTGCGGACTATCAAAGCAAATACCTTGATATGTCCAATGAGTT
TGCTATTACCTCGGAAAAGGAAGATGCCGAAAGTATTCTGCAAGATATAGATTTTGAGTTAGAATTAGTGCACCGAGATA
TTATCAATGTGATGTATATTCTTGCTCTGCTTCAAGACCTCAAGCCGGAGTCTTCATCATATCCAAAAGACAGGAAAGCG
GTTCTTGACACAATGGACTCCAACCCTGAACTGCGTTCTAAGATTGCCCTAATCGACAACTTTATCAAGTTGCATATTGA
CGGACGGCAAAGCAATGATTTACCTGCTGATATGGAGAGCGATTTGGATAAATACATTGCCACTCAAAAGGCTATTGCCA
TAGAGCAAGTTGCCACAGAGGAGGGGATTGATAGTACCCTATTGCACGAATATATTTCTGAATATGAGTATTTAGGGAAA
CCTAAAAACGAAATTATCAAACGAGCCATTGATCCGCTTAAATTATCATTTATGGATGCTCAATCCAAAAAAAAGAGTCT
AATCGACAAAATGAAAGATATTATTAAACTGTTTAGCTGGAATTAA

Protein sequence :
MTIQSEQALEQGLIKTLVSMNYQKIEIADENALIVNFRNQLNIHNNIELTDDEFNRIMIHLDSGSIFVKAEKLRNRFPLT
RDDESVKWIEFLNTQEWCKNEFQVSNQITDEGRRKCRYDVTILVNGLPLVQVELKKRGVELKQAYSQVQRYHKTAFKGLF
NYIQIFVISNGVNTRYFANNPNQGYKFTFPWADFKNNHIDRLDLFAAMFFEQCTLGKMLAKYVVLHQSDKCLMILRPYQF
YAVEALLDKVANSVKNGYIWHTTGSGKTLTSFKAAQLIAEMSDIDKVLFVVDRHDLDTQTKKEYDAFAPGAVDSTDNTKE
LVKALQGKKKLMITTIQKLNNAVQKDRYSKGLQGVKDKNIVMIFDECHRSQFGEMHSNITGFFSKIRYFGFTGTPIFADN
ANNGCTTKDIFGERLHEYLIKDAIADENVLGFLVEYHGKWKRKSENDKQVKSIDTAEVLLKDERIASITDFILSNYNSST
YERDFNAMLAVGGVKMLIKYYDMFKSRNHDLKIATIFTYTQNEESEDEFTGLGQGFVSVQNTRDILESYIKDYNETFGTE
FDTDHFGLYYDDINKRMKNRQIDLLIVSDMFLTGFDAKKLNTLYVDKNLNYHGLLQAFSRTNRVLNEKKKFGKITCFRDL
KQQTDDAIKLYSNNKSSEVVLMKPYEKLVERFNEMAAEFLSYFPTVKSVGNLESELDKRRFVILFRAMLRLRNEVKGYNE
FDAEDLTIEEQRFADYQSKYLDMSNEFAITSEKEDAESILQDIDFELELVHRDIINVMYILALLQDLKPESSSYPKDRKA
VLDTMDSNPELRSKIALIDNFIKLHIDGRQSNDLPADMESDLDKYIATQKAIAIEQVATEEGIDSTLLHEYISEYEYLGK
PKNEIIKRAIDPLKLSFMDAQSKKKSLIDKMKDIIKLFSWN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SSP0054 YP_300144.1 type I site-specific restriction-modification system restriction subunit Not tested SCC15305cap Protein 1e-179 43