Gene Information

Name : Clocl_1698 (Clocl_1698)
Accession : YP_005046236.1
Strain : Clostridium clariflavum DSM 19732
Genome accession: NC_016627
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1937122 - 1940337 bp
Length : 3216 bp
Strand : +
Note : PFAM: Domain of unknown function (DUF3387); Type I restriction enzyme R protein N terminus (HSDR_N); Type III restriction enzyme, res subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; manually curated

DNA sequence :
ATGCCTAAGCTTTGTGAATCTGAAATTGAAAAAATGGCAATAGAAGAACTGGTGAAATTGGGCTATGAATACTTTTCTGG
CCCGGATATAGCACCGGATGCACCTTTTGCAGAGCGTAAAGGTTATGGGGATGTTCTGCTTAAGAAAAGGCTGATTGACG
CGGTAATACAGCTTAATCCCGGACTTCCTTATGATGTGGTTATAGAGGCGGTAAATAAAGTTTCCCGTATCAGTTCATCA
AATCTGATTGGCGACAATGAAACTTTCCATAAAATGCTGGTTGACGGTGTGCCTGTCGAATACCGCAAAAACGGAGATAT
TGTTGGCGATTATGTAAAGCTTGTGGATTTCTCGGAAGATGGAGTTGACAACAACGAATTTCTTGTAGTCAACCAGTTTA
CCGTTATTGAGAACAATAACAATAAGCGGCCTGATATTCTTCTATTTATCAACGGTATTCCTATGGTGCTGTTTGAGCTT
AAAAATCCGGTTGATGAAAATGCCACAATTCGCAAAGCTTATGACCAGATCTGCACATACAAGGCAATTATACCAAGCCT
TTTCACATATAACGAGATATGCGTTATTTCAGATGGGCTGGAAGCTAAGGCAGGTTCATTAACCGCTCCATTCTCACGTT
TTTCCACATGGAAAACAAAAGACGGGCTGAATGAAGCCTCACGATTTGAAGATCAGCTTACCACCCTTATTCACGGGCTG
TGCAACAAGAAAACCTTGCTCGACTATATACGCAATTTTATAACCTTTGAAAAGAGCAAAACTGAGGATAAAAAGACCAA
AATAACAAAAGTGGAAACTGTCAAGAAAATTGCCGCCTATCATCAGTATTACGCTGTTAATAAAGCTGTTAAAAGTACGA
TTGAAGCGGCAAGGGCGGATGGCAGCAAAAAAGCCGGTGTTATATGGCATACTCAAGGATCCGGCAAATCCCTTTCTATG
GTTTTTTATGCTGGAAAACTGGTGCAAAACCTTAACAATCCAACTATAGTTGTTATTACCGACAGAAATGATTTGGACGA
TCAACTCTTTGATACTTTTGCTGGAAACAGCGACCTTCTGCGCCAGCCGCCGAAGCAGGCGGAAAGCTGCGAGCATTTAA
AAGAATTATTAAAGGTAGCATCCGGAGGCATAGTTTTTACCACTATTCAAAAATTCATACCTGATAATGACAGCTCGGTT
TATGAGCTTTTGTCGGAAAGAGACAACATAGTAGTTATAGCCGATGAAGCGCACCGCACGCAGTACGGCTTTAACGCGAA
ACTCAGAGAAATAAAGGACGAAAACAATCAGGTTGTAGGGCAGCGCATAGCTTATGGTTTTGCCAAATATATGCGCGATG
CCTTGCCAAACGCTACTTTTATAGGCTTTACAGGTACACCTGTTGAAAAGCAGGATGCCAACACGCCGGCGGTATTTGGC
AATTACATAGATATTTACGACATTGCACAAGCGGTTGAAGATAAGGTGACAGTTAAAATATATTATGAAAGCCGTCTGGC
TAAAGTCAATCTCACCGAGGAAGGCAAAAAGCTGATTGAGGAGTTCGACAGGGAACTTGAAGAAGTAGATGAAAAAGACG
AAGCAAAAGCCGCAAAAATGAAATGGGCAAAGCTTGAGGCTATTGTCGGCAATAAAGAACGACTTGCCACTCTCGCAAAA
GACATTGTTACGCATTTTGAAGACCGGCAGAAAGTATTTCAGGGTAAAGCGATGATTGTTGCCATGAGCCGTAGAATCGC
CGTTGATTTGTATAATGAAATCATAAAGCTTCGCCCTGAGTGGCATAGTGACGATTTGGACAAGGGTGCTATCAAAGTTG
TTATGACATCTTCCAGTTCTGATGGTCCTGAAATGCAGAAACACCATACTACTAAAGAGCAGCGAAAAATGCTTGCACAG
CGTATGAAAGATGAAAATGATCCACTGAAAATTGTCATCGTGCGCGATATGTGGCTGACTGGCTTTGACGTTCCATGCCT
CAACACCATGTATATCGATAAGCCGATGAAGGACCACAACCTTATGCAGGCAATAGCGCGCGTAAACCGTGTTTTCAAGG
ACAAGCCGGGCGGGCTGATTGTCGATTATATCGGGATTGCCCCAAACCTAAAAAAAGCTTTAAGTTTCTATGCAGAGAGC
GGTGGCAAGGGAGTACCTGCTGAAACTCAGGCAAGGGCTGTAGAAATAATGCTTGAAAAGCTAGAAGTTGTCCGGCAGAT
TTTACACGGATTTGATTATACGAGCTTTTTCAAAGCAGAAGTAAAGGACAAGCTTTCTATTATCCTTCGCGCTGAAGATT
TTATTTTATCTACAGATGATAAAAAAGCGCGTTTTATTAAGGAAGTAACCCTCCTAAGCCAGGCTTATGCACTTGCCAAA
CCTGATAAGGCAACGGTTACACATGCAGAGGAAATAGCGTTTTTCCAAGCAGTTAAGGCAAGACTCACTAAATTTGAAAC
AAGCGGCGAAAGTGGCATAAATTACGATTCTGTTATAAAAAACATTGTTGATTCGGCTATTGTATCAGATGAAGTTGTAG
ATATTTTTGATGCTGCTGGTATTGAAAAGCCGGAATTGTCCATTCTCTCAGATGAATTCCTTATGGAAATTAAAGGGATG
AAACATAAGAATCTGGCTATTGAGCTACTAAAAAAGATTTTGTCCGATGAAATAAAAGTACGTTCAAAATATAACCTGAC
AAAATCAAAATCACTTATGGAAATGCTCACTTTAGCACTAAAACGGTATCAGAACAACCTACTTACAACCGCTGAAATCA
TAGAAGAGCTTATCCGCATTGCTAAGGAAATTAAAAACGCCGACAGGAGGGGCGAGGAACTCGGCTTGTCCGAGGACGAG
CTTGCTTTTTATGATGCGCTTGAAACCAATGATAGTGCGGTTAAAGTCTTGGGTGATGAAACCTTGAGAACAATTGCCCG
CGAACTTGCAGATAAAGTGCGTAAAAATGCCACAATTGACTGGACATTAAAAGAAAGTGTCCGCGCAAAACTGATGGTAT
TGGTTCGCCGGACATTAAATAAATACGGTTATCCGCCTGACAAACAGCAGCGCGCCGTGGAAACGGTTATGAAACAGGCA
GAAAACTTAGCGGATATATGGGTTTCTCAAGAAGTGACTTATGATACGAGATTTTTGTCAGATCTTCCAAAAGTAGCGGA
AGAAGTGAGTAATTAG

Protein sequence :
MPKLCESEIEKMAIEELVKLGYEYFSGPDIAPDAPFAERKGYGDVLLKKRLIDAVIQLNPGLPYDVVIEAVNKVSRISSS
NLIGDNETFHKMLVDGVPVEYRKNGDIVGDYVKLVDFSEDGVDNNEFLVVNQFTVIENNNNKRPDILLFINGIPMVLFEL
KNPVDENATIRKAYDQICTYKAIIPSLFTYNEICVISDGLEAKAGSLTAPFSRFSTWKTKDGLNEASRFEDQLTTLIHGL
CNKKTLLDYIRNFITFEKSKTEDKKTKITKVETVKKIAAYHQYYAVNKAVKSTIEAARADGSKKAGVIWHTQGSGKSLSM
VFYAGKLVQNLNNPTIVVITDRNDLDDQLFDTFAGNSDLLRQPPKQAESCEHLKELLKVASGGIVFTTIQKFIPDNDSSV
YELLSERDNIVVIADEAHRTQYGFNAKLREIKDENNQVVGQRIAYGFAKYMRDALPNATFIGFTGTPVEKQDANTPAVFG
NYIDIYDIAQAVEDKVTVKIYYESRLAKVNLTEEGKKLIEEFDRELEEVDEKDEAKAAKMKWAKLEAIVGNKERLATLAK
DIVTHFEDRQKVFQGKAMIVAMSRRIAVDLYNEIIKLRPEWHSDDLDKGAIKVVMTSSSSDGPEMQKHHTTKEQRKMLAQ
RMKDENDPLKIVIVRDMWLTGFDVPCLNTMYIDKPMKDHNLMQAIARVNRVFKDKPGGLIVDYIGIAPNLKKALSFYAES
GGKGVPAETQARAVEIMLEKLEVVRQILHGFDYTSFFKAEVKDKLSIILRAEDFILSTDDKKARFIKEVTLLSQAYALAK
PDKATVTHAEEIAFFQAVKARLTKFETSGESGINYDSVIKNIVDSAIVSDEVVDIFDAAGIEKPELSILSDEFLMEIKGM
KHKNLAIELLKKILSDEIKVRSKYNLTKSKSLMEMLTLALKRYQNNLLTTAEIIEELIRIAKEIKNADRRGEELGLSEDE
LAFYDALETNDSAVKVLGDETLRTIARELADKVRKNATIDWTLKESVRAKLMVLVRRTLNKYGYPPDKQQRAVETVMKQA
ENLADIWVSQEVTYDTRFLSDLPKVAEEVSN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 50
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 50
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 50
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48