Gene Information

Name : Clocl_3622 (Clocl_3622)
Accession : YP_005048015.1
Strain : Clostridium clariflavum DSM 19732
Genome accession: NC_016627
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4246871 - 4250068 bp
Length : 3198 bp
Strand : -
Note : PFAM: Type I restriction enzyme R protein N terminus (HSDR_N); Domain of unknown function (DUF3387); Type III restriction enzyme, res subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family

DNA sequence :
ATGCCGAGCCATAATATATATGAGGATGAATTGGAGCAGGCTGCATTGGAATGGTTTGAGGAACTTGGTTATGAAACCAT
TTTTGCGCCTGATATCTCGCCTGGTGGGGATTATCCTGAACGTTCTGATTATTCAGATGTTATTCTGGAAGAACGTCTTA
AAGATGCTTTAAAGCGCATAAATCCTGATTTACCGCAGGAAGCTCTTGATGACGCTCTGCATCAAATACTTGTACCTCTT
AATCCTGCTCTGATAGACAATAATCATCTTTTTCAGAAGATGGTCACTGATGGAGTGAATGCTACCTTCAGAGCTAATGA
TGGCAGGATAGTAAGTAAACAGGTCAGGATTTTTGATTTTGAGAAAGCAGAGAACAATGATTTTTTAGCTGTTAACCAGT
TTACTGTGATTGAAAACAGAGTTGAAAAGCGTCCTGATGTGGTGGTGTTTGTAAATGGTATCCCCCTTGTAGTTATCGAA
CTGAAAAATTTGGCCAATGAGGATGTAGAAATATCCGATGCATATAATCAGATCCAGAATTATATAAGCACCATTCCATC
TTTATTTACTTATAATGCTTTTGCAGTTATATCCGATGGAGTTAATGCAAGAGCAGGCACAATCACTGCTGATGAAGACC
GGTTCATGATGTGGAGGACGATTGATGGGGATGAAATAGCACCTTTAAGCCGCCCACAGCTTGAAGTCCTGATCAAAGGT
ATGTTTGATAAGGAACGCCTTCTTGATATCATTAAGAACTTTATACTGTTCCAGACTGACGGGAAGGACACATATAAAAT
TCTTGCCGGTTACCACCAGTATCATGCTGTTAACAAGGCGGTTGAGAGTACAATAAGAGCTGCCATCACTGAAGGGGATC
GTAAAATCGGTGTTGTTTGGCATACCCAGGGAAGTGGTAAAAGTCTTTCCATGGTCTTCTATGTTGGCAAGCTTGTTTTG
TCAAAAGAACTTAACAACCCTACAATCGTAGTGATTACTGATCGTAACGACCTTGACGACCAACTGTTTTCCACCTTTGC
TAAATCAAAAGATTTATTGCGCCAAGAACCGGTTCAGGCTCAGGACAGAGCGGATCTTCGAAAGCTCCTGACAAGGGAAT
CTGGAGGCATTATCTTTACAACCATACACAAATTTGCTCCTGAAGAAAAAGGAGACAGCGTGCCAGTTTTAACAGACCGG
GAAAATGTAATTGTAATAGCTGACGAAGCGCATCGAAGCCAATACGGATTCAGGGCGGAAATTGTTAAGGGTGATACAGA
AGCTGATGTAAAATACGGTTATGCAAAATATATGCGTGATGCCCTTCCAAATGCGTCTTTCATTGGATTTACCGGTACAC
CCATATCTCTTGCTGACAAGGATACCAGAGCAGTTTTTGGCGATTATATCGACATTTACGATATGACCAGGGCTGTTGAG
GATGGAACAACGGTACGCATATTCTATGAAAGCCGGATAGCGAAGCTGGAACTGCCCGATGAGCTTAAGCCTGTTATTGA
CGATGAATACGAAGAAATCACCGAATATCAGGAATATACCCAGAAAGAAAAGCTTAAGACCAAATGGTCAAGACTTGAAG
CCATCGTTGGCGCCGAGCAACGGGTTAAAGCCATTGCCAAAGACATCGTGGAGCACTTTGAAAAGCGCCTTGCCGCACAG
GAAACAGAAGTGGGCAAGGGTATGATCGTGGTTATGTCACGTAGAATTGCAATTGACTTATATAAAGCCATTGTTGAACT
GCGTCCCGAATGGCATTCAGACGATATTGACAAAGGTGTTATCAAGGTGGTCATGACCGGAAGTTCTTCCGACCCAAAGG
AATGGCAGCCTTTCATTGGTACCAAAGCTACCAGGGAGCGTATCGCCAAACGAATGAAGGACAACAAGGATGAACTGAAG
CTGGTCATTGTGCGAGACATGTGGCTGACAGGCTTTGATGTGCCCAGCATGCACACCATGTATATCGACAAACCCATGCA
GGGACACAATCTTATGCAGGCTATCGCCCGCGTTAACCGTGTATTCAAAGAGAAGCAGGGAGGATTGATTGTAGACTACA
TCGGCATTGCCGAAAATCTTAAGAATGCACTCAATGATTATACCGAAAGCGACAGGGAAAAGACTGGTGTTGACACTGAA
GTGGCAGCGGCAGTCCTTGTTGAAAAGTATGAACTGATTAAGGAATTACTTCATGGTCATGATTATCAGAAGTTCTTTAC
CGGCTCTGCTTCCGAAAGAATGTCTGCCATTGTTGAAACAATCGACTACATCATCGGATTGCGCGAGGAGCGCAAAAACG
ATTATCTTAAGCTGGTCAGTGAGCTGTCAAGAGCTTACTCCCTTTGCGCGACTACTGATATTGCTGAGAAATTGAATCTT
GAGGTAGGTTTCCATAAAGCGGTTAAATCCGGTATTATCAAGCTTATACCCGAAAATTCCAGGAAGAAAACTGCTGCAGA
AATTGAAGCACAACTTAACCAACTTGTTTCCAAATCAATATCCAGCAATGAGGTTATCGATGTCTTGGATGCCGTTGGGC
TAAACAAGCCGAATATTGCTATACTCTCAGATGAGTTCCTGGAAGAAGTGCGTAACATGAAACAGCGTAACCTTGCTGTC
GAACTTCTTAATAGATTACTTAAGGGCAAAATTAAGACATTCTCAAAACGGAACCTAGTTCAGTCCAGAAAATTTTCAGA
ACTATTGGAGAATGCCATCAGAAAGTACCAGAATCGCACAATAGAAACCACTCAGGTTATTCTGGAGCTTATACAGCTTG
CAAAGGAAATTAATGAAGCACATAAGCGTGGAGAAAACACAGGCCTTACTGAAGATGAACTGGCATTTTACGATGCCCTC
GCTGAAAATGAGTCTGCAAAAGAGGTTATGGGTGACGATATCCTCAAGCAAATAGCCAGGGACTTGACTGAAGCAATACG
TAAGAACATAAGCATTGACTGGTCTATACGTGCCAGCGTTCAAGCAAAGATGAAAATGATTATAAAGAGGCTGCTTAAGC
GCTATGGTTATCCTCCGGACAAAACACCAAAGGCTGTAGAAATCGTAATGGAACAGGCCAAGCTTATGTGTCAAAACGAG
AGTTCCGGGGTTAGGTATCAGTACCAGCTTGAAAAAGAAGATCTGCCAAAAGTAGCGGAGGATAGTTTTGACATATAG

Protein sequence :
MPSHNIYEDELEQAALEWFEELGYETIFAPDISPGGDYPERSDYSDVILEERLKDALKRINPDLPQEALDDALHQILVPL
NPALIDNNHLFQKMVTDGVNATFRANDGRIVSKQVRIFDFEKAENNDFLAVNQFTVIENRVEKRPDVVVFVNGIPLVVIE
LKNLANEDVEISDAYNQIQNYISTIPSLFTYNAFAVISDGVNARAGTITADEDRFMMWRTIDGDEIAPLSRPQLEVLIKG
MFDKERLLDIIKNFILFQTDGKDTYKILAGYHQYHAVNKAVESTIRAAITEGDRKIGVVWHTQGSGKSLSMVFYVGKLVL
SKELNNPTIVVITDRNDLDDQLFSTFAKSKDLLRQEPVQAQDRADLRKLLTRESGGIIFTTIHKFAPEEKGDSVPVLTDR
ENVIVIADEAHRSQYGFRAEIVKGDTEADVKYGYAKYMRDALPNASFIGFTGTPISLADKDTRAVFGDYIDIYDMTRAVE
DGTTVRIFYESRIAKLELPDELKPVIDDEYEEITEYQEYTQKEKLKTKWSRLEAIVGAEQRVKAIAKDIVEHFEKRLAAQ
ETEVGKGMIVVMSRRIAIDLYKAIVELRPEWHSDDIDKGVIKVVMTGSSSDPKEWQPFIGTKATRERIAKRMKDNKDELK
LVIVRDMWLTGFDVPSMHTMYIDKPMQGHNLMQAIARVNRVFKEKQGGLIVDYIGIAENLKNALNDYTESDREKTGVDTE
VAAAVLVEKYELIKELLHGHDYQKFFTGSASERMSAIVETIDYIIGLREERKNDYLKLVSELSRAYSLCATTDIAEKLNL
EVGFHKAVKSGIIKLIPENSRKKTAAEIEAQLNQLVSKSISSNEVIDVLDAVGLNKPNIAILSDEFLEEVRNMKQRNLAV
ELLNRLLKGKIKTFSKRNLVQSRKFSELLENAIRKYQNRTIETTQVILELIQLAKEINEAHKRGENTGLTEDELAFYDAL
AENESAKEVMGDDILKQIARDLTEAIRKNISIDWSIRASVQAKMKMIIKRLLKRYGYPPDKTPKAVEIVMEQAKLMCQNE
SSGVRYQYQLEKEDLPKVAEDSFDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 56
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 56
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 56
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 50