Gene Information

Name : CLK_1539 (CLK_1539)
Accession : YP_001787476.1
Strain : Clostridium botulinum Loch Maree
Genome accession: NC_010520
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease HsdR family subfamily
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2345730 - 2348480 bp
Length : 2751 bp
Strand : -
Note : identified by match to protein family HMM PF04313; match to protein family HMM PF04851; match to protein family HMM TIGR00348

DNA sequence :
ATGGCATACCAAAGCGAAGCAGAGCTTGAGAAACAGTTAATAAAGCAGCTTGAAAGTAAAGGTTACAATAAGGTTAAAAT
AAGCACTGAAGAAGAGTTAATCAAAAATTTCAGAGTACAACTTAACAAATATAATGAGGAAAAACTTGCAGGTACACCTT
TAACAGATAAAGAATTTGAACGTGTTATGCGTAAGGTTGAAGGTAAAAGTATATTTGAAGGTGCAAAAATTTTAAGAGAT
AAATGGCCTTTAAAAAGGGATGATGGGTTAGAAGTTTACATAGAATTTTTTAATTCTAAATCTTTGTGTAAAAACATATT
CCAAGTAACTACTCAAACAACAGTAGTAGGAAAATATACTAATAGATATGATGTAACATTATTAGTTAATGGACTTCCAC
TTGTTCAAATAGAATTAAAGAGAAGAGGAGTAGACTTTAAAGAAGCCTTTAATCAAATACAGCGTTATAGAAAACACTCC
TATCAAGGTTTATATAGATACATACAAATATTTATAGTAAGTAATGGTATGGATACAAAATATTTTTCCAATAGCGACAG
AGATATTTTATATACTCATACCTTCTTTTGGACAGATGAAAAAAATCAAAGAATTAGTAAATTAAATGATTTTACAGATA
CATTTTTAGAAAGAAGTTTTATTTCTAAAATTATTGCTCGTTACATGATAATAAATTATACAGAAAAGATACTAATGGTA
ATGAGACCATACCAAATATATGCTGTAGAAGCTCTTATAAATAGAGCATTAGAGACAAATGGCAACGGATATATATGGCA
CACAACAGGAAGCGGTAAAACTTTAACTTCATTTAAAGCAAGTCAGATACTTTCAAAAGAGCCTAAGATAAAGAAAGTAT
TTTTTCTTGTGGATCGTAAAGATTTAGATTCCCAAACTATAAATGAATTTAATAAATTCGAACCAAAATCAGTAGATGTT
ACAGATAAGACAAGCACTTTAATAAAGCAAATAAAAGACGTAAACAAACCTCTTATAGTAACTACTATACAAAAGATGGC
CAATGCTATAAAGTCGCCTAGATACACAAAGATAATGGAGCAATACAAAGATGAAAAAGTTGCATTTATTATAGATGAGT
GTCATAGATCACAGTTTGGAAGTATGCATATAGCTATTGAAAAGCATTTTAAAAAGGCTCAATATTTTGGATTTACAGGA
ACACCTATACTTAAAGAAAATAAAAGTCAAGATGGTCGTACTACAGCAGATTTGTTTGATGAAATGCTTCATAGTTATCT
TATTAAAGATGCTATAAAAGATAATAATGTTTTAGGATTTTCTGTTGAATATATATCCACTTTTAAAGGTCAATTTGATG
AAAATGATGACACTAAAGTAAAGGCTATTGATAAAAAAGAAGCTTTTATGGATGATGAGCGTATATCACAAATTGCACAG
GACATAATTAAAAATCATAATAAGAAAACAAAAGATAGACAATATACAGCTATATTTGCAGTTGAAAGTATTGAAATGCT
TGTAAAATATTATGATAAGTTTAAAAAATTAGATCATAACTTAAAAATAACAGGCATATTTAGTTATGGAGTAAATGAAG
ACGCAGAAGGGAAAGATGAACATTCAAGAGATAGCTTGGAGGAAATAATTAAAGATTATAATGAAATGTTTGATACAAAG
TATTCTACGGATACATTCCAAGGGTATTTTGCTAATGTATCTAAAAAAGTTAAATCAGGACAAATTGATATACTTATAGT
TGTAAATATGTTTTTAACAGGATTTGATAGCAAAACATTAAACACACTATATATAGATAAAAATTTAGCATATCATTCAT
TAATACAAGCTTATTCAAGAACTAACAGAGTTTACAAATCTACAAAGCCTTATGGAAATATTGTGTGTTATAGAAATTTA
AAGAAAAAGACAGATGAGGCTATAAAATTATTTTCATTAACAGATAATGCTAATGAAGTTCTTATGAAAAGTTATAATCA
CTACTTAGAAGCTTTTAAAGAGAGTGTACTAAACTTATATAAGATAGTTCCAAGACCAGAAGATGTGGATTTTATTGAAG
GTGAAAAAGAGAAAAAAGAATTTATAGTAGCTTTTAGAGAGTTATCTAAAATACTTATAAAATTACAAACCTTTGTAGAA
TTTGAATTTGATGAAGATAAACTTTTAATTAGTGAACAAACTTATCAAGATTTTAAAAGTAAGTATTTGGCTATTTATGA
TTCATTTAAAAATGATGAAGAGGGAAAAGCTTCTATTTTAGATGATATTGATTTTGGCATTGAGCTTATGCATACAGATA
AAATAAATGTAGACTATATAATGAATTTAATAAGAAATATAGATTTTTCTGATAAAGAAAATAAAGAAAAAGATATAAAA
CATATTATTAAGGAATTAGATAGGGCAGATAGTGAACATTTAAGATTAAAGGTAGATTTATTAAAATCATTCCTTCAAGA
AGTAGTTCCAAATCTTACTGAAGAAGATTCTATTGATGATGCTTATAGTAGGTTTGAACAAGTTCAAAGAACTGAAGAAA
TAAAAGCATTCTCAGAGCAAGCAGCAGTTAAAGAGGGCAAACTTAAAGATTACATTTCAGAATATGAATACAGCGGAATG
TTAGATCGCAAAGATATGGGTGATACTATAGAGGGATCATTTTTAAAGAGAAAAAAAGTAGTAAACAAAATAACTACTTT
TATTAAAAATCATGTAGAGAAATTTAGCTAG

Protein sequence :
MAYQSEAELEKQLIKQLESKGYNKVKISTEEELIKNFRVQLNKYNEEKLAGTPLTDKEFERVMRKVEGKSIFEGAKILRD
KWPLKRDDGLEVYIEFFNSKSLCKNIFQVTTQTTVVGKYTNRYDVTLLVNGLPLVQIELKRRGVDFKEAFNQIQRYRKHS
YQGLYRYIQIFIVSNGMDTKYFSNSDRDILYTHTFFWTDEKNQRISKLNDFTDTFLERSFISKIIARYMIINYTEKILMV
MRPYQIYAVEALINRALETNGNGYIWHTTGSGKTLTSFKASQILSKEPKIKKVFFLVDRKDLDSQTINEFNKFEPKSVDV
TDKTSTLIKQIKDVNKPLIVTTIQKMANAIKSPRYTKIMEQYKDEKVAFIIDECHRSQFGSMHIAIEKHFKKAQYFGFTG
TPILKENKSQDGRTTADLFDEMLHSYLIKDAIKDNNVLGFSVEYISTFKGQFDENDDTKVKAIDKKEAFMDDERISQIAQ
DIIKNHNKKTKDRQYTAIFAVESIEMLVKYYDKFKKLDHNLKITGIFSYGVNEDAEGKDEHSRDSLEEIIKDYNEMFDTK
YSTDTFQGYFANVSKKVKSGQIDILIVVNMFLTGFDSKTLNTLYIDKNLAYHSLIQAYSRTNRVYKSTKPYGNIVCYRNL
KKKTDEAIKLFSLTDNANEVLMKSYNHYLEAFKESVLNLYKIVPRPEDVDFIEGEKEKKEFIVAFRELSKILIKLQTFVE
FEFDEDKLLISEQTYQDFKSKYLAIYDSFKNDEEGKASILDDIDFGIELMHTDKINVDYIMNLIRNIDFSDKENKEKDIK
HIIKELDRADSEHLRLKVDLLKSFLQEVVPNLTEEDSIDDAYSRFEQVQRTEEIKAFSEQAAVKEGKLKDYISEYEYSGM
LDRKDMGDTIEGSFLKRKKVVNKITTFIKNHVEKFS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SSP0054 YP_300144.1 type I site-specific restriction-modification system restriction subunit Not tested SCC15305cap Protein 0.0 56