Gene Information

Name : clpC1 (MUL_4178)
Accession : YP_907684.1
Strain : Mycobacterium ulcerans Agy99
Genome accession: NC_008611
Putative virulence/resistance : Virulence
Product : ATP-dependent protease ATP-binding subunit ClpC1
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 4648646 - 4651195 bp
Length : 2550 bp
Strand : -
Note : Also detected in the extracellular matrix by proteomics.; cytoplasmic protein; hydrolyses proteins in presence of ATP. may interact with a ClpP-like protease involved in degradation of denatured proteins.

DNA sequence :
ATGTTCGAACGCTTTACCGACCGTGCCCGCCGGGTCGTTGTCCTGGCGCAAGAAGAGGCCCGGATGCTGAACCACAACTA
CATCGGCACCGAGCACATTCTGCTGGGGCTAATCCATGAAGGCGAAGGCGTTGCGGCCAAGTCGCTGGAGTCGCTCGGGA
TCTCGCTGGAAGGTGTGCGCAGCCAGGTCGAAGAGATCATCGGCCAGGGCCAGCAGGCACCGTCCGGGCACATCCCGTTC
ACTCCGCGCGCCAAGAAGGTGCTCGAACTCAGCCTGCGCGAGGCGCTGCAGCTCGGCCACAACTACATCGGTACCGAGCA
CATCCTGCTCGGCCTGATTCGCGAGGGCGAGGGTGTGGCCGCCCAGGTGCTGGTCAAGCTGGGCGCCGAACTGACCCGGG
TGCGCCAGCAGGTGATCCAGCTGCTGAGCGGCTACCAGGGCAAGGAGGCCGCCGAGGCAGGCACCGGCGGCCGGGGAGGG
GAGTCCGGCTCTCCCTCCACGTCGCTGGTTCTCGACCAGTTCGGCCGCAACCTGACCGCGGCTGCCATGGAAGGCAAGCT
GGACCCGGTCATCGGCCGCGAGAAGGAAATCGAGCGGGTCATGCAGGTGCTCTCCCGGCGCACCAAGAACAACCCGGTGC
TCATCGGCGAGCCCGGCGTGGGCAAGACCGCTGTGGTCGAGGGCCTGGCGCAGGCCATCGTGCACGGCGAGGTGCCCGAG
ACGCTGAAGGACAAGCAGCTCTACACGCTGGACCTCGGCTCGCTGGTCGCGGGTTCGCGCTACCGTGGTGACTTCGAGGA
ACGCCTGAAGAAGGTGCTCAAGGAGATCAATACCCGCGGCGACATCATCCTGTTCATCGACGAGCTGCACACGCTGGTCG
GGGCGGGTGCCGCCGAGGGCGCGATCGATGCCGCAAGCATCCTCAAGCCCAAGCTGGCTCGCGGCGAGCTGCAGACGATC
GGTGCCACCACCCTCGACGAGTACCGCAAGTACATCGAGAAGGACGCCGCGCTGGAGCGCCGTTTCCAGCCGGTGCAGGT
GGGGGAGCCGACGGTCGAGCACACCATCGAGATCCTCAAGGGCCTGCGGGACCGCTACGAGGCGCACCACCGCGTGTCGA
TCACCGATTCCGCGATGGTGGCCGCCGCGACGCTGGCCGACCGCTACATCAACGACCGGTTCCTGCCGGACAAGGCGATC
GACCTGATCGACGAGGCGGGCGCCCGGATGCGGATCCGTCGCATGACCGCGCCGCCAGACCTGCGCGAGTTCGACGAGAA
GATCGCCGACGCGCGCCGGGAGAAGGAATCGGCGATCGACGCCCAGGACTTCGAGAAGGCGGCGAGCCTGCGCGACCGGG
AGAAGCAACTGGTGGCGCAGCGTGCCGAGCGTGAAAAGCAATGGCGTTCGGGCGATCTCGACGTGGTCGCCGAAGTCGAT
GACGAACAGATCGCCGAAGTGCTGGGCAACTGGACTGGCATCCCGGTGTTCAAGCTCACCGAGGCGGAAACCACCCGCCT
GCTGCGGATGGAAGATGAGCTGCACAAGCGGATCATCGGCCAGGAAGACGCGGTCAAGGCGGTCTCCAAGGCGATCCGCC
GCACCCGCGCCGGGCTGAAGGACCCGAAGCGCCCGTCGGGATCGTTCATCTTCGCCGGCCCGTCCGGTGTCGGTAAGACC
GAGCTGTCCAAGGCGCTGGCCAACTTCTTGTTCGGTGACGACGACGCGCTCATCCAGATCGACATGGGCGAGTTCCACGA
CCGGTTCACCGCGTCCCGGCTCTTCGGTGCTCCGCCCGGATATGTCGGCTACGAAGAGGGCGGCCAGCTCACCGAGAAGG
TGCGGCGCAAGCCGTTCAGCGTGGCGCTCTTCGATGAGATCGAGAAGGCACACCAGGAGATCTCCAACAGTCTGTTGCAG
GTCCTCGAGGACGGCCGGCTCACCGACGGCCAGGGCCGCACGGTCGACTTCAAGAACACCGTGCTGATCTTCACGTCGAA
TCTGGGGACATCCGATATTTCCAAGCCCGTCGGCCTCGGGTTCAGCCAGACTGGCGGCGAGAACGACTACGAGCGGATGA
AGCAGAAGGTCAACGACGAGCTCAAGAAGCACTTCCGGCCGGAGTTCCTCAACCGCATCGACGACATCATCGTCTTCCAC
CAGCTGACTCGCGACGAGATCATCCGGATGGTCGACCTGATGATCGGCCGCGTCGCCAACCAGCTCAAGAGCAGCAAGGA
CATGGCGCTCGAGCTGACCGACAAGGCAAAGTCGCTGCTGGCCAAGCGCGGTTTCGACCCGGTCTTGGGTGCTCGTCCGC
TGCGGCGAACCATTCAGCGTGAGATCGAGGACCAGCTCTCGGAGAAGATCCTCTTCGAGGAGGTCGGTCCGGGACAGGTC
GTCACCGTCGACGTCGACAACTGGGACGGCGAGGGCGCCGGTGAGGACGCGAAATTTACGTTCACCGGTACCCGCAAGCC
GCCGAGTGAGCCGGACCTGGCAAAGGCCGGAGCACACAGCGCCGGGGGACCTGGGCCCACCGAGCAGTAA

Protein sequence :
MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKSLESLGISLEGVRSQVEEIIGQGQQAPSGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGAELTRVRQQVIQLLSGYQGKEAAEAGTGGRGG
ESGSPSTSLVLDQFGRNLTAAAMEGKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQAIVHGEVPE
TLKDKQLYTLDLGSLVAGSRYRGDFEERLKKVLKEINTRGDIILFIDELHTLVGAGAAEGAIDAASILKPKLARGELQTI
GATTLDEYRKYIEKDAALERRFQPVQVGEPTVEHTIEILKGLRDRYEAHHRVSITDSAMVAAATLADRYINDRFLPDKAI
DLIDEAGARMRIRRMTAPPDLREFDEKIADARREKESAIDAQDFEKAASLRDREKQLVAQRAEREKQWRSGDLDVVAEVD
DEQIAEVLGNWTGIPVFKLTEAETTRLLRMEDELHKRIIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKT
ELSKALANFLFGDDDALIQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKPFSVALFDEIEKAHQEISNSLLQ
VLEDGRLTDGQGRTVDFKNTVLIFTSNLGTSDISKPVGLGFSQTGGENDYERMKQKVNDELKKHFRPEFLNRIDDIIVFH
QLTRDEIIRMVDLMIGRVANQLKSSKDMALELTDKAKSLLAKRGFDPVLGARPLRRTIQREIEDQLSEKILFEEVGPGQV
VTVDVDNWDGEGAGEDAKFTFTGTRKPPSEPDLAKAGAHSAGGPGPTEQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 0.0 81

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpC1 YP_907684.1 ATP-dependent protease ATP-binding subunit ClpC1 VFG0079 Protein 0.0 60
clpC1 YP_907684.1 ATP-dependent protease ATP-binding subunit ClpC1 VFG0080 Protein 8e-148 51