Gene Information

Name : clpC (CMM_0857)
Accession : YP_001221597.1
Strain : Clavibacter michiganensis NCPPB 382
Genome accession: NC_009480
Putative virulence/resistance : Virulence
Product : ATP-dependent protease, ATPase subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 976162 - 978672 bp
Length : 2511 bp
Strand : +
Note : ATP-dependent protease, ATPase subunit (NP_627581.1| putative Clp-family ATP-binding protease [Streptomyces coelicolor A3(2)]; NP_959395.1| ClpC [Mycobacterium avium subsp. paratuberculosis str. k10]). pfam02861, Clp_N, Clp amino terminal domain (twice).

DNA sequence :
ATGTTCGAGAGATTCACCGACCGCGCTCGTCGCGTCGTCGTCCTGGCCCAAGAAGAGGCCAAGATGCTCAACCACAACTA
CATCGGGACCGAGCACATCCTGCTCGGCCTCATCCACGAGGGCGAAGGCGTGGCCGCCAAGGCCCTGGAGTCGCTCGGCA
TCTCCCTCGATGCCGTCCGTGAACAGGTCCAGGACATCATCGGCCAGGGCCAGCAGCAGCCCACGGGGCACATCCCGTTC
ACGCCGCGCGCCAAGAAGGTCCTGGAGCTGTCGCTCCGCGAGGCCCTCCAGCTCGGCCACAACTACATCGGCACCGAGCA
CATCCTCCTCGGCCTGATCCGCGAGGGCGAGGGCGTCGCCGCGCAGGTGCTCGTCAAGCTCGGCGCCGACCTCAACCGCG
TGCGCCAGCAGGTCATCCAGCTCCTGTCCGGCTACCAGGGCAAGGAGGCGGTCGCCGTCGGCGGCGAGGCCCAGCAGAGC
CAGCAGGCGGGCTCCACGGTCCTCGACCAGTTCGGGCGCAACCTCACGCAGGCCGCGCGCGACGGCAAGCTCGACCCCGT
CATCGGGCGCGAGAAGGAGATCGAGCGCGTGATGCAGATCCTGTCGCGCCGCTCCAAGAACAACCCCGTCCTCATCGGCG
AGCCCGGCGTCGGCAAGACCGCCGTCGTCGAGGGCCTGGCGCAGGCCATCGTCAAGGGCGACGTCCCGGAGACGCTGAAG
GACAAGCAGCTCTACACGCTCGACCTCGGCTCGCTCATCGCCGGTTCCCGCTACCGCGGCGACTTCGAGGAGCGCCTCAA
GAAGGTCACCAAGGAGATCCGCACGCGCGGGGACATCATCACCTTCATCGACGAGATCCACACCCTCGTCGGCGCGGGTG
CCGCCGAGGGCGCGATCGACGCGGCCAGCATCCTCAAGCCGCTCCTCGCGCGCGGCGAGCTGCAGACCATCGGCGCCACC
ACGCTCGACGAGTACCGCAAGCACTTCGAGAAGGACGCGGCCCTCGAGCGCCGCTTCCAGCCCATCCAGGTGCAGGAGCC
CTCGCTGCCCCACACCATCAACATCCTCAAGGGCCTGCGCGACCGGTACGAGGCGTTCCACAAGGTGTCCATCACCGACG
GCGCCATCGTGTCCGCGGCGAACCTCGCGGACCGCTACATCGCCGACCGGTTCCTCCCGGACAAGGCCATCGACCTGATC
GACGAGGCCGGCGCCCGCCTGCGCCTCTCGATCCTGTCGGCGCCGCCGGAGCTGCGCGAGTTCGACGAGCGCATCTCCAC
GGTCCGCGTGGCCAAGGAGACCGCCATCGAGGACCAGGACTTCGAGAAGGCCGCGAGCCTGCGCGACGAGGAGAAGAACC
TCCTCGGCGAGCGCCTCCGCCTCGAGAAGCAGTGGCGCTCGGGCGACGTCCGCACCACCGCAGAGGTCGACGAGGGCCTG
ATCGCCGAGGTGCTGGCGCAGGCCACGGGCATCCCGGTCTTCAAGCTCACGGAGGAGGAGTCCTCGCGCCTCGTCTTCAT
GGAGAAGGCCCTGCACCAGCGGGTCATCGGCCAGGAGGAGGCCATCTCGGCCCTGTCCAAGACCATCCGCCGCACCCGCG
CCGGACTCAAGGACCCGCGCCGTCCCTCGGGATCGTTCATCTTCGCCGGCCCCACGGGCGTCGGCAAGACGGAGCTCGCG
AAGGCCCTGGCGGAGTTCCTGTTCGACGACGAGGACGCCCTCATCTCGCTCGACATGAGCGAGTACGGCGAGAAGCACAC
TGTGAGCCGCCTCTTCGGCGCCCCTCCCGGATTCGTCGGCTTCGAGGAGGGCGGGCAGCTCACCGAGAAGGTGCGCCGCA
AGCCGTTCTCCGTGGTGCTCTTCGACGAGATCGAGAAGGCCCACCCGGACATCTTCAACTCGCTCCTCCAGATCCTGGAG
GAGGGACGCCTGACGGATGGCCAGGGCCGCGTGGTCGACTTCAAGAACACGGTCATCATCATGACCACCAACCTCGGCAC
CAAGGACATCACGGGTGCCCCGGTCGGGTTCCAGGTCGAGAACAACGCCGCGAACTCGTACGAGCGCATGAAGGGCAAGG
TCAGCGAGGAGCTGAAGAAGAACTTCAAGCCCGAGTTCCTCAACCGCGTGGACGACACCATCGTCTTCCCGCAGCTGTCG
AAGCCCGAGCTGCTCCAGATCGTCGACCTGTTCGTGAAGCGACTGTCGGACCGCATGATGGACCGCGACCTCACGATCAC
GCTCGAGACCGCCGCGAAGGAGCGACTCATCGAGGTCGGCTTCGACCCGTCGCTCGGCGCCCGGCCGCTGCGCCGCGCGG
TGCAGCACGAGATCGAGGACCGTCTGTCCGAGCGCATCCTGCAGGGCGAGCTCAACGCGGGCGACCACGTGCACGTCGAC
TACGTGGACGGCCAGTTCACGTTCGTCACGACGCAGCGCGAGGGCATCTCGGTCGCGGCCGGCATCGGCACCGGGACCGG
CACGCCGGACCTCGCCATCACGAGCGAGTAG

Protein sequence :
MFERFTDRARRVVVLAQEEAKMLNHNYIGTEHILLGLIHEGEGVAAKALESLGISLDAVREQVQDIIGQGQQQPTGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGADLNRVRQQVIQLLSGYQGKEAVAVGGEAQQS
QQAGSTVLDQFGRNLTQAARDGKLDPVIGREKEIERVMQILSRRSKNNPVLIGEPGVGKTAVVEGLAQAIVKGDVPETLK
DKQLYTLDLGSLIAGSRYRGDFEERLKKVTKEIRTRGDIITFIDEIHTLVGAGAAEGAIDAASILKPLLARGELQTIGAT
TLDEYRKHFEKDAALERRFQPIQVQEPSLPHTINILKGLRDRYEAFHKVSITDGAIVSAANLADRYIADRFLPDKAIDLI
DEAGARLRLSILSAPPELREFDERISTVRVAKETAIEDQDFEKAASLRDEEKNLLGERLRLEKQWRSGDVRTTAEVDEGL
IAEVLAQATGIPVFKLTEEESSRLVFMEKALHQRVIGQEEAISALSKTIRRTRAGLKDPRRPSGSFIFAGPTGVGKTELA
KALAEFLFDDEDALISLDMSEYGEKHTVSRLFGAPPGFVGFEEGGQLTEKVRRKPFSVVLFDEIEKAHPDIFNSLLQILE
EGRLTDGQGRVVDFKNTVIIMTTNLGTKDITGAPVGFQVENNAANSYERMKGKVSEELKKNFKPEFLNRVDDTIVFPQLS
KPELLQIVDLFVKRLSDRMMDRDLTITLETAAKERLIEVGFDPSLGARPLRRAVQHEIEDRLSERILQGELNAGDHVHVD
YVDGQFTFVTTQREGISVAAGIGTGTGTPDLAITSE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 0.0 71

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpC YP_001221597.1 ATP-dependent protease, ATPase subunit VFG0079 Protein 0.0 59
clpC YP_001221597.1 ATP-dependent protease, ATPase subunit VFG0080 Protein 4e-154 53
clpC YP_001221597.1 ATP-dependent protease, ATPase subunit VFG2084 Protein 3e-94 41