Gene Information

Name : clpC1 (Rv3596c)
Accession : YP_177995.1
Strain : Mycobacterium tuberculosis H37Rv
Genome accession: NC_000962
Putative virulence/resistance : Virulence
Product : Probable ATP-dependent protease ATP-binding subunit ClpC1
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 4038158 - 4040704 bp
Length : 2547 bp
Strand : -
Note : Rv3596c, (MTCY07H7B.26), len: 848 aa. Probable clpC1, ATP-dependent protease ATP-binding subunit,equivalent to P24428|CLPC_MYCLE probable ATP-dependent CLP protease ATP-binding subunit from Mycobacterium leprae (848 aa) (see Misra et al., 1996), FASTA sco

DNA sequence :
ATGTTCGAACGATTTACCGACCGTGCCCGCAGGGTCGTCGTCCTGGCTCAGGAAGAGGCCAGGATGCTCAACCACAACTA
CATCGGCACCGAGCACATTCTTTTAGGCCTGATCCATGAAGGGGAAGGCGTTGCCGCCAAGTCACTGGAGTCGTTGGGGA
TCTCGCTGGAAGGTGTGCGCAGTCAGGTCGAGGAGATCATCGGCCAGGGCCAGCAGGCGCCGTCTGGGCACATTCCGTTT
ACCCCCCGCGCCAAAAAGGTCCTCGAGCTGAGCTTGCGTGAAGCGCTGCAGCTTGGCCACAACTACATCGGGACCGAACA
CATTTTGCTGGGCCTCATCCGAGAGGGTGAAGGCGTGGCCGCCCAGGTGCTGGTCAAGCTGGGCGCCGAGCTGACCCGGG
TGCGCCAGCAGGTGATCCAGCTGCTCTCCGGTTACCAAGGCAAGGAGGCCGCCGAAGCCGGCACCGGCGGCCGCGGGGGA
GAGTCCGGCTCTCCGTCTACGTCCTTGGTGCTCGACCAGTTCGGCCGCAACCTCACGGCGGCGGCGATGGAAGGCAAACT
GGACCCGGTCATCGGCCGCGAGAAGGAAATCGAGCGGGTCATGCAGGTGCTCTCTCGGCGCACCAAGAACAACCCGGTGC
TGATCGGCGAGCCCGGCGTCGGCAAGACCGCGGTCGTCGAAGGACTGGCGCAGGCCATCGTGCACGGCGAGGTGCCCGAG
ACGCTCAAGGACAAGCAGCTCTACACGCTGGATCTGGGATCGCTGGTGGCGGGTAGCCGCTACCGCGGTGACTTCGAGGA
ACGCCTCAAGAAGGTGCTCAAGGAGATCAACACCCGCGGTGACATCATCCTGTTTATCGACGAGCTGCACACCTTGGTCG
GTGCTGGAGCCGCCGAGGGCGCGATCGACGCCGCCTCGATCCTGAAACCGAAGCTCGCTCGCGGTGAACTGCAAACGATC
GGCGCCACCACGCTCGACGAATACCGCAAGTACATCGAGAAGGACGCCGCGCTGGAGCGCCGCTTCCAGCCGGTGCAGGT
GGGTGAGCCGACGGTGGAGCACACCATCGAGATCCTCAAGGGCCTGCGGGACCGGTACGAGGCGCACCACCGGGTGTCGA
TCACCGATGCGGCGATGGTGGCCGCCGCGACCCTGGCCGACCGCTACATCAACGACCGGTTCCTGCCCGACAAGGCGATC
GACCTGATCGACGAGGCGGGTGCTCGGATGCGGATTCGTCGCATGACCGCACCGCCAGACCTACGCGAGTTCGATGAGAA
GATCGCCGAGGCTCGTCGGGAGAAGGAATCGGCTATCGACGCCCAGGACTTCGAGAAGGCCGCCAGCCTGCGCGACCGGG
AGAAGACACTGGTCGCACAGCGTGCTGAGCGCGAAAAGCAGTGGCGTTCAGGCGATCTTGACGTGGTCGCGGAGGTCGAC
GACGAGCAGATCGCCGAGGTGCTGGGCAACTGGACCGGTATCCCGGTGTTCAAGCTCACCGAGGCCGAGACCACCCGGCT
GTTGCGGATGGAAGAAGAGCTGCACAAGCGGATCATCGGGCAAGAGGACGCCGTCAAGGCCGTTTCCAAGGCCATCCGGC
GTACCCGGGCCGGGCTGAAAGACCCCAAGCGCCCGTCGGGCTCGTTCATCTTCGCCGGCCCGTCCGGTGTCGGTAAGACC
GAACTGTCCAAGGCGCTGGCCAACTTCTTGTTCGGTGACGACGACGCGCTTATTCAGATTGACATGGGTGAATTCCACGA
CCGGTTCACCGCGTCGCGGCTATTCGGCGCGCCGCCCGGATACGTCGGCTACGAGGAGGGCGGCCAACTCACCGAGAAGG
TGCGGCGCAAGCCGTTCTCGGTGGTGCTGTTCGACGAGATCGAGAAGGCGCATCAGGAGATCTACAACAGCCTGCTGCAG
GTGCTCGAGGATGGCCGGCTCACCGACGGGCAGGGCCGCACGGTGGACTTCAAGAACACCGTGCTGATCTTTACGTCCAA
TCTGGGCACCTCCGACATCTCTAAGCCGGTCGGTCTGGGCTTTTCCAAGGGCGGCGGTGAGAACGACTACGAGCGGATGA
AACAGAAGGTCAACGACGAGCTGAAGAAACACTTCCGCCCGGAGTTCCTCAACCGCATCGACGACATCATCGTCTTCCAC
CAGCTGACCCGCGAGGAGATCATCCGGATGGTCGACCTGATGATCAGCCGGGTCGCCGGCCAGCTCAAGAGCAAGGACAT
GGCGCTGGTGCTGACCGATGCGGCCAAGGCGCTGCTGGCCAAGCGTGGCTTCGACCCGGTGTTGGGGGCCCGCCCGTTGC
GGCGCACCATCCAGCGTGAGATCGAAGATCAGCTCTCGGAGAAGATCCTCTTCGAGGAGGTCGGGCCGGGTCAGGTGGTC
ACCGTCGACGTGGACAACTGGGACGGTGAAGGTCCCGGCGAGGACGCGGTGTTCACCTTCACCGGTACCCGCAAGCCGCC
GGCCGAGCCGGATCTGGCCAAGGCTGGAGCGCACAGCGCGGGCGGCCCGGAGCCGGCCGCGCGGTAG

Protein sequence :
MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKSLESLGISLEGVRSQVEEIIGQGQQAPSGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGAELTRVRQQVIQLLSGYQGKEAAEAGTGGRGG
ESGSPSTSLVLDQFGRNLTAAAMEGKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQAIVHGEVPE
TLKDKQLYTLDLGSLVAGSRYRGDFEERLKKVLKEINTRGDIILFIDELHTLVGAGAAEGAIDAASILKPKLARGELQTI
GATTLDEYRKYIEKDAALERRFQPVQVGEPTVEHTIEILKGLRDRYEAHHRVSITDAAMVAAATLADRYINDRFLPDKAI
DLIDEAGARMRIRRMTAPPDLREFDEKIAEARREKESAIDAQDFEKAASLRDREKTLVAQRAEREKQWRSGDLDVVAEVD
DEQIAEVLGNWTGIPVFKLTEAETTRLLRMEEELHKRIIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKT
ELSKALANFLFGDDDALIQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKPFSVVLFDEIEKAHQEIYNSLLQ
VLEDGRLTDGQGRTVDFKNTVLIFTSNLGTSDISKPVGLGFSKGGGENDYERMKQKVNDELKKHFRPEFLNRIDDIIVFH
QLTREEIIRMVDLMISRVAGQLKSKDMALVLTDAAKALLAKRGFDPVLGARPLRRTIQREIEDQLSEKILFEEVGPGQVV
TVDVDNWDGEGPGEDAVFTFTGTRKPPAEPDLAKAGAHSAGGPEPAAR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 0.0 81

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpC1 YP_177995.1 Probable ATP-dependent protease ATP-binding subunit ClpC1 VFG0079 Protein 0.0 60
clpC1 YP_177995.1 Probable ATP-dependent protease ATP-binding subunit ClpC1 VFG0080 Protein 7e-140 51