Gene Information

Name : clpB (Rv0384c)
Accession : NP_214898.1
Strain : Mycobacterium tuberculosis H37Rv
Genome accession: NC_000962
Putative virulence/resistance : Virulence
Product : Probable endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (heat shock protein F84.1)
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 459456 - 462002 bp
Length : 2547 bp
Strand : -
Note : Rv0384c, (MTV036.19c), len: 848 aa. Probable clpB (alternate gene name: htpM), endopeptidase ATP-binding protein, chain B, equivalent to AC32007.1|AL583925 heat shock protein from Mycobacterium leprae (848 aa). Also highly similar to others e.g. P53532|CL

DNA sequence :
GTGGACTCGTTTAACCCGACGACCAAGACGCAGGCGGCGCTAACCGCGGCGTTACAGGCGGCTTCGACCGCCGGCAATCC
CGAGATCCGGCCCGCTCACCTGCTGATGGCGCTGCTGACCCAAAACGACGGTATCGCCGCACCGCTACTGGAGGCTGTCG
GTGTCGAGCCCGCCACCGTCCGCGCCGAAACCCAGCGCCTGCTCGACCGTTTGCCGCAGGCGACTGGAGCCAGCACGCAG
CCGCAGCTGTCCCGCGAGTCGTTAGCGGCGATCACCACCGCGCAGCAGCTGGCCACCGAGCTGGACGACGAGTACGTCTC
CACCGAGCACGTGATGGTCGGGCTGGCCACCGGTGACTCCGACGTCGCCAAGCTGTTGACCGGCCACGGCGCCTCGCCGC
AGGCGCTGCGGGAGGCGTTCGTCAAGGTGCGCGGCAGCGCCCGGGTCACCAGCCCCGAACCGGAGGCGACCTATCAGGCG
CTGCAGAAGTACTCCACCGACCTGACCGCCCGCGCCCGCGAAGGCAAACTCGACCCGGTCATCGGCCGCGACAACGAGAT
CCGCCGCGTGGTGCAGGTGCTGTCCCGTCGCACCAAGAACAACCCGGTGCTGATCGGTGAGCCCGGCGTCGGCAAGACCG
CGATCGTGGAGGGCCTGGCGCAGCGCATCGTGGCCGGCGACGTGCCGGAGAGCTTGCGCGACAAGACCATCGTCGCGCTC
GATCTCGGCTCGATGGTCGCCGGCTCCAAATACCGCGGCGAATTCGAGGAACGGCTCAAGGCCGTCCTCGACGACATCAA
GAACTCGGCCGGCCAAATCATCACGTTCATCGACGAGCTGCACACCATCGTCGGCGCCGGCGCCACCGGCGAGGGGGCGA
TGGACGCCGGCAACATGATCAAGCCGATGCTGGCCCGCGGCGAGTTACGGCTGGTCGGGGCGACCACGCTGGACGAATAC
CGCAAGCACATCGAGAAGGACGCCGCGCTCGAGCGCCGTTTCCAACAGGTGTACGTCGGCGAGCCGTCGGTGGAGGACAC
CATCGGCATCCTGCGCGGGCTCAAAGACCGCTACGAGGTGCACCACGGGGTGCGCATCACCGACTCGGCGCTGGTGGCAG
CTGCCACTTTGAGCGACCGGTATATCACCGCCCGCTTCCTGCCCGACAAGGCCATCGACCTGGTCGACGAGGCGGCCAGC
CGGCTGCGGATGGAGATCGACTCGCGGCCCGTCGAGATCGACGAGGTCGAGCGGCTGGTGCGCCGGCTGGAGATCGAAGA
GATGGCGCTGTCCAAAGAAGAAGACGAGGCGTCGGCGGAGCGGTTGGCCAAGCTGCGCTCCGAGCTGGCCGACCAGAAAG
AGAAGTTGGCCGAGCTCACCACCCGCTGGCAGAACGAGAAGAACGCGATCGAAATCGTCCGCGACCTCAAGGAGCAGCTG
GAAGCCCTGCGCGGGGAATCCGAGCGGGCCGAACGCGACGGCGACCTGGCCAAGGCCGCCGAGCTGCGCTACGGACGCAT
CCCCGAGGTGGAGAAGAAGCTCGACGCGGCGTTGCCGCAGGCGCAGGCCCGGGAGCAGGTGATGCTCAAGGAGGAGGTCG
GTCCCGACGACATCGCCGACGTGGTGTCGGCGTGGACCGGCATCCCGGCCGGTCGGCTGCTGGAAGGCGAGACCGCCAAG
CTGCTGCGCATGGAAGACGAGCTGGGCAAGCGGGTCATCGGGCAGAAGGCCGCGGTTACCGCAGTCTCTGATGCGGTGCG
GCGCAGCCGGGCCGGGGTGTCCGACCCCAACCGGCCCACCGGGGCGTTCATGTTCCTCGGCCCGACCGGTGTCGGCAAGA
CCGAGCTGGCCAAGGCGCTGGCCGACTTCCTGTTCGACGACGAGCGGGCGATGGTCCGCATCGACATGAGCGAGTACGGC
GAGAAGCACACCGTGGCTCGGTTGATCGGCGCCCCGCCCGGCTATGTGGGATACGAGGCGGGCGGTCAGCTGACCGAGGC
GGTGCGCCGGCGTCCCTACACGGTGGTGCTGTTCGACGAGATCGAGAAGGCGCACCCGGACGTGTTCGACGTGCTGCTGC
AGGTCCTCGACGAGGGCCGGCTCACCGACGGGCACGGCCGCACGGTCGACTTCCGCAACACCATCTTGATCCTGACGTCC
AACCTGGGGTCGGGTGGCAGCGCCGAGCAGGTGCTGGCCGCGGTGCGCGCTACGTTCAAGCCGGAGTTCATCAACCGGCT
CGACGACGTGCTCATCTTTGAGGGTCTCAACCCCGAAGAGCTGGTGCGCATCGTCGACATCCAGCTGGCGCAGCTGGGCA
AGCGGCTGGCGCAGCGGCGGCTGCAGCTGCAGGTCTCGCTGCCGGCCAAGCGCTGGTTGGCGCAGCGCGGATTCGACCCG
GTGTACGGGGCGCGGCCGTTGCGCCGGCTGGTGCAGCAGGCCATCGGTGACCAGCTGGCCAAGATGCTGTTGGCCGGCCA
GGTGCACGACGGCGATACCGTGCCGGTCAACGTCAGCCCCGACGCCGACTCGCTGATCCTGGGCTGA

Protein sequence :
MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ
PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA
LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL
DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY
RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS
RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL
EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK
LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG
EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS
NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP
VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-104 43
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-105 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-105 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB NP_214898.1 Probable endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (heat shock protein F84.1) VFG2076 Protein 4e-112 44
clpB NP_214898.1 Probable endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (heat shock protein F84.1) VFG2084 Protein 1e-111 41