Gene Information

Name : clpB (BN45_10429)
Accession : YP_007267027.1
Strain : Mycobacterium canettii CIPT 140070017
Genome accession: NC_019952
Putative virulence/resistance : Virulence
Product : Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 470809 - 473355 bp
Length : 2547 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; PubMedId : 10493122, 10510226, 11271494, 11385512, 11567012, 12368446, 12657046, 15525680, 17611072; Product type pf : putative factor

DNA sequence :
GTGGACTCGTTTAACCCGACGACCAAGACACAAGCCGCCCTGACCGCGGCGTTACAGGCGGCTTCGACCGCCGGCAATCC
CGAGATCCGGCCCGCTCACCTGCTGATGGCGCTGCTGACCCAAAACGATGGGATCGCCGCACCGCTACTGGAGGCTGTCG
GTGTCGAGCCCGCCACCGTCCGCGCCGAAACCCAGCGCCTGCTCGACCGGTTGCCGCGGGCCAGCGGGGCCAGCACGCAG
CCGCAGCTGTCCCGCGAGTCGTTAGCGGCGATCACCACCGCGCAGCAGCTGGCCACCGAGCTGGACGACGAGTACGTCTC
CACCGAGCACGTGATGGTCGGGCTGGCCACCGGTGACTCCGACGTCGCCAAGCTGTTGACCGGCCACGGCGCCTCGCCGC
AGGCGTTGCGGGAGGCGTTCGTCAAGGTGCGCGGCAGCGCCCGGGTCACCAGCCCCGAACCGGAGGCGACGTATCAGGCG
CTGCAGAAGTACTCCACCGACCTGACCGCTCGCGCCCGCGAAGGCAAACTCGACCCGGTCATCGGCCGCGACAACGAGAT
CCGCCGCGTGGTGCAGGTGCTCTCCCGTCGCACCAAGAACAATCCGGTGCTCATCGGCGAACCCGGCGTCGGCAAGACCG
CGATCGTGGAGGGCCTGGCGCAGCGGATCGTGGCCGGCGACGTGCCGGAGAGCCTACGCGACAAGACGATCGTCGCGCTC
GATCTCGGCTCGATGGTCGCCGGTTCGAAGTACCGCGGAGAATTCGAGGAACGGCTCAAGGCCGTGCTCGATGACATCAA
GAACTCGGCCGGCCAAATCATCACGTTCATCGACGAGCTGCACACCATCGTCGGGGCCGGCGCCACCGGCGAGGGGGCGA
TGGACGCCGGCAACATGATCAAGCCGATGCTGGCCCGCGGCGAGTTACGGCTGGTCGGGGCGACCACACTTGACGAGTAT
CGCAAGCACATCGAGAAGGACGCCGCGCTCGAGCGTCGTTTCCAACAGGTGTACGTCGGCGAGCCGTCGGTGGAGGACAC
CATCGGCATCCTGCGCGGACTCAAGGACCGCTACGAGGTGCACCACGGGGTGCGCATCACCGACTCGGCACTGGTGGCAG
CTGCCACTTTGAGTGACCGGTATATCACCGCCCGCTTTCTGCCGGACAAGGCCATCGACCTGGTCGACGAGGCGGCCAGC
CGGCTGCGGATGGAGATCGACTCGCGGCCCGTCGAGATCGACGAGGTCGAGCGGCTGGTGCGCCGGCTGGAGATCGAAGA
GATGGCGCTGTCCAAAGAAGAAGACGAGGCGTCGGCGGAGCGGTTGGCCAAGCTGCGCTCCGAGCTGGCCGATCAGAAAG
AGAAGTTGGCCGAACTTACTACCCGTTGGCAGAACGAGAAGAACGCGATCGAAATCGTCCGCGAGCTCAAAGAGCAGCTG
GAGGCGCTGCGCGGGGAATCCGAGCGGGCCGAACGCGACGGCGACCTGGCCAAGGCCGCCGAGCTGCGCTACGGGCGCAT
CCCCGAGGTGGAGAAGAAACTCGACTCGGCGCTGCCGCAGGCGCAGGCCCGCGAGCAGGTGATGCTCAAGGAGGAGGTCG
GTCCCGACGACATCGCCGACGTGGTGTCGGCGTGGACCGGCATTCCGGCCGGGCGGCTGCTGGAAGGCGAGACCGCCAAG
CTGCTGCGCATGGAAGACGAGCTGGGCAAGCGGGTCATCGGGCAGAAGGCCGCGGTTACCGCAGTCTCTGATGCGGTGCG
GCGCAGCCGGGCCGGGGTGTCCGACCCCAACCGGCCCACCGGGGCGTTCATGTTCCTCGGCCCGACCGGTGTCGGCAAGA
CCGAGCTGGCCAAGGCGCTGGCCGACTTCCTGTTCGACGACGAGCGGGCGATGGTCCGCATCGACATGAGCGAGTACGGC
GAGAAGCACACCGTGGCTCGGTTGATCGGCGCCCCGCCCGGCTATGTGGGTTATGAGGCGGGCGGTCAGCTGACCGAGGC
GGTGCGCCGGCGTCCCTACACTGTGGTGCTGTTCGACGAGATCGAGAAGGCGCACCCGGACGTGTTCGACGTGCTGCTGC
AGGTGCTCGACGAGGGCCGGCTCACCGACGGGCACGGCCGCACGGTCGACTTCCGCAACACCATCTTGATTCTGACGTCC
AACCTGGGGTCGGGTGGCAGCGCCGAGCAGGTGCTGGCCGCGGTGCGCGCTACGTTCAAGCCGGAGTTCATCAACCGGCT
CGACGACGTGCTCATCTTTGAGGGTCTCAACCCCGAAGAGCTGGTGCGCATCGTCGACATCCAGCTGGCGCAGCTGGGCA
AGCGGCTGGCGCAGCGGCGGCTGCAGCTGCAGGTCTCGCTGCCGGCCAAGCGCTGGTTGGCGCAACGCGGCTTCGACCCG
GTGTACGGGGCGCGGCCGTTGCGCCGGCTGGTGCAGCAGGCCATCGGTGACCAGCTGGCCAAGATGCTGTTGGCCGGCCA
GGTGCACGACGGCGATACCGTGCCGGTCAACGTCAGCCCCGGCGCCGACTCGCTGATCCTGGGCTGA

Protein sequence :
MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPRASGASTQ
PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA
LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL
DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY
RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS
RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRELKEQL
EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDSALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK
LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG
EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS
NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP
VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPGADSLILG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 8e-105 43
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-105 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-105 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_007267027.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2076 Protein 3e-112 44
clpB YP_007267027.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2084 Protein 1e-111 41