Gene Information

Name : clpB (BN43_10417)
Accession : YP_007286266.1
Strain : Mycobacterium canettii CIPT 140070008
Genome accession: NC_019965
Putative virulence/resistance : Virulence
Product : Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 456754 - 459300 bp
Length : 2547 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; PubMedId : 10493122, 10510226, 11271494, 11385512, 11567012, 12368446, 12657046, 15525680, 17611072; Product type pf : putative factor

DNA sequence :
GTGGACTCGTTTAACCCGACGACCAAGACGCAGGCGGCGCTGACCGCGGCGTTACAGGCGGCTTCGACCGCCGGCAATCC
CGAGATCCGGCCCGCTCACCTGCTGATGGCGCTGCTGACCCAAAACGACGGGATCGCCGCACCGCTGCTGGAGGCTGTCG
GCGTCGAGCCCGCCACCGTCCGCGCAGAAACCCAGCGCCTGCTGGACCGGTTGCCACAGGCCACCGGGGCCAGCACGCAG
CCGCAGCTGTCTCGCGAGTCGCTGGCGGCGATCACCACCGCCCAGCAGCTGGCCACCGAGATAGACGACGAGTACGTCTC
CACCGAGCACGTGATGGTCGGGCTGGCCACCGGTGACTCCGACGTCGCCAAGCTGTTGACCGGCCACGGCGCCTCGCCGC
AGGCGTTGCGGGAGGCGTTCGTCAAGGTGCGCGGCAGCGCCCGGGTCACCAGCCCCGAACCGGAGGCGACGTATCAGGCG
CTGCAGAAGTACTCCACCGACCTGACCGCCCGCGCCCGCGAAGGCAAACTCGACCCGGTCATCGGCCGCGACAACGAGAT
CCGCCGCGTGGTGCAGGTGCTCTCCCGTCGCACCAAGAACAACCCGGTGCTGATCGGTGAGCCCGGCGTCGGCAAGACCG
CGATCGTGGAGGGCCTGGCGCAGCGGATCGTGGCCGGCGACGTGCCGGAGAGCTTGCGCGACAAGACCATCGTCGCGCTC
GATCTCGGCTCGATGGTCGCCGGCTCCAAATACCGCGGCGAATTCGAGGAACGGCTCAAGGCCGTCCTCGACGACATCAA
GAACTCGGCCGGCCAAATCATCACGTTCATCGACGAGCTGCACACCATCGTCGGCGCCGGCGCCACCGGCGAGGGCGCGA
TGGACGCCGGCAACATGATCAAGCCGATGCTGGCCCGCGGCGAGTTACGGCTGGTCGGGGCGACCACACTTGACGAGTAC
CGCAAGCACATCGAGAAGGACGCCGCGCTCGAGCGCCGTTTCCAACAGGTGTACGTCGGCGAGCCGTCGGTGGAGGACAC
CATCGGCATCCTGCGCGGGCTCAAAGACCGCTACGAGGTGCACCACGGGGTGCGCATCACCGACTCGGCGCTGGTGGCAG
CTGCCACTTTGAGCGACCGGTATATCACCGCCCGCTTCCTGCCCGACAAGGCCATCGACCTGGTCGACGAGGCGGCCAGC
CGGCTGCGGATGGAGATCGACTCGCGGCCCGTCGAGATCGACGAGGTCGAGCGGCTGGTGCGCCGGCTGGAGATCGAAGA
GATGGCGCTGTCCAAAGAAGAAGACGATGCGTCGGCGGAGCGGCTGGCCAAGCTGCGCTCCGAGCTGGCCGACCAGAAAG
AGAAGCTGGCCGAGCTCACCACCCGCTGGCAGAACGAGAAGAACGCGATCGAAATCGTCCGCGACCTCAAGGAGCAGCTG
GAAGCCCTGCGCGGGGAATCCGAGCGGGCCGAACGCGACGGCGACCTGGCCAAGGCCGCCGAGCTGCGCTACGGGCGCAT
CCCCGAGGTGGAGAAGAAACTCGACGCGGCGCTGCCGCAGGCGCAGGCCCGTGAGCAGGTGATGCTCAAGGAGGAGGTCG
GTCCCGACGACATCGCCGACGTGGTCTCGGCGTGGACCGGCATCCCGGCCGGTCGGCTGCTGGAAGGCGAGACCGCCAAG
CTGCTGCGCATGGAAGACGAGCTGGGCAAGCGGGTCATCGGGCAGAAGGCCGCGGTTACCGCAGTCTCTGATGCGGTGCG
GCGCAGCCGGGCCGGGGTGTCCGACCCCAACCGGCCCACCGGGGCGTTCATGTTCCTCGGCCCGACCGGTGTCGGCAAGA
CCGAGCTGGCCAAGGCGCTGGCCGACTTCCTGTTCGACGACGAGCGGGCGATGGTCCGCATCGACATGAGCGAGTACGGC
GAGAAGCACACCGTGGCTCGGTTGATCGGCGCCCCGCCCGGCTATGTGGGATACGAGGCGGGCGGTCAGCTGACCGAGGC
GGTGCGCCGGCGTCCCTACACGGTGGTGCTGTTCGACGAGATCGAGAAGGCGCACCCGGACGTGTTCGACGTGCTGCTGC
AGGTCCTCGACGAGGGCCGGCTCACCGACGGGCACGGCCGCACGGTCGACTTCCGCAACACCATCTTGATCCTGACGTCC
AACCTGGGGTCGGGTGGCAGCGCCGAGCAGGTGCTGGCCGCGGTGCGCGCTACGTTCAAGCCGGAGTTCATCAACCGGCT
CGACGACGTGCTCATCTTTGAGGGTCTCAACCCCGAAGAGCTGGTGCGCATCGTCGACATCCAGCTGGCGCAGCTGGGCA
AGCGGCTGGCGCAGCGGCGGCTGCAGCTGCAGGTCTCGCTGCCGGCCAAGCGCTGGTTGGCGCAGCGCGGATTCGACCCG
GTGTACGGGGCGCGGCCGTTGCGCCGGCTGGTGCAGCAGGCCATCGGTGACCAGCTGGCCAAGATGCTGTTGGCCGGCCA
GGTGCACGACGGCGATACCGTGCCGGTCAACGTCAGCCCCGACGCCGACTCGCTGATCCTGGGCTGA

Protein sequence :
MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ
PQLSRESLAAITTAQQLATEIDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA
LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL
DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY
RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS
RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDDASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL
EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK
LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG
EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS
NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP
VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-104 43
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-105 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-105 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_007286266.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2076 Protein 4e-112 44
clpB YP_007286266.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2084 Protein 8e-112 41