Gene Information

Name : clpB (BN42_20110)
Accession : YP_007263103.1
Strain : Mycobacterium canettii CIPT 140070010
Genome accession: NC_019951
Putative virulence/resistance : Virulence
Product : Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 477497 - 480043 bp
Length : 2547 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; PubMedId : 10493122, 10510226, 11271494, 11385512, 11567012, 12368446, 12657046, 15525680, 17611072; Product type pf : putative factor

DNA sequence :
GTGGACTCGTTTAACCCGACGACCAAGACGCAGGCGGCGCTGACCGCGGCGTTACAGGCGGCTTCGACCGCCGGCAATCC
CGAGATCCGGCCCGCTCACCTGCTGATGGCGCTGCTGACCCAAAACGACGGGATCGCCGCACCGCTGCTGGAGGCTGTCG
GCGTCGAGCCCGCCACCGTCCGCGCAGAAACCCAGCGCCTGCTGGACCGGTTGCCACAGGCCACCGGGGCCAGCACGCAG
CCGCAGCTGTCTCGCGAGTCGCTGGCGGCGATCACCACCGCCCAGCAGCTGGCCACCGAGATAGACGACGAGTACGTCTC
CACCGAGCACGTGATGGTCGGGCTGGCCACCGGTGACTCCGACGTCGCCAAGCTGTTGACCGGCCACGGCGCCTCGCCGC
AGGCGTTGCGGGAGGCGTTCGTCAAGGTGCGCGGCAGCGCCCGGGTCACCAGCCCCGAACCGGAGGCGACGTATCAGGCG
CTGCAGAAGTACTCCACCGACCTGACCGCCCGCGCCCGCGAAGGCAAACTCGACCCGGTCATCGGCCGCGACAACGAGAT
CCGCCGCGTGGTGCAGGTGCTCTCCCGTCGCACCAAGAACAACCCGGTGCTGATCGGTGAGCCCGGCGTCGGCAAGACCG
CGATCGTGGAGGGCCTGGCGCAGCGGATCGTGGCCGGCGACGTGCCGGAGAGTTTGCGGGACAAGACGATCGTCGCGCTC
GATCTCGGCTCGATGGTCGCCGGTTCGAAATACCGCGGCGAATTCGAGGAACGGCTCAAGGCCGTCCTCGACGACATCAA
GAACTCGGCCGGCCAAATCATCACGTTCATCGACGAGCTGCACACCATCGTCGGCGCGGGTGCCACCGGCGAGGGCGCGA
TGGACGCCGGCAACATGATCAAGCCGATGCTGGCCCGCGGCGAGTTACGGCTGGTCGGGGCGACCACACTTGACGAGTAC
CGCAAGCACATCGAGAAGGACGCCGCGCTCGAGCGTCGTTTTCAGCAGGTGTACGTCGGCGAGCCGTCGGTGGAGGACAC
CATCGGCATCCTGCGCGGACTCAAAGACCGCTACGAGGTGCACCACGGGGTGCGCATCACCGACTCGGCACTGGTGGCAG
CTGCCACTTTGAGTGACCGGTATATCACCGCCCGCTTTCTGCCGGACAAGGCCATCGACCTGGTCGACGAGGCGGCCAGC
CGGCTGCGCATGGAGATCGACTCGCGGCCCGTCGAGATCGACGAGGTCGAGCGGCTGGTGCGCCGGCTGGAGATCGAAGA
GATGGCGCTGTCCAAGGAGGAGGACGAGGCGTCGGCGGAGCGGTTGGCCAAGCTGCGCTCCGAGCTGGCCGATCAGAAAG
AGAAGTTGGCCGAACTTACCACCCGCTGGCAGAACGAGAAGAACGCCATCGAAATCGTCCGCGACCTCAAAGAGCAGCTG
GAAGCCCTGCGCGGGGAATCGGAGCGGGCCGAACGCGACGGCGACCTGGCCAAGGCCGCCGAGCTGCGCTACGGACGCAT
CCCCGAGGTGGAGAAGAAACTCGACTCGGCGCTGCCGCAGGCGCAGGCCCGGGAGCAGGTGATGCTCAAGGAGGAGGTCG
GTCCCGACGACATCGCCGACGTGGTGTCGGCGTGGACCGGCATTCCGGCCGGGCGGCTGCTGGAAGGCGAGACCGCCAAG
CTGCTGCGCATGGAAGACGAGCTGGGCAAGCGGGTCATCGGGCAGAAGGCCGCGGTTACCGCAGTCTCTGATGCGGTGCG
GCGCAGCCGGGCCGGGGTGTCCGACCCCAACCGGCCCACCGGGGCGTTCATGTTCCTCGGCCCGACCGGTGTCGGCAAGA
CCGAGCTGGCCAAGGCGCTGGCCGACTTCCTGTTCGACGACGAGCGGGCGATGGTCCGCATCGACATGAGCGAGTACGGC
GAGAAGCACACCGTGGCCCGGTTGATCGGCGCCCCGCCCGGCTATGTGGGATATGAGGCGGGCGGTCAGCTGACCGAGGC
GGTGCGCCGGCGTCCCTACACGGTGGTGCTGTTCGACGAGATCGAGAAGGCGCACCCGGACGTGTTCGACGTGCTGCTGC
AGGTCCTCGACGAGGGCCGGCTCACCGACGGGCACGGCCGCACGGTCGACTTCCGCAACACCATCTTGATCCTGACGTCC
AACCTGGGGTCGGGTGGCAGCGCCGAGCAGGTGCTGGCCGCGGTGCGCGCTACGTTCAAGCCGGAGTTCATCAACCGGCT
CGACGACGTGCTCATCTTTGAGGGTCTCAACCCCGAAGAGCTGGTGCAGATCGTGGACATCCAGCTGGCGCAGCTGGGCA
AGCGGCTGGCGCAGCGGCGGCTGCAGCTGCAGGTCTCGCTGCCGGCCAAGCGCTGGTTGGCGCAGCGCGGATTCGACCCG
GTGTACGGGGCGCGGCCGTTGCGCCGGCTGGTGCAGCAGGCCATCGGTGACCAGCTGGCCAAGATGCTGCTCGCCGGCAC
CGTGCACGACGGCGACACCGTGCCGGTCAACGTCAGCCCCGACGCCGACTCGCTGATCCTGGGCTGA

Protein sequence :
MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ
PQLSRESLAAITTAQQLATEIDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA
LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL
DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY
RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS
RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL
EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDSALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK
LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG
EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS
NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVQIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP
VYGARPLRRLVQQAIGDQLAKMLLAGTVHDGDTVPVNVSPDADSLILG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 7e-105 43
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-105 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-105 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_007263103.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2076 Protein 3e-112 44
clpB YP_007263103.1 Putative endopeptidase ATP binding protein (chain B) ClpB (ClpB protein) (Heat Shock Protein f84.1) VFG2084 Protein 3e-112 42