Gene Information

Name : clpB (ECOK1_0226)
Accession : YP_006099473.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone protein ClpB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 250945 - 253704 bp
Length : 2760 bp
Strand : -
Note : identified by match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728; match to protein family HMM TIGR03345

DNA sequence :
ATGATCCAGATTGATTTAGCCACGCTCGTAAAGCGGCTTAACCCCTTTGCAAAACAGGCGCTGGAAATGGCTGCCTCGGA
GTGTATGAGCCAGCAGGCCAGTGAGATCACCGTTGCCCATGTGCTGTTACAGATGCTGGCAATCCCGCGCAACGACGTAA
GGGTGATTGCCGAACGCACAGGCATCAGTGCGGAGGATTTACGCCAGGCGCTGACCGTGGAGAGTTATCCCGGTGGACGT
TCCGCCGAAGGCTATCCCAGCTTCTCCCCGATGCTGATCGAATGGCTCAAGGAGTCATGGCTGCTGGCCTCAGCCCAGAT
GCAGCACAGCGAACTGCGCAGCGGCGTCCTGCTGCTTACTCTGCTGCATTCACCGCTGCGCTATATTCCACCCGCTGCTG
CCCGTCTGCTGACGGCCATAAACCGCGACCAGTTACAGCAGGATTTCGCAGCATGGACAAAAGAATCCGCCGAATCCGTG
GATCTGGCTGGCGGGCAAACGCCCCGCGCAACGGAAACCGGCGACACCCTGCTTGCCCGCTACGCCAAAAACATGACCGC
AGACGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCAACTATGAAATCGACCTGATGATCGACATTCTCTGCCGCC
GCCGTAAAAACAATCCGGTGGTAGTGGGCGAAGCAGGCGTGGGTAAAAGCGCGCTGATTGAAGGGCTGGCGCTGCGCATC
GTGGCAGGCCAGGTGCCGGACAAGCTGAAAAACACCGATATCATGACCCTTGACCTGGGCGCATTGCAGGCCGGTGCGTC
GGTGAAAGGTGAGTTTGAAAAACGCTTCAAGGGGCTGATGGCGGAGGTCATTTTCTCCCCGGTGCCGGTCATTCTGTTTA
TCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATATCCAACCTGCTCAAACCAGCGCTG
GCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATATTTCGAGAAAGATGCAGCTTTGTC
GCGCCGCTTCCAGTTAGTGAAGGTCAGCGAACCCAACGCTGCCGAAGCCACCATTATTCTGCGCGGTTTGTCGGCGGTCT
ATGAACAATCTCACGGCGTGCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACACTGAGCGAGCGTTATCTCTCCGGG
CGTCAGTTGCCGGACAAAGCCATTGATGTGCTGGACACCGCCTGCGCCCGTGTGGCCATCAACCTGTCATCGCCGCCAAA
ACAAATCTCGGCACTGACCACCCTGAGCCACCAGCAGGAGGCAGAAATTCGCCAGCTTGAGCGCGAGCTTCGCATCGGGC
TGCGTACCGACACCTCACGAATGACTGAGGTGTTGGAGCAGTATGATGAAACGCTGTCGGCGCTGGACGAACTGGAAGTC
GCCTGGCAGCAGCAGACGCTGGTACAGGAAATTATCGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAAGACGACGTTGC
GTCGTTGCCGGACGCAGATGCCGTGGAGGATACGCCGCCAGAGGCAGAACAGGATAGTACCGATGCCGAATCGGCTGATG
ATGCAGGCAGCGTACAGCCGGAAGAGACCGCTGAAACAGTTTCCCCGGTACAACGGCTGGCACAACTCACTGCCGAACTG
GACGCCCTGCATAACGACCAGTTGCTGGTTTCCCCGCATGTCGATAAAAAACAGATTGCAGCGGTGATTGCTGAATGGAC
CGGCGTGCCGCTTAACCGCCTGTCACAGAATGAGATGTCGGTCATCACCGACCTGCCGGTATGGCTCGGCGGCACCATCA
AGGGCCAGGACCTGGCGATTGCCAGCCTGCATAAACACCTGCTGACCGCACGCGCCGACCTGCGCCGTCCGGGACGCCCG
CTTGGTGCGTTCCTGCTGGCTGGCCCCAGCGGCGTGGGTAAAACCGAAACCGTCCTGCAACTGGCGGAACTGCTCTATGG
CGGACGTCAATACCTGACCACCATCAATATGTCTGAGTTCCAGGAGAAACACACCGTTTCGCGGCTGATTGGCTCTCCTC
CGGGTTATGTCGGCTACGGTGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCGTACTCGGTGGTGCTGCTCGAT
GAAGTAGAAAAAGCGCACCCGGATGTACTCAACCTGTTCTACCAGGCGTTTGACAAGGGCGAGATGGCAGACGGTGAAGG
CCGCCTGATTGACTGTAAAAATATCGTTTTCTTCCTGACCTCCAACCTTGGCTACCAGGTGATTGTCGAACACGCCGATG
ACCCGGAAACCATGCAGGAAGCCCTCTATCCGGTGCTGGCGGACTTCTTTAAACCTGCTCTGCTGGCGCGCATGGAAGTG
GTGCCTTACCTACCGCTGTCGAAAGAGACGCTCGCTACCATTATTGCCGGGAAACTGGCCCGCCTGGATAACGTGCTGCG
CAGCCGCTTTGGCGCGGACGTGATTATTGGGCCGGAAGTGACGGACGAAATCATGAGCCGCGTCACCCGCGCGGAAAACG
GCGCGAGGATGCTGGAATCGGTTATCGACGGCGACATGCTGCCGCCGCTCTCGCTGCTGCTGTTGCAGAAAATGGCGGCC
AACACGGCGATTGCCCGGATTCGCCTGTCGGCGGCAGACGGTGCGTTCACGGCAGATGTGGAAGATGCGCTGCCTGATGA
TGCCGTGACACCGCAAACAGAGGACGAAACGGTTTTATGA

Protein sequence :
MIQIDLATLVKRLNPFAKQALEMAASECMSQQASEITVAHVLLQMLAIPRNDVRVIAERTGISAEDLRQALTVESYPGGR
SAEGYPSFSPMLIEWLKESWLLASAQMQHSELRSGVLLLTLLHSPLRYIPPAAARLLTAINRDQLQQDFAAWTKESAESV
DLAGGQTPRATETGDTLLARYAKNMTADARNGRLDPVLCRNYEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLALRI
VAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVIFSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKPAL
ARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPNAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYLSG
RQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLEQYDETLSALDELEV
AWQQQTLVQEIIALRQQLLGVAEDDVASLPDADAVEDTPPEAEQDSTDAESADDAGSVQPEETAETVSPVQRLAQLTAEL
DALHNDQLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPVWLGGTIKGQDLAIASLHKHLLTARADLRRPGRP
LGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVVLLD
EVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEALYPVLADFFKPALLARMEV
VPYLPLSKETLATIIAGKLARLDNVLRSRFGADVIIGPEVTDEIMSRVTRAENGARMLESVIDGDMLPPLSLLLLQKMAA
NTAIARIRLSAADGAFTADVEDALPDDAVTPQTEDETVL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 0.0 100
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 0.0 100
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 6e-123 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_006099473.1 ATP-dependent chaperone protein ClpB VFG2084 Protein 0.0 57
clpB YP_006099473.1 ATP-dependent chaperone protein ClpB VFG2076 Protein 8e-137 41