Gene Information

Name : c3392 (c3392)
Accession : NP_755267.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : ClpB protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 3229545 - 3232181 bp
Length : 2637 bp
Strand : +
Note : Heat shock protein F84.1; Escherichia coli K-12 ortholog: b2592; Escherichia coli O157:H7 ortholog: z0254

DNA sequence :
ATGACAGGAAATCACCCCGCCGCGCTGCTGCGTCGCCTTAACCCATACTGTGCACGGGCGCTGGACGCTGCCGCCTCACT
GTGTCAGACCCGCGCCCATGCGGAAATAACCATTGAACACTGGCTGCTGAAACTGCTGGAGCAGGGAGAAGGCGATATCA
CGGTGATTGCCCGCCGCTATGAATGGGATATCGACACGCTCTGGCAGTCTCTGCTGGCACATCTGGACACCTTACCCCGC
CCGGTCCGCGAACGTCCTCAACTTTCTGAACCGCTGGCAGCGCTTATCCGACAGGCGTGGCTGATAGCGTCACTGGAAGG
CGACGATCCACAAATCCGCAGCCAGCATCTGCTGATGGCGCTGACAGAAAAACCGATGCTGCCCGCCTGTAATGACCTGT
GGGTATTGCTGAGTCTGAGCCGCGTGCAGCTTGAGCGGCTGCGTCCCCTGCTGGATGCGCAGTCGGATGAATGTCCGGCA
CGTCAGCCACAGGTCACCGAACCGCTGACCTCTGCACTGCCGGAGACGGCAACGGCGGACGCACCGGCAAAAACGCTGAC
GGAGAAACAGGATGACGCCCTGCTGGCGGTGCTTAACCGCTTTACCGAAGACGTGACGGAAAAAGCCCGCAGCGGGCGAA
TCGACCCGGTATTCGGGCGCGACACGGAAATTCGCCAGATGGTCGATATCCTCTCCCGTCGCCGCAAAAACAACCCGATT
CTGGTGGGAGAACCGGGGGTGGGCAAAACCGCGCTGGTGGAAGGGCTGGCGCTGCGTATCACCGAAGGCAACGTGCCGGA
CAGCCTGAAAACGGTGCATATCCGCACACTGGACCTCGGTCTGTTACAGGCTGGCGCGGGCGTTAAAGGTGAATTTGAAC
AGCGGCTGAAAAATGTCATCGATGCAGTGCAGAAATCACCGGAGCCGGTACTGCTGTTTATTGATGAAGCCCATACCATT
ATCGGTGCGGGTAATCAGGCAGGCGGCGCGGATGCGGCGAACCTGCTGAAACCGGCACTGGCAAGGGGCGAACTGCGCAC
CATCGCGGCGACCACGTGGAGCGAATACAAACAGTATTTTGAGCGCGACGCCGCGCTGGAGCGCCGCTTCCAGATGGTTA
AGGTTGACGAGCCGGATGATGACACCGCCTGCCTGATGCTCAGGGGGCTGAAAGCCCGCTATGCGCAGCACCACGGCGTA
CATATGCTGGACAGCGCCATTCAGACCGCCGTGCGCCTGTCGCGCCGCTATCTGACCGGACGCCAGTTGCCGGACAAGGC
GGTTGATTTGCTGGATACCGCCGGGGCAAGAGTCCGCATGAGCCTTGACACCCTGCCGGAACCGTTGACGCAGCTTCATG
TGCGACTGGCGGCACTGGATATTGAGCGGGAAGCGATTGAGCAGGACAGCGTATTTTATCCCGAAGCCAGCCCGGAGCGG
CTGGCGGAACTGACCGATTTGCGTGATGAGCTACAGGCAGAAGCCGGGCATCTGGAAGCGCAGTATCAGCAGGAAAAGGC
ACTGGCGCAGCAGATTATGACGTTGCGTCAGGAAGGGACAGACAGCACTGAACTGCAACAGCAACTGCGAACGCATCAGG
GCTTTGCACCGCTGCTGGCGCTGGATGTGGACGCCCGCGCCGTCGCCACGGTGGTGGCGGACTGGACCGGCATCCCGCTA
TCTTCCCTGCTCAAGGACGAGCAGAGCGACCTGCTCAGTATGGAACAGAGTCTTGAAAACCGCGTTGTCGGGCAAAGCCC
GGCGCTCTGCGCCATCGCACAGCGGCTGCGGGCGGCTAAGACCGGCCTCACGCCGGAGAACGGCCCGCAGGGGGTATTCC
TGCTGACCGGCCCCAGCGGCACCGGTAAAACCGAAACTGCGCTCACACTGGCCGACACTCTGTTTGGCGGTGAAAAATCC
CTTATCACCATCAACCTCTCCGAATATCAGGAGCCGCATACCGTCTCCCAGCTTAAGGGGTCTCCTCCGGGCTACGTTGG
CTACGGTCAGGGTGGCGTACTCACCGAAGCCGTTCGCAAACGCCCTTACAGCGTGGTGCTGCTCGACGAAGTGGAAAAGG
CGCACCGCGACGTAATGAACCTGTTCTATCAGGTATTTGACCGGGGCTTTATGCGCGACGGCGAAGGCCGGGAAATTGAC
TTCCGCAACACCGTGATTCTGATGACGGCTAATCTGGGCAGCGACCACATCATGCAACTGCTGGAGGAAAAACCGGACGC
CACGGACGCAGACCTGCATGAACTGCTGTATCCCCTGCTGCGCGAGCACTTCCAGCCAGCACTGATGGCGCGTTTTCAGA
CGGTGATTTACCGCCCGCTGGGACAGGAGGCGATGCGCGCCATTGTGGAAATGAAACTGGCGCAGGTGGCCCGCCGTCTT
CACCAGCACTATGGGCTGGAAACGGAAATCAGTAACAGCCTGTACGACGCCCTGACCGCCGCCTGCCTGCTGCCGGACAC
CGGTGCGCGTAATATCGACAGCCTGCTGAACCAGCAAATCCTGCCGGTCTTAAGCCAGCAGTTGCTGGCGCAGCAGGCCG
TGCATCATAAGCCTGCCCGACTGCGGCTTGACTGGGATGATGAAGACGGGATTGTGCTGGAATTTGATGAGAAATAA

Protein sequence :
MTGNHPAALLRRLNPYCARALDAAASLCQTRAHAEITIEHWLLKLLEQGEGDITVIARRYEWDIDTLWQSLLAHLDTLPR
PVRERPQLSEPLAALIRQAWLIASLEGDDPQIRSQHLLMALTEKPMLPACNDLWVLLSLSRVQLERLRPLLDAQSDECPA
RQPQVTEPLTSALPETATADAPAKTLTEKQDDALLAVLNRFTEDVTEKARSGRIDPVFGRDTEIRQMVDILSRRRKNNPI
LVGEPGVGKTALVEGLALRITEGNVPDSLKTVHIRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQKSPEPVLLFIDEAHTI
IGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYFERDAALERRFQMVKVDEPDDDTACLMLRGLKARYAQHHGV
HMLDSAIQTAVRLSRRYLTGRQLPDKAVDLLDTAGARVRMSLDTLPEPLTQLHVRLAALDIEREAIEQDSVFYPEASPER
LAELTDLRDELQAEAGHLEAQYQQEKALAQQIMTLRQEGTDSTELQQQLRTHQGFAPLLALDVDARAVATVVADWTGIPL
SSLLKDEQSDLLSMEQSLENRVVGQSPALCAIAQRLRAAKTGLTPENGPQGVFLLTGPSGTGKTETALTLADTLFGGEKS
LITINLSEYQEPHTVSQLKGSPPGYVGYGQGGVLTEAVRKRPYSVVLLDEVEKAHRDVMNLFYQVFDRGFMRDGEGREID
FRNTVILMTANLGSDHIMQLLEEKPDATDADLHELLYPLLREHFQPALMARFQTVIYRPLGQEAMRAIVEMKLAQVARRL
HQHYGLETEISNSLYDALTAACLLPDTGARNIDSLLNQQILPVLSQQLLAQQAVHHKPARLRLDWDDEDGIVLEFDEK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-142 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
c3392 NP_755267.1 ClpB protein VFG2076 Protein 2e-149 44
c3392 NP_755267.1 ClpB protein VFG2084 Protein 8e-132 43
c3392 NP_755267.1 ClpB protein VFG0079 Protein 6e-101 41