Gene Information

Name : EC55989_3334 (EC55989_3334)
Accession : YP_002404300.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : chaperone clpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 3420014 - 3422680 bp
Length : 2667 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pf : factor

DNA sequence :
GTGAATAACATGGAAAATTCGGCAGCCCTGTTACGTCGTCTTAATCATTACTGTGCCCGTGCACTGGAAGGCGCAGCCTC
CCTTTGCCAGACCCGGGCCCATGCGGAAATCACCCCGGAGCACTGGTTACTGAAACTGCTGGAGCAGGGGGAAGGAGACC
TTACCGTGCTGGGCAGGCGTTATGACTGGGATATGGACGCGATATGGCAGTCGCTGCTCGGCTGGCTGGATAACCAGCCC
CGTAGCGTACGCAGTCGCCCGCAGCTTGCGCAGTCCCTGAATGCTCTGCTGAAACAGGCCTGGATGGTGGCCTCATTACA
GGGAGAAGAACATATCCGCAGCGTGCATCTGCTGGGTGCCCTGACGGAAAATCCGCACCTGGTTCGCTGTGACGGGCTGT
GGCCTCTGCTGACACTGAGTCAGAGCCAGTTGCAGCGTCTGTCCCCGCTGCTGGATGCGCAGCCTGATGAGTGTCCGGAG
ACGTTACAGGATGCAGAGCCTGTGCTGCCTCAGGGAGACAGTGTGACCTTTATCGGGCGCCCTGTCGGTGCGGATACGGC
AGGTATACCGTCAGGTGACCTGCCGCCGGTGTTACAGGGTGCGCTGGATAAATTCACCCGGGACATCACGGCCAGCGCGA
GAGAGGGGAAAATTGACCCGGTATCGGGACGCGATACGGAAATCCGTCAGATGGTGGATATTCTCTCCCGCCGTCGCAAG
AATAACCCGATTCTGGTGGGGGACCCCGGTGTGGGGAAAACGGCTCTGGTGGAAGGGCTCGCCCTGCGTATTGTGGAAGG
TAACGTACCAGAATCTCTCAGACCCGTCACCCTGCGCACCCTTGACCTCGGCCTGCTGCAGGCCGGTGCCGGCGTGAAAG
GTGAATTTGAACAGCGTCTGAAAAACGTGATTGATGCCGTGCAGCTGTCACCGGCTCCTGTACTGCTGTTTATAGATGAA
GCCCATACCCTTATCGGTGCCGGTAATCAGGCCGGTGGCGCGGATGCGGCCAACCTGCTGAAGCCTGCGCTGGCGCGAGG
CGAACTGCGCACCATTGCTGCCACCACCTGGTCTGAGTACAAACAATACCTGGAACGTGACGCGGCTCTGGAGCGGCGTT
TTCAGATGGTCAAAGTGGACGAGCCGGATGATGAGACGGCCTGTCTGATGCTGCGCTCCCTGAAATCCCGTTATGCGGAA
CATCATAACGTGCATATCACCGATGAGGCAGTACGTGCTGCCGTCACACTCTCGCGCCGTTATCTGACAGAACGTCAGTT
ACCGGACAAGGCCGTTGACCTGCTTGATACTGCTGCTGCCCGTGTGCGGATGAGCCTCGACACGGTGCCGGAACAGCTGA
CCCGAATCCGTTCACAGCTCGCTTCCCTCGGTATGGAAAAACAGGCACTGCTGGAAGATATTGCCGTTGGTCATCAGAAT
CACGGCGAACGCCTGTCTGCCATTGAGCAGGAAGAGAACGTGCTGATGACAGCACGTGATGACCTGGAGCAGCAGTACGC
CCGTGAGTGCGAACTCACCGGTGAGTTACTTGAAAGCCGCAGCGATATTTCCCGCCAGAGTGAGACACATCACCTGCAGC
AGGCACTGCACGACATTCAGCAGAATCAGCCCCTGCTCAGTGTGGATGTTGACGTACGCACCGTTGCCGGTGTGGTCGCG
GACTGGACCGGAGTGCCGTTATCCTCACTGATGAAGGATGAACAGACGGAACTGCTGCATCTGGAAAAGGATATCGGCAG
ACGGGTGGTCGGACAGGACGTGGCACTGGAATCCATTGCACAGCGTCTGCGTGCGGCGAAAACGGGTCTGACATCCGGTA
ACGGCCCCCAGGAGGTGTTCCTGCTGGTCGGCCCCAGTGGTGTGGGAAAAACCGAAACGGCACTGGCGCTGGCAGATGTG
ATGTACGGCGGCGAAAAATCACTGATTACCATCAATCTGTCGGAATATCAGGAGCCCCATACGGTTTCCCAGCTTAAGGG
TTCACCGCCCGGGTACGTGGGATACGGTCAGGGGGGTATTCTGACGGAAGCTGTACGTAAGCGCCCTTACAGCGTGGTGC
TGCTGGATGAAGTGGAAAAGGCCCACCGGGATGTGCTGAATCTGTTCTACCAGGTGTTTGACCGGGGCTTTATGCGCGAC
GGTGAGGGGCGTGAAATTGACTTCCGTAATACGGTCATTCTGATGACCTCCAATCTGGGCAGTGACCTTCTGATGCAGCA
ACTGAGCGAAAAGCCGGAGACAACGGAATCGGAGCTGCATGAGCTTATCCGTCCACTGCTGCGCGACCACTTCCAGCCAG
CACTGCTTGCCCGTTTCCAGACCGTGATTTACCGTCCGCTGACACCGTCTGCTATGCGCACCATTGTGGAGATGAAGCTT
GCACAGGTCTGCGAACGCCTGCACTGCCATTATGGGCTGAGCACATCGGTTGATGAACGTGTGTACGATGCCCTGACATC
CGCCTGCCTGCTGCCGGATACGGGCGCCCGGAATGTGGAGAGTCTGCTGAACCAGCAACTGCTGCCGGTACTGAGCCGGC
AGTTGCTGAGCCATATGGCCGCGAAGCAGAAACCGCAGGCGCTGGCTCTGGCATGGAGTGACGAAGACGGTATGGTGATT
GAGCTGCGGCAGGAATGCGCGTTATGA

Protein sequence :
MNNMENSAALLRRLNHYCARALEGAASLCQTRAHAEITPEHWLLKLLEQGEGDLTVLGRRYDWDMDAIWQSLLGWLDNQP
RSVRSRPQLAQSLNALLKQAWMVASLQGEEHIRSVHLLGALTENPHLVRCDGLWPLLTLSQSQLQRLSPLLDAQPDECPE
TLQDAEPVLPQGDSVTFIGRPVGADTAGIPSGDLPPVLQGALDKFTRDITASAREGKIDPVSGRDTEIRQMVDILSRRRK
NNPILVGDPGVGKTALVEGLALRIVEGNVPESLRPVTLRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQLSPAPVLLFIDE
AHTLIGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYLERDAALERRFQMVKVDEPDDETACLMLRSLKSRYAE
HHNVHITDEAVRAAVTLSRRYLTERQLPDKAVDLLDTAAARVRMSLDTVPEQLTRIRSQLASLGMEKQALLEDIAVGHQN
HGERLSAIEQEENVLMTARDDLEQQYARECELTGELLESRSDISRQSETHHLQQALHDIQQNQPLLSVDVDVRTVAGVVA
DWTGVPLSSLMKDEQTELLHLEKDIGRRVVGQDVALESIAQRLRAAKTGLTSGNGPQEVFLLVGPSGVGKTETALALADV
MYGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRPYSVVLLDEVEKAHRDVLNLFYQVFDRGFMRD
GEGREIDFRNTVILMTSNLGSDLLMQQLSEKPETTESELHELIRPLLRDHFQPALLARFQTVIYRPLTPSAMRTIVEMKL
AQVCERLHCHYGLSTSVDERVYDALTSACLLPDTGARNVESLLNQQLLPVLSRQLLSHMAAKQKPQALALAWSDEDGMVI
ELRQECAL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 2e-141 45
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 7e-108 43
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 1e-107 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EC55989_3334 YP_002404300.1 chaperone clpB VFG2084 Protein 3e-127 48
EC55989_3334 YP_002404300.1 chaperone clpB VFG2076 Protein 3e-158 44
EC55989_3334 YP_002404300.1 chaperone clpB VFG0079 Protein 2e-113 42