Gene Information

Name : EcE24377A_3129 (EcE24377A_3129)
Accession : YP_001464144.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Virulence
Product : ClpA/ClpB family protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 3134749 - 3137394 bp
Length : 2646 bp
Strand : +
Note : identified by match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728; match to protein family HMM TIGR03345

DNA sequence :
ATGACCGGAAATCACCCCGCCGCGCTGCTGCGTCGCCTTAACCCATACTGTGCACGGGCGCTGGACGCTGCCGCCTCACT
GTGTCAGACCCGCGCCCATGCGGAGATAACCATTGAACACTGGCTGCTGAAACTGCTGGAGCAGGGAGAAGGCGATATCA
CGGTGATTGCCCGCCGCTATGAATGGGATATCGACACGCTCTGGCAGTCTCTGCTGGCACATCTGGACACCTTACCCCGC
TCGGTCCGCGAACGTCCGCAGCTTTCTGAACCCCTGACGGCGCTTATCAGACAGGCGTGGCTGATTGCTTCGCTGGAAGG
CGACGACCCGCAAATCCGCAGCCAGCACCTGCTGATGGCGCTGACAGAAAAACCGATGCTGCCCGCCTGTAATGACCTGT
GGGTATTGCTGAGCCTGAGCCGTGTGCAGTTTGAGCGGCTGCGTCCCCTGCTGGATGCGCAGTCGGATGAATGCCCGGCA
CGTCAGCCACAGGTCACCGAACCGCTGACATCCGCGCAACCCGGGGTGGCAACGACGGACGCACCGGCAAACACGCTGAC
GGGGAAACAGGATGACGCCCTGCTGGCGGTACTTAACCGCTTTACCGAAGACGTGACGGAAAAAGCCCGCAGCGGGCGCA
TCGACCCGGTATTCGGGCGCGACACCGAAATTCGCCAGATGGTGGATATCCTCTCCCGTCGCCGCAAAAACAACCCGATT
CTGGTGGGAGAACCGGGGGTGGGCAAAACCGCACTGGTGGAAGGGCTGGCGCTGCGTATCGCCGAAGGCAATGTACCGGA
CAGCCTGAAAACGGTGCATATCCGCACGCTGGACCTCGGTCTGTTACAGGCTGGCGCGGGCGTTAAAGGCGAATTTGAGC
AGCGGCTGAAAAACGTGATTGATGCAGTACAGAAATCACCGGAGCCGGTACTACTGTTTATTGATGAAGCCCATACCATT
ATCGGCGCGGGTAATCAGGCGGGCGGCGCGGATGCGGCGAACCTGCTGAAACCGGCGCTGGCAAGGGGCGAACTGCGCAC
CATCGCGGCGACCACATGGAGCGAATACAAACAGTATTTTGAGCGCGACGCCGCGCTGGAACGCCGCTTCCAGATGGTGA
AAGTCGACGAGCCGGATGACGATACCGCCTGCCTGATGCTGCGGGGACTCAAGGCCCGCTATGCACAGCACCACGGCGTG
CATATGCTGGACAGCGCCATTCAGACCGCCGTGCGCCTGTCGCGCCGCTATCTGACCGGGCGCCAGTTGCCGGACAAGGC
GGTTGATTTGCTGGATACCGCCGGGGCAAGAGTCCGCATGAGCCTTGACACCCTGCCGGAACCGTTGACGCAGCTTCATG
CGCGACTGGCGGCACTGGATATTGAGCGGGAAGCGATTGAGCAGGACAGCGTATTTTATCCCGAAGCCAGCCCGGAGCGG
CTGGCGGAACTGATCGATTTGCGTGATGAGTTACAGGCAGAAGCCGGGCATCTGGAAACGCAGTATCAGCAGGAAAAGCG
GCTTGCGCAGCAGATTATGACGTTGCGTCAGGAAGGGACAGACAGCACTGAACTGCAACAGCAACTGCGAACACATCAGG
GCTTTGCACCGCTGCTGGCGCTGGATGTGGACGCCCGCGCCGTCGCCACGGTGGTGGCGGACTGGACCGGCATCCCGCTC
TCTTCCCTGCTCAGGGACGAACAGAGCGACCTGCTCAGTATGGAACAAAGCCTTGAAAACCGCGTTGTCGGACAAAGCCC
GGCGCTCTGCGCCATCGCACAGCGGCTGCGGGCGGCTAAAACCGGCCTCACGCCGGAGAATGGCCCACAGGGGGTATTCC
TGCTGACCGGCCCCAGCGGCACCGGTAAAACCGAAACTGCGCTCACACTGGCCGACACCCTGTTTGGCGGTGAAAAATCC
CTTATCACCATCAACCTCTCCGAATATCAGGAGCCGCATACCGTCTCCCAGCTTAAGGGGGCTCCTCCGGGCTACGTTGG
CTACGGTCAGGGCGGCGTACTCACCGAAGCCGTTCGCAAACGCCCGTACAGCGTGGTGCTGCTCGACGAAGTGGAAAAGG
CGCATCGCGACGTGATGAACCTGTTCTATCAGGTATTTGATCGGGGCTTTATGCGCGACGGCGAAGGCCGGGAAATTGAC
TTCCGCAACACCGTGATTCTGATGACGGCTAATCTGGGCAGTGACCACATCATGCAGCTGCTGGAGGAAAAACCGGACGC
CACGGACGCAGACCTGCATGAACTGCTGTACCCCCTGCTGCGCGAGCACTTCCAGCCTGCACTGATGGCGCGTTTTCAGA
CGGTGATTTACCGCCCTCTGGGACAGGAAGCGATGCGCGCCATTGTGGAAATGAAACTGGCGCAGGTGGTTCGTCGTCTT
CGTCAGCACTATGGGCTGGAAACGGAAATCAACGACAGCCTGTATGACGCGCTGACCGCCGCCTGCCTGCTGCCGGACAC
CGGGGCGCGTAATATCGACAGCCTGCTGAACCAGCAAATCCTGCCGGTCTTAAGCCAGCAGTTGCTGGCGCAACAGGCCG
CGCACCGTAAACCAGCCCAGCTACGGCTTGGCTGGGATGAGGAAGACGGGATTGTACTGGAGTTCGCTACGGAAGAGATG
CAATAA

Protein sequence :
MTGNHPAALLRRLNPYCARALDAAASLCQTRAHAEITIEHWLLKLLEQGEGDITVIARRYEWDIDTLWQSLLAHLDTLPR
SVRERPQLSEPLTALIRQAWLIASLEGDDPQIRSQHLLMALTEKPMLPACNDLWVLLSLSRVQFERLRPLLDAQSDECPA
RQPQVTEPLTSAQPGVATTDAPANTLTGKQDDALLAVLNRFTEDVTEKARSGRIDPVFGRDTEIRQMVDILSRRRKNNPI
LVGEPGVGKTALVEGLALRIAEGNVPDSLKTVHIRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQKSPEPVLLFIDEAHTI
IGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYFERDAALERRFQMVKVDEPDDDTACLMLRGLKARYAQHHGV
HMLDSAIQTAVRLSRRYLTGRQLPDKAVDLLDTAGARVRMSLDTLPEPLTQLHARLAALDIEREAIEQDSVFYPEASPER
LAELIDLRDELQAEAGHLETQYQQEKRLAQQIMTLRQEGTDSTELQQQLRTHQGFAPLLALDVDARAVATVVADWTGIPL
SSLLRDEQSDLLSMEQSLENRVVGQSPALCAIAQRLRAAKTGLTPENGPQGVFLLTGPSGTGKTETALTLADTLFGGEKS
LITINLSEYQEPHTVSQLKGAPPGYVGYGQGGVLTEAVRKRPYSVVLLDEVEKAHRDVMNLFYQVFDRGFMRDGEGREID
FRNTVILMTANLGSDHIMQLLEEKPDATDADLHELLYPLLREHFQPALMARFQTVIYRPLGQEAMRAIVEMKLAQVVRRL
RQHYGLETEINDSLYDALTAACLLPDTGARNIDSLLNQQILPVLSQQLLAQQAAHRKPAQLRLGWDEEDGIVLEFATEEM
Q

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 6e-143 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcE24377A_3129 YP_001464144.1 ClpA/ClpB family protein VFG2076 Protein 4e-151 45
EcE24377A_3129 YP_001464144.1 ClpA/ClpB family protein VFG2084 Protein 3e-132 44