Gene Information

Name : ECP_0229 (ECP_0229)
Accession : YP_668165.1
Strain : Escherichia coli 536
Genome accession: NC_008253
Putative virulence/resistance : Virulence
Product : Clp ATPase
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 250351 - 253113 bp
Length : 2763 bp
Strand : -
Note : -

DNA sequence :
ATGATCCAGATTGATTTAGCCACGCTCGTAAAGCGGCTTAACCCCTTTGCAAAACAGGCGCTGGAAATGGCTGCCTCGGA
GTGTATGAGCCAGCAGGCCAGTGAGATCACCGTTGCCCATGTGCTGTTACAGATGCTGGCAATCCCGCGCAACGACGTAA
GGGTGATTGCCGAACGCACAGGCATCAGTGCGGAGGATTTACGCCAGGCGCTGACCGTGGAGAGTTATCCCGGTGGACGT
TCCGCCGAAGGCTATCCCAGCTTCTCCCCGATGCTGATCGAATGGCTCAAGGAGTCATGGCTGCTGGCCTCAGCCCAGAT
GCAGCACAGCGAACTGCGCAGCGGCGTCCTGCTGCTTACTCTGCTGCATTCACCGCTGCGCTATATTCCACCCGCTGCTG
CCCGTCTGCTGACGGCCATAAACCGCGACCAGTTACAGCAGGATTTCGCAGCATGGACAAAAGAATCCGCCGAATCCGTG
GATCTGGCTGGCGGGCAAACGCCCCGCGCAACGGAAACCGGCGACACCCTGCTTGCCCGCTACGCCAAAAACATGACCGC
AGACGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACTATGAAATCGACCTGATGATCGACATTCTCTGCCGCC
GCCGTAAAAACAATCCGGTGGTAGTGGGCGAAGCAGGCGTGGGTAAAAGCGCGCTGATTGAAGGGCTGGCGCTGCGCATC
ATGGCAGGCCAGGTGCCGGACAAGCTGAAAAACACCGATATCATGACCCTTGACCTGGGCGCATTGCAGGCCGGTGCGTC
GGTGAAAGGTGAGTTTGAAAAACGCTTCAAGGGGCTGATGGCGGAGGTCATTTTCTCCCCGGTGCCGGTCATTCTGTTTA
TCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATATCCAACCTGCTCAAACCAGCGCTG
GCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATATTTCGAGAAAGATGCAGCTTTGTC
GCGCCGCTTCCAGTTAGTGAAGGTCAGCGAACCCAACGCTGCCGAAGCCACCATTATTCTGCGCAGTTTGTCGGCGGTCT
ATGAACAATCTCACGGCGTGCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACACTGAGCGAGCGTTATCTCTCCGGG
CGTCAGTTGCCGGACAAAGCCATTGATGTGCTGGACACCGCCTGCGCCCGTGTGGCCATCAACCTGTCATCGCCGCCAAA
ACAAATCTCGGCACTGACCACCCTGAGCCACCAGCAGGAGGCAGAAATTCGCCAGCTTGAGCGCGAGCTTCGCATCGGGC
TGCGTACCGACACCTCACGAATGACTGAGGTGTTGGAGCAGTATGATGAAACGCTGTCGGCGCTGGACGAACTGGAAGTC
GCCTGGCAGCAGCAGCAGACGCTGGTACAGGAAATTATCGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAAGACGACGT
TGCGTCGTTGCCGGACGCAGATGCCGTGGAGGATACGCCGCCAGAGGCAGAACAGGATAGTACCGATGCCGAATCGGCTG
ATGATGCAGGCAGCGTACAGCCGGAAGAGACCGCTGAAACAGTTTCCCCGGTACAACGGCTGGCACAACTCACTGCCGAA
CTGGACGCCCTGCATAACGACCAGTTGCTGGTTTCCCCGCATGTCGATAAAAAACAGATTGCAGCGGTGATTGCTGAATG
GACCGGCGTGCCGCTTAACCGCCTGTCACAGAATGAGATGTCGGTCATCACCGACCTGCCGGTATGGCTCGGCGGCACCA
TCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCATAAACACCTGCTGACCGCACGCGCCGACCTGCGCCGTCCGGGACGC
CCGCTTGGTGCGTTCCTGCTGGCTGGCCCCAGCGGCGTGGGTAAAACCGAAACCGTCCTGCAACTGGCGGAACTGCTCTA
TGGCGGACGTCAATACCTGACCACCATCAATATGTCTGAGTTCCAGGAGAAACACACCGTTTCGCGGCTGATTGGCTCTC
CTCCGGGTTATGTCGGCTACGGTGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCGTACTCGGTGGTGCTGCTC
GATGAAGTAGAAAAAGCGCACCCGGATGTACTCAACCTGTTCTACCAGGCGTTTGACAAGGGCGAGATGGCAGACGGTGA
AGGCCGCCTGATTGACTGTAAAAATATCGTTTTCTTCCTGACCTCCAACCTTGGCTACCAGGTGATTGTCGAACACGCCG
ATGACCCGGAAACCATGCAGGAAGCCCTCTATCCGGTGCTGGCGGACTTCTTTAAACCTGCTCTGCTGGCGCGCATGGAA
GTGGTGCCTTACCTACCGCTGTCGAAAGAGACGCTCGCTACCATTATTGCCGGGAAACTGGCCCGCCTGGATAACGTGCT
GCGCAGCCGCTTTGGCGCGGACGTGATTATTGGGCCGGAAGTGACGGACGAAATCATGAGCCGCGTCACCCGCGCGGAAA
ACGGCGCGAGGATGCTGGAATCGGTTATCGACGGCGACATGCAGCCGCCGCTCTCGCTGCTGCTGTTGCAGAAAATGGCG
GCCAACACGGCGATTGCCCGGATTCGCCTGTCGGCGGCAGACGGTGCGTTCACGGCAGATGTGGAAGATGCGCTGCCTGA
TGATGCCGTGACACCGCAAACAGAGGATGAAACGGTTTTATGA

Protein sequence :
MIQIDLATLVKRLNPFAKQALEMAASECMSQQASEITVAHVLLQMLAIPRNDVRVIAERTGISAEDLRQALTVESYPGGR
SAEGYPSFSPMLIEWLKESWLLASAQMQHSELRSGVLLLTLLHSPLRYIPPAAARLLTAINRDQLQQDFAAWTKESAESV
DLAGGQTPRATETGDTLLARYAKNMTADARNGRLDPVLCRDYEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLALRI
MAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVIFSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKPAL
ARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPNAAEATIILRSLSAVYEQSHGVLIDDDALQAAATLSERYLSG
RQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLEQYDETLSALDELEV
AWQQQQTLVQEIIALRQQLLGVAEDDVASLPDADAVEDTPPEAEQDSTDAESADDAGSVQPEETAETVSPVQRLAQLTAE
LDALHNDQLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPVWLGGTIKGQDLAIASLHKHLLTARADLRRPGR
PLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVVLL
DEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEALYPVLADFFKPALLARME
VVPYLPLSKETLATIIAGKLARLDNVLRSRFGADVIIGPEVTDEIMSRVTRAENGARMLESVIDGDMQPPLSLLLLQKMA
ANTAIARIRLSAADGAFTADVEDALPDDAVTPQTEDETVL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 0.0 99
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 0.0 99
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 4e-126 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECP_0229 YP_668165.1 Clp ATPase VFG2084 Protein 0.0 56
ECP_0229 YP_668165.1 Clp ATPase VFG2076 Protein 5e-139 41