Gene Information

Name : ECO103_0218 (ECO103_0218)
Accession : YP_003220228.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp proteinase ATP-binding chain
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 248011 - 250776 bp
Length : 2766 bp
Strand : -
Note : -

DNA sequence :
ATGATCCAGATTGATCTTCCCACGCTGGTAAAACGGCTGAACCTGTTCTCCCGCCAGGCGCTGGAGATGGCCGCCTCTGA
ATGTATGAGTCAGCAGGCAGCGGAAATTACCGTCAGCCATGTGCTCATTCAGATGCTCACCATGCCACGCAGTGACCTGC
GGGTTATTACCCGCCAGGGCGATATTGGCATGGAAGAGTTGCGCCAGGCGCTGACGGTAGAGAACTACACAACCGCCCGT
TCTGCGGACAGCTACCCGGCGTTTTCCCCGATGCTGGTTGAGTGGCTGAAAGAGGGCTGGCTGCTGGCGTCGGCTGAGAT
GCAGCACAGCGAACTGCGCGGCGGCGTGTTGCTGCTGGCCCTGCTGCATTCGCCGCTGCGTTATATACCGCCTGCTGCCG
CCCGGCTGTTGACCGGCATTAACCGTGACCGTCTGCAACAGGACTTTGTGCAGTGGACACAGGAGTCGGCGGAATCAGTC
GTGCCGGATGCAGACGGTAAAGGCGCAGGCACAATGACGGACGCCTCTGACACCCTGCTTGCCCGCTATGCCAAAAACAT
GACCGCAGACGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACCACGAAATCGACCTGATGATCGACATTCTCT
GCCGCCGCCGTAAAAACAACCCGGTGGTGGTGGGCGAAGCGGGCGTGGGCAAAAGCGCACTGATTGAAGGGCTGGCGCTG
CGCATCGTGGCAGGCCAGGTGCCGGACAAGCTGAAAAACACCGATATCATGACCCTTGACCTGGGCGCATTGCAGGCCGG
GGCGTCGGTGAAGGGTGAATTCGAAAAACGTTTCAAAGGGCTGATGGCGGAGGTCATTTCCTCCCCGGTGCCGGTCATTC
TGTTTATCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATCTCCAACCTGCTCAAACCG
GCGCTGGCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATACTTCGAAAAAGATGCCGC
CCTGTCGCGCCGCTTCCAGTTGGTGAAGGTCAGCGAACCCAACGCTGCCGAAGCCACCATTATTCTGCGCGGTCTGTCGG
CGGTCTATGAACAGTCTCACGGCGTGCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACATTAAGCGAGCGTTATCTC
TCCGGGCGTCAGTTACCGGACAAAGCGATTGATGTGCTGGATACCGCCTGCGCCCGTGTGGCCATCAACCTGTCGTCGCC
GCCGAAGCAAATCTCGGCGCTGACCACTCTGAGCCACCAGCAGGAGGCGGAAATTCGCCAGCTTGAGCGCGAGCTTCGCA
TCGGACTGCGTACCGACACATCACGGATGACCGAGGTGCTGGTGCAGTATGATGAAACGCTGACGGCGCTGGATGAACTG
GAAGCGGCCTGGCACCAGCAGCAGACGCTGGTCCGGGAGATTATTGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAGGA
CGATGCGGCGCCGTTGCCGGACGCAGATACCGTGGAGGATACGCAGCCAGAGTCAGAGTCAGAACAGGATAATACCGGTG
CCGTACCGGCTGATGAGGCCGACAGAGAACAGCCGGAAGAGACCGCTGAAACAGTTTCCCCGGTACAACGTCTGGCACAT
CTCACTGCCGAACTGGACGCCCTGCATAACGACCGGTTGCTGGTTTCCCCGCACGTCGATAAAAAACAGATTGCGGCGGT
GATTGCCGAATGGACCGGCGTGCCGCTTAACCGCCTGTCGCAGAATGAAATGTCGGTCATCACCGACCTGCCAAAATGGC
TCGGCGACACCATCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCACAAACACCTGCTGACCGCACGCGCCGACCTGCGC
CGTCCGGGACGCCCGCTGGGCGCGTTCCTGCTGGCTGGCCCCAGCGGCGTGGGTAAAACTGAAACCGTCCTGCAACTGGC
AGAGCTGCTCTACGGCGGTCGCCAGTACCTGACCACCATCAATATGTCCGAGTTCCAGGAGAAACATACCGTCTCGCGAC
TGATTGGTTCGCCTCCGGGCTACGTTGGCTACGGTGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCCTACTCG
GTAGTACTGCTCGATGAAGTGGAAAAAGCGCACCCGGATGTGCTCAACCTGTTCTACCAGGCGTTCGATAAGGGCGAAAT
GGCAGACGGTGAAGGCCGCCTGATTGACTGCAAAAATATCGTCTTCTTCCTGACGTCCAACCTCGGCTACCAGGTAATAG
TCGAGCATGCCGATGACCCGGAAACCATGCAGGAAGTACTGTATCCGGTGCTGGCCGACTTCTTCAAACCTGCCCTGCTG
GCGCGTATGGAAGTGGTGCCGTACCTGCCGCTGTCGAAAGAGACGCTCGCCACCATCATTGCCGGGAAACTGGCCCGCCT
GGATAACGTGCTGCGCAGCCGCTTTGGCGCGGAGGTGATTATAGAACCGGAAGTGACGGACGAAATCATGAGCCGCGTCA
CCCGCGCGGAAAACGGCGCGAGGATGCTGGAATCGGTCATCGACGGCAATATGCTGCCGCCGCTCTCCCTGCTGCTGTTG
CAGAAAATGGCGGCGAATACGGCGGTTGCCCGGATTCGGTTGTCGGCAGTGGACGGCGCATTTACGGCAGACGTGGAAGA
TGCTCAGAACGACGAGTCCGTCACAAAGGATGAAACGGTTTTATGA

Protein sequence :
MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLTMPRSDLRVITRQGDIGMEELRQALTVENYTTAR
SADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESV
VPDADGKGAGTMTDASDTLLARYAKNMTADARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLAL
RIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPNAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYL
SGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDEL
EAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESESEQDNTGAVPADEADREQPEETAETVSPVQRLAH
LTAELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPKWLGDTIKGQDLAIASLHKHLLTARADLR
RPGRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYS
VVLLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLADFFKPALL
ARMEVVPYLPLSKETLATIIAGKLARLDNVLRSRFGAEVIIEPEVTDEIMSRVTRAENGARMLESVIDGNMLPPLSLLLL
QKMAANTAVARIRLSAVDGAFTADVEDAQNDESVTKDETVL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 0.0 91
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 0.0 91

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO103_0218 YP_003220228.1 ATP-dependent Clp proteinase ATP-binding chain VFG2084 Protein 0.0 58
ECO103_0218 YP_003220228.1 ATP-dependent Clp proteinase ATP-binding chain VFG2076 Protein 4e-138 42