Gene Information

Name : O3K_20465 (O3K_20465)
Accession : YP_006780757.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp proteinase ATP-binding subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4199657 - 4202416 bp
Length : 2760 bp
Strand : +
Note : COG0542 ATPases with chaperone activity, ATP-binding subunit

DNA sequence :
ATGATCCAGATTGATCTTCCCACGCTGGTAAAACGGCTGAACCTGTTCTCCCGCCAGGCGCTGGAGATGGCCGCCTCTGA
ATGTATGAGTCAGCAGGCAGCGGAAATTACCGTCAGCCATGTGCTCATTCAGATGCTCGCCATGCCACGCAGTGACCTGC
GGGTTATTACCCGCCAGGGCGATATTGGCATGGAAGAGTTGCGCCAGGCGCTGACGGTAGAGAACTACACAACCGCCCGT
TCTGCGGACAGCTACCCGGCGTTTTCCCCGATGCTGGTTGAGTGGCTTAAAGAGGGCTGGCTGCTGGCGTCGGCTGAGAT
GCAGCACAGCGAACTGCGCGGCGGCGTGTTGCTGCTGGCCCTGCTGCATTCGCCGCTGCGTTATATACCGCCTGCTGCCG
CCCGGCTGTTGACCGGCATTAACCGTGACCGTCTGCAACAGGACTTTGTGCAGTGGACACAGGAGTCGGCGGAATCAGTC
GTGCCGGATGCAGACGGTAAAGGCGCAGGCACAATGACGGACGCCTCTGACACCCTGCTTGCCCGCTATGCCAAAAACAT
GACCGCAGACGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACCACGAAATCGACCTGATGATCGACATTCTCT
GCCGCCGCCGTAAAAACAACCCGGTGGTGGTGGGCGAAGCGGGCGTGGGCAAAAGCGCACTGATTGAAGGGCTGGCGCTG
CGCATCGTGGCAGGCCAGGTGCCGGACAAGCTGAAAAACACCGATATCATGACCCTTGACTTGGGCGCATTGCAGGCCGG
GGCGTCGGTGAAGGGTGAATTCGAAAAACGTTTCAAAGGGCTGATGGCGGAGGTCATTTCCTCCCCGGTGCCGGTCATTC
TGTTTATCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATCTCCAACCTGCTCAAACCG
GCGCTGGCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATACTTCGAAAAAGATGCCGC
CCTGTCGCGCCGCTTCCAGTTGGTGAAGGTCAGCGAACCCAACGCTGCCGAAGCCACCATTATTCTGCGCGGTCTGTCGG
CGGTCTATGAACAGTCTCACGGCGTGCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACATTAAGCGAGCGTTATCTC
TCCGGGCGTCAGTTACCGGACAAAGCGATTGATGTGCTGGATACCGCCTGCGCCCGTGTGGCCATCAACCTGTCGTCGCC
GCCGAAGCAAATCTCGGCGCTGACCACTCTGAGCCACCAGCAGGAGGCGGAAATTCGCCAGCTTGAGCGCGAGCTTCGCA
TCGGACTGCGTACCGACACATCACGGATGACCGAGGTGCTGGTGCAGTATGATGAAACGCTGACGGCGCTGGATGAACTG
GAAGCGGCCTGGCACCAGCAGCAGACGCTGGTCCGGGAGATTATTGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAGGA
CGATGCGGCGCCGTTGCCGGACGCAGATACCGTGGAGGATACGCAGCCAGAGTCAGAACAGGATAATACCGGTGCTAAAC
TGGCTGATGAAGCTGGCAGCGAACAGCCGGAAGAGACCGCAGAAACAGTTTCCCCGGTGCAGCGACTGGCACAGCTCACT
GCCGAACTGGACGCCCTGCATAACGACCGGTTGCTGGTCTCCCCGCACGTCGATAAAAAACAGATTGCGGCGGTGATTGC
CGAATGGACCGGCGTACCGCTTAACCGCCTGTCACAGAATGAGATGTCGGTCATCACCGACCTGCCGGTATGGCTGGGTG
ACACCATCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCATAAACACCTGCTGACTGCACGCGCCGACCTGCGTCGTCCG
GGACGCCCACTCGGCGCGTTTCTGCTGGCCGGTCCCAGCGGCGTGGGTAAAACCGAAACCGTCCTGCAACTGGCAGAACT
GCTCTACGGCGGTCGCCAGTACCTGACCACCATCAATATGTCCGAGTTCCAGGAAAAACACACCGTCTCGCGGCTGATTG
GCTCCCCTCCGGGCTATGTCGGCTATGGCGAAGGCGGCGTACTGACCGAAGCGATTCGCCAGAAACCGTACTCGGTGGTG
CTGCTTGATGAAGTGGAAAAAGCGCACCCGGATGTCCTCAACCTGTTCTACCAGGCGTTCGACAAGGGCGAGATGGCAGA
CGGCGAAGGCCGACTGATTGACTGTAAGAATATCGTTTTCTTCCTCACCTCCAACCTTGGTTACCAGGTGATTGTTGAAC
ACGCGGATGACCCGGAAACCATGCAGGAAGTGCTGTATCCGGTGCTGGCGGACTTCTTTAAACCAGCCCTGCTGGCGCGT
ATGGAAGTGGTGCCGTACCTGCCTCTGTCGAAAGAGACGCTCACCACCATTATCGACGGGAAACTGGCCCGCCTGGATAA
CGTGCTGCGCAGCCGCTTTGATGCGGACGTGATTATTGAGTCGGAAGTGACGGACGAGATCATGAGCCGCGTCACCCGCG
CGGAAAACGGCGCAAGGATGCTGGAGTCCGTCATTGACGGCGACATGCTGCCCCCGCTCTCGCTGCTGCTGTTGCAGAAA
ATGGCGGCCAACACGGCGATTGCCCGCATCCGCCTGTCGGCGGCAGACGGCGCGTTCACGGCAGACGTGGAAGATGCTCT
GGACGACGAGTCTGTCACAGAGGATGAAACGGATTTATGA

Protein sequence :
MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLAMPRSDLRVITRQGDIGMEELRQALTVENYTTAR
SADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESV
VPDADGKGAGTMTDASDTLLARYAKNMTADARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLAL
RIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPNAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYL
SGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDEL
EAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESEQDNTGAKLADEAGSEQPEETAETVSPVQRLAQLT
AELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLPVWLGDTIKGQDLAIASLHKHLLTARADLRRP
GRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTEAIRQKPYSVV
LLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLADFFKPALLAR
MEVVPYLPLSKETLTTIIDGKLARLDNVLRSRFDADVIIESEVTDEIMSRVTRAENGARMLESVIDGDMLPPLSLLLLQK
MAANTAIARIRLSAADGAFTADVEDALDDESVTEDETDL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 0.0 92
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 0.0 92

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_20465 YP_006780757.1 ATP-dependent Clp proteinase ATP-binding subunit VFG2084 Protein 0.0 57
O3K_20465 YP_006780757.1 ATP-dependent Clp proteinase ATP-binding subunit VFG2076 Protein 7e-137 42