Gene Information

Name : CAP2UW1_3269 (CAP2UW1_3269)
Accession : YP_003168463.1
Strain : Candidatus Accumulibacter phosphatis UW-1
Genome accession: NC_013194
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 3759091 - 3761670 bp
Length : 2580 bp
Strand : -
Note : KEGG: dar:Daro_2727 AAA ATPase, central region:Clp, N terminal; TIGRFAM: ATP-dependent chaperone ClpB; PFAM: AAA ATPase central domain protein; Clp domain protein; ATPase associated with various cellular activities AAA_5; ATPase AAA-2 domain protein; SMAR

DNA sequence :
ATGCGCCTGGACAAACTGACCACCAAGTTCCAGCAGGCTCTCGCCGATGCGCAGAGTCTCGCCGTCGGACATGACAACCA
GATCATCGAACCGCAGCACCTGCTGCTCGCCCTGCTCCAGCAGGATGACGGCTCGACGACTTCGCTGCTGGCGCATGCCG
GAGCCAACGTGCCGCCGTTGAAGGCGGCGCTCGTCCAGGCCATCAACCGCCTGCCGAAAGTCGAAGGGCACGGCGGCGAG
GTGCAGGTCGGGCGCGATCTCACCAACCTGCTGAACCTTACCGACAAGGAAGCGCAGAAACGGGGCGACCAGTTCATTGC
CTCGGAAATGTTCCTGCTCGCCTTGTGCGACGACAAGGGCGAGTGCGGTCGCCTGCTCCGGCAGCACGGACTGGTCAAGC
AATCCCTGGAGCAGGCCGTCGCCTCGCTGCGCGGCGGCCAGGCCGTCGATAACCAGGAGGCCGAGGGGCAGCGCCAGTCG
CTGAGCAAGTACTGCATCGACCTCACCGAGCGCGCCCGCCTTGGCAAGCTCGACCCGGTGATCGGGCGCGACGATGAGAT
TCGCCGGACGATCCAGATTCTCCAGCGGCGTACCAAGAACAATCCGGTCCTGATCGGCGAGCCGGGTGTCGGCAAGACGG
CGATCGTCGAAGGCCTGGCGCAGCGCATCGTCAATGGCGAAGTCCCGGAGACGCTGAAGGGCAAGAAGGTGATGAGCCTC
GACATGGCCGCGCTGCTCGCCGGCGCCAAGTATCGCGGCGAATTCGAGGAGCGCCTGAAGGCGGTGCTCAAGGAGATTGC
CCAGGAAGAGGGGCAGATCATCGTCTTCATCGACGAGTTGCATACGATGGTCGGCGCCGGCAAGGCCGAGGGCGCGATCG
ACGCCGGCAACATGCTCAAGCCGGCGCTGGCGCGCGGCGATCTGCACTGCGTCGGCGCGACGACACTCGACGAGTACCGC
AAGTACATCGAAAAGGATGCCGCGCTCGAGCGCCGCTTCCAGAAAGTGCTGGTCGAGGAGCCGACGGTCGAATCGACGAT
CGCCATCCTGCGTGGCCTGCGCGAGCGATACGAGCTGCACCACGGCGTCGACATCACCGACCCGGCCATCGTCGCCGCGG
CCGAACTGTCGCATCGCTACATCACCGACCGCTTCCTGCCGGACAAGGCCATCGACCTGATCGACGAGGCGGCGGCGCGC
ATCAAGATGGAGATCGACTCCAAGCCCGAGGTGATGGACAAGCTCGACCGCCGCATGATCCAGCTCAAGATCGAGAGGGA
GGCCGTGCGCAAGGAGCGCGATGACGCCTCCAAGAAGCGCATGCATCTGATCGAGGAGGAGCTCGTCAAGCTCGAGCGCG
AGTATAACGATCTCGACGAGGTCTGGAAGGCCGAAAAATCGCAGGTGCAGGGCAGTGCGCACATCAAGGAAGAGATCGAC
AAGCTGAAGCTCGAGCTGGCTCAACTGCAGCGCGAGAACAAGTGGGACAAGGTCGCCGAGATCCAGTACGGCAAGCTGCC
GCAACTCGAGGCGCAACTGACCGTTGCCGAGAAATCGGGCGACGGTGGCCAGCACAACAAGCTGCTGCGCACCGAGGTGG
GTACCGAGGAGATCGCCGAAGTCGTCTCGCGGGCGACTGGTATTCCGGTATCGAAGATGATGCAGGGCGAGCGCGAGAAG
CTGTTGCAGATGGAAGAACGGATGCATCAGCGCGTCGTCGGGCAGGACGAGGCGGTGCGTCTGGTCGCCAATGCCATTCG
CCGCTCGCGCGCCGGCCTCGCCGACCCGAACCGCCCCTACGGTTCCTTCCTGTTCCTCGGCCCGACCGGGGTCGGCAAGA
CCGAGCTGTGCAAGGCCCTGGCCGGCTTCCTCTTCGACTCCGAGGAGCACCTGATTCGCGTCGACATGAGCGAGTTCATG
GAGAAGCACTCGGTGTCCCGGTTGATCGGTGCGCCGCCCGGCTATGTCGGTTACGAGGAGGGCGGCTACCTGACCGAAGC
CGTACGCCGCAAGCCCTATTCGGTAATCCTTCTCGACGAGGTGGAGAAGGCGCACCCGGACGTCTTCAACGTGCTGCTGC
AGGTCCTCGACGACGGGCGGATGACCGACGGGCACGGGCGGACCGTCGATTTCAAGAACACCGTCGTCGTCATGACGTCA
AACCTCGGGAGCCAGATGATCCAGCAGATGGCTGGCGACGACTATCAGCTGATCAAGCTGGCTGTAATGGGTGAAGTGAA
AACGTATTTCCGGCCCGAGTTCATCAACCGGATTGACGAGGTGGTTGTATTCCACGCACTGGACGAGCAGCATATCAAGG
CGATCGCCAGAATCCAGCTCGCCTACCTCGAAAAGCGCCTGGCGCAGCTCGAGTTGCGGCTGGAAGTGGCGGATAGCGCG
CTGGCCGAAGTGGCGACGGCCGGCTTCGATCCTGTTTATGGCGCGCGCCCGCTGAAGCGCGCGATCCAGTCGCAACTCGA
GGATTCCCTCGCCATGGCCATCCTCGAAGGCCGTTTCGCCGCAGGCGATACGATTCGCGTCGCCTGCGACAGCGGCATCA
TGCGCTTTGACAAGGCCTAG

Protein sequence :
MRLDKLTTKFQQALADAQSLAVGHDNQIIEPQHLLLALLQQDDGSTTSLLAHAGANVPPLKAALVQAINRLPKVEGHGGE
VQVGRDLTNLLNLTDKEAQKRGDQFIASEMFLLALCDDKGECGRLLRQHGLVKQSLEQAVASLRGGQAVDNQEAEGQRQS
LSKYCIDLTERARLGKLDPVIGRDDEIRRTIQILQRRTKNNPVLIGEPGVGKTAIVEGLAQRIVNGEVPETLKGKKVMSL
DMAALLAGAKYRGEFEERLKAVLKEIAQEEGQIIVFIDELHTMVGAGKAEGAIDAGNMLKPALARGDLHCVGATTLDEYR
KYIEKDAALERRFQKVLVEEPTVESTIAILRGLRERYELHHGVDITDPAIVAAAELSHRYITDRFLPDKAIDLIDEAAAR
IKMEIDSKPEVMDKLDRRMIQLKIEREAVRKERDDASKKRMHLIEEELVKLEREYNDLDEVWKAEKSQVQGSAHIKEEID
KLKLELAQLQRENKWDKVAEIQYGKLPQLEAQLTVAEKSGDGGQHNKLLRTEVGTEEIAEVVSRATGIPVSKMMQGEREK
LLQMEERMHQRVVGQDEAVRLVANAIRRSRAGLADPNRPYGSFLFLGPTGVGKTELCKALAGFLFDSEEHLIRVDMSEFM
EKHSVSRLIGAPPGYVGYEEGGYLTEAVRRKPYSVILLDEVEKAHPDVFNVLLQVLDDGRMTDGHGRTVDFKNTVVVMTS
NLGSQMIQQMAGDDYQLIKLAVMGEVKTYFRPEFINRIDEVVVFHALDEQHIKAIARIQLAYLEKRLAQLELRLEVADSA
LAEVATAGFDPVYGARPLKRAIQSQLEDSLAMAILEGRFAAGDTIRVACDSGIMRFDKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-101 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CAP2UW1_3269 YP_003168463.1 ATP-dependent chaperone ClpB VFG0079 Protein 2e-153 49
CAP2UW1_3269 YP_003168463.1 ATP-dependent chaperone ClpB VFG2076 Protein 4e-104 44
CAP2UW1_3269 YP_003168463.1 ATP-dependent chaperone ClpB VFG2084 Protein 1e-105 41