Gene Information

Name : clpB2 (azo3903)
Accession : YP_935405.1
Strain : Azoarcus sp. BH72
Genome accession: NC_008702
Putative virulence/resistance : Virulence
Product : putative ATP-dependent Clp protease, ATP-binding subunit ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 4280714 - 4283434 bp
Length : 2721 bp
Strand : +
Note : Putative ATP-dependent Clp protease, ATP-binding subunit ClpB. Homology to clpB of E. coli of 38% (sprot|CLPB_ECOLI). The protein is thought to be subunits of ATP-dependent proteases which act as chaperones to target the proteases to substrates. Pfam: ATP

DNA sequence :
ATGAGCGAAATCAGTCGGGTAGCCTCGTTCGGCAAACTCAACAGCCTCGCCTACAAGGCGATGGAAGGCGCCACCGTCTT
CTGCAAGCTGCGCGGCAATCCCTATGTGGAGCTGGAACACTGGCTGCAGCAGCTGCTGCAGAACGAGGACACCGACCTGC
ACCGCATCGTGCGCCACTTCGACGTGGACGCGGCGCGCCTGGCCGCCGACATGACCACCGCGCTCGACCGCCTGCCGCGC
GGCTCGACCTCGATCTCGGGCCTCTCCGAAAACCTCGACACGCTCGTCGAACGCGGCTGGGTGTATGCCTCTCTGATGTT
CGGCGACCGCCAGATCCGCTCCGGCTACCTGCTGGTCGGCATGCTCAAGACGCCCTCGCTGCGCAACGCGCTGATGGCGA
TCTCGCGCCAGTTCGAGCGCGTGCGCCCCGATGTGCTGACCGACGAATTCACCAAGATCGTCGCCGGCTCGGCGGAAGAA
AACCTCACCGCGCGCGATGGCTCGGGCGGCGCGCCGGGCGAGGACAGCGGCGCGATTGCGCCCGCGCAGATGGGCAAGCA
GGAGGCGCTGAAGAAATTCACCGTCGACCTCACCGAGCAGGCCCGCTCCGGCAAGATGGACCCGATCGTCGGCCGCGACG
AAGAAATCCGCCAGGTGGTCGACATCCTGATGCGCCGCCGCCAGAACAACCCCATCCTGGTCGGCGAGGCCGGCGTCGGC
AAGACCGCGGTGGTCGAAGGCTTCGCCCAGCGCATCGCGCGCGGCGACGTGCCGCCGGCGCTCAAGGACGTCTCGCTGCT
GGCGCTGGATGTCGGCCTGCTGCAGGCCGGCGCCAGCATGAAGGGCGAATTCGAGCAGCGACTGCGCTCGGTCATCGACG
AAGTCCAGGCCAGCCCCAAGCCCATCATCCTGTTCGTCGATGAGACCCACACCCTGGTCGGCGCCGGCGGCGCGGCCGGC
ACCGGCGATGCGGCCAACCTGCTCAAGCCGGCGCTCGCGCGCGGCACGCTGCGCACCGTCGGCGCCACCACCTGGGCCGA
GTACAAGAAGTACATCGAGAAGGACCCGGCGCTGACCCGGCGCTTCCAGAACGTGCAGGTCGACGAGCCGGACGAGAAGA
AGGCGGTGCTCATGATGCGCGGCGTTGCCAGCACCATGGAGAAGCACCACCAGGTGCAGATCCTCGACGAGGCGCTGGAG
GCCGCGGTCAAGCTGTCGCACCGCTACATCCCGGCGCGCCAGCTGCCGGACAAGTCCGTCTCGCTGCTCGACACCGCCTG
CGCCCGCGTCGCGGTGAGCCTGCACGCCACCCCGGCCGAGGTGGACGACAGCCGCAAGCGCATCGACGCGCTGAACACCG
AGCTGGAAATCATCGGCCGCGAGAGCAACATCGGCATCGAGGTCGGCGAGCGCCGTGCCCATGCCGAGGCCCTGCTGGCC
GAAGAACAGCAGCGGCTGGCCGAACTCGAGGCGCGCTGGGCGGCCGAGAAGACGCTGGTCGATGAACTCCTCGCGCTGCG
CGCCAAGCTGCGCAGCGGCAGCCGCCCGGTGGAAGGCACCGGCAGCGCGCTGGAAGCCGCGGCCGAAGCGGCTGCGCCGG
AGGCCGCCGCCGAGAGCGCGCCCGAACCCGAGCGCGAGGCGCTGTTCGCGCGCCTGCGCGAAGTGCAGGCCGAACTCGCC
ACGCTGCAGGGCGAAGACCCGCTGATCCTGCCGACAGTGGACTACCAGGCGGTGGCCTCGGTGGTCGCCGACTGGACCGG
CATCCCGGTCGGCCGCATGGCGCGCAACGAAATCGAGAACGTGCTGCGCCTGCCGCAGCTGCTCGGCCAGCGCGTCATCG
GCCAGGACCACGCGATGGAGATGATCGCCAAGCGCATCCAGACTTCGCGCGCCGGGCTCGACAACCCCAACAAGCCGATC
GGCGTGTTCATGCTCGCCGGCACCTCGGGCGTCGGCAAGACCGAAACCGCGCTGGCACTGGCCGAAGCGCTGTACGGCGG
CGAGCAGAACGTCGTCACGATCAACATGAGCGAATTCCAGGAAGCGCACACCGTGTCCACGCTCAAGGGCGCGCCTCCGG
GCTACGTCGGCTACGGCGAAGGCGGCGTGCTGACCGAGGCGGTGCGGCGCAAGCCCTACAGCGTGGTGCTGCTCGACGAG
GTCGAGAAGGCCCACCCCGATGTGCACGAGATGTTCTTCCAGGTCTTCGACAAGGGCTTCATGGAAGACGGCGAAGGCCG
CTTCATCGACTTCAAGAACACCCTGATCCTGCTCACCACCAATGCCGGCACCGACCTCATCGCCAGCATGTGCAAGGACC
CCGACCTGCTGCCCGACCCGGAAGGCCTCGCCAAGGCGCTGCGCGACCCGCTGCTGAAGATCTTCCCGCCGGCGCTGCTC
GGCCGCCTCGTCACCATTCCCTACTACCCGCTGACCGACGCCATGCTGGGCGCCATCGTCCGGCTGCAGCTCGGGCGCAT
CAAGAAGCGCGTGGAAGCGCGCTACAAGATCCCGTTCGAGTATGGCGACGACGTGGTCGAGCTGGTGGTCAGCCGCTGCA
CCGAGAGCGAATCCGGCGGCCGCATGATCGACGCCATCCTGACCAACACCATGCTGCCCGACATCAGCCGCGAGTTCCTG
AACCGGACGATGGAAGGCAAGCCGATCGTGCGCGTGGCGATGGGAGTCGCGAATGCCGACTTCACCTACAACTTCGATTG
A

Protein sequence :
MSEISRVASFGKLNSLAYKAMEGATVFCKLRGNPYVELEHWLQQLLQNEDTDLHRIVRHFDVDAARLAADMTTALDRLPR
GSTSISGLSENLDTLVERGWVYASLMFGDRQIRSGYLLVGMLKTPSLRNALMAISRQFERVRPDVLTDEFTKIVAGSAEE
NLTARDGSGGAPGEDSGAIAPAQMGKQEALKKFTVDLTEQARSGKMDPIVGRDEEIRQVVDILMRRRQNNPILVGEAGVG
KTAVVEGFAQRIARGDVPPALKDVSLLALDVGLLQAGASMKGEFEQRLRSVIDEVQASPKPIILFVDETHTLVGAGGAAG
TGDAANLLKPALARGTLRTVGATTWAEYKKYIEKDPALTRRFQNVQVDEPDEKKAVLMMRGVASTMEKHHQVQILDEALE
AAVKLSHRYIPARQLPDKSVSLLDTACARVAVSLHATPAEVDDSRKRIDALNTELEIIGRESNIGIEVGERRAHAEALLA
EEQQRLAELEARWAAEKTLVDELLALRAKLRSGSRPVEGTGSALEAAAEAAAPEAAAESAPEPEREALFARLREVQAELA
TLQGEDPLILPTVDYQAVASVVADWTGIPVGRMARNEIENVLRLPQLLGQRVIGQDHAMEMIAKRIQTSRAGLDNPNKPI
GVFMLAGTSGVGKTETALALAEALYGGEQNVVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRKPYSVVLLDE
VEKAHPDVHEMFFQVFDKGFMEDGEGRFIDFKNTLILLTTNAGTDLIASMCKDPDLLPDPEGLAKALRDPLLKIFPPALL
GRLVTIPYYPLTDAMLGAIVRLQLGRIKKRVEARYKIPFEYGDDVVELVVSRCTESESGGRMIDAILTNTMLPDISREFL
NRTMEGKPIVRVAMGVANADFTYNFD

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-130 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-130 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB2 YP_935405.1 putative ATP-dependent Clp protease, ATP-binding subunit ClpB VFG2076 Protein 0.0 77
clpB2 YP_935405.1 putative ATP-dependent Clp protease, ATP-binding subunit ClpB VFG2084 Protein 1e-141 42