Gene Information

Name : clpB (BF1172)
Accession : YP_210841.1
Strain : Bacteroides fragilis NCTC 9343
Genome accession: NC_003228
Putative virulence/resistance : Virulence
Product : heat shock ClpB protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1427359 - 1429947 bp
Length : 2589 bp
Strand : -
Note : Similar to Synechococcus sp. heat shock ClpB protein SWALL:CLPB_SYNP7 (SWALL:P53533) (883 aa) fasta scores: E(): 6.3e-144, 54.86% id in 873 aa, and to Bacteroides thetaiotaomicron endopeptidase Clp ATP-binding chain B BT4597 SWALL:AAO79702 (EMBL:AE016945)

DNA sequence :
ATGAACTTTAACAATTTTACCATTAAATCTCAGGAAGCTGTACAAGAGGCTATTAACCTGGCACAAAGTCGGGGGCAACA
AGCCATCGAAACGGCTCATATCCTGTATGGAGTGATGAAGGTAGGCGAAAATGTGACTAACTTTATCTTTCAGAAGTTAG
GACTGAACGGACAACAAATCTCCCTCGTACTCGATAAGCAGATCGACTCTTTCCCAAAAGTTTCCGGCGGAGAACCTTAC
TTGAGTAGGGAAGCGAACGAAGTCTTTCAAAAAGCAACGCAGTACTCCAAGGAGATGGGCGATGAGTTTGTTTCATTGGA
ACATCTTTTGCTGGCTTTACTGACAGTAAAGAGCACGGTATCTACCATCCTGAAAGATGCAGGAATGACCGAAAAAGAAT
TGCGTGGTGCCATCAGTGAATTGAGAAAAGGAGAAAAGGTGACCTCTCAGTCCAGTGAAGATAATTACCAGTCACTGGAA
AAATATGCCATTAACTTAAATGAAGCAGCCCGTAGCGGTAAACTCGACCCTGTGATCGGACGTGATGAAGAAATCCGACG
GGTACTTCAGATTTTAAGTCGACGTACAAAAAACAATCCTATACTAATAGGTGAACCGGGTACCGGTAAAACAGCTATTG
TTGAGGGATTGGCACACCGTATTCTTCGGGGTGATGTTCCTGAAAACCTGAAAAATAAACAGGTATACTCACTTGATATG
GGCGCACTCGTTGCAGGAGCTAAATATAAAGGAGAATTTGAGGAACGACTGAAATCGGTAGTGAATGAGGTGAAGAAATC
AGAAGGTAATATCATATTATTCATTGATGAAATCCATACTTTGGTAGGGGCAGGAAAAGGAGAAGGTGCTATGGACGCAG
CTAATATTCTGAAACCTGCACTTGCCCGTGGAGAACTACGCTCTATCGGTGCTACCACTCTCGACGAATATCAGAAATAT
TTTGAAAAAGATAAAGCTTTGGAACGTCGTTTCCAAATAGTACAGGTAGATGAACCAGACAATCTGAGCACAATATCTAT
CTTACGTGGATTAAAAGAACGGTATGAAAATCACCATCACGTACGTATCAAAGATGATGCAATCATTGCTGCCGTAGAAT
TAAGCAGCCGGTACATCACTGACCGTTTTTTACCCGATAAAGCAATTGACCTGATGGACGAAGCTGCCGCAAAACTTCGC
ATGGAGGTGGATTCTGTCCCTGAAGGATTAGATGAAATCTCACGAAAGATTAAACAGCTGGAGATTGAGCGAGAAGCTAT
AAAACGGGAAAATGATGAACCGAAATTACAGACAATCGGCAAAGAATTGGCTGAATTGAAAGAACAGGAAAAGTCATATA
AAGCAAAATGGCAAAGCGAGAAAAGCCTGATGGATATAATCCAGCAGAACAAAGTTGAAATAGAAAATCTTAAATTCGAA
GCTGACAAGGCAGAACGTGAGGGAAACTATGGCAAAGTTGCAGAGATTCGCTATGGCAAATTGCAGGAACTGCATAAGGA
AATTGAAGATACCCAGAAAAAATTGCACGAAATGCAAGGGGATACAGCCATGATAAAAGAAGAGGTGGATGCTGAAGACA
TCGCTGACGTAGTATCCCGCTGGACCGGAATTCCTGTAAGCAAAATGATGCAGAGTGAAAAGGACAAATTGCTCCACCTT
GAAGAAGAATTACATCAGCGTGTTATCGGACAAGACGAGGCTATCGCAGCTGTGTCTGATGCTGTACGCCGCAGCCGTGC
AGGCTTACAGGATCCCAAACGACCTATTGGTTCCTTCATCTTCCTGGGCACTACAGGAGTTGGTAAAACCGAACTTGCCA
AAGCGCTTGCCGAATTTCTGTTTGACGATGAAACGATGATGACCCGTATCGACATGAGCGAATACCAGGAGAAGCACAGC
GTTTCGCGTTTAGTTGGAGCGCCTCCGGGATATGTAGGATATGACGAAGGCGGACAATTGACAGAGGCGATCCGTCGCAA
ACCCTATTCTGTAGTATTGTTTGATGAAATCGAGAAAGCACATCCGGATGTATTTAATATCTTGTTGCAGGTACTCGATG
ACGGACGGTTGACAGATAACAAAGGCCGTGTGGTAAACTTTAAAAATACAATCATCATTATGACCTCTAATATGGGTAGC
AGCTACATACAGAGCCAGATGGAAAAACTGAACGGCGCCAACAACGAGGAAGTAGTGGAAGAAACCAAGAAAGAGGTAAT
GAACATGTTAAAGAAAACCATCCGTCCGGAATTCCTAAACCGTATCGATGAGACTATCATGTTCCTGCCATTAACAGAAA
AAGACATAAAACAGATTGTCTTGTTACAGATTAAGAGCGTACAAAAGATGCTTGCCGGTAATGGAATAGAACTAGAACTG
ACAGATGCGGCTTTGGATTTCCTCTCACAGGTCGGCTATGATCCGGAATTCGGTGCACGTCCTGTAAAAAGGGCTATTCA
GAGATATTTACTCAACGATCTATCGAAAAAATTATTGGCACAGGAAGTAGACCGTAGTAAAGCAATCATTGTAGATGCAC
AAGGAGACGGATTAGTTTTCCGTAACTAA

Protein sequence :
MNFNNFTIKSQEAVQEAINLAQSRGQQAIETAHILYGVMKVGENVTNFIFQKLGLNGQQISLVLDKQIDSFPKVSGGEPY
LSREANEVFQKATQYSKEMGDEFVSLEHLLLALLTVKSTVSTILKDAGMTEKELRGAISELRKGEKVTSQSSEDNYQSLE
KYAINLNEAARSGKLDPVIGRDEEIRRVLQILSRRTKNNPILIGEPGTGKTAIVEGLAHRILRGDVPENLKNKQVYSLDM
GALVAGAKYKGEFEERLKSVVNEVKKSEGNIILFIDEIHTLVGAGKGEGAMDAANILKPALARGELRSIGATTLDEYQKY
FEKDKALERRFQIVQVDEPDNLSTISILRGLKERYENHHHVRIKDDAIIAAVELSSRYITDRFLPDKAIDLMDEAAAKLR
MEVDSVPEGLDEISRKIKQLEIEREAIKRENDEPKLQTIGKELAELKEQEKSYKAKWQSEKSLMDIIQQNKVEIENLKFE
ADKAEREGNYGKVAEIRYGKLQELHKEIEDTQKKLHEMQGDTAMIKEEVDAEDIADVVSRWTGIPVSKMMQSEKDKLLHL
EEELHQRVIGQDEAIAAVSDAVRRSRAGLQDPKRPIGSFIFLGTTGVGKTELAKALAEFLFDDETMMTRIDMSEYQEKHS
VSRLVGAPPGYVGYDEGGQLTEAIRRKPYSVVLFDEIEKAHPDVFNILLQVLDDGRLTDNKGRVVNFKNTIIIMTSNMGS
SYIQSQMEKLNGANNEEVVEETKKEVMNMLKKTIRPEFLNRIDETIMFLPLTEKDIKQIVLLQIKSVQKMLAGNGIELEL
TDAALDFLSQVGYDPEFGARPVKRAIQRYLLNDLSKKLLAQEVDRSKAIIVDAQGDGLVFRN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 4e-157 45
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 4e-100 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-100 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_210841.1 heat shock ClpB protein VFG2084 Protein 8e-109 43