Gene Information

Name : clpB (HMPREF9137_1129)
Accession : YP_004328830.1
Strain : Prevotella denticola F0289
Genome accession: NC_015311
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone protein ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1327568 - 1330156 bp
Length : 2589 bp
Strand : -
Note : identified by match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728; match to protein family HMM TIGR03346

DNA sequence :
ATGACATTTGAGAAATTTACAATTAAAGCACAGGAGGCTGTCCAGAGTGCGGTGAACACCGCACAGCGCAACGGGCAGCA
GACCATCGAGCCGCTACATCTGCTTGCCGGCGTGATGGACAAGGGAAAGGATGTGACGAACTACGTCTTCCAGAAACTGG
GCGTGAACGCACAGACCGTAGAGAATGCCGTCCAGAGCGAAATGAGCCATCTGCCGAAAGTATCGGGCGGTGAGCCTTAT
TTCTCCTCTGAAGCCAATCAGGTGATGCAGCGCACGTTGGACATCTCGCAGAAGATGGGCGACGAGTTCGTCAGTATCGA
GCCGATGCTGCTGGCACTGCTGGCGGTCAACTCAACGGCAAGCCGGATTCTGAAAGATGCCGGATGTGCAGAGAAAGAGA
TGACAGCAGCCATCAACGACCTGCGACAGGGACAGAAGGTACAGACGCAGAGCGGTGACGAGAACTACCAGTCGCTGCAG
AAATTTGCCCGCAACCTTGTCGAGGATGCCCGTGCAGGAAAGCTCGATCCGGTAATCGGCCGTGACGAGGAAATCCGCCG
TGTACTGCAGATACTGTCCCGCCGTACGAAGAACAACCCGATTCTCATCGGTGAACCGGGCACAGGAAAGACGGCTATCG
TCGAGGGACTTGCCGAAAGAATTGTCCGTGGCGATGTACCGGAAAACCTGAAGGACAAGCAGCTCTACTCGCTGGATATG
GGGGCGATGCTCGCCGGAGCAAAATACAAAGGTGAGTTTGAGGAGCGACTGAAGAGCGTCATCAAGGAGGTGACGAAGGC
CGAAGGCAACATCATCCTCTTCATTGATGAAATCCACACGCTGGTCGGTGCAGGCGGCGGCGAGGGTGCCATGGATGCCG
CCAACATTTTGAAGCCGGCTCTGGCACGTGGCGAACTGAGGGCCATCGGTGCGACAACATTGAACGAGTACCAGAAATAC
TTCGAGAAGGACAAGGCCCTGGAGCGTCGTTTCCAGACTGTCATGGTGGATGAGCCGGACGAGCTGGATGCCATCTCCAT
CCTCCGCGGACTGAAAGAACGCTACGAGAACCACCACAAGGTACGTATCCAGGACGATGCCTGCATTGCGGCCGTGAAGC
TCTCAGAACGTTATATCTCCGACCGTTTCCTGCCCGACAAGGCCATCGACCTGATGGACGAGGCAGCTGCCAAGCTGCGC
ATGGAGCGCGATTCCGTACCCGAGGAACTGGACGAGATAACCCGCCGGCTGAAACAGCTCGAGATTGAGCGTGAGGCCAT
CAAGCGGGAAGACGACAGCGAGAAGATCGCACAGCTCGACAAGGAGATTGCCGAACTCAGGGAACAGGAGCACAGCTTCC
GTGCAAAATGGGAAGGCGAACGTGCCCTGGTGAACAAGATTCAGCAGGACAAGCAGGAGATGGAGAACCTCAAGTATGAG
GCCGACCGTGCCGAGCGTGAGGGCAACTACGAGCGTGTCGCTGAGATCCGCTACTCGAGGCTGAAACAGCTGGAAGACGA
CATCAAGAACATCCAGCAGCAGCTGCAGGCTACGCAGGGCGGACAGGCCATGGTGCGTGAGGAGGTTACCGAGGACGACA
TCGCCGAGGTAGTAAGCCGCTGGACAGGCATTCCTGTCACGAAGATGCTGCAGAGCGAAAAGGACAAGCTGCTCCATCTG
GAGGACGAACTCCACAAGCGTGTCATCGGACAGGACGAGGCCATCACGGCCGTGGCAGATGCCGTGCGCCGTTCACGTGC
CGGCCTGCAGGATCCGAAGAAACCTATTGCATCGTTCATCTTCCTCGGAACAACGGGTACAGGTAAGACCGAACTGGCAA
AGGCATTGGCTGACTATCTTTTCAACGACGAGTCGATGATGACCCGGATTGACATGAGCGAGTATCAGGAGAAATTCAGC
GTCACACGACTTATCGGTGCGCCTCCGGGGTACGTAGGTTACGATGAAGGAGGCCAGCTGACCGAGGCCGTACGCCGCAA
ACCTTACTCGGTCGTGCTCTTCGACGAGATAGAAAAGGCGCATCCCGACGTGTTCAACATCCTCTTGCAGGTACTGGACG
ACGGACGACTGACGGACAACAAGGGGCGGACGGTGAACTTCAAGAACACGATTATCATCATGACGTCCAATCTGGGGTCG
CAGTATATCCAGCAGCAGTTCGAGAATCTCACCGATTCCAACCGTGAGGAAGTCATCGAAAAGGCGAGGACTGCCGTGAT
GGAAATGTTGAAGAAGACCATCCGTCCGGAGTTCCTCAACCGTATCGACGAGACCATCATGTTCCTGCCGTTGACAAAGG
AGCAGATCGGAGGTGTCGTACGGCTGCAGCTGGAGCGTGTGAAGGCTATGCTCGAACCACAGGGCATCAACCTCCAGTGG
ACCGACCCTGCCATCAGCTATCTGGCCGGGGTAGGCTACGATCCGGAGTTCGGCGCACGTCCCGTCAAGCGGGCTATCCA
GCGGTATGTGCTGAACGACCTCAGCAAGTCTCTGCTTTCAGGCAGTGTCAACCGCGACAAGCCGGTCATCATCGACTGCT
TCGGTGAGGGACTGACCTTCAGGAACTAA

Protein sequence :
MTFEKFTIKAQEAVQSAVNTAQRNGQQTIEPLHLLAGVMDKGKDVTNYVFQKLGVNAQTVENAVQSEMSHLPKVSGGEPY
FSSEANQVMQRTLDISQKMGDEFVSIEPMLLALLAVNSTASRILKDAGCAEKEMTAAINDLRQGQKVQTQSGDENYQSLQ
KFARNLVEDARAGKLDPVIGRDEEIRRVLQILSRRTKNNPILIGEPGTGKTAIVEGLAERIVRGDVPENLKDKQLYSLDM
GAMLAGAKYKGEFEERLKSVIKEVTKAEGNIILFIDEIHTLVGAGGGEGAMDAANILKPALARGELRAIGATTLNEYQKY
FEKDKALERRFQTVMVDEPDELDAISILRGLKERYENHHKVRIQDDACIAAVKLSERYISDRFLPDKAIDLMDEAAAKLR
MERDSVPEELDEITRRLKQLEIEREAIKREDDSEKIAQLDKEIAELREQEHSFRAKWEGERALVNKIQQDKQEMENLKYE
ADRAEREGNYERVAEIRYSRLKQLEDDIKNIQQQLQATQGGQAMVREEVTEDDIAEVVSRWTGIPVTKMLQSEKDKLLHL
EDELHKRVIGQDEAITAVADAVRRSRAGLQDPKKPIASFIFLGTTGTGKTELAKALADYLFNDESMMTRIDMSEYQEKFS
VTRLIGAPPGYVGYDEGGQLTEAVRRKPYSVVLFDEIEKAHPDVFNILLQVLDDGRLTDNKGRTVNFKNTIIIMTSNLGS
QYIQQQFENLTDSNREEVIEKARTAVMEMLKKTIRPEFLNRIDETIMFLPLTKEQIGGVVRLQLERVKAMLEPQGINLQW
TDPAISYLAGVGYDPEFGARPVKRAIQRYVLNDLSKSLLSGSVNRDKPVIIDCFGEGLTFRN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 2e-159 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_004328830.1 ATP-dependent chaperone protein ClpB VFG2076 Protein 1e-119 45
clpB YP_004328830.1 ATP-dependent chaperone protein ClpB VFG2084 Protein 2e-101 43