Gene Information

Name : clpB (ERWE_CDS_06710)
Accession : YP_197547.1
Strain : Ehrlichia ruminantium Welgevonden
Genome accession: NC_006832
Putative virulence/resistance : Virulence
Product : ClpB protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1053557 - 1056148 bp
Length : 2592 bp
Strand : -
Note : Similar to sp|P44403|CLPB_HAEIN sp|P03815|CLPB_ECOLI sp|O53719|CLPB_MYCTU sp|O83110|CLPB_TREPA sp|Q9RA63|CLPB_THETH sp|P03815|CLPB_ECOLI sp|P53533|CLPB_SYNP7 sp|O53719|CLPB_MYCTU sp|P44403|CLPB_HAEIN rc||clpB; Ortholog to ERGA_CDS_06620

DNA sequence :
ATGAAGTTAGTTATGGATTTAAATAAATTTACTGATATATCAAAGAATTTCATAGTGCAAGCGCAAACTTCAGCTGTTGC
ATTAGGGCATCAGTCTTTAGTACCTGAGCATTTACTTAAGGTAATGTTGGATGATAAAGATGAGATAGTTGAAGTTTTGC
TTACTTCTTGTGGGTGTAATGTAGAGACGTTACGTAATGATGTTATATCGGCTTTAAATAAATTACCAGTTGTAAGTGGT
CCAGGTAGTGGTCATATACATTTATCAAAGGAAATGGCGCAAGTTTTACAAGAGGCTGTTAATCTTGCAAAAAGACATCA
GGACTCTTATGTTACTGTTGAAAGATTACTGCAAGCTCTGACAATAATAAAGGACAGTAATGTTTCTAGGATATTAATTG
CACATGGTGTGACTCCTCAGAAGTTGGAGTCATTAATAGTAAACATGCGTAATGGTGCTAGAGCTGATAGTGTAAATTCT
GAGCAAAAGTTTAATGCACTAAAAAAATATGCTAAAGATGTGACTGAAGTTGCTAGAGCAGGAAAATTAGATCCAGTAAT
TGGAAGAGATGAGGAAATTAGACGTACAATACAGGTATTATTGAGAAGAACAAAAAATAATCCTGTATTAATTGGAGAGC
CTGGTGTTGGTAAAACAGCAATTATTGAAGGGTTAGCACATAAGATAGTGAAGGGAGATGTTCCAATTGGGTTGCGAGAT
ATGAGAATAATGTCATTAGATCTTGGTATGCTTGTTGCTGGGACTAAATATAGAGGTGAATTTGAAGAAAGGTTGAAAGC
TGTAGTTAATGAAATTGTTTCTTCAAATGGTAGTATTATATTATTCATTGATGAGTTACATACATTAGTTGGTGCTGGTG
CAACAGATGGAGCAATGGATGCATCAAATTTGTTGAAGCCAGCATTAGCTAGAGGTGAAATACATTGTATAGGTGCAACA
ACATTGGATGAATATAGAAAGCATATAGAAAAAGATGTAGCACTTGCTAGAAGATTTCAAACTATATTTATTTCTGAGCC
AACTTGTGATGATACAATTTCTATGTTACGTGGGTTAAAGGAAAGATATGAAGGACATCATGGTATAGATATTCCTGACA
GATCGATAATTGCTGCTGTAGCTTTATCGCAGCGTTATATTACGGATAGGTATTTACCAGATAAAGCTATAGACCTTATT
GATGAAGCAGCGAGTCGTGCGAGAATGGAGATTGATAGTAAACCTGAGGTTATTGATAAGTTAGATAGAAAGATAATGCA
GCTAAAAATCGAGATAGGAGTATTAGAAAAAGAAAGTGATGAATCCTCAAAACAGAGGTTAATGAAGTTAAAAGATGAAC
TAGAAAAACTAAATGTTCAGTCTGCTGAGCTAAGTAGTAAATGGCAAGCGGAAAAAATGAAAATGTCAAAGATGAAAGCA
TGTAAGGAAAAGCTTGATATTGCTAGAAGTGATTTAGAAAGAGCACAAAGATCTGGTGATTTGGCAAAAGCTGGTGAGTT
AATGTATGGTGTAATACCAGAAATTGAGAAAGAGTTAAAAGAACATGAAAAATTTACAAGTAGCCTTTTTAAGAAGGAAA
TTACAGAACATGACATAGCAAGTATTGTATCAAAATGGACTGGTATTCCTATTGAGAACATAATGAGTAGTGAAAGAGAA
AAACTACTGCGTATGGAGGAGGAGATAGGCAAAACAGTTATTGGTCAGGATAGTGCTGTAAAAGCAGTAAGTGATGCTGT
CAGGAGATCACGTGCAGGGGTACAGGATGCACAGAAACCATTGGGGTCTTTTTTATTTCTTGGGCCAACTGGAGTAGGTA
AAACTGAGTTGGTTAAAACATTAGCTGAGTTTTTATTTTGTGATAAGTCTGCACTTTTAAGATTTGACATGTCAGAATTT
ATGGAAAAGCATGCTGTTTCACGATTAATAGGAGCTCCTCCAGGATATGTTGGATATGACCAAGGTGGTGCATTAACTGA
AGCTGTGAGGAGAAGGCCTTATCAAGTAATATTATTTGATGAAATTGAAAAAGCACATGGAGATATTTTCAATATTTTAT
TGCAAGTATTAGATGAAGGAAGATTGACTGATAATCATGGTAAGTTAGTGGATTTCCGTAATACAATACTGGTATTAACT
TCAAATTTAGGGCAAGAAATATTAATGAACAATGAATCTGGAAATATCAATGAAGAGTCAGTTAAAGAGTCTGTTACTAA
TGTGTTGCGTAGTCATTTTCGGCCAGAATTTTTAAATAGATTGGATGAAATTATTATATTTCATAGGTTAACTAAAGAAC
ATATTGAAAGAATTATTGATGTGCAATTTTCTATATTACAAAAAATTGTTGCTCAAAGAAAATTAGAGATTACTTTATCT
TCAGATGCAAAAACATGGTTGATAAATAATGGCTATGATCCTTTATATGGGGCAAGACCTTTAAAGAGGTTAATACAACA
GCAAATACAGAATAACTTGGCAAAATTAATACTTGCTAATCAGGTAGCTGAAGGTAATAAATTAAGGGTAGATTTATTAG
ATGATAATCTTGTTATTCATAAGATTAGTTAA

Protein sequence :
MKLVMDLNKFTDISKNFIVQAQTSAVALGHQSLVPEHLLKVMLDDKDEIVEVLLTSCGCNVETLRNDVISALNKLPVVSG
PGSGHIHLSKEMAQVLQEAVNLAKRHQDSYVTVERLLQALTIIKDSNVSRILIAHGVTPQKLESLIVNMRNGARADSVNS
EQKFNALKKYAKDVTEVARAGKLDPVIGRDEEIRRTIQVLLRRTKNNPVLIGEPGVGKTAIIEGLAHKIVKGDVPIGLRD
MRIMSLDLGMLVAGTKYRGEFEERLKAVVNEIVSSNGSIILFIDELHTLVGAGATDGAMDASNLLKPALARGEIHCIGAT
TLDEYRKHIEKDVALARRFQTIFISEPTCDDTISMLRGLKERYEGHHGIDIPDRSIIAAVALSQRYITDRYLPDKAIDLI
DEAASRARMEIDSKPEVIDKLDRKIMQLKIEIGVLEKESDESSKQRLMKLKDELEKLNVQSAELSSKWQAEKMKMSKMKA
CKEKLDIARSDLERAQRSGDLAKAGELMYGVIPEIEKELKEHEKFTSSLFKKEITEHDIASIVSKWTGIPIENIMSSERE
KLLRMEEEIGKTVIGQDSAVKAVSDAVRRSRAGVQDAQKPLGSFLFLGPTGVGKTELVKTLAEFLFCDKSALLRFDMSEF
MEKHAVSRLIGAPPGYVGYDQGGALTEAVRRRPYQVILFDEIEKAHGDIFNILLQVLDEGRLTDNHGKLVDFRNTILVLT
SNLGQEILMNNESGNINEESVKESVTNVLRSHFRPEFLNRLDEIIIFHRLTKEHIERIIDVQFSILQKIVAQRKLEITLS
SDAKTWLINNGYDPLYGARPLKRLIQQQIQNNLAKLILANQVAEGNKLRVDLLDDNLVIHKIS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 1e-172 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB YP_197547.1 ClpB protein VFG2076 Protein 3e-112 41