Gene Information

Name : clpB (WD0224)
Accession : NP_966034.1
Strain : Wolbachia endosymbiont of Drosophila melanogaster wMel
Genome accession: NC_002978
Putative virulence/resistance : Unknown
Product : ATP-dependent Clp protease, ATP-binding subunit ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 207285 - 209846 bp
Length : 2562 bp
Strand : -
Note : Identified by similarity to EGAD:20010; match to PF02861 protein family HMM PF02861; match to PF02861 protein family HMM PF02861; match to PF00004 protein family HMM PF00004; match to PF00004 protein family HMM PF00004

DNA sequence :
ATGGACTTAAATAAATTTACCGAAAAAGCAAAAAGTCTAATTCAAAGTGCTCAAATGAAAGCGTTGGGAGCTGGGCATCA
GATTTTTATGCCTGAGCATTTACTAAAAGTGATGCTTGAAGATGAATCAGGTTTAGTTCAGGATTTGATAAACGCTTGTG
GTGGAAATGTGCAGAGTATTTCCGATGCTGTGGATAGTGCAATAAAAAAGTTGCCAGTAATTGAAGGTCCAGGAAGTGGT
GGCCTTCAACTTTCAAGAGAAATAGCAAAAGTTTTTGAAGATTCAATTGGTATTGCAAAGAGAAATAAAGATTCATTTGT
TGCTGTCGAGCGTTTACTGCAGGGCCTCACTGCACAAAAAGATGATACTGTAGGCAAAATTCTGGCAGAGAATGGTGTGA
CTCCACAAAAACTGAACTCAGTTGTTGCAGAAATGAGAAAGGGCAGCAGTGCGGACTCGCCAAACAGTGAGGAAAAATTA
AATGCGGCAAAGAAATATACAAAAGATGTTACAGAGCTTGCTATGCAAGGAAAGCTTGATCCTGTAATTGGTCGTGATGA
AGAGATCAGAAGAACTATGCAGGTATCACTGAGGCGGACAAAAAATAATCCAGTGCTGATCGGTGAACCTGGTGTTGGAA
AAACTGCAATAGTTGAGGGGCTTGCAAATAGAATTGTTGCAAACGATGTACCACTTGGTTTGCATGATGCAAAGGTCTTA
GCTTTGGATCTTGGAGCACTAATTGCTGGCACTAAGTTTAGAGGAGAATTTGAAGAAAGGTTAAAAGCAGTTATTAATGA
GCTTTCGAGAGCGGAGGGAAAAGTTATCTTGTTTATAGATGAGCTTCATACTTTAGTTGGAGCAGGAGCAACAAGTGGTG
CAATGGATGCCTCGAATCTGCTGAAGCCTGCTCTTGCGCGTGGAGAAGTTCGCTGTATAGGAGCCACAACTTTAGATGAA
TATCGTCAGCATATAGAAAAAGATCCTGCACTTGCAAGGCGTTTTCAGCCTGTGTTTATCTCTCAACCAACTGAAACTGA
TACCATTTCAATACTCAGGGGTTTGAAGGAAAGATATGAGGTGCATCATGGTATAAGAATTACAGACGGTGCAATAATTG
CTGCTGCAACGTTATCCAATAGATATATAACAGATAGGTTTTTACCTGATAAAGCAATTGATTTGATTGATGAAGCAGCA
AGTAGGGTGAGAATTGAAATGGATAGCAAGCCTGAAGTTATTGATGAACTTGAGAGAAAGGTTATACAGCTAAAGATTGA
AGCAGAGGCTCTAAAAAAAGAAAGTGATGAAAATTCTAAACAGCGTTTAAAAAAGATAAATGAAGAGATCGAAAATCTAA
ACAGTAAATTTGCTGACTTGAACAGCAAATGGCAGATGGAAAAGAATAAAATAGCTAGAATACAAGAAACAGCCGAAAAA
TTAGATAACGCTAGAAAAGAGTTGGAATTAGTTCAGCGTAGTGGGAATTTGGGAAGGGCAGGAGAGCTGATGTATGGTGT
GATCCCTCAGCTTGAGAATGAATTAAAAAATCAGGAAAAGGTTACTGATAGTTTCTTAAAGAAAGAAGTGACTGGAGATG
ATATTGCAAACATTGTTTCAAAGTGGACGGGAATTCCAGTTGATAATATGATGCATAGTGAAAAGGAAAAACTCCTTAAC
ATGGAGAATGAAATAGGAAGAAGAGTAATAGGGCAGAAAGATGCAATAGAAGCGATAAGTAATGCAGTTAGGCGCTCCCG
CTCTGGGGTGCAAGATACCAATAAGCCTTTTGGTTCATTTTTATTTTTAGGTCCAACTGGAGTTGGTAAAACCGAACTTG
CGAAAGCCTTAGCTGAATTTCTGTTTGACGATCAATCAGCGCTTTTGCGTTTTGATATGTCAGAATATATGGAAAAACAT
TCTGTTTCAAAGTTAATTGGCGCGCCTCCAGGGTATGTTGGCTATGAACAAGGTGGCAGATTAACTGAAGCGGTAAGAAG
AAGGCCATATCAGGTTATTCTGTTTGATGAAATCGAGAAAGCAAATCCAGATATATTTAATCTATTGCTGCAAATCTTGG
ATGAGGGTAGACTCACTGATAGTCATGGTAAATTAATTGATTTTCGCAACACCATACTGATTTTAACTTCTAATCTTGGC
GCTGAGATAATGCTAAAAGGGAATGTTGATTCTGTAAGAAATGAAGTGATGCAAATAGTTAAATCAGCATTTCGTCCAGA
ATTTTTAAACAGGCTGGATGAAATTATTATATTCCACAGTTTAACCAGAGATGACATCTATAAAATTATAGATGTTCAAT
TTTCTTATTTACAAAAAACACTTGCTAAGCGTAAACTGAGTATAAGTTTGTTGCAGGAAGCTAAAGAACTAATAGCACAA
ACTGGCTATGACCCTGAATATGGAGCAAGGCCCTTAAAAAGGGTTATACAAGAGTGTATTCAAAATAATTTAGCTAAATT
AGTTTTATCAGGGGAAGTAGTAGAAAATGATGAGCTAATAGTGTACGCTCTTGATAATGAAATCTTAGTTAAAAAGGTTT
AA

Protein sequence :
MDLNKFTEKAKSLIQSAQMKALGAGHQIFMPEHLLKVMLEDESGLVQDLINACGGNVQSISDAVDSAIKKLPVIEGPGSG
GLQLSREIAKVFEDSIGIAKRNKDSFVAVERLLQGLTAQKDDTVGKILAENGVTPQKLNSVVAEMRKGSSADSPNSEEKL
NAAKKYTKDVTELAMQGKLDPVIGRDEEIRRTMQVSLRRTKNNPVLIGEPGVGKTAIVEGLANRIVANDVPLGLHDAKVL
ALDLGALIAGTKFRGEFEERLKAVINELSRAEGKVILFIDELHTLVGAGATSGAMDASNLLKPALARGEVRCIGATTLDE
YRQHIEKDPALARRFQPVFISQPTETDTISILRGLKERYEVHHGIRITDGAIIAAATLSNRYITDRFLPDKAIDLIDEAA
SRVRIEMDSKPEVIDELERKVIQLKIEAEALKKESDENSKQRLKKINEEIENLNSKFADLNSKWQMEKNKIARIQETAEK
LDNARKELELVQRSGNLGRAGELMYGVIPQLENELKNQEKVTDSFLKKEVTGDDIANIVSKWTGIPVDNMMHSEKEKLLN
MENEIGRRVIGQKDAIEAISNAVRRSRSGVQDTNKPFGSFLFLGPTGVGKTELAKALAEFLFDDQSALLRFDMSEYMEKH
SVSKLIGAPPGYVGYEQGGRLTEAVRRRPYQVILFDEIEKANPDIFNLLLQILDEGRLTDSHGKLIDFRNTILILTSNLG
AEIMLKGNVDSVRNEVMQIVKSAFRPEFLNRLDEIIIFHSLTRDDIYKIIDVQFSYLQKTLAKRKLSISLLQEAKELIAQ
TGYDPEYGARPLKRVIQECIQNNLAKLVLSGEVVENDELIVYALDNEILVKKV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 3e-163 42
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-104 42