Gene Information

Name : clpB; SCH10.39c; SCH44.01c (SCO3661)
Accession : NP_733613.1
Strain : Streptomyces coelicolor A3(2)
Genome accession: NC_003888
Putative virulence/resistance : Virulence
Product : ATP-dependent protease ATP-binding subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 4040450 - 4043047 bp
Length : 2598 bp
Strand : -
Note : clpB, SCH10.39c, probable clp-family ATP-binding subunit, partial CDS, len: >853 aa; similar to many e.g. SW:CLPB_ECOLI (EMBL:M29364) Escherichia coli ATP-dependent protease regulatory subunit (857 aa), fasta scores; opt: 3039 z-score: 2963.3 E(): 0, 56.5

DNA sequence :
GTGGACGCCGAGCTGACCAACCGGAGCCGGGACGCGATCAACGCGGCCAGCAACCGGGCCGTGACCGAGGGAAACGCCGA
CCTCACCCCCGCCCACCTGCTCCTCGCCCTGCTCCAGGGGCAGGACAACGAGAACATCACCGACCTGCTCGCCGCCGTCG
AGGCCGACCTCGCCGCCGTGCGCACCGGCGCCGAGCGCATCGTCGCCGGGCTGCCCAGCGTGACCGGGTCCACCGTCGCA
CCCCCGCAACCCAGCCGTGAGATGCTCGCCGTGGTCGCCGACGCCCAGGCACGCGCCAAGGAACTCGGGGACGAGTACCT
CTCCACCGAGCACCTGCTCCTCGGCATCGCCGCGAAGGGCGGCGCGGCCGGAGAGGTACTGGAGGGGCAGGGAGCCAGTG
CGAAGAAGCTGCAGGAGGCCTTCCGCAAGGCCAGGGGAGGGCGTCGCGTGACCACCGCCGACCCCGAGGGCCAGTACAAG
GCCCTGGAGAAGTTCGGTACCGACCTCACCGCCGCAGCCCGGGACGGGAAGCTCGACCCCGTCATCGGACGGGACCAGGA
GATCCGGCGGGTGGTCCAGGTGCTCAGCCGGCGGACCAAGAACAACCCCGTCCTCATCGGCGAGCCCGGCGTCGGCAAGA
CCGCCGTCGTCGAAGGGCTCGCCCAGCGGATCGTCAAGGGGGACGTGCCCGAGTCGCTGAAGGACAAGCGGCTGGTCGCG
CTGGACCTCGGCGCCATGGTCGCCGGGGCCAAGTACCGCGGTGAGTTCGAGGAGCGGCTGAAGACCGTGCTCGCCGAGAT
CAAGGACTCCGACGGGCAGGTCGTCACCTTCATCGACGAGCTGCACACCGTCGTGGGCGCCGGCGCCGGCGGGGACTCCG
CCATGGACGCCGGGAACATGCTCAAGCCCATGCTCGCGCGCGGCGAGCTGCGCATGGTGGGCGCCACCACCCTCGACGAG
TACCGAGAGCGGATCGAGAAGGACCCCGCCCTGGAGCGGCGCTTCCAGCAGGTGCTGGTCGCCGAGCCGACCGTCGAGGA
CTCCATCGCCATCCTGCGCGGACTCAAGGGACGCTACGAGGCCCACCACAAGGTGCAGATCGCGGACAGCGCGCTGGTGG
CCGCCGCGAGCCTCTCCGACCGGTACATCACCTCGCGCTTCCTGCCCGACAAGGCCATCGACCTGGTCGACGAGGCCGCC
TCCCGGCTGCGCATGGAGATCGACTCCTCTCCCGTCGAGATCGACGAACTCCAGCGCTCCGTCGACCGGCTGAAGATGGA
GGAGCTGGCGATCGGCAAGGAGACCGACGCCGCCTCCCTGGAGCGCCTGGAGCGGCTGCGGCGCGACCTCGCCGACAAGG
AGGAGGAGCTGCGCGGCCTCACCGCCCGCTGGGAGAAGGAGAAGCAGTCCCTCAACCGGGTCGGCGAGCTGAAGGAGAAG
CTCGACGAACTGCGCGGGCAGGCCGAGCGCGCCCAGCGCGACGGCGACTTCGACACCGCCAGCAAGCTGCTCTACGGCGA
GATCCCGGACCTGGAACGGGACCTGGAGGCCGCCTCCGAGGCCGAGGAAGAGGTCGCCCGGGACACCATGGTCAAGGAGG
AGGTCGGCGCCGACGACATCGCCGACGTCGTCGCCTCCTGGACCGGCATCCCCGCCGGGCGCCTGCTCGAAGGGGAGACG
CAGAAGCTGCTGCGCATGGAGGACGAGCTGGGCAAGCGGCTCATCGGCCAGACCCAGGCCGTGCGGGCCGTCTCCGACGC
CGTACGGCGCAGCCGGGCCGGGATCGCCGACCCGGACCGCCCGACGGGTTCCTTCCTCTTCCTCGGCCCCACCGGCGTCG
GCAAGACCGAGCTGGCCAAGGCGCTCGCCGACTTCCTCTTCGACGACGAGCGGGCCATGGTCCGCATCGACATGTCGGAG
TACAGCGAGAAGCACAGCGTGGCCCGGCTGGTCGGCGCCCCGCCCGGGTACGTCGGCTACGAGGAGGGCGGCCAGCTCAC
CGAGGCCGTGCGGCGGCGGCCGTACACCGTCGTACTGCTCGACGAGGTGGAGAAGGCGCACCCCGAGGTCTTCGACATCC
TGCTCCAGGTGCTGGACGACGGCCGGCTCACGGACGGCCAGGGCCGCACCGTCGACTTCCGCAACACCATCCTGGTGCTC
ACCTCCAACCTGGGCAGCCAGTACCTGGTCGACCCGACGACCGGCGAGGCGGAGAAGAAGCAGCAGGTCCTGGAGGTGGT
CCGGTCCTCCTTCAAGCCGGAGTTCCTCAACCGGCTCGACGACCTGGTGGTCTTCTCGGCCCTCAGCCAGGAGGAGCTGA
GCCGGATCGCACGGCTCCAGATCAACGGCCTGGCCCGGCGGCTCGCCGAACGCCGCCTCACCCTGGAGGTCACCGACGAG
GCCCTCGCCTGGCTGGCCGAGGAGGGCAACGATCCGGCCTACGGCGCCCGCCCGCTGCGCCGCCTCGTGCAGACCGCGAT
CGGCGACCGGCTCGCCCGGGAGATCCTCTCCGGCGAGATCAAGGACGGCGACACGGTCCGCGTGGACCGCTTCGGCGACG
AGCTGATCGTGGGGCCGGCGAGCGGCAAGACGCTGTAG

Protein sequence :
MDAELTNRSRDAINAASNRAVTEGNADLTPAHLLLALLQGQDNENITDLLAAVEADLAAVRTGAERIVAGLPSVTGSTVA
PPQPSREMLAVVADAQARAKELGDEYLSTEHLLLGIAAKGGAAGEVLEGQGASAKKLQEAFRKARGGRRVTTADPEGQYK
ALEKFGTDLTAAARDGKLDPVIGRDQEIRRVVQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVKGDVPESLKDKRLVA
LDLGAMVAGAKYRGEFEERLKTVLAEIKDSDGQVVTFIDELHTVVGAGAGGDSAMDAGNMLKPMLARGELRMVGATTLDE
YRERIEKDPALERRFQQVLVAEPTVEDSIAILRGLKGRYEAHHKVQIADSALVAAASLSDRYITSRFLPDKAIDLVDEAA
SRLRMEIDSSPVEIDELQRSVDRLKMEELAIGKETDAASLERLERLRRDLADKEEELRGLTARWEKEKQSLNRVGELKEK
LDELRGQAERAQRDGDFDTASKLLYGEIPDLERDLEAASEAEEEVARDTMVKEEVGADDIADVVASWTGIPAGRLLEGET
QKLLRMEDELGKRLIGQTQAVRAVSDAVRRSRAGIADPDRPTGSFLFLGPTGVGKTELAKALADFLFDDERAMVRIDMSE
YSEKHSVARLVGAPPGYVGYEEGGQLTEAVRRRPYTVVLLDEVEKAHPEVFDILLQVLDDGRLTDGQGRTVDFRNTILVL
TSNLGSQYLVDPTTGEAEKKQQVLEVVRSSFKPEFLNRLDDLVVFSALSQEELSRIARLQINGLARRLAERRLTLEVTDE
ALAWLAEEGNDPAYGARPLRRLVQTAIGDRLAREILSGEIKDGDTVRVDRFGDELIVGPASGKTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-91 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB; SCH10.39c; SCH44.01c NP_733613.1 ATP-dependent protease ATP-binding subunit VFG2076 Protein 2e-107 43
clpB; SCH10.39c; SCH44.01c NP_733613.1 ATP-dependent protease ATP-binding subunit VFG2084 Protein 5e-96 42