Gene Information

Name : clpA (SCO6408)
Accession : NP_630495.1
Strain : Streptomyces coelicolor A3(2)
Genome accession: NC_003888
Putative virulence/resistance : Virulence
Product : Clp protease ATP binding subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 7073756 - 7076284 bp
Length : 2529 bp
Strand : -
Note : SC3C8.27c, clpA, probable clp protease ATP binding subunit, len: 842 aa; highly similar to many e.g. CLPB_ECOLI clpB protein (857 aa), fasta scores; opt: 975 z-score: 1914.6 E(): 0, 46.8% identity in 857 aa overlap anCLAB_LYCES ATP-dependent clp protease

DNA sequence :
ATGACCAGCGGCTACATGGGCCCGGAGGGCGACCCGTTCGCGGAGTTCCTGGCACGCTTCTTCGGCGGGCCCAGGCCCCG
GCAGATCGACATCGGCCGGCTGCTCAGCCAGCCCGCCCGGGAGCTGGTGCGCGGCGCCGCCCAGTACGCCGCCGAGCACG
GCAGCCGGGACCTGGACACCGAGCACCTGCTGCGGGCCGCGCTCGCCACCGAGCCGACCCGGGGGCTGCTCAGCCGGGCC
GGTGCCGACCCCGACTCGCTGGCCTCGCAGATCGACGAGCGGACCGGACCGGTGCAGCACCCGCCGGGCGAGGTCCCGCC
GCCCACGTCGCTCTCGCTCACCCCGGCCGTCAAGCGTGCCCTGCTGGACGCGCACGAGCTGGCCCGCTCCACCGGCACCG
GGTACATCGGCCCGGAGCACGTGCTCAGCGCCCTGGCCGCCAACCCCGACTCGGCCGCCGGGCACATCCTGAACGCGGCC
CGCTTCGCGCCCTCGAACCTGCCCACGGAGACGCCGGAGGCCGCGAAGGGCCGTACCGAGAGCGCCCGGACCACGAACAC
GCCGACCCTCGACAAGTACGGCCGCGATCTCACCGATCTGGCGCAGCAGGGCCGGATCGACCCGGTGATCGGGCGTGAGG
AGGAGATCGAGCAGACCGTCGAGGTGCTCTCCCGGCGCGGCAAGAACAACCCCGTCCTGATCGGGGACGCCGGCGTCGGC
AAGACCGCGATCGTGGAGGGGCTCGCCCAGCGCATCACCGACGGCGACGTGCCGGACGTGCTGATCGGGCGCCGGGTGGT
CGCGCTCGACCTGACGGGCGTCGTCGCGGGCACCCGCTACCGGGGCGACTTCGAGGAGCGGATGAACAACATCGTGGGCG
AGATCCGCGCCCACTCCGACGAGCTGATCATCTTCATCGACGAGCTGCACACCGTCGTCGGCGCGGGCGGGGGCGGCGAG
AGCGGGTCGATGGACGCCGGGAACATCCTCAAGCCGGCCCTGGCCCGCGGCGAGCTGCACATCGTGGGCGCCACCACGCT
GGAGGAGTACCGCAGGATCGAGAAGGACGCGGCCCTCGCCCGCCGCTTCCAGCCGATCCTGGTGCCGGAGCCCACCACCG
CCGACGCGATCGAGATCCTGCGCGGCCTGCGCGACCGCTACGAGGCCCACCACCAGGTCCGCTACACCGACGAGGCGCTG
GTCGCGGCCGTGGAGATGTCCGACCGCTACCTCACCGACCGGCGCCTGCCCGACAAGGCCATCGACCTGATCGACCAGGC
GGGCGCCCGCGTGCGGCTGCGGGCGCGCACCAAGGGCACCGACGTACGGGCCCTGGAGCGCGAGGTCGACCAGCTGGTGA
GGGACAAGGACCAGGCGGTCGCGGACGAGCAGTACGAGCAGGCGACCCAGCTGCGGGACCGGATCGTCGGGCTGAAGCAG
CGCATCACCGAGGCCACCGGCGACGGCCAGGCCGACGAGGGCCTCGACCTGGTCGTGGACACCGAGTCCATCGCCGAGGT
GGTCTCCCGGCAGACCGGCATCCCGGTCAGCAGCCTCACCCAGGAGGAGAAGGACCGCCTGCTGGGCCTGGAGTCCCACC
TGCACGAGCGGGTCGTCGGCCAGGACGAGGCGGTACGGGTCGTCTCCGACGCCGTGCTGCGCTCCCGCGCCGGGCTGTCC
AGTGCGGACCGGCCCATCGGCAGCTTCCTCTTCCTCGGCCCGACCGGCGTCGGCAAGACCGAGCTGGCCCGGGCACTGGC
CGAGGCGCTGTTCGGCAGCGAGGACCGGATGGTGCGCCTCGACATGAGCGAGTACCAGGAGCGCCACACCGTCAGCCGCC
TGGTCGGCGCCCCGCCCGGCTACGTCGGGCACGAGGAGGCCGGTCAGCTCACCGAGGTGGTGCGCCGGCACCCGTACTCG
CTGCTCCTGCTGGACGAGGTGGAGAAGGCGCACCCCGACGTCTTCAACATCCTGCTCCAGGTCCTGGACGACGGGCGGCT
GACCGACTCGCAGGGCCGGACGGTGGACTTCTCCAACACGGTCGTCGTGATGACCAGCAACCTGGGCTCCGACGTGATCA
CCCGGCGCGGTGCCGGCATCGGCTTCGGCGCGGGCGGCGCGGAGGCGGACGAGGAAGCCCGGCGCGAGCAGGTGCTGCGC
CCGCTGCGGGAGCACTTCCGGCCCGAGTTCCTCAACCGCGTCGACGAGATCGTGGTCTTCCGCCAGCTCAGCGGCGAGCA
GCTGCGGCAGATCACCAGCCTGCTCCTGGAGCAGACCCGTCGCATGGTGCACGCGCAGGGCGTCACCGTCGACTTCACCG
ACGCGGCCGTGGACTGGCTCGCCGAGCGCGGCTACCAGCCCGAGTACGGCGCCCGACCGCTGCGCCGCACCATCCAGCGC
GAGGTGGACAACGAGCTGTCCCGGCTGCTGCTCGACGGCCGGGTGGCGGAGGGCGGCCGGGTGACGGTCGACGTCGAGGA
CGGGCGGCTGGCGTTCCGCACGCCGGAGCGACCCGTCCCCGAACTGTGA

Protein sequence :
MTSGYMGPEGDPFAEFLARFFGGPRPRQIDIGRLLSQPARELVRGAAQYAAEHGSRDLDTEHLLRAALATEPTRGLLSRA
GADPDSLASQIDERTGPVQHPPGEVPPPTSLSLTPAVKRALLDAHELARSTGTGYIGPEHVLSALAANPDSAAGHILNAA
RFAPSNLPTETPEAAKGRTESARTTNTPTLDKYGRDLTDLAQQGRIDPVIGREEEIEQTVEVLSRRGKNNPVLIGDAGVG
KTAIVEGLAQRITDGDVPDVLIGRRVVALDLTGVVAGTRYRGDFEERMNNIVGEIRAHSDELIIFIDELHTVVGAGGGGE
SGSMDAGNILKPALARGELHIVGATTLEEYRRIEKDAALARRFQPILVPEPTTADAIEILRGLRDRYEAHHQVRYTDEAL
VAAVEMSDRYLTDRRLPDKAIDLIDQAGARVRLRARTKGTDVRALEREVDQLVRDKDQAVADEQYEQATQLRDRIVGLKQ
RITEATGDGQADEGLDLVVDTESIAEVVSRQTGIPVSSLTQEEKDRLLGLESHLHERVVGQDEAVRVVSDAVLRSRAGLS
SADRPIGSFLFLGPTGVGKTELARALAEALFGSEDRMVRLDMSEYQERHTVSRLVGAPPGYVGHEEAGQLTEVVRRHPYS
LLLLDEVEKAHPDVFNILLQVLDDGRLTDSQGRTVDFSNTVVVMTSNLGSDVITRRGAGIGFGAGGAEADEEARREQVLR
PLREHFRPEFLNRVDEIVVFRQLSGEQLRQITSLLLEQTRRMVHAQGVTVDFTDAAVDWLAERGYQPEYGARPLRRTIQR
EVDNELSRLLLDGRVAEGGRVTVDVEDGRLAFRTPERPVPEL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 3e-159 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpA NP_630495.1 Clp protease ATP binding subunit VFG0079 Protein 3e-180 53
clpA NP_630495.1 Clp protease ATP binding subunit VFG0080 Protein 9e-153 50