Gene Information

Name : SPD_2022 (SPD_2022)
Accession : YP_817407.1
Strain : Streptococcus pneumoniae D39
Genome accession: NC_008533
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease, ATP-binding subunit
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2000461 - 2002893 bp
Length : 2433 bp
Strand : -
Note : equivalent gene in S.pneumoniae TIGR4 = SP2194; equivalent gene in S.pneumoniae R6 = spr2000; identified by match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728

DNA sequence :
ATGAACTATTCAAAAGCATTGAATGAATGTATCGAAAGTGCCTACATGGTTGCTGGACATTTTGGAGCTCGTTATCTAGA
GTCTTGGCACTTGTTGATTGCCATGTCTAATCACAGTTATAGTGTAGCAGGGGCAACTTTAAATGATTATCCGTATGAGA
TGGACCGTTTAGAAGAGGTGGCTTTGGAACTGACTGAAACGGACTATAGCCAGGATGAAACTTTTACGGAATTGCCGTTC
TCCCGTCGTTTGCAGATTCTTTTTGACGAAGCAGAGTATGTAGCGTCAGTGGTCCATGCTAAGGTGCTAGGTACAGAGCA
CGTCCTCTATGCGATTTTGCATGATGGCAATGCCTTGGCGACTCGTATCTTGGAGAGGGCTGGTTTTTCTTATGAAGACA
AGAAAGATCAGGTCAAGATTGCTGCTCTTCGTCGAAATTTAGAAGAACGGGCAGGCTGGACTCGTGAAGATCTCAAGGCT
TTACGCCAACGCCATCGTACAGTAGCTGACAAGCAAAATTCTATGGCCAATATGATGGGCATGCCGCAGACTCCTAGTGG
TGGTCTCGAGGACTATACGCATGATTTGACAGAGCAAGCGCGTTCTGGCAAGTTAGAACCAGTCATCGGTCGGGACAAGG
AAATCTCACGTATGATTCAAATCTTAAGCCGGAAGACTAAGAACAACCCTGTCTTGGTTGGGGATGCTGGTGTCGGGAAA
ACAGCTCTGGCGCTTGGTCTTGCTCAGCGTATTGCCAGTGGTGACGTGCCTGCGGAAATGGCTAAGATGCGCGTGTTAGA
ACTTGATTTGATGAATGTCGTTGCAGGGACACGCTTCCGTGGTGACTTTGAAGAACGCATGAATAATATCATCAAGGATA
TTGAAGAAGATGGCCAAGTCATCCTCTTTATCGATGAACTCCACACCATCATGGGTTCTGGTAGCGGGATTGATTCGACT
CTGGATGCGGCCAATATCTTGAAACCAGCCTTGGCGCGTGGAACTTTGAGAACGGTTGGTGCCACCACTCAGGAAGAATA
TCAAAAACATATCGAAAAAGATGCGGCACTTTCTCGTCGTTTCGCTAAAGTGACGATTGAAGAACCAAGTGTGGCAGATA
GTATGACTATTTTACAAGGTTTGAAGGCGACTTATGAGAAACATCACCGTGTACAAATCACAGATGAAGCGGTTGAAACA
GCGGTTAAGATGGCTCATCGTTATTTAACCAGTCGTCACTTGCCAGACTCTGCTATCGACCTCTTGGACGAAGCAGCAGC
AACAGTGCAAAATAAGGCAAAGCATGTAAAAGCAGACGATTCAGATTTGAGTCCAGCTGACAAGGCCCTGATGGATGGCA
AGTGGAAACAGGCAGCCCAGCTAATCGCAAAAGAAGAGGAAGTACCTGTCTACAAAGACTTGGTGACAGAGTCTGATATT
TTGACCACCTTGAGTCGCTTGTCAGGAATCCCAGTTCAAAAACTGACTCAGACTGACGCTAAGAAATATCTAAATCTTGA
AGCAGAACTCCATAAACGGGTTATCGGTCAAGATCAAGCTGTTTCAAGTATTAGTCGTGCGATTCGCCGTAATCAGTCAG
GAATTCGCAGTCACAAGCGTCCGATCGGTTCCTTTATGTTCCTAGGACCCACAGGTGTCGGTAAGACCGAGTTGGCCAAG
GCTTTGGCAGAAGTTCTCTTTGATGACGAATCAGCCCTTATCCGCTTTGATATGAGTGAGTATATGGAGAAATTCGCAGC
CAGCCGTCTAAATGGAGCTCCTCCAGGCTATGTGGGTTACGAAGAAGGTGGGGAGTTGACAGAGAAGGTTCGCAATAAAC
CCTATTCCGTTCTCCTCTTTGATGAGGTAGAGAAGGCCCACCCAGATATCTTTAATGTTCTCTTGCAGGTTCTGGATGAC
GGTGTCTTGACAGATAGCAAGGGACGCAAGGTCGATTTTTCAAATACCATTATCATTATGACATCGAATCTAGGTGCGAC
TGCCCTTCGTGATGATAAGACTGTTGGTTTTGGGGCTAAGGATATTCGTTTTGACCAGGAAAATATGGAAAAACGCATGT
TTGAAGAACTGAAAAAAGCTTATAGACCGGAATTCATCAACCGTATTGATGAGAAGGTGGTCTTCCATAGCCTATCTAGT
GACCATATGCAGGAAGTGGTGAAGATTATGGTCAAGCCTTTAGTGGCAAGTTTGGCTGAAAAAGGCATTGACTTGAAATT
ACAAGCTTCAGCTCTGAAATTGTTAGCAAATCAAGGATATGACCCAGAGATGGGAGCTCGCCCACTTCGCAGAACCCTGC
AAACAGAAGTGGAGGACAAGTTGGCAGAACTTCTTCTTAAGGGAGATTTAGTGGCAGGCAGCATACTTAAGATTGGTGTC
AAAGCAGGCCAGTTAAAATTTGATATTGCATAA

Protein sequence :
MNYSKALNECIESAYMVAGHFGARYLESWHLLIAMSNHSYSVAGATLNDYPYEMDRLEEVALELTETDYSQDETFTELPF
SRRLQILFDEAEYVASVVHAKVLGTEHVLYAILHDGNALATRILERAGFSYEDKKDQVKIAALRRNLEERAGWTREDLKA
LRQRHRTVADKQNSMANMMGMPQTPSGGLEDYTHDLTEQARSGKLEPVIGRDKEISRMIQILSRKTKNNPVLVGDAGVGK
TALALGLAQRIASGDVPAEMAKMRVLELDLMNVVAGTRFRGDFEERMNNIIKDIEEDGQVILFIDELHTIMGSGSGIDST
LDAANILKPALARGTLRTVGATTQEEYQKHIEKDAALSRRFAKVTIEEPSVADSMTILQGLKATYEKHHRVQITDEAVET
AVKMAHRYLTSRHLPDSAIDLLDEAAATVQNKAKHVKADDSDLSPADKALMDGKWKQAAQLIAKEEEVPVYKDLVTESDI
LTTLSRLSGIPVQKLTQTDAKKYLNLEAELHKRVIGQDQAVSSISRAIRRNQSGIRSHKRPIGSFMFLGPTGVGKTELAK
ALAEVLFDDESALIRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIFNVLLQVLDD
GVLTDSKGRKVDFSNTIIIMTSNLGATALRDDKTVGFGAKDIRFDQENMEKRMFEELKKAYRPEFINRIDEKVVFHSLSS
DHMQEVVKIMVKPLVASLAEKGIDLKLQASALKLLANQGYDPEMGARPLRRTLQTEVEDKLAELLLKGDLVAGSILKIGV
KAGQLKFDIA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 2e-141 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SPD_2022 YP_817407.1 ATP-dependent Clp protease, ATP-binding subunit VFG0079 Protein 8e-162 47
SPD_2022 YP_817407.1 ATP-dependent Clp protease, ATP-binding subunit VFG0080 Protein 9e-126 44