Gene Information

Name : STY0294
Accession : NP_454876.1
Strain : Salmonella enterica CT18
Genome accession: NC_003198
Putative virulence/resistance : Virulence
Product : ClpB-like protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 309016 - 311679 bp
Length : 2664 bp
Strand : +
Note : Similar to Escherichia coli ClpB heat shock protease SW:CLPB_ECOLI (P03815) (857 aa) fasta scores: E(): 0, 37.9% id in 892 aa; Paralogue of E. coli clpB (CLPB_ECOLI); Fasta hit to CLPB_ECOLI (857 aa), 38% identity in 892 aa overlap

DNA sequence :
ATGGAAACTCCTGTTTCACGCAGTGCGTTATATGGAAAACTGGCCGGCCCACTATTCCGGTCGCTGGAATCGGCAACGGC
ATTTTGCAAACTACGCTCTAATCCCTGGGGTGAGCTGACTCACTGGCTGCACCAGTTAACACAGCAGCCCGATAACGATA
TTCTCCACGTTCTTCGGCATTACCAGATCCCTCTTTCTGATGTGGAGAAAGCGTTACTCCGGCAACTGGATATGCTGCCC
GCCGGGGCCAGCGCCATTAGTGATTTTTCTCACCATATCGATCTCAGCGTTGAAAAGGCCTGGATGCTGGCGAGCGTCCG
TTACGGCGATAACAAAATTCGCAGCGGCTGGTTGCTGCTGGCCTTGTTGACCACGCCAGAACTGCGTCGGGTACTGAGCA
GTATCTGCGCGCCGCTGGCCACGCTTCCGGTTGATGAACTGACGGAAATACTGCCCTCGTTGATCGAAACATCGCCGGAA
GCGCAGGAGCGCCCTTACGACGGCTCCGGTCTGGCATCAGCCATTCCCGGTGAAAGCAGCCAGGCGATTCCCAACGGCGT
GCAGGACGGTAAATCCGCGCTGGCAAAATACTGTCAGGACATGACGGCACAGGCGCGCGACGGCAAAATCGACCCGGTGA
CGGGGCGTGAGCATGAAATCCGCACCATGACGGATATTCTGCTGCGCCGTCGCCAGAATAATCCCCTACTGACTGGTGAG
GCGGGCGTCGGAAAGACGGCGGTCGTCGAAGGTTTTGCCCTCGCGATTGCGCAGGGGGAAGTGCCGCCCGCGCTGCGGGA
AGTACGGCTGCTGGCGCTGGACGTTGGCGCTCTGTTGGCCGGAGCCAGCATGAAAGGCGAGTTTGAATCGCGTCTGAAAG
GGTTACTGGAAGAGGCCGGGCGCTCGCCGCAGCCGGTTATTCTGTTTGTCGATGAAGTTCACACTCTGGTGGGCGCGGGC
GGCGCATCCGGCACGGGCGATGCCGCTAACCTGCTGAAACCGGCGCTGGCGTGCGGCACCCTGCGGACTATCGGCGCCAC
CACCTGGAGCGAATACAAGCGCCATATTGAGAAAGATCCGGCGCTGACCCGTCGTTTTCAGGTGTTGCAGATTGCCGAGC
CGGAAGAGATCCCCGCAATGGAAATGGTGCGTGGTCTGGTGGATACGCTGGAAAAACACCATAACGTACTGATTCTGGAT
GAGGCGGTACGTGCGGCGGTACAGCTTTCTCACCGCTACATTCCCGCCCGGCAGTTGCCGGATAAGGCCATCAGCCTACT
GGATACCGCCGCGGTCCGCGTGGCGCTGACGCTGCACACGCCGCCTGCCAGCGTACAGTTCCTGCGTCAGCAGCTAAAAG
CGGCGGAAATGGAACGGTCGCTGTTGCAGAAGCAGGAAAAAATGGGGATTCAGTCAGATGAGCGGTGCGATGCGCTGACG
GCGCGAATTTTCTCGCTCAACGATGAACTGACTGCATCCGAATCCCGCTGGCAGCGGGAGCTGGAACTGGTACATACGTT
GCAGGAACTGCGTGTCGCAGAGTCTGATGCTGATGACAAAACCACGCTGCAACAGGCCGAAACAGCGCTAAGGGAGTGGC
AGGGCGACGCGCCGGTGGTGTTCCCGGAAGTCAGCGCGGCGATTGTCGCGGCGATTGTCGCCGACTGGACCGGTATTCCT
GCCGGGCGCATGGTGAAAGATGAGGCCAGCCAGGTGCTGGAACTGCCTGCCCGACTGGCGCAACGCGTTACCGGGCAAGA
CGGCGCGCTGGCGCAGATTGGTGAACGTATTCAGACCGCCAGGGCGGGACTGGGCGATCCACGCAAACCGGTGGGCGTGT
TTATGCTGGCCGGGCCGTCCGGTGTCGGTAAAACCGAAACCGCGCTGGCGCTGGCGGAGGCTATCTACGGCGGTGAGCAG
AACCTGGTAACCATCAATATGAGCGAGTTCCAGGAGGCTCACACCGTTTCCACACTGAAAGGCGCGCCGCCCGGCTATGT
GGGCTATGGCGAGGGTGGTGTGCTGACGGAAGCGGTGCGTCGTCACCCCTGGAGCGTAGTGCTGCTCGACGAGATCGAAA
AAGCGCACCATGACGTCCATGAACTCTTCTATCAGGTGTTTGACAAGGGCGGGATGGAGGACGGTGAGGGAACACATGTC
GATTTCAAAAACACCACGCTACTACTCACCACCAATGTAGGTTCCGACCTCATCAGCCAGATGTGTGAAGATCCGGCCTT
AATGCCCGATGCTACGGGGCTTAAAGAGGCGCTAATGCCGGAATTGCGCAAGCATTTCCCGGCGGCATTTCTGGGCCGCG
TGACGGTGATCCCTTACCTGCCGCTGGATGAAACGTCGCGTGGCGTGATTGCCCGTCTGCATCTTGACCGGCTGGTGGCG
CGGATGGGTGAACAGCACGGGGTGACGCTGACGTATAGTGAGGAACTGGTCGCACATATTGTGGCGTGCTGTCCAATGCA
TGAAACGGGCGCGCGGTTGCTGATTGGCTACATCGAACAGCACATCCTGCCACAACTGTCGCGCTACTGGTTGCAGGCCA
TGACGGAAAAAGCCGCTATCAGGCAGATTGATATCGGCGTTAATGGTGATGAGCAGATTGTTTTTGAGACAACCTCGCAG
GAGGGAATATGCCAAAAGAGTTAA

Protein sequence :
METPVSRSALYGKLAGPLFRSLESATAFCKLRSNPWGELTHWLHQLTQQPDNDILHVLRHYQIPLSDVEKALLRQLDMLP
AGASAISDFSHHIDLSVEKAWMLASVRYGDNKIRSGWLLLALLTTPELRRVLSSICAPLATLPVDELTEILPSLIETSPE
AQERPYDGSGLASAIPGESSQAIPNGVQDGKSALAKYCQDMTAQARDGKIDPVTGREHEIRTMTDILLRRRQNNPLLTGE
AGVGKTAVVEGFALAIAQGEVPPALREVRLLALDVGALLAGASMKGEFESRLKGLLEEAGRSPQPVILFVDEVHTLVGAG
GASGTGDAANLLKPALACGTLRTIGATTWSEYKRHIEKDPALTRRFQVLQIAEPEEIPAMEMVRGLVDTLEKHHNVLILD
EAVRAAVQLSHRYIPARQLPDKAISLLDTAAVRVALTLHTPPASVQFLRQQLKAAEMERSLLQKQEKMGIQSDERCDALT
ARIFSLNDELTASESRWQRELELVHTLQELRVAESDADDKTTLQQAETALREWQGDAPVVFPEVSAAIVAAIVADWTGIP
AGRMVKDEASQVLELPARLAQRVTGQDGALAQIGERIQTARAGLGDPRKPVGVFMLAGPSGVGKTETALALAEAIYGGEQ
NLVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRHPWSVVLLDEIEKAHHDVHELFYQVFDKGGMEDGEGTHV
DFKNTTLLLTTNVGSDLISQMCEDPALMPDATGLKEALMPELRKHFPAAFLGRVTVIPYLPLDETSRGVIARLHLDRLVA
RMGEQHGVTLTYSEELVAHIVACCPMHETGARLLIGYIEQHILPQLSRYWLQAMTEKAAIRQIDIGVNGDEQIVFETTSQ
EGICQKS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 0.0 100
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-111 45
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-111 45
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 3e-90 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein VFG2076 Protein 0.0 52
STY0294 NP_454876.1 ClpB-like protein VFG2084 Protein 9e-125 41
STY0294 NP_454876.1 ClpB-like protein VFG0079 Protein 4e-89 41