Gene Information

Name : SARI_02728 (SARI_02728)
Accession : YP_001571726.1
Strain : Salmonella enterica RSK2980
Genome accession: NC_010067
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2640439 - 2643033 bp
Length : 2595 bp
Strand : -
Note : 'KEGG: eci:UTI89_C3197 0. ClpB protein K01358; COG: COG0542 ATPases with chaperone activity, ATP-binding subunit; Psort location: cytoplasmic, score: 23'

DNA sequence :
ATGGAAGGGGCGGCCTCGCTCTGCCAGACCCGCGCCCATGCGGAAATTCTGCCTGAGCACTGGCTGCTGAAACTGCTCGA
ACAGGGAGAAGGTGACCTGACGGTGCTGGCGCGCCGTTATGAGTGGGATATAGATGCGCTGTGGCAGGATTTGCTCGGCT
GGCTGGATTCACTTCCCCGCTCTGTTCGCAGTCGCCCGCAGCTTTCAGACAGCATTCAGACGCTGATGCAGGAAGCCTGG
CTGATTGCTTCGCTCAACAGTGAAGAGCAGATCCGCAGTCATCATCTGCTGATGGCCTTGGTCGGGAAACAAAATCTGGT
GCGCTGTGATGGTCTGTGGCCGCTGCTGACGCTCGGTCAGAGCCAACTGGGGCGTCTGCGACCACTGCTCGATGTGCAGT
CAGATGAGCGTCCGGAAGTGCAACTGGAGAAGGAACTGGCGCAGAGCCACGGTGGAGAGGTGGTGTTTGTTGGCCGCCCT
GCAGGTGCAGAACTTAAAGACGGTGAGCTGAACCCGGCGCTACAGAACGCGCTGGATAAGTTCACTCTCGACGTTACCGC
CAAAGCGAAGGAAGGCAAAATCGACCCGGTATTTGGTCGCGATACTGAGATCCGTCAGATGGTGGATATCCTCTCCCGCC
GCCGCAAGAACAACCCGATTCTGGTCGGTGAGCCGGGTGTCGGCAAAACGGCGCTGGTGGAAGGGCTGGCATTACGAATT
GCCGAGGGCAACGTGCCGGAATCCCTCAGACCTGTTGTTCTGCGTACCCTCGACCTCGGTCTGCTGCAGGCGGGGGCGGG
CGTGAAAGGGGAATTCGAACAGCGCCTTAAAAACGTGATTGATGCCGTACAGCACTCGCCGGCCCCCATTCTGCTGTTTA
TCGACGAAGCACACACCATTATCGGTGCGGGAAATTCAGCTGGTGGCGCGGATGCGGCCAACCTGCTGAAACCTGCCCTG
GCCCGTGGTGAACTGCGCACTATTGCCGCGACCACCTGGTCCGAATATAAGCAGTATTTTGAGCGGGATGCCGCGCTGGA
GCGCCGCTTCCAGATGGTGAAGGTCGACGAACCGGATGACGACACCGCCTGCCTGATGCTCAGAGGCCTGAAATCACGCT
ACGCCGAACACCATAACGTGCATATCACTGACGATGCGGTCAAAGCCGCTGTCACCCTGTCGCGCCGCTACCTGACGGGC
CGCCAGCTGCCGGATAAGGCCGTCGATTTACTCGACACCGCCGCTACCCGCGTGCGTATGAGCCTCGATACCGTACCCGA
ACAGTTGACCCGCATCCGCTCGCAGATTACCTCCCTTGAAATGGAGAAGCAGGCGCTGCTGGAAGATATTGCCGTTGGCA
ATCAGATCCACGGCGAACGCCTGAGCGGGATTGAACAGGAGGAGGTACGCCTGATAGTCGAACTCGATGATCTGGAATCC
CGGTATGGTCAGGAGCTGAAACTTACTGAACAGTTACTGGAATGCCGCCAGGACATCTCTCGCCAGAGTGAGACTCACGC
GCTGCAACAGGAGCTGAACGGCATGCAGGACGGCAACCCATTGCTCTCCGTTGATGTGGACGTGCGCACCGTCGCCACTG
TCATTGCCGACTGGACGGGCGTGCCACTGTCTTCCCTGATGAAAGATGAACAGACCGAACTGCTGATTCTGGAAAACGAA
ATCGGCAAACGGGTTGTGGGTCAGGAAGTGGCGCTTGCAGCCATTGCTCAGCGTCTGCGTGCGGCGAAAACCGGCCTCAC
TTCCGAGAACGGTCCGCAGGGCGTGTTCCTGCTGGTTGGCCCGAGCGGCGTGGGTAAAACCAAGACCGCGCTGGCGCTGG
CCGATGTAATGTACGGTGGTGAAAAATCCCTTATCACGATTAACCTCTCGGAATACCAGGAGCCGCACACGGTGTCGCAG
CTGAAAGGTTCACCGCCGGGTTACGTTGGTTATGGCCAGGGCGGCATTCTCACCGAAGCGGTACGCAAGCGTCCGTACAG
CGTGGTGCTGCTCGATGAAGTGGAGAAAGCGCACCGCGACGTCATGAACCTGTTCTATCAGGTGTTCGACCGCGGCTTTA
TGCGCGACGGCGAAGGACGTGAAATCGACTTCCGTAACACCGTTATTCTGATGACCTCCAACCTCGGCAGCGACCACCTG
ATGCAGTTGCTCGCTGAGCAGCCGGAAGCCACTGAAGCCGACCTGCACGAACTGCTGCGCCCGATTTTACGCGACCACTT
CCAGCCGGCGCTGCTGGCCCGTTTCCAGACCGTGATTTATCGTCCGTTAGCAGAGACCGCCATGCGCACCATCGTGGAAA
TGAAGCTCGCTCAGGTGAGTAAGCGTCTGCACCGTCACTACGGCCTGACCACCAAAATTGACGAAAGCCTGTATGACGCG
CTGACTGCCGCCTGCCTGCTGCCGGATACCGGTGCGCGTAACGTTGACAGCCTGCTCAATCAGCAGATTCTACCGGTGCT
GAGTCAGCAGCTTTTAACGTACATGGTGGCGAAACAGAAACCGACTTTACTGACGCTGGGGTGGAGTGATGAAGAAGGGA
TTGGGCTTGAATTTTCAGGTGCGGCTGAACGTTAA

Protein sequence :
MEGAASLCQTRAHAEILPEHWLLKLLEQGEGDLTVLARRYEWDIDALWQDLLGWLDSLPRSVRSRPQLSDSIQTLMQEAW
LIASLNSEEQIRSHHLLMALVGKQNLVRCDGLWPLLTLGQSQLGRLRPLLDVQSDERPEVQLEKELAQSHGGEVVFVGRP
AGAELKDGELNPALQNALDKFTLDVTAKAKEGKIDPVFGRDTEIRQMVDILSRRRKNNPILVGEPGVGKTALVEGLALRI
AEGNVPESLRPVVLRTLDLGLLQAGAGVKGEFEQRLKNVIDAVQHSPAPILLFIDEAHTIIGAGNSAGGADAANLLKPAL
ARGELRTIAATTWSEYKQYFERDAALERRFQMVKVDEPDDDTACLMLRGLKSRYAEHHNVHITDDAVKAAVTLSRRYLTG
RQLPDKAVDLLDTAATRVRMSLDTVPEQLTRIRSQITSLEMEKQALLEDIAVGNQIHGERLSGIEQEEVRLIVELDDLES
RYGQELKLTEQLLECRQDISRQSETHALQQELNGMQDGNPLLSVDVDVRTVATVIADWTGVPLSSLMKDEQTELLILENE
IGKRVVGQEVALAAIAQRLRAAKTGLTSENGPQGVFLLVGPSGVGKTKTALALADVMYGGEKSLITINLSEYQEPHTVSQ
LKGSPPGYVGYGQGGILTEAVRKRPYSVVLLDEVEKAHRDVMNLFYQVFDRGFMRDGEGREIDFRNTVILMTSNLGSDHL
MQLLAEQPEATEADLHELLRPILRDHFQPALLARFQTVIYRPLAETAMRTIVEMKLAQVSKRLHRHYGLTTKIDESLYDA
LTAACLLPDTGARNVDSLLNQQILPVLSQQLLTYMVAKQKPTLLTLGWSDEEGIGLEFSGAAER

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 9e-144 45
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-124 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-124 42
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 2e-109 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SARI_02728 YP_001571726.1 hypothetical protein VFG2076 Protein 9e-163 45
SARI_02728 YP_001571726.1 hypothetical protein VFG2084 Protein 9e-136 44
SARI_02728 YP_001571726.1 hypothetical protein VFG0079 Protein 2e-110 41