Gene Information

Name : SARI_02627 (SARI_02627)
Accession : YP_001571626.1
Strain : Salmonella enterica RSK2980
Genome accession: NC_010067
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2549741 - 2552413 bp
Length : 2673 bp
Strand : -
Note : 'KEGG: eci:UTI89_C3197 0. ClpB protein K01358; COG: COG0542 ATPases with chaperone activity, ATP-binding subunit; Psort location: cytoplasmic, score: 23'

DNA sequence :
ATGGAACATCATTCAGCAGTCCTGCTACGACGACTTAACCCTTATTGCGCCAGAGCACTGGAGGGGGCGGCTTCTTTATG
TCAGGCCCGTGCGCATGCAGAAATCACGCCTGAGCACTGGTTGCTGAAACTGCTGGAGCAGGGGGAAGGTGACCTGACCG
TTCTTGCCCGGCGCTATGAATGGGATATGGATGCCGTCTGGCAGTCGCTCCTGAGCTGGCTGGATGCACAGCCTCGTTCA
GTTCGCACACGCCCGGAGCTTTCTGCTTCCCTCCAGACCTTAGTGAAACAGGCGTGGCTTGCCGCCACGCTCGCGGGAGA
TGAGCAGATCCGCAGCATACATTTGCTCACGGCCATGATTGAAACGTCAGGGCTGACACGCTGCGATGGCCTCTGGCCTT
TGATGACGCTGACCACCAGCCAGCTGGAAAGGTTACGTCCCTTGCTGGAAGCCCAGTCGGATGAGCGGCATGACGTTACG
CTCAGTGAGCAGCCTGGCAATGTCAGCATTATTGGCCGGGCCGCACCGTTACAGCATGAAGCACAACACCAGTCTGAAGG
CGGAAGTGTGCAGCCTGCCGTTTCACAGGAAGAGTCCGTTCTCAATCGCTTTACGGTCGATGTGACCGCCCGTGCCCGGG
AAGGAAAGATTGACCCGGTCTTTGGACGCGATAACGAAATCCGTCAGATGGTGGATATTCTCTCCAGACGCCGTAAGAAC
AACCCGATCCTCGTGGGTGAACCCGGCGTGGGCAAAACGGCCCTGGTTGAAGGCCTGGCGCTGCGTATTGCTGAGGGCAA
TGTGCCCGAAAGTCTTAAGCCCGTCATCGTCAGGACACTGGATCTGGGTTTGCTTCAGGCGGGTGCCGGTGTAAAGGGTG
AATTCGAACAGCGTCTGAAAAATATCATCGACGCCGTACAGCACTTCCCGGTGCCGGTGCTGCTGTTTATTGATGAGGCA
CACACCATTATCGGGGCGGGTAATCAGGCAGGCGGGGCAGATGCGGCTAACCTGCTGAAACCAGCGCTGGCTCGCGGCGA
ACTGCGCACCATTGCGGCCACCACATGGAGTGAATACAAGCAGTATTTCGAACGTGACGCCGCCCTGGAGCGCCGCTTCC
AGATGGTCAAGGTTGATGAGCCCGATGATGATACGGCCTGCCTGATGCTGCGCGGTCTAAAATCCCGCTATGCCGAACAT
CATGGCGTACATATTACCGGCGAGGCCGTCCGGGCAGCGGTGACGCTATCCCGTCGTTACCTGACCGGACGCCAGTTGCC
AGATAAAGCCGTGGACCTGCTGGATACCGCAGCGGCCCGGGTGCGGATGAGTCTGGACACGCTGCCAGAACAGCTTACCC
GGTTGCAGGCTGAACTGACTGCGCTTGCCATGGAGCAGCAGGAGTTGCTGGAAGATATTTCTCTGGGTAACGCTGTTGAT
GCCAGCCGTCTGCCTCAGATCGAGCAACTTTCGCTGGAGCTCAATCAGCAGAAAGCTGCCCTTCAGTCACAATATGAGAC
CGAAAAGCAGCTCACCGACTCGCTGAAAGCCTGCCGTGAAGACATCAGCCGCCAGGAGGAGCTCTCCGCCCTTCAGCACG
AGCTTTCTCAGATTCAGAATAACAGTCCATTGCTGGGTCTGGATGTTGATGTCCGCACCGTTGCGACCGTTATCGCCGAC
TGGACAGGGGTTCCTCTCTCGTCGCTGATGAAGGATGAGCAGACCGAACTGCTGACGCTGGAAGAACAGCTGGCAACCCG
CGTAGTCGGTCAGGGTCCCGCCCTTAATGCCATTGCCGAGCGAATGAGGGCGTCAAAAACCGGCCTGACGCCGGAAAACG
GACAGCAGGGGGTGTTTTTACTGGTGGGTCCAAGCGGAGTGGGCAAAACAGAAACCGCGCTTGCGCTTGCTGACGTTATG
TATGGTGGTGAGAAATCGCTCATTACCATCAACCTGTCTGAATACCAGGAGCCACACACGGTCTCGCAGTTGAAAGGCTC
CCCTCCGGGCTATGTAGGATATGGTCAGGGCGGCATTCTCACCGAGGCAGTCCGTAAGCGGCCTTACAGCGTGGTGCTGC
TCGATGAAGTGGAGAAGGCCCATCGGGATGTAATGAATCTGTTTTATCAGGTCTTTGACCGGGGCTTTATGCGGGACGGA
GAAGGGCGTGAAATTGATTTTCGCAATACCGTGATCCTGATGACGTCCAACCTGGGAAGTGACCCGCTGATGCAGTTCCT
GGAAGAGCAACCAGAAGCCACTGAGAGCGATCTGCATGAGTTGTTACGCCCGATCCTGCGCGATCATTTCCAGCCAGCGC
TGCTGGCCCGCTTCCAGACGGTGATATACCGGCCGCTGGAAATGGATGCCATGCGCACCATCGTCGGCATAAAGCTCGCA
CAGGTGAGTACGCGTCTGCAACGCCACTACGGCATTTCCACGCACATTGGCGAAAGTCTTTTCGACACGCTGACGAAGGC
GTGTTTGCTGACGGATACCGGTGCCCGCAACGTGGACAGCCTGCTTAATCAGCAGATCCTGCCGGTTCTGAGCCAGCAGC
TATTAAGCCATATGGCGGCAAAACAAAAGCCATCCTCGCTACAGCTGACCTGGGATGATGAAGAGGGCATTGTGCTGGCG
TTTGATGCACAGGTAGAAGGAGAGCCGTCATGA

Protein sequence :
MEHHSAVLLRRLNPYCARALEGAASLCQARAHAEITPEHWLLKLLEQGEGDLTVLARRYEWDMDAVWQSLLSWLDAQPRS
VRTRPELSASLQTLVKQAWLAATLAGDEQIRSIHLLTAMIETSGLTRCDGLWPLMTLTTSQLERLRPLLEAQSDERHDVT
LSEQPGNVSIIGRAAPLQHEAQHQSEGGSVQPAVSQEESVLNRFTVDVTARAREGKIDPVFGRDNEIRQMVDILSRRRKN
NPILVGEPGVGKTALVEGLALRIAEGNVPESLKPVIVRTLDLGLLQAGAGVKGEFEQRLKNIIDAVQHFPVPVLLFIDEA
HTIIGAGNQAGGADAANLLKPALARGELRTIAATTWSEYKQYFERDAALERRFQMVKVDEPDDDTACLMLRGLKSRYAEH
HGVHITGEAVRAAVTLSRRYLTGRQLPDKAVDLLDTAAARVRMSLDTLPEQLTRLQAELTALAMEQQELLEDISLGNAVD
ASRLPQIEQLSLELNQQKAALQSQYETEKQLTDSLKACREDISRQEELSALQHELSQIQNNSPLLGLDVDVRTVATVIAD
WTGVPLSSLMKDEQTELLTLEEQLATRVVGQGPALNAIAERMRASKTGLTPENGQQGVFLLVGPSGVGKTETALALADVM
YGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRPYSVVLLDEVEKAHRDVMNLFYQVFDRGFMRDG
EGREIDFRNTVILMTSNLGSDPLMQFLEEQPEATESDLHELLRPILRDHFQPALLARFQTVIYRPLEMDAMRTIVGIKLA
QVSTRLQRHYGISTHIGESLFDTLTKACLLTDTGARNVDSLLNQQILPVLSQQLLSHMAAKQKPSSLQLTWDDEEGIVLA
FDAQVEGEPS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-127 47
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-127 47
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-142 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SARI_02627 YP_001571626.1 hypothetical protein VFG2084 Protein 1e-137 43
SARI_02627 YP_001571626.1 hypothetical protein VFG2076 Protein 4e-160 43
SARI_02627 YP_001571626.1 hypothetical protein VFG0079 Protein 1e-109 41