Gene Information

Name : TW075 (TW075)
Accession : NP_789025.1
Strain : Tropheryma whipplei TW08/27
Genome accession: NC_004551
Putative virulence/resistance : Virulence
Product : Clp-family ATP-binding protease/regulator
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 67380 - 69902 bp
Length : 2523 bp
Strand : +
Note : Similar to Bacillus subtilis negative regulator of genetic competence ClpC/MecB SWALL:CLPC_BACSU (SWALL:P37571) (810 aa) fasta scores: E(): 3e-152, 56.66% id in 810 aa, and to Streptomyces coelicolor Vlp-family ATP-binding protease SCO3373 or SCE94.24c SW

DNA sequence :
TTGGGTGAGGATATGTTTGAAAGGTTTACAGATAAAGCCCGCCGGGTTATAGTACTGGCGCAGGAGGAAGCGCGCACGCT
TAGCCACAATTACATTGGCACCGAGCATGTTCTTCTTGGCCTTATTAGCGAGGGTGATGGTATTGCCGCCCAGGCGCTTG
AGAGTCTTGATATTACCCTTGAGCGTGCGCGCGAGGGCGTGGCGGAACTAATAGGTCGGGGTCAGAACGCAACATCCGGG
CATATCCCGTTTACGCCTCGTGCGAAAAAGGTTCTAGAGCTTTCGCTCCGCGAGGCTCTTCAGCTTGGTCACAATTACAT
TGGCACTGAGCATATACTCCTTGGGATTCTTCATGAGGGTGAGGGTATCGCTGCACAGGTACTGGTGAATATGGGGGCTG
AACTGCCTGCAATTCAGCAGAGGGTTATGCACCTTTTAGAAGATGGAAGAGAACAAGAGCCCGTTTCGGTTGGCCCATCT
GAGTCAGGAAAAATTTCAGGCAGTCAGATACTTGATCAATTCGGGCGACACCTCACGCGTGCCGCGAAAGAGGGCAAGTT
GGACCCAGTTATTGGCCGTGAGAAGGAAATAGAGCGCGTGATGCAGGTGCTCTCGCGCAGAACGAAGAATAACCCAATCC
TAATCGGTGAGCCCGGTGTTGGAAAAACTGCTGTCGTCGAGGCTTTGGCTCAAGCGATAGTCAATGGTGATGTTCCGGTC
AATTTGCGCAACAAGCAAGTGTACTCGCTTGATTTGGGTTCTCTTATCGCTGGTAGCAGGTACAGGGGTGATTTTGAGGA
ACGCCTCAAGAAGGTTACGAAGGAAATACGTTCACGAGGGGACATTATCGTTTTTATAGATGAGATCCATTCCCTTGTTG
GGGCCGGGTCTGCTGAGGGTGCTATCGATGCAGCAACTATCCTGAAACCCCTTCTGGCCCGTGGTGAGCTTCAAACAATT
GGCGCAACAACCCTTGATGAGTATCGCAAGAATATTGAAAAGGATTCTGCTCTGGAAAGGCGCTTCCAGCCCGTCAACGT
GTCAGAGCCCAGCATACCAATGTGTATCCAGATACTGAAAGGGTTGCGTGATCGCTATGAGGCCCATCATAAGGTGAAAA
TCACCGATGAGGCAATTTATGCTGCAGTTACCCTTTCGAGTCGTTATATAAATGATAGGTTTTTACCAGACAAAGCCATA
GACTTGATTGATGAGGCCGGCGCCAGGCTGAGACTGTCTGTTTTGTCTAATCCGAGCCAACTGCGCGCTGTCGAGAAAAA
AATTCTCGCTGTTGTTGCCAGGAAAGACAAGGCGGTTGAAAAACAGGATTTCGATAAAGTTGGCGAGCTAAAACGGAAAG
AAAAGGCCCTAAGAGCTGAGTTACGGAAGATAAAGCGTGACTATGAGAATGGCAATATTGCAAGTGCCGGCACTGTAGAT
GAGGGCTTGATCGCTGAAGTGCTTGCGTCAGCGACTGGGGTTCCCGTTTTTCGACTAACCGAAGACGAGTCCGTGCGTCT
GATGATGATGGAAAGAAGTTTGCATCAGCGTGTGATTGGTCAGGACGAGGCGATTTCGTCTCTGTCAAGGGCGATGCGCC
GAACGCGCGCTGGATTGAAGGATCCAAACCGACCTTCCGGATCGTTTATTTTTGCCGGCCCGACCGGTGTCGGAAAAACC
GAGCTGGCCAAGGCTCTTGCCGAGTTTCTGTTTGATAATGAAGACGCCCTTGTAAGTCTTGACATGTCAGAGTACGGAGA
GCGGCACACTGTATCCAGGCTTTTTGGAGCCCCGCCCGGATTTGTTGGTTTCGAAGAAGGCGGACAACTGACAGAGAAAA
TACGCCGCAAGCCCTTTAGCGTTGTTTTATTTGATGAAATTGAAAAAGCCCATCCGGATGTTTTTAACTCGCTTTTGCAG
ATTTTGGAAGAAGGTCGTCTGAGTGACGCCCAGGGCCGTATGGTTGATTTTAGAAATACGATAATTGTTATGACAACTAA
CCTTGGCAGTCGCGATATAGCATCAGGCCCTGTCGGGTTTCAGTCGGGTGATGGTAGTTTTTTGGCCTATGAGGCTATGA
AGGCCAAGGTTAATGAGGAACTGAGGCGCAGTTTAAAGCCGGAGTTTCTCAATCGTATTGATGAAGTTATAGTTTTCCCG
CCCCTCAATAGAGACGAGCTTCTGCAAATACTGAAGATTTTTATCAAAAAGCTCGATGACCGTTTGCGCGATAGGACCAT
GCGTCTTTCTGTGACAGATGCTGCCCTAGAACAGTTGGTGCAAATTGGCTACGAACCCACCATGGGAGCAAGGCCCTTGC
GCCGAGCGGTTCAGCGCGAGGTTGAGGATAGGATTTCTGAAAAGATTCTGCTGGGCGAGATAAAGCCAAATCAAGAGATA
GAAATGGACTTTGTGAATGATAATTTCACGATTGTATCAAGGGATATGGATTACCCCCTATCTCCTCTTGTGCCGTCAGT
GACAACGGGTCTCGTTACTCCGGATCTTGCAAGTCACGTCTAG

Protein sequence :
MGEDMFERFTDKARRVIVLAQEEARTLSHNYIGTEHVLLGLISEGDGIAAQALESLDITLERAREGVAELIGRGQNATSG
HIPFTPRAKKVLELSLREALQLGHNYIGTEHILLGILHEGEGIAAQVLVNMGAELPAIQQRVMHLLEDGREQEPVSVGPS
ESGKISGSQILDQFGRHLTRAAKEGKLDPVIGREKEIERVMQVLSRRTKNNPILIGEPGVGKTAVVEALAQAIVNGDVPV
NLRNKQVYSLDLGSLIAGSRYRGDFEERLKKVTKEIRSRGDIIVFIDEIHSLVGAGSAEGAIDAATILKPLLARGELQTI
GATTLDEYRKNIEKDSALERRFQPVNVSEPSIPMCIQILKGLRDRYEAHHKVKITDEAIYAAVTLSSRYINDRFLPDKAI
DLIDEAGARLRLSVLSNPSQLRAVEKKILAVVARKDKAVEKQDFDKVGELKRKEKALRAELRKIKRDYENGNIASAGTVD
EGLIAEVLASATGVPVFRLTEDESVRLMMMERSLHQRVIGQDEAISSLSRAMRRTRAGLKDPNRPSGSFIFAGPTGVGKT
ELAKALAEFLFDNEDALVSLDMSEYGERHTVSRLFGAPPGFVGFEEGGQLTEKIRRKPFSVVLFDEIEKAHPDVFNSLLQ
ILEEGRLSDAQGRMVDFRNTIIVMTTNLGSRDIASGPVGFQSGDGSFLAYEAMKAKVNEELRRSLKPEFLNRIDEVIVFP
PLNRDELLQILKIFIKKLDDRLRDRTMRLSVTDAALEQLVQIGYEPTMGARPLRRAVQREVEDRISEKILLGEIKPNQEI
EMDFVNDNFTIVSRDMDYPLSPLVPSVTTGLVTPDLASHV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 0.0 63

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
TW075 NP_789025.1 Clp-family ATP-binding protease/regulator VFG0079 Protein 0.0 55
TW075 NP_789025.1 Clp-family ATP-binding protease/regulator VFG0080 Protein 4e-150 49
TW075 NP_789025.1 Clp-family ATP-binding protease/regulator VFG2084 Protein 9e-95 42