Gene Information

Name : TP0071 (TP0071)
Accession : NP_218511.1
Strain : Treponema pallidum Nichols
Genome accession: NC_000919
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease subunit B (clpB)
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 75906 - 78542 bp
Length : 2637 bp
Strand : -
Note : similar to PID:1163118 SP:P53532 percent identity: 59.57; identified by sequence similarity

DNA sequence :
ATGAACACAGACAGGTACACAGTCAAAGCAAGCGAAGCGCTCAATGACGCCATATCTCTGGCAGAAGCGGAGAACCACGG
TCAAGTTGAGGAGGAACATCTACTCCACGCCCTACTTTCCCAGAAAGACGGGATTATCTCTCCGCTCATTGAAAAAATTG
GGGCAAAACCGGACTTCCTGTACGATGAATTACTCCAATGCCTGCGCCGCAAACCACGTGTTACCGGTCCTGCCGCCCAA
ACGCGCTGTGCACCAACGCTGAGCAAAGCCTGTGCACGTGCAGAACGCCTCGCGCTCAAGAACCAAGATGAATATGTCTC
CTGCGAACATCTCCTGCTTGCCATAAGTGAGACAGATAGCAACACTGCACGTCTCCTTCACAGTCAGGGCATTACCAGTA
AAACTATCAGTGCCGCCCTCAAAGATATACGCGGCAGCAAGCGCGTTACGAGCCAGGATCCAGAATCAACATTCCAGTGC
TTGGAGAAATACTGCCGAGATCTTACTACCCTCGCCCGAGAAGAAAAAATAGATCCGGTTATTGGACGTGATGAAGAGAT
CCGGCGCGTTATGCAGGTACTCTCACGTCGTACAAAAAATAACCCAGTGCTTATTGGAGAACCCGGCGTAGGGAAAACCG
CTATTGTCGAGGGACTTGCACGCCGTATCGTTTCAGGAGACGTACCAGAAAGCCTCAAGGGAAAGCGTTTGCTTTCCCTT
GACCTCGGCGCATTGGTTGCCGGTGCAAAGTTCCGCGGGGAATTTGAAGAACGACTAAAAGCGGTAATTGAAGCGGTACA
GAAAAGCGACGGTGGCGTTATTTTATTCATTGATGAACTACACACGCTCGTAGGCGCCGGCGCAAGTGAGGGATCTATGG
ATGCGTCGAACCTTCTGAAACCTGCGCTTGCGCGCGGTGAATTGCGTTCAATCGGCGCAACCACGCTCAACGAATATCGC
AAATATATCGAAAAGGACGCAGCGCTCGAACGCCGCTTTCAGCAAGTGTACTGCGTACAGCCTACGGTGGAGGACACCAT
TGCTATCCTGCGCGGTTTGCAAGAAAAGTACGAAGTGCATCACGGGGTACGTATCAAAGATGAAGCGCTTGTTGCAGCAA
CCGTTTTGTCTGACCGTTACATCACCAACCGCTTTTTACCAGATAAGGCGATTGATCTGGTGGATGAAGCAGCAAGCCGC
CTGAAAATGGAAATTGAAAGTCAGCCTGTTGAGCTAGACCAGGTGGAGCGCAAGATATTACAGCTGAATATCGAAAAGGC
CTCTCTCCTTAAAGAAAGTGATCCGGCTTCAAAGGAACGTTTGGAAAAGTTAGAAAAAGAGCTCGCAGGCTTCCTAGAGC
GCCGTGCTGCAATGCAGGTCCAATGGCAAAATGAGAAAGGGAGGATAGAAGAGTCACGCCGCTACAAAGAGGAGCTTGAG
CGTCTCCGCATTGAGGAAACCATGTTTTCACGTGAAGGGGACCTGAACAAGGCTGCAGAACTTCGGTATGGCAAAATTCC
AGAACTTGAAAAAAAAATCATGCTTCTTACTGCAGAAGTAGAGAAAAAATCCGGTCTAGAAGGACAGCTCTTGCGCGAGG
AAGTGTGTGAAGAGGACATTGCGAAAATTATTTCTATGTGGACCGGAATTCCGGTATCCAAAATGATGGCAAGCGAGCAA
CAGAAATATCTGCAGCTTGAGTCAGTACTCATGCAACGTGTGGTAGGGCAGGACGAAGCAGTGCGGGTAATTTCCGACGC
GATTCGTCGTAATAAGGCAGGACTTTCTGATACGCGCCGTCCTCTTGGCAGTTTCTTATGTGTCGGTCCCACGGGGGTAG
GAAAGACAGAACTTGCACGTACGTTAGCTGATTTTCTTTTCAACGATGAGCGTGCACTGACGCGTATCGATATGAGTGAA
TACATGGAAAAACACGCGATCAGCCGACTCATTGGCGCGCCCCCGGGGTATGTGGGCTATGACGAGGGGGGACAATTGAC
AGAAGCGGTACGACGTAGACCCTACAGCGTACTTCTTTTTGATGAAGTAGAGAAAGCGCACCAGGATGTGTTTAATATAT
TCCTGCAAATACTCGACGATGGGCGCTTGACTGACGGCCAAGGAAGGGTGGTGGATTTCCGCAACACGATCATCATCATG
ACCAGCAATATCGGATCAGAGCATATTCTTTCTGCACGCGAGTCGCGCACACACACGTCGGACTTGCCTGTACCCGAGAC
ACAATCTACAGAAGAACAAACTCTACCAGAGCAGATACGGGGATTACTGCACACATACTTTCGCCCAGAATTCTTAAACC
GGATTGACGAAGTGTTAATTTTTAAGCGTCTCACACGGAAACATATTCGCCTCATCACAGACATCCAACTGCAGATGGTA
GTGGAGCGTTTGGAAAGTCGACATATAAAACTTCGTGTGCGTGACGCGGCGAAAGCCTATCTTGCGGAGCGCGGATACGA
CGACACTTTCGGAGCACGACCACTGAAGCGTGCAATCCAAACGGAATTGGAAAATGCCCTAGCGCGTGAGATTCTCAGTG
GCCGATTCAGAGGCGGCTCAACTATCGTGGTGGATATGTGTAAAGATGCTCTGTGTTTCACTGAACAAACATCCTGA

Protein sequence :
MNTDRYTVKASEALNDAISLAEAENHGQVEEEHLLHALLSQKDGIISPLIEKIGAKPDFLYDELLQCLRRKPRVTGPAAQ
TRCAPTLSKACARAERLALKNQDEYVSCEHLLLAISETDSNTARLLHSQGITSKTISAALKDIRGSKRVTSQDPESTFQC
LEKYCRDLTTLAREEKIDPVIGRDEEIRRVMQVLSRRTKNNPVLIGEPGVGKTAIVEGLARRIVSGDVPESLKGKRLLSL
DLGALVAGAKFRGEFEERLKAVIEAVQKSDGGVILFIDELHTLVGAGASEGSMDASNLLKPALARGELRSIGATTLNEYR
KYIEKDAALERRFQQVYCVQPTVEDTIAILRGLQEKYEVHHGVRIKDEALVAATVLSDRYITNRFLPDKAIDLVDEAASR
LKMEIESQPVELDQVERKILQLNIEKASLLKESDPASKERLEKLEKELAGFLERRAAMQVQWQNEKGRIEESRRYKEELE
RLRIEETMFSREGDLNKAAELRYGKIPELEKKIMLLTAEVEKKSGLEGQLLREEVCEEDIAKIISMWTGIPVSKMMASEQ
QKYLQLESVLMQRVVGQDEAVRVISDAIRRNKAGLSDTRRPLGSFLCVGPTGVGKTELARTLADFLFNDERALTRIDMSE
YMEKHAISRLIGAPPGYVGYDEGGQLTEAVRRRPYSVLLFDEVEKAHQDVFNIFLQILDDGRLTDGQGRVVDFRNTIIIM
TSNIGSEHILSARESRTHTSDLPVPETQSTEEQTLPEQIRGLLHTYFRPEFLNRIDEVLIFKRLTRKHIRLITDIQLQMV
VERLESRHIKLRVRDAAKAYLAERGYDDTFGARPLKRAIQTELENALAREILSGRFRGGSTIVVDMCKDALCFTEQTS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 4e-106 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
TP0071 NP_218511.1 ATP-dependent Clp protease subunit B (clpB) VFG2084 Protein 8e-103 43
TP0071 NP_218511.1 ATP-dependent Clp protease subunit B (clpB) VFG2076 Protein 7e-112 41