Gene Information

Name : clpA (TREPR_2813)
Accession : YP_004530291.1
Strain : Treponema primitia ZAS-2
Genome accession: NC_015578
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp protease ATP-binding subunit ClpA
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1358874 - 1361420 bp
Length : 2547 bp
Strand : -
Note : identified by match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728; match to protein family HMM TIGR02639

DNA sequence :
ATGAAAATCAGCCGCCATGTGCAGGCTATCATCAATGCCGCCTATAATGAGGCCAAGGTACGTAACCACGAATACCTGAC
CCCCGAACATATACTCTATGCCGCCCTTGCATTCAATGAGGTACAGAGCATACTTTCATCCTGCGGGGCTAACCTGGATC
AACTTAAACATGGGATGGAAAACTACTTTGAACAGCAAGTGCCCCCCACCCAGGATAACACCGAGCCCACCCAAACTGTG
AGTTTCCAGAGCGTCCTGGAACGAGCAGTGCTTCAAAGCCAGTCATCCCAGAAGGAAACCCTGGATATTGCGGACATTCT
GGTCTCCCTCTACGATGAGGAACGGAATTATTGCGCCTATTTCCTCCGCAAGTCGGGAATCAGGAATCGGCTGGAGCTTT
TGGAGAGTCTTTCCCGGGAATATGAGGATGAAGGTGAGCAGGGTTTCAATTTTTCCCGTGCCTATACTCCGAAATTCACC
CGTATGAACCCCCACATGAATGACGGAGACGGCCATGAGGAAATTCCCTTTGAGGGTGAGGCTTCCGATGCGGCCCAGAA
AACTAAGGCCAATAAAAAGAGCGCCCTGGAGCGCTACGCCACGGAACTCACCGCCCTGGCAAAGCTGGGCCGGCTGGAAC
CGGTCATAGGCCGGCAGACCGAACTGGACCGGACCGTGCAGGTACTTTGCCGCCGGCTCAAAAACAACCCGGTCCATGTG
GGCGATTCCGGAGTAGGAAAAACCGCCATCACCGAAGGGCTGGCCCAGCGCATAGTCGCCGGGAACGTGCCCCCCACTTT
GAAAGATTTCGCCATCTATTCCCTGGATATGGGCGCCCTGGTAGCGGGAACCAAGTACCGGGGGGACTTTGAGGAACGGA
TCAAGCGGGTCATTGAAGAAATCCTGAAAAAAGAAAAGGCCATCCTCTTTATCGATGAAATCCACACCCTGGTAGGGGCC
GGCTCAGTCTCCGGCAGCGCTTTAGATGCCTCAAACCTGCTCAAACCTGCCCTTGCATCGGGAAAAATCCGCTGCATCGG
GTCCACCACCCACGAGGAATACGGCAAATTCTTTGACAAAGACCGGGCCCTGTCCCGGCGTTTCCAGAAAATTGACATTG
ACGAGCCCAGCCAGGAAGACGCCATCGCGATCCTCAAGGGGCTCAGGTCCAAATACGAAGAATACCATCGGGTTAAGTAC
AGTGATGAGGCCGTGGAAGGGGCGGTGCGTCTTTCCGCCCAGTTTATCACCGAACGGCGGCTGCCGGATAAAGCCATCGA
TGTGATCGACGAAGCCGGCGCCTTTGCCCGGATCCAGGCTTTCAAGGACAACGGCAAGACCGAAAACCCCGCCGATACTC
TTGCCGGCGCTTCTACCGCTGCAGGAAACTCCTCCGGCGCGGATACTCCGGCTACCGGTGACCCTGGGGAGCCCACACTT
ACCAGAGGCCCAACCCTTAGCAGCGGCCCGGTGGGCGCCGAAACCCCAAGGGAGGAACCCATTTCTGTGCAGGAAGGGAC
CGTTGCGGTGATAGACATCGGCCTCCCCCTGATCGAGACCGTGGTGTCCAAAATAGCCCGGATCCCCGAGCGGTCAGTGG
GTGAAAACGAAAAAGACAAGCTCCGGTTCCTGGAACAAAAGCTCCGGGAGCGTATTTTCGGCCAGGACGAGGCGATCCTG
GCGGCGGTCAAGGCAGTCAAACGATCCCGGGCAGGCTTCCGGGCGGACAACAAGCCCGTGGCAAACTTCCTCTTTGTGGG
CCCCACCGGGGTGGGGAAAACCGAACTGGCCCGCCAACTGGCGGATATCCTGGGTGTGCCCATGCACCGCTTCGACATGA
GCGAATACCAGGAAAAGCACACCGTCTCCCGGCTCATCGGATCCCCCCCGGGTTATGTGGGCTATGAGGAGGGGGGCCTC
CTTACGGACGTCATACGGAAACAGCCCCATGCGGTCGTTTTATTGGACGAAATCGAAAAAGCCCATCCGGATATTTACAA
TATACTTCTCCAGATCATGGACTACGCCACCCTGACGGACAACAATGGCCGCAAGGCGGATTTCCGCAATGTGGTTTTCA
TCATGACCAGCAACGCCGGAGCCCGGGAAGTCGGAAAAAGCCTCATCGGCTTTGGGGAGCGTATAGGCGGGGAAGCGGAA
CTGGATAGCGCAGTAGAAAAACTGTTCACCCCGGAATTCCGCAACCGCCTGGACGCGGTGGTCCGCTTCGGCCACCTGTC
CAAGGAAGTGATGGAGTCCATAGTCCGCAAGGAACTGGAAATTTTCCGGGACCAGCTGGCAGAAAAAAAGATAAGCCTTT
CAATAAGCGATGCCTGCGTGGAACACCTGGCCGAAGAAGGCTACAGCCAGGAATTCGGCGCCCGCAATGTGGGCCGTGTT
ATCGAAGATAAGATCAAGTCCTTCTTTGTGGACGAGGTCCTCTTCGGCCGGCTCGTCTCAGGCGGCGCCGCCCAGGCAGA
CTGGCGGGATGGGGAGTACCGGATCGAGGTGGCAGGCGCCAGAGAGGCGGTGACAGCGGAAGCCTGA

Protein sequence :
MKISRHVQAIINAAYNEAKVRNHEYLTPEHILYAALAFNEVQSILSSCGANLDQLKHGMENYFEQQVPPTQDNTEPTQTV
SFQSVLERAVLQSQSSQKETLDIADILVSLYDEERNYCAYFLRKSGIRNRLELLESLSREYEDEGEQGFNFSRAYTPKFT
RMNPHMNDGDGHEEIPFEGEASDAAQKTKANKKSALERYATELTALAKLGRLEPVIGRQTELDRTVQVLCRRLKNNPVHV
GDSGVGKTAITEGLAQRIVAGNVPPTLKDFAIYSLDMGALVAGTKYRGDFEERIKRVIEEILKKEKAILFIDEIHTLVGA
GSVSGSALDASNLLKPALASGKIRCIGSTTHEEYGKFFDKDRALSRRFQKIDIDEPSQEDAIAILKGLRSKYEEYHRVKY
SDEAVEGAVRLSAQFITERRLPDKAIDVIDEAGAFARIQAFKDNGKTENPADTLAGASTAAGNSSGADTPATGDPGEPTL
TRGPTLSSGPVGAETPREEPISVQEGTVAVIDIGLPLIETVVSKIARIPERSVGENEKDKLRFLEQKLRERIFGQDEAIL
AAVKAVKRSRAGFRADNKPVANFLFVGPTGVGKTELARQLADILGVPMHRFDMSEYQEKHTVSRLIGSPPGYVGYEEGGL
LTDVIRKQPHAVVLLDEIEKAHPDIYNILLQIMDYATLTDNNGRKADFRNVVFIMTSNAGAREVGKSLIGFGERIGGEAE
LDSAVEKLFTPEFRNRLDAVVRFGHLSKEVMESIVRKELEIFRDQLAEKKISLSISDACVEHLAEEGYSQEFGARNVGRV
IEDKIKSFFVDEVLFGRLVSGGAAQADWRDGEYRIEVAGAREAVTAEA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 1e-121 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpA YP_004530291.1 ATP-dependent Clp protease ATP-binding subunit ClpA VFG0079 Protein 1e-131 44
clpA YP_004530291.1 ATP-dependent Clp protease ATP-binding subunit ClpA VFG0080 Protein 8e-119 41