Gene Information

Name : BC1003_1750 (BC1003_1750)
Accession : YP_003907008.1
Strain :
Genome accession: NC_014539
Putative virulence/resistance : Virulence
Product : ATPase AAA-2 domain-containing protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2017893 - 2020781 bp
Length : 2889 bp
Strand : +
Note : KEGG: bxe:Bxe_A1887 putative ATP-dependent Clp protease, ATP-binding subunit; PFAM: ATPase AAA-2 domain protein; AAA ATPase central domain protein; Clp domain protein; Clp ATPase-like; SMART: AAA ATPase

DNA sequence :
ATGGCGGAGCACCTGTGTGAACTATGCGGCGTCAGGCCGGCGACGATCCGCGTCGCCGTCGTTCAAGGCGGACGCAGAAG
GCAACTGGACGTGTGCGATTACCACTACGCACAACTCACGCGGCATCAGCGGCAAATATCACCGCTCGAATCGCTGTTTC
GGGGCGGACTGTTCGATGGCCTGCTCGACGCGGGCGTGAACGAAACACGCGCTGGCCGGGCAGACTCGCAACGCGACGCG
CCCGTCAGTAAGGGCATCGACCTGCAACAACATTTCAGCGAACAGGCCAAGGAGATTCTGCAACGTGCCGCTGAGCGCGC
CGTGCAATTCGGCCGACGCGAAGTGGACACCGAGCATGTACTTTACGAACTCGCGCAGACTGACACTGTCCAGCGGATTC
TGTCGTCTCTGAAGATTTCGGCGAGCGACCTTCAGCACTACATCGACACGCATGCACCCAAAGGCGAAGCCACCCAGAGC
CCTCCGGAAGGAGCCGATATTTCGGTCTCGCCACGGCTGAAAAGTGCGCTGGAACGGGCCGTGATCGCTTCGCGCGAGTT
CGGGCAGAACTATGTCGGCCCGGAGCACATTCTGACCGGGCTCGCCGAGGTTACCGACAGCTTCGCCGGCGACCTGCTGA
TGAAGTACGGCCTGTCGTCTCATCAATTGCGTCAGCAGACGTTGCAAGCGGCCGGTCAGGAAGACAAGGGACACGCCCCG
GGCAACACGCCGCAACTCGACAAGTACAGCCGGGACGTGACGCGGCTTGCGCGCGAAGGCAAGCTCGATCCGGTGATCGG
CAGATCGAAGGAAACCGAAACGCTCGCCGAAGTGCTCGCACGGCGCAAGAAGAACAACCCCGTGCTGATCGGCGAGCCGG
GCGTCGGCAAGACGGCGATTGTCGAAGGACTGGCCCTGCGCATGATCAGCGGCGACGTGCCGGAAACGCTGCGCGACAAA
CGGCTCGTTGAGCTCAATGTCAATTCTCTGGTCGCGGGCTCCAAATATCGCGGCGAGTTCGAGGAACGCGTCAAGCAGAT
CATGGACGAAATCGCCGCGCACAAGGACGAACTGGTGCTCTTCGTCGATGAGGTGCACACCATTGTGGGAGCAGGCCAGG
GCGGCGGCGAAGGCGGCCTCGACATCGCCAACGTGTTCAAGCCGCCGATGGCGCGCGGCGAACTGAACCTGATTGGCGCG
ACCACACTCGCCGAGTATCAGAAGTACATCGAGAAAGACGCGGCGCTGGAGCGCCGTTTCCAGCCGGTGCTCGTAGGCGA
GCCGAGTGTCGAGCAGACCATCGGCATTCTGCGCGGGCTTCGTGACCGCCTCGAAGCGCACCACAAGGTCACGATTCTCG
ACGAAGCCTGCGTGGCCGCCGCCGAACTGTCCGATCGCTACATCTCGGGGCGCTTTCTGCCGGACAAGGCCATCGACCTG
ATCGATCAGGCTGCCGCGCGAGAACATCTTTCGTCCACGTCACGGCCCGCCGAGGTGCTCGAACTCGAAGCCGAGATCGC
GCAGATGGACCGCGAACAGGAATACGCGGCGTCGCACAAGCAGTTCGAGCGCGCCAAGGCGCTCGCTGAGCAGATCAAGG
GAAAACAGAGTCAGCTCAACGACGCCACCCTCGCGTGGAAACGACGCGTGAGCACCAGCAGCGCGGAAGTGACCCGCACG
CTGGTCGCCGAAATCGTTGCAAAGATGACCGGCATTCCGGTTGCGGATCTCACGCTGGAGGAAAAAGAAAAGCTGCTGCA
AATGGAGTCGCGTCTGCACAAGCGGGTGATCGGTCAGGAAGAGGCAATTGGCGCGGTCAGCGACGCCGTGCGCCGCAGCC
GCGCAGGCTTGCAGAGCCGCCGTCAGCCGCTTGCGGTGTTCCTGTTCCTCGGCCCGACGGGCGTCGGCAAAACCGAACTG
GCGAAGGCGCTCGCCGAAGTCGTGTTCGGCGACGAGGACGCCATCGTGCGCATCGACATGAGCGAATACATGGAGCGGCA
TGCGGTGGCTCGCCTTATCGGCGCCCCGCCCGGCTATGTCGGTTACGACGAAGGCGGCCAATTGACCGAGCGCGTGCGGC
GCCGCCCGCATAGCGTGATTCTGCTCGACGAAATCGAGAAGGCGCACCCCGACGTCTATAACGTGCTGTTGCAGGTATTC
GACGACGGGCGACTGACAGACGGCAAAGGACGCGTGGTCGACTTCACCAATACGCTTATCATCGCCACCAGCAATCTGGC
CTCCGAAGTCATCATGGGCACGTCGCGCAGCCGGCCGGGTTTCATGCAGGGGGAAACCGGCCACGCGCCGGGCACACGGA
CTGCCAAAGAGCGGCAGTTGGAGCCCGTGCGCGAGGGCGTGATGGCGGTGTTGCGCTCGCACTTCCGGCCGGAGTTTCTG
AATCGCATCGATGAAATCATCATCTTCGAATCGTTGAGTGCCGAGCAGATCCGCTCCATCGTCCGTCTGCAGCTCGATCA
GGTGGCGCGCGTCGCGAGGAGCCAGGATATCGACATCGAGTTCGACGAGGGCGTCGTCGATCATCTGGCGACCGAGGCTT
ACCGGCCGGAGTACGGCGCACGCGAATTGCGACGGCGGATCCGCCAGGTCATCGAAAATCCGCTCGCGAAGCAGATGCTC
GACGGCGCGGTTCGCGAAGGAAATCGGGTGGTATGCCGTTTCGATACGAGTGATAAAGTCGCGGTTTTTGAAGTGCGCGA
AGCGCTGCCGCAGGAAGAGGCTCGAGAAACGCTGCCGCAAAAGCGCGAGCCGGCCTCCGCTGCGGGCGACGCCGGCCCCG
CTGGTAAGCCTGCCAGCAAGCCGCTGCGTCAGTCACGCAAACGTCGCAATGCGGACGGTGACGACCGCGGCAATCAACCC
GCGGCTTGA

Protein sequence :
MAEHLCELCGVRPATIRVAVVQGGRRRQLDVCDYHYAQLTRHQRQISPLESLFRGGLFDGLLDAGVNETRAGRADSQRDA
PVSKGIDLQQHFSEQAKEILQRAAERAVQFGRREVDTEHVLYELAQTDTVQRILSSLKISASDLQHYIDTHAPKGEATQS
PPEGADISVSPRLKSALERAVIASREFGQNYVGPEHILTGLAEVTDSFAGDLLMKYGLSSHQLRQQTLQAAGQEDKGHAP
GNTPQLDKYSRDVTRLAREGKLDPVIGRSKETETLAEVLARRKKNNPVLIGEPGVGKTAIVEGLALRMISGDVPETLRDK
RLVELNVNSLVAGSKYRGEFEERVKQIMDEIAAHKDELVLFVDEVHTIVGAGQGGGEGGLDIANVFKPPMARGELNLIGA
TTLAEYQKYIEKDAALERRFQPVLVGEPSVEQTIGILRGLRDRLEAHHKVTILDEACVAAAELSDRYISGRFLPDKAIDL
IDQAAAREHLSSTSRPAEVLELEAEIAQMDREQEYAASHKQFERAKALAEQIKGKQSQLNDATLAWKRRVSTSSAEVTRT
LVAEIVAKMTGIPVADLTLEEKEKLLQMESRLHKRVIGQEEAIGAVSDAVRRSRAGLQSRRQPLAVFLFLGPTGVGKTEL
AKALAEVVFGDEDAIVRIDMSEYMERHAVARLIGAPPGYVGYDEGGQLTERVRRRPHSVILLDEIEKAHPDVYNVLLQVF
DDGRLTDGKGRVVDFTNTLIIATSNLASEVIMGTSRSRPGFMQGETGHAPGTRTAKERQLEPVREGVMAVLRSHFRPEFL
NRIDEIIIFESLSAEQIRSIVRLQLDQVARVARSQDIDIEFDEGVVDHLATEAYRPEYGARELRRRIRQVIENPLAKQML
DGAVREGNRVVCRFDTSDKVAVFEVREALPQEEARETLPQKREPASAAGDAGPAGKPASKPLRQSRKRRNADGDDRGNQP
AA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 5e-168 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BC1003_1750 YP_003907008.1 ATPase AAA-2 domain-containing protein VFG0079 Protein 0.0 50
BC1003_1750 YP_003907008.1 ATPase AAA-2 domain-containing protein VFG0080 Protein 5e-134 44