Gene Information

Name : Acid_2333 (Acid_2333)
Accession : YP_823607.1
Strain : Candidatus Solibacter usitatus Ellin6076
Genome accession: NC_008536
Putative virulence/resistance : Virulence
Product : ATPase
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2976043 - 2978694 bp
Length : 2652 bp
Strand : +
Note : PFAM: AAA ATPase, central domain protein; Clp N terminal domain protein; ATPase associated with various cellular activities, AAA_5; ATPase AAA-2 domain protein; SMART: AAA ATPase; KEGG: aba:Acid345_4765 ATPase AAA-2

DNA sequence :
ATGTTTCGATTGGACAAACTGACTCAAAAGGCGCAGGATGCCCTGCAGCAAACGCAGGCCCTGGCCGAGAAAAATGACAG
CCAGATCATGTTCCCACTGCACCTGTTGATCGCCCTGGCTGAAGAACAGGAAGGCATCGTGCGCCCCGTGCTCGAAAAGT
GCGGCGTGCAGCCCGAAGCCGTCATCACCGAGGCGCAGCACCTCGCGGTATCCCTGCCGAAGACCAACGGCATGCAGCCT
GGCATGTACCTTGCCCCGCAGCTCAACAAAATCATGGAGCGCGCCTTCGATGAGGCCATCCGCTTCAAAGACGAATTTGT
CTCTACCGAGCACCTGCTGCTCTCGATCGCCGAGGAGACCAGCGATCCGGCCGGACAGCTTCTGAAACAGGCCGGCGCCA
GCCATGATGCGATTTTGAAGGCCCTGGTCTCCGTCCGCGGCACGCAGCGCATCACGGACCAGAATCCGGAGACCAAATAC
CAGGCGTTGGAGCGCTATGCGCACGACCTGACCGAATCCGCGCGTCTCGGGAAGCTCGATCCCGTGATCGGGCGCGACGA
TGAGATTCGCCGCGTCATGCAGGTTCTCAGCCGCCGCACCAAGAACAACCCGGTGCTGATCGGCGAACCCGGCGTCGGCA
AGACCGCCATCGTCGAAGGACTTGCGCAGCGGATCATCCGCGGCGACGTGCCGGATCAGCTCAAGAACAAGAAGCTGGTC
GCGATCGATCTCGGCAGCATGATCGCCGGCACCAAGTTCCGCGGCGAATTCGAAGATCGCCTTAAAGCCGTGGTCAAGGA
AATCATCGATTCCAACGGCGAGATCATCTGCTTCATCGACGAGCTGCACACGCTCGTGGGCGCCGGCTCCGCAGAAGGCG
CGATCGATGCCGCCAACCTGCTGAAACCGGCGCTGGCTCGCGGCGAGCTGCGCTGCATCGGCGCTACTACGCTGAACGAG
TACAAGAAGCACGTCGAGAAGGACGCCGCCCTGGCGCGCCGCTTCCAGACCGTCATGGTCGGCGAGCCGAGCGTGGATGA
TACGATTGCGATTCTGCGCGGGCTCAAGGAGAAGTTCGAGATCCACCATGGCGTGCGCATCAAGGACTCGGCAATCCTCG
CGGCCGCCGTGCTTTCGCAGCGCTACATTGCCGACCGCTTCCTGCCCGATAAGGCCATCGATCTGATCGACGAGGCCGGC
AGCGCGCTGCGTCTCCAGATCGGCTCCATGCCGATCGAGATCGACAACCTGGAGCGCCGCATCTCGCATCTCGAGATCGA
GCGCCAGGCCTTGAGCAAGGAGCGCGATCACGCGTCCCACGACCGCCTGCGCCATGTCGAGAACGAACTGGAATCGCTGC
GTGGCGAAGCCGCCGGTTTGAAAGAGCGGTGGAGCCGCGAAAAAGGCGCGATCGATCGGGTCCGCACCCTCAAAGAGCAG
CAGGATTCCCTGCGGCAGGAAGAGGAAAAGGCCAGCCGCGCGGGCGATTGGGAGAAGGCTGCCCAGCTTCGCTACGGCCA
GCTCTCGCAGATCGAGAAGGACATTCAGGCCGCCGAGCAGGAGATGGAGTCCGTCAAATCGCACGCTCTTCTGAAGGAAG
AGATCGATGAAGACGATATCGCCGCCATCGTTGCCAAGTGGACCGGAATCCCCGTGACCCGCATGCTCGAAGGCGAAGTC
CAGAAGCTGGTTCAGATGCCCGACCGGCTCAAAGACCGGGTTATCGGCCAGGACGAAGCCGTCCGGCTGGTGGCCAACGC
CATCCTGCGCAATCGCGCCGGACTCGGCGATCCGCAGCGGCCGATCGGCAGCTTCATTTTCCTCGGGCCCACCGGCGTCG
GTAAAACCGAGCTGGTGCGGGCTCTCGCTCAATACCTGTTTGACGACAGCAAAGCCATGATCCGCGTCGATATGAGCGAG
TACATGGAGAAGCATGCCGTCGCCCGGATGGTTGGCTCGCCTCCCGGATACGTCGGCTACGATGAGGGCGGACAGCTCAC
CGAACAGGTGCGCCGCCGGCCTTATTCGGTGGTCCTTTTCGATGAAATCGAGAAGGCGCACCCCGATGTCTTCCACATGC
TCCTGCAGATACTCGACGACGGCCGCCTGACCGATGGCCAGGGCCGTACCGTGAGTTTCAAGAACACGGTCATCGTCATG
ACCAGCAACGTCGGCACCGGAATGGTCGAGCGCAATACCATCGGCTTCTCGGTGCATGCCAAGAACACTCGCAACGCGGA
CACTCAGAAGCGCCTGCTCGAAGCTCTGCGCGCCCAGTTCCGGCCCGAGTTCCTCAACCGCGTTGACGATATCATTGTCT
TTAACTCGCTGACTCGCGATCATCTCGCTCGCATCGTGGATATCCAGTTGGCCAACGTGGGCAAACTCTTGAAAGATCGC
AAGCTGAATATCGAAATCACTCCCGCGGCCAAAGACCGGATCATCTCCGAAGGTTACGATCCCGATTACGGCGCACGGCC
CATGCGTCGCGCCATACAGCGCCTCGTCCAGGACCCGCTCGCCCTCAAGGTGATCAGCGGCGAATTCGAGGAAGGTGACA
CCATCCTGGTGGACGCCGCTCCGGACGGAGCCGAACTCACGTTCGCAAAGCTGGTTCCGATAACGGAACCAGCGGTTCCC
ATCTCCAGGTAA

Protein sequence :
MFRLDKLTQKAQDALQQTQALAEKNDSQIMFPLHLLIALAEEQEGIVRPVLEKCGVQPEAVITEAQHLAVSLPKTNGMQP
GMYLAPQLNKIMERAFDEAIRFKDEFVSTEHLLLSIAEETSDPAGQLLKQAGASHDAILKALVSVRGTQRITDQNPETKY
QALERYAHDLTESARLGKLDPVIGRDDEIRRVMQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIIRGDVPDQLKNKKLV
AIDLGSMIAGTKFRGEFEDRLKAVVKEIIDSNGEIICFIDELHTLVGAGSAEGAIDAANLLKPALARGELRCIGATTLNE
YKKHVEKDAALARRFQTVMVGEPSVDDTIAILRGLKEKFEIHHGVRIKDSAILAAAVLSQRYIADRFLPDKAIDLIDEAG
SALRLQIGSMPIEIDNLERRISHLEIERQALSKERDHASHDRLRHVENELESLRGEAAGLKERWSREKGAIDRVRTLKEQ
QDSLRQEEEKASRAGDWEKAAQLRYGQLSQIEKDIQAAEQEMESVKSHALLKEEIDEDDIAAIVAKWTGIPVTRMLEGEV
QKLVQMPDRLKDRVIGQDEAVRLVANAILRNRAGLGDPQRPIGSFIFLGPTGVGKTELVRALAQYLFDDSKAMIRVDMSE
YMEKHAVARMVGSPPGYVGYDEGGQLTEQVRRRPYSVVLFDEIEKAHPDVFHMLLQILDDGRLTDGQGRTVSFKNTVIVM
TSNVGTGMVERNTIGFSVHAKNTRNADTQKRLLEALRAQFRPEFLNRVDDIIVFNSLTRDHLARIVDIQLANVGKLLKDR
KLNIEITPAAKDRIISEGYDPDYGARPMRRAIQRLVQDPLALKVISGEFEEGDTILVDAAPDGAELTFAKLVPITEPAVP
ISR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 1e-172 45
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-106 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Acid_2333 YP_823607.1 ATPase VFG2076 Protein 1e-118 43
Acid_2333 YP_823607.1 ATPase VFG2084 Protein 7e-113 41