Gene Information

Name : FraEuI1c_0153 (FraEuI1c_0153)
Accession : YP_004014111.1
Strain : Frankia sp. EuI1c
Genome accession: NC_014666
Putative virulence/resistance : Virulence
Product : ATPase AAA-2 domain-containing protein
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 198616 - 201126 bp
Length : 2511 bp
Strand : -
Note : KEGG: fal:FRAAL6680 ATP-dependent protease, HSP 100, part of multi-chaperone system with DnaK, DnaJ, and GrpE; PFAM: ATPase AAA-2 domain protein; AAA ATPase central domain protein; UvrB/UvrC protein; Clp domain protein; Clp ATPase-like; SMART: AAA ATPase

DNA sequence :
ATGTTCGAGAGATTCACCGACCGGGCACGTCGGGTCGTCGTCCTGGCTCAAGAAGAAGCCAGGATGCTCAACCACAACTA
CATCGGCACCGAGCACATCCTTCTCGGCCTGATCCACGAGGGCGAGGGTGTCGCGGCGAAGGCGCTGGAGTCGCTGGGGA
TCTCGCTCGAGGGTGTGCGGTCGCAGGTTGAAGAGATCATCGGTCAGGGCCAGCAGGCCCCGAGCGGGCACATCCCCTTC
ACCCCGCGCGCGAAGAAGGTCCTCGAGCTCTCCCTGCGCGAGGCGCTGCAGCTCGGTCACAACTACATCGGCACCGAGCA
CATCCTCCTCGGCCTGATCCGCGAGGGTGAGGGCGTCGCCGCCCAGGTGCTGGTCAAGCTCGGCGCCGACCTCAACCGGG
TCCGCCAGCAGGTCATCCAGCTGCTCTCCGGCTACCAGGGCAAGGGCGACCCGGCCACCGCCGGCGCTCCCTCCGAGGGC
ACGCCGTCGACGTCGCTGGTGCTCGACCAGTTCGGCCGCAACCTGACCGCGGCCGCCCGCGAGGCCAAACTCGACCCGGT
CATCGGGCGCGAGAAGGAAATCGAGCGGGTCATGCAGGTCCTGTCGCGGCGGACCAAGAACAACCCGGTCCTGATCGGCG
AGCCCGGCGTCGGCAAGACCGCCGTCGTCGAGGGCCTGGCGCAGGCGATCGTCAAGGGCGAGGTGCCCGAGACGCTCAAG
GACAAGCAGCTCTACACGCTTGACCTGGGCGCCCTGGTCGCCGGCTCCCGCTACCGCGGTGACTTCGAGGAGCGCCTGAA
GAAGGTCCTCAAGGAGATCCGCACCCGCGGCGACATCATCCTGTTCATCGACGAGCTGCACACGCTCGTCGGCGCGGGTG
CCGCCGAGGGCGCGATCGACGCCGCGTCGATCCTCAAGCCGATGCTGGCCCGTGGCGAGCTGCAGACGATCGGTGCGACC
ACGCTCGACGAGTACCGCAAGCACCTGGAGAAGGACGCCGCGCTGGAGCGCCGTTTCCAGCCGATCCAGGTCGCGGAGCC
GTCGGTGGCGCACACCATCGAGATCCTCAAGGGCCTGCGCGACCGGTACGAGGCGCACCACCGCGTCTCGATCACCGACG
CCGCCCTGGTCGCCGCCGCGTCACTGGCCGACCGGTACATCTCGGACCGCTTCCTGCCGGACAAGGCGATCGACCTGATC
GACGAGGCCGGTTCCCGGATGCGCATCCGCCGGATGACCGCTCCGCCGGATCTGCGCGAGTTCGACGAGCGCATCGCGAA
CGTCCGCCGGGACAAGGAGTCCGCGATCGACGCGCAGGACTTCGAGAAGGCGGCCTCGCTGCGCGACAAGGAAAAGAACC
TGATCGCGGACAAGGCCAAGCGCGAGAAGGAGTGGAAGGCCGGCGACATGGACGTCGTCGCCGAGGTGGGCGACGAGGAG
ATCGCCGAGGTGCTCGCCATCTGGACGGGCATCCCGGTCTTCAAGCTCACCGAGGAGGAGACGGCCCGCCTCCTGCGCAT
GGAGGACGAGCTGCACCGGCGCGTCATCGGCCAGCAGCAGGCCATCAAGGCCGTCTCCCAGGCGATCCGGCGTACTCGGG
CCGGCCTGAAGGACCCGAAGCGCCCAGGCGGCTCGTTCATCTTCGCCGGCCCGTCCGGTGTCGGTAAGACCGAGCTGTCC
AAGACGCTGGCCGAGTTCCTGTTCGGCGACGAGGACGCACTGATCCAGCTCGACATGTCCGAGTACATGGAGAAGCACAC
CGTCTCGCGGCTGGTGGGCTCGCCGCCCGGCTATGTCGGCTACGAGGAGGGCGGCCAGCTCACCGAGCGGGTGCGCCGCA
AGCCGTTCTCCGTGGTCCTCTTCGACGAGGTCGAGAAGGCCCACCCGGACGTCTTCAACACGCTCCTGCAGATCCTGGAG
GACGGTCGCCTGACCGACTCCCAGGGCCGGCTGGTCGACTTCAAGAACACCGTCCTGATCATGACGTCGAACCTGGGCAC
CCGCGACATCTCCAAGGGCCCCGGCATCGGCTTCGCGACCGGTCAGGGCGCGGTCGACTACGAGCGCATGAAGGCCAAGG
TCCAGGACGAGCTCAAGCAGCACTTCCGGCCCGAGTTCCTGAACCGCATCGACGACATCATCGTCTTCCACCAGCTGTCC
CAGGACGAGATCATCCAGATCGTCGACCTGATGCTGGCGCGGGTCGACGCGCAGCTGAAGAACAAGGACATGGCGCTGGA
GCTCACGCCGGCCGCCAAGCAGCTGCTCGCCGTCCGTGGCTACGACCCCGTGCTGGGCGCCCGGCCGCTGCGCCGGACGA
TCCAGCGCGAGATCGAGGACGTGCTGTCCGAGAAGATCCTGTACGGCACCCTGAAGCCGGGCGAGATCGTGATCGGCGAC
GTCGAGGGCGACCCCACCGCCGAGGACGCGAAGTTCGTCTTCCGCGGTGAGACCAAGCCCAGCTCGGTCCCGGACACGCC
CCCGATCGACCTGGCCAAGTCCGGCGAATAG

Protein sequence :
MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKALESLGISLEGVRSQVEEIIGQGQQAPSGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGADLNRVRQQVIQLLSGYQGKGDPATAGAPSEG
TPSTSLVLDQFGRNLTAAAREAKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQAIVKGEVPETLK
DKQLYTLDLGALVAGSRYRGDFEERLKKVLKEIRTRGDIILFIDELHTLVGAGAAEGAIDAASILKPMLARGELQTIGAT
TLDEYRKHLEKDAALERRFQPIQVAEPSVAHTIEILKGLRDRYEAHHRVSITDAALVAAASLADRYISDRFLPDKAIDLI
DEAGSRMRIRRMTAPPDLREFDERIANVRRDKESAIDAQDFEKAASLRDKEKNLIADKAKREKEWKAGDMDVVAEVGDEE
IAEVLAIWTGIPVFKLTEEETARLLRMEDELHRRVIGQQQAIKAVSQAIRRTRAGLKDPKRPGGSFIFAGPSGVGKTELS
KTLAEFLFGDEDALIQLDMSEYMEKHTVSRLVGSPPGYVGYEEGGQLTERVRRKPFSVVLFDEVEKAHPDVFNTLLQILE
DGRLTDSQGRLVDFKNTVLIMTSNLGTRDISKGPGIGFATGQGAVDYERMKAKVQDELKQHFRPEFLNRIDDIIVFHQLS
QDEIIQIVDLMLARVDAQLKNKDMALELTPAAKQLLAVRGYDPVLGARPLRRTIQREIEDVLSEKILYGTLKPGEIVIGD
VEGDPTAEDAKFVFRGETKPSSVPDTPPIDLAKSGE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 0.0 75
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 5e-97 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
FraEuI1c_0153 YP_004014111.1 ATPase AAA-2 domain-containing protein VFG0079 Protein 0.0 63
FraEuI1c_0153 YP_004014111.1 ATPase AAA-2 domain-containing protein VFG0080 Protein 1e-151 52