Gene Information

Name : Franean1_0229 (Franean1_0229)
Accession : YP_001504602.1
Strain : Frankia sp. EAN1.pec
Genome accession: NC_009921
Putative virulence/resistance : Virulence
Product : ATPase
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 279463 - 282075 bp
Length : 2613 bp
Strand : -
Note : PFAM: AAA ATPase central domain protein; Clp domain protein; ATPase associated with various cellular activities AAA_5; ATPase AAA-2 domain protein; SMART: AAA ATPase; KEGG: fal:FRAAL6643 ATP-dependent protease, HSP 100, part of multi-chaperone system with

DNA sequence :
ATGAACGCTGACCGTCTCACCGCCCGCTCGCAGGAGGCTCTGTCCTCCGCGATCAGCCGTGCCACCGGCGACGGATCGCC
GCTCGTCGACCCGCTGCACCTGCTGACCGCCCTGCTCGAGGCGCCCGACGGTGTCGGTGCCGCCCTGCTGGAGGCCGTCG
GCACCCCGGCGGCGGACATCCGCTCCCGGGCGGAGGCCGCGGTGGGCCGGCTCCCCCGCGCCGCCGGGGCCAACGTGGCG
CCGCCGCAACTGTCCCGGCAGCTCGTCGCCGTCCTGAACAACGCCGAGCGCCAGGCCGCCCGGCTCGGCGATGAGTACAC
CTCGGTCGAGCACCTGGTCGTGGCGCTCGCGGAGGAGGGCGGGGAGGCGTCCCGCATCCTCGCCGAGGCGGGCGCGACCC
CGGACGCCCTGCGCGGCGCGTTCGACCGCGTCCGCGGCGGCGCCCGCCGCGTCACCAGCCGGGATCCGGAGGGGGCCTAC
CGGGCGCTCGAGAAGTACTCCATCGACCTCACCGCGCGGGCCCGCGACGGCAAGCTCGACCCGGTGATCGGCCGCGACAC
CGAGATCCGCCGGGTCGTGCAGGTTCTCTCCCGGCGCACGAAGAACAACCCGGTCCTGATCGGCGAGCCCGGCGTCGGCA
AGACGGCGATCGTCGAGGGGCTCGCGCTGCGGGTGGCCGCGGGTGACGTCCCGGAGTCGCTGCGCGGGCGGCGCATCGTC
TCGCTCGACCTCGGCTCGATGGTCGCCGGCTCCAAGCTGCGCGGCGAGTTCGAGGAACGGCTGACCTCGGTGCTCACCGA
GATCCGCGAGGCCGAGGGCCAGATCATCACCTTCATCGACGAGCTGCACACCGTCGTCGGCGCCGGCGCGGCCGAGGGCG
CGATGGACGCCGGCAACATGCTCAAGCCGATGCTCGCCCGCGGTGAGCTGCGCATGATCGGCGCGACGACGCTGGACGAG
TACCGCACCCGCATCGAGAAGGACCCGGCGCTGGAGCGCCGCTTCCAGCCCGTGATGGTCGGGGAGCCGTCCGTGGAGGA
CACGATCGGCATCCTGCGCGGGCTCAAGGAGCGTTACGAGGTCCACCACGGGGTGCGGATCACCGACTCGGCGCTGGTGG
CCGCGGCCACCCTGTCCGACCGGTACGTCACCGCCCGGTTCCTCCCCGACAAGGCGATCGACCTGATGGATGAGGCGGCG
TCCCGGCTACGGATGGAGATCGACAGCCGGCCGGTCGCCGTCGACGAGCTCGAGCGGGCCGTGCGCCGTCTCGAGATCGA
GGACATGGCGCTGTCGAAGGAGAACGACGACGCGTCCCGGGAACGGCGCGACCGGCTGCAGCGCGAGCTGGCGGAGAAGC
GCGAGGAGCTCTCCGCGCTGACCGCGCGGTGGCAGCGGGAGAAGAACTCCATCTCCGAGGTCCAGAAGATCAAGGAGGAG
CTGGAGAACGCCCGCCGCGCCGCCGAGATGGCCGAGCGCGACCTCGACCTCGCCAAGGCCGGTGAGCTGCGGTACGGCAC
GATCCCGACGCTGGAGAAGCGGCTCGCCGAGGCGACCGGCGCGCTCGCCGGATCGGACTCGCCCGGCGGGGCGATGCTCA
GCGAGGAGGTCGGTCCCGACGACGTCGCCGAGGTCGTCGCCTCGTGGACGGGCATCCCCGCCGGCCGCATGCTCGAGGGC
GAGACGAGCAAGCTCCTGCGCATGGAGACGGAGCTGCACCGTCGCGTGATCGGGCAGGACGAGGCCGTGCGCACCGTGGC
GGACGCCGTCCGCCGCGCGCGGGCCGGCATCGCCGACCCGGACCGGCCGACCGGGTCGTTCCTCTTCCTCGGGCCGACGG
GTGTGGGCAAGACGGAGCTGGCCAAGGCGCTCGCCGACTTCCTGTTCGACGACGAGCGGGCGGTCGTGCGCATCGACATG
AGCGAGTACGCCGAGAAGCACTCGGTGGCGCGGTTGATCGGCGCGCCTCCCGGCTACGTCGGCTTCGAGTCCGGCGGCCA
GCTCACCGAGGCGATCCGGCGCCGCCCGTACAGCGTGATCCTGCTCGACGAGGTCGAGAAGGCGCACCCGGACGTCTTCG
ACGTGCTGCTCGCCGTACTCGACGACGGCCGGCTGACCGACGGCCAGGGCCGCACGGTCGACTTCCGGAACACCATCCTG
ATCCTGACCTCGAACCTGGGGTCGGTCTACATCGCCGACCCGACCCTGCCCCCGCAGGTCCGCCACGATTCGGTGATGGT
CGCCGTGCGCGACGCCTTCAAGCCGGAGTTCCTGAACCGGCTCGACGACGTGCTGGTCTTCGAGCAGCTCGGCCGGGACG
ATCTGACGAAGATCGTCGACATCCAGATCGACCGGCTGCGCAGGCGGCTGGCCGACCGCCGGATCTCCCTCGAGGTGACC
GACGCCGCCAAGGTCTGGCTCGCGGACGCCGGCTACGACCCGGTGTACGGGGCGCGGCCGCTGCGCCGCCTGGTGCAGAC
CTCGATCGGCGACCAGCTCGCCCGCGAGCTGCTGGCCGGCCAGATCAGGGACGGCGACGGGGTCGTGGTCGACGTGGACG
GGCAGCGCTCGGCGCTGAGCGTCCACTCCGCGGCCCGCGCGCAGGCCATCTGA

Protein sequence :
MNADRLTARSQEALSSAISRATGDGSPLVDPLHLLTALLEAPDGVGAALLEAVGTPAADIRSRAEAAVGRLPRAAGANVA
PPQLSRQLVAVLNNAERQAARLGDEYTSVEHLVVALAEEGGEASRILAEAGATPDALRGAFDRVRGGARRVTSRDPEGAY
RALEKYSIDLTARARDGKLDPVIGRDTEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLALRVAAGDVPESLRGRRIV
SLDLGSMVAGSKLRGEFEERLTSVLTEIREAEGQIITFIDELHTVVGAGAAEGAMDAGNMLKPMLARGELRMIGATTLDE
YRTRIEKDPALERRFQPVMVGEPSVEDTIGILRGLKERYEVHHGVRITDSALVAAATLSDRYVTARFLPDKAIDLMDEAA
SRLRMEIDSRPVAVDELERAVRRLEIEDMALSKENDDASRERRDRLQRELAEKREELSALTARWQREKNSISEVQKIKEE
LENARRAAEMAERDLDLAKAGELRYGTIPTLEKRLAEATGALAGSDSPGGAMLSEEVGPDDVAEVVASWTGIPAGRMLEG
ETSKLLRMETELHRRVIGQDEAVRTVADAVRRARAGIADPDRPTGSFLFLGPTGVGKTELAKALADFLFDDERAVVRIDM
SEYAEKHSVARLIGAPPGYVGFESGGQLTEAIRRRPYSVILLDEVEKAHPDVFDVLLAVLDDGRLTDGQGRTVDFRNTIL
ILTSNLGSVYIADPTLPPQVRHDSVMVAVRDAFKPEFLNRLDDVLVFEQLGRDDLTKIVDIQIDRLRRRLADRRISLEVT
DAAKVWLADAGYDPVYGARPLRRLVQTSIGDQLARELLAGQIRDGDGVVVDVDGQRSALSVHSAARAQAI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-98 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Franean1_0229 YP_001504602.1 ATPase VFG2084 Protein 6e-101 43