Gene Information

Name : DICTH_0749 (DICTH_0749)
Accession : YP_002250615.1
Strain : Dictyoglomus thermophilum H-6-12
Genome accession: NC_011297
Putative virulence/resistance : Virulence
Product : ATPase, AAA family
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 759630 - 762314 bp
Length : 2685 bp
Strand : +
Note : identified by match to protein family HMM PF00004; match to protein family HMM PF02861; match to protein family HMM PF07724; match to protein family HMM PF07728

DNA sequence :
ATGAGTAGAAAATTATGCGATATATGTGGAGTAAACCCTGCAACCATTAGGGTTTATACCATAAAAAATGGCCAAAGACG
GATTCTTGATATATGCGAAGAATGTTATGCAAAACAAAGGAGACAAGAAAGATCTTATTCTCCTTTAGAATCTCTCTTCT
TTGGGGATCTTTTTAGAGACTTTTTTGGAGAAGATTTTGGTCTTCCCTTTGAAGGATCCTTAGGAAGAGAGAGAGAATCT
ATAGATTTGAATGATTATTTGAGTAATGTATCAAAGGATCTTTTACAGCAAGCTGCCAAAAAAGCAATTGATTTTGGTAA
GAAGCAGGTAGGTACTGAACATTTACTCTATGTTCTCTTGGATAATGACGTAGTGAATGAGATTTTAAAGCAGTTTAAGA
TTTCTCCACAAGAGATTAAGGCTTATATAGACAATAATGCTCCTAAGGGTGGATTTAAGCCTGAGGGAGAGGAGGTAGAA
GTAGATATATCTCCAAGACTGAAAAGCGCTTTAGAGAGAGCTTTTATGGCTGCAAGGGATCTGGAGCATAATTATATAGG
ACCAGAGCATCTTTTAATAGGGCTCGCTGAGGAAGAAGAAGGATTTGCTGCTAATGTTCTAAGAAAATATGGTTTGACTC
CACAAGCTTTAAGACAAGCAACTATAAGGGTTGTAGGTAGAGGTAAAGAGGCTGGTAAAGGAGTTAGAAGGTCTAATACT
CCAACTCTTGATAAGTTCTCTCGTGATTTAACCGAATTGGCAAGACAAGGTAAGTTAGATCCTGTGATAGGAAGAGCAAA
GGAAATTGAGACTGTAATTGAAGTTCTTGCAAGAAGAAAGAAGAACAACCCTGTGCTTATAGGAGAGCCAGGTGTAGGTA
AGACAGCAATTGTAGAGGGGCTTGCTCAAAGAATTGTTAAAGGAGAAGTTCCAGAAGTTTTGAGAGATAAAAGATTAGTC
GAATTAAACATAAATACTTTAGTTGCAGGAACAAAGTATAGAGGAGAGTTGGAAGAGAGAGTTAAGCAGATCTTGGATGA
GATAATAGCCAACCAAGATAGCCTTATAATATTTATCGATGAAATTCATACTATTGTGGGAGCAGGTGCCGCTGGAGAAG
GAAGTTTAGATATTGCTAATGCCTTTAAACCTGCATTAGCAAGGGGAGAACTACACTTGATTGGTGCTACAACATTAAAT
GAATATCAAAAATATATTGAAAAGGATCCTGCCCTTGAGAGAAGATTTCAACCTATATTTGTTTCTGAGCCTACAGTAGA
GCAAACAATAATGATATTGAGAGGTTTGAGAGATAGGTTTGAGGCTCATCACAAGGTAAAAATTACTGATGAGGCGATAA
TTGCTGCTGCTGAGCTTTCAGATAAGTATATCACTAACAGATACTTGCCTGATAAGGCTATAGATCTCATAGATCAGGCG
GCTGCTAGGGTAAGAATTATGATGACTTCAAGACCTGCCGAAATACAGGAACTTGAAGCTAAGATTCAGTCTTTGAAGAG
GGAGCAAGAATATGCATCTTCGAGAAAACAGTATGATAGAGCAAAGGAGCTTGAAGAACAGATCCAGAAATTAGAGAAGG
AATTACAAGAAAAAACTGAGGCTTGGAAAAAAGAAATAGCATCTGATGTTCCAGAGGTAAGAGCTAAACATATAGCTGAA
ATAGTTTCTTCACTTACTGGTATTCCTGTAACAGAGCTTACTACCGATGAAAAAGAGAGATTATTAAAGTTAGAGGAAAA
ACTTCATGAAAGGGTGGTAGGACAGGATGAAGCTATAAAGGCAGTAAGTGATGCTATAAGGTTGGCAAGAGCAGGTCTTA
GGGAAAAGAATAGACCTATTGCTACTTTTCTGTTCCTTGGTCCTACTGGAGTAGGTAAGACAGAGCTTGCAAGGGCGCTT
GCATGGGCAGTGTTTGGGGATGAAGATGCTATTATTCGTATTGACATGAGTGAATATATGGAGAGACATACTGTTTCAAG
GTTGATTGGTGCACCACCAGGATATGTAGGATATGAAGAGGGAGGACAACTTACTGAGAAGGTACGTAGAAGACCATATA
GTGTCATTCTCTTAGATGAAATAGAGAAAGCGCATCCCGATGTACATAATATATTGTTACAGGTATTTGACGCAGGAAGA
CTTACTGATGGAAAAGGAAGGGTTGTGGACTTTACAAATACCATAATTATAATGACAAGTAATATAGGATCTGACATTAT
TCAGGCCAATTTAACTGCTACAGGAAGAGATAAGTTAAGCTATGAGCAGCTTAAAGAAAAGCTTATGGATATACTTAAGA
GATACTTTAGACCAGAATTTCTTAATAGAATTGATGAAATAATAGTATTCCATGCGTTAACTAAGGAGCAGGTCAGAGAT
ATTGTCAAGTTACAACTTGAGAGAGTAAGGAGAACGGCTAGGGCTCAAAATATAGAGCTCGTGTTTGATGAGTCAGTGGT
AGATTTCTTTGCTGAGATAGGATATAGCCCTGAATTTGGGGCAAGAGAGCTCAAGAGAAAGATTAGAAATGAACTTGAGA
CAAAGCTTGCTAAGGCTATGCTTGAAGGAGCAATTCAAGAAGGAGATAAGATTAGGGTAGTTTATAATAAAGAGAAAAAT
ACAATAGAGTTTGAGAAGATAACTGAAGAAGCGAAGATAAACTAA

Protein sequence :
MSRKLCDICGVNPATIRVYTIKNGQRRILDICEECYAKQRRQERSYSPLESLFFGDLFRDFFGEDFGLPFEGSLGRERES
IDLNDYLSNVSKDLLQQAAKKAIDFGKKQVGTEHLLYVLLDNDVVNEILKQFKISPQEIKAYIDNNAPKGGFKPEGEEVE
VDISPRLKSALERAFMAARDLEHNYIGPEHLLIGLAEEEEGFAANVLRKYGLTPQALRQATIRVVGRGKEAGKGVRRSNT
PTLDKFSRDLTELARQGKLDPVIGRAKEIETVIEVLARRKKNNPVLIGEPGVGKTAIVEGLAQRIVKGEVPEVLRDKRLV
ELNINTLVAGTKYRGELEERVKQILDEIIANQDSLIIFIDEIHTIVGAGAAGEGSLDIANAFKPALARGELHLIGATTLN
EYQKYIEKDPALERRFQPIFVSEPTVEQTIMILRGLRDRFEAHHKVKITDEAIIAAAELSDKYITNRYLPDKAIDLIDQA
AARVRIMMTSRPAEIQELEAKIQSLKREQEYASSRKQYDRAKELEEQIQKLEKELQEKTEAWKKEIASDVPEVRAKHIAE
IVSSLTGIPVTELTTDEKERLLKLEEKLHERVVGQDEAIKAVSDAIRLARAGLREKNRPIATFLFLGPTGVGKTELARAL
AWAVFGDEDAIIRIDMSEYMERHTVSRLIGAPPGYVGYEEGGQLTEKVRRRPYSVILLDEIEKAHPDVHNILLQVFDAGR
LTDGKGRVVDFTNTIIIMTSNIGSDIIQANLTATGRDKLSYEQLKEKLMDILKRYFRPEFLNRIDEIIVFHALTKEQVRD
IVKLQLERVRRTARAQNIELVFDESVVDFFAEIGYSPEFGARELKRKIRNELETKLAKAMLEGAIQEGDKIRVVYNKEKN
TIEFEKITEEAKIN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 6e-167 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
DICTH_0749 YP_002250615.1 ATPase, AAA family VFG0079 Protein 0.0 56
DICTH_0749 YP_002250615.1 ATPase, AAA family VFG0080 Protein 8e-141 48