Gene Information

Name : Marky_1987 (Marky_1987)
Accession : YP_004368826.1
Strain : Marinithermus hydrothermalis DSM 14884
Genome accession: NC_015387
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 1999424 - 2001991 bp
Length : 2568 bp
Strand : +
Note : COGs: COG0542 ATPase with chaperone activity ATP-binding subunit; InterProIPR003593:IPR017730:IPR004176:IPR003959:IPR 013093:IPR019489; KEGG: opr:Ocepr_0433 ATP-dependent chaperone ClpB; PFAM: ATPase, AAA-2; ATPase, AAA-type, core; Clp, N-terminal; Clp AT

DNA sequence :
GTGAACCTAGAGAAATGGACCGAAGCTTCCCGCCAAGCCCTGGCCCAAGCCCAGGTGCTGGCGCGCGAGATGGGGCACCA
GCAGATCGACCTGCCCCACCTCATCGCCGTGCTACTGCGCGAGCCCAACGGGCTGCCCTCCCGCGTCCTCGAGCGGGCGG
GCCAGGACCCCAAGGCCGCCCTCGAGGCCGCCCAGGCGGCGCTCGCTCGAGCCCCGCGCGTGGAGGGCGCCCAACCCGGC
CAGTACCTCTCGGGCCAGCTCGCAAAAGCCATTGAGCGGGCCGAAAAGCTCGCCGAGGAGTGGGGCGACCGGTTCGTCGC
CGTGGACCTGCTGCTGCTCGCCGCGGCCGAGGCCGGCCACCCCGGCCTCCCGCCGGCGGACCAGCTCAAGCAAGCCATCC
AAGCCATACGGGGGGGGAGAACCGTGGAAAGCGAACACACCGAAGGCACCTTCCAAGCCCTGGAACAGTACGGGGTTGAC
CTTACGCGCCTCGCGCAAGAAGGCAAGCTCGACCCGGTGATCGGGCGGGACGAGGAGATCCGCCGCACGGTCCAGATCCT
CCTGCGCCGCACCAAGAACAACCCCGTCCTCATCGGGGAGCCCGGCGTGGGGAAGACCGCGATCGTCGAGGGCCTCGCCC
AGCGCATCGTCAAAGGCGACGTGCCCGAAGGACTCAAGAACAAACGCATCGTCGCGCTGCAGATGGGCAGCCTCCTCGCC
GGGGCCAAGTACCGCGGCGAGTTCGAGGAACGCCTCAAGGCCGTTATCCAGGAGACCATCCAGTCCCAAGGGGAGGTCAT
CCTCTTCATCGACGAGCTGCACACCATCGTCGGCGCGGGCAAAGCCGAAGGCGCGGTGGACGCGGGCAACATGCTCAAAC
CCGCCCTCGCCCGCGGCGAGCTCCGCCTGATCGGCGCCACCACCCTCGACGAGTACCGCGAGATCGAGAAGGACGCCGCC
CTCGAGCGCCGCTTCCAACCGGTCCTCGTGGACGAGCCTAGCGTTGAGGACACGATCTCGATCCTGCGGGGGATCAAGGA
GAAGTACGAGGTGCACCACGGCGTGCGCATCGCGGACCCCGCGATCGTCGCCGCGGCCGTCCTCTCCCACCGCTACATCA
CCGACCGACGGCTGCCGGACAAGGCCATCGACCTGGTGGACGAGGCCGCGAGCCGCCTCCGCATGCAGCTCGAGTCCAGC
CCCGAAGCCATCGACACCCTCGAGCGCAAGAAGCTCCAGCTCGAGATCGAGCGGGAGGCCCTAAAGAAAGAGAAAGACCC
CGACTCCCGCGCGCAGCTCGAGGCGATCGAGAACGAGCTCGAGCAGCTCAACGCCCAGATCGCCCAGATGCGCGCCGAGT
GGGAGGCGGAACGCGAGGCCTTGCAGAAGCTCCGCGAAGCGCAGAAGAAGCTGGACGAAACCCGCACCGCGATCGAACAG
GCCGAGCGCTCCTACGACCTGAACAAAGCCGCCGAGCTGCGCTACGGCGTGCTGCCTAAGCTCGAGCACGAGGTCGAGGC
CCTCTCCGAAAAACTCAAGCACGCCCGCTTCGTGCGCCTCGAGGTCACCGAGGAGGACATCGCCGAGGTCGTCTCCCGCT
GGACCGGGATCCCCGTCGCGAAGCTCTTGGAGGGCGAACGGGAGAAGCTCGTGCGCCTCGAGGACGAGCTGCGCAAGCGG
GTGGTGGGGCAGGACGAGGCGATCGTCGCGGTGGCCGACGCGATCCGCCGCGCCCGCGCGGGCCTCAAGGACCCCAACCG
CCCGATCGGGAGCTTCCTGTTCCTGGGCCCCACCGGGGTGGGTAAGACCGAGCTCGCCAAGACCCTGGCCGCCACGCTCT
TCGACAGCGAGGAGGCCATGGTGCGCATCGACATGACCGAGTACATGGAGAAGCACGCCGTCGCCCGGTTGATCGGCGCG
CCCCCCGGGTACGTGGGGTACGAGGAGGGCGGCCAGCTCACCGAGGTCGTTCGGCGCAAGCCGTACACGGTCATCCTCTT
CGACGAGATCGAGAAAGCCCACCCCGACGTGTTCAACATCCTCCTGCAGATCCTCGACGACGGCCGGCTCACCGACTCGC
ACGGGCGGGTGGTGGACTTCCGGAACACGGTCATCATCCTGACCTCGAACCTCGGCAGCCCCCTCATCCTCGAGGGAATC
CAGCAAGGCTCGAGCTACGAGGGCATCCGCGAACGGGTCTTCCGCGTGCTCCAGGAGCACTTCCGGCCCGAGTTCCTGAA
CCGCCTGGACGAGATCATCGTCTTCCGGCCGCTCACCAAGGAGCAGATCGTGCGGATCGTGGACCTGCAGCTCCAGCGCC
TCCAGGCCCGGCTCCAGGAGAAGCGCGTCACCCTCGAGCTCACGCCCGAGGCCAAGACCTGGCTCGCCGAGCGGGGGTAC
GACCCGGCCTTCGGCGCGCGGCCCCTCAGGCGCGTGATCCAGCGCGAGGTGGAGACGCCCCTGGCGCGGATGATCCTGGA
GGGCCGCATCCCCGAAGGCGCCCGCGTCGTGGCGCGCCCCGGCGAGGCCGGCCTCCGCTTCGAAGCCCAAACCCCCGCCC
AGGCCTAA

Protein sequence :
MNLEKWTEASRQALAQAQVLAREMGHQQIDLPHLIAVLLREPNGLPSRVLERAGQDPKAALEAAQAALARAPRVEGAQPG
QYLSGQLAKAIERAEKLAEEWGDRFVAVDLLLLAAAEAGHPGLPPADQLKQAIQAIRGGRTVESEHTEGTFQALEQYGVD
LTRLAQEGKLDPVIGRDEEIRRTVQILLRRTKNNPVLIGEPGVGKTAIVEGLAQRIVKGDVPEGLKNKRIVALQMGSLLA
GAKYRGEFEERLKAVIQETIQSQGEVILFIDELHTIVGAGKAEGAVDAGNMLKPALARGELRLIGATTLDEYREIEKDAA
LERRFQPVLVDEPSVEDTISILRGIKEKYEVHHGVRIADPAIVAAAVLSHRYITDRRLPDKAIDLVDEAASRLRMQLESS
PEAIDTLERKKLQLEIEREALKKEKDPDSRAQLEAIENELEQLNAQIAQMRAEWEAEREALQKLREAQKKLDETRTAIEQ
AERSYDLNKAAELRYGVLPKLEHEVEALSEKLKHARFVRLEVTEEDIAEVVSRWTGIPVAKLLEGEREKLVRLEDELRKR
VVGQDEAIVAVADAIRRARAGLKDPNRPIGSFLFLGPTGVGKTELAKTLAATLFDSEEAMVRIDMTEYMEKHAVARLIGA
PPGYVGYEEGGQLTEVVRRKPYTVILFDEIEKAHPDVFNILLQILDDGRLTDSHGRVVDFRNTVIILTSNLGSPLILEGI
QQGSSYEGIRERVFRVLQEHFRPEFLNRLDEIIVFRPLTKEQIVRIVDLQLQRLQARLQEKRVTLELTPEAKTWLAERGY
DPAFGARPLRRVIQREVETPLARMILEGRIPEGARVVARPGEAGLRFEAQTPAQA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 4e-161 53
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-102 42
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-102 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Marky_1987 YP_004368826.1 ATP-dependent chaperone ClpB VFG2084 Protein 2e-104 42
Marky_1987 YP_004368826.1 ATP-dependent chaperone ClpB VFG2076 Protein 2e-109 41