Gene Information

Name : Chro_2788 (Chro_2788)
Accession : YP_007092126.1
Strain : Chroococcidiopsis thermalis PCC 7203
Genome accession: NC_019695
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3102046 - 3104703 bp
Length : 2658 bp
Strand : +
Note : PFAM: AAA domain (Cdc48 subfamily); C-terminal, D2-small domain, of ClpB protein; Clp amino terminal domain; ATPase family associated with various cellular activities (AAA); TIGRFAM: ATP-dependent chaperone ClpB; COGs: COG0542 ATPase with chaperone activi

DNA sequence :
ATGCAGCCCACCGACCCTAACAAGTTCACCGAAGAAGCCTGGGAAGCGATCGTCAATTCTCAGGATTTGATGGTTCAACG
TTTTCAGCAACAACAACTGGAGGTCGAACACCTCGCGATCGCACTTTTAGAACAACCAAAAGGGCTGGCTAATCGGATTC
TGACTAAAGCTGGTGTCGATCCCGCAGCAATGAATCAGCAGCTTGAAGCTTTTGCCAAGCGACAGCCGAAAGTGGGTAAA
AGCAGTCAGCTTTATTTGGGTCGAAATTTGGATCTGATGCTCGACCGGACAGATGCGATTAGAACGAACTGGCAAGACCA
ACACGTTTCAGTCGAACACCTGCTGATAGCGTTTGGAGAGGACGAACGGATCGGGCGTAGGGTGTGCAAAGGCTTCAATC
TCGATACGGCAAAGCTAGAAGCGGCAATCAAGGCGATTCGTGGCAATCAAAAGGTCATAGACCGAACCCCCGAAGCACGC
TACGAAGCTTTAGAAAAATTTGGGCGCGACCTGACAGAACAGGCAAAGGTTGGCAAGCTCGATCCCGTCATTGGTAGAGA
CGATGAGGTGCGGCGCGTGATCCAGGTATTGTCTCGCCGCACGAAGAATAATCCTGTTTTAATTGGCGAACCTGGTGTGG
GGAAAACCGCGATCGCGGAAGGGTTGGCGCAAAGAATTATCAAAGGCGATGTCCCTGAATCCCTCAAAAACAGGCAGCTC
ATTACTCTCGACATGGGTAGCTTGATCGCTGGGGCAAAATATCGCGGCGAATTTGAAGACCGCTTGCGGGCTGTATTACG
GGAAGTTACCGAATCTAACGGACAAATCGTCTTATTTATTGACGAATTACACACCGTTGTCGGTGCGGGTGGCACGACTC
AAGGCGCAATGGACGCGGGAAACTTACTCAAACCAATGCTGGCGCGGGGAGAGTTACGCTGTATTGGCGCGACGACTTTA
GATGAGTACCGCAAGTATATTGAAAAAGACGCAGCCTTAGAACGCCGCTTCCAACAAGTCTTTGTCGATCAACCCTCGGT
AGAAACGACAATTTCGATTCTCCGAGGCTTGAAAGAACGATACGAAGCACATCACAGCGTCAAAATTACTGATTCTGCTT
TGGTTGCAGCAGCAACTCTTTCCCACCGCTACATCACCGATCGCTTTTTACCCGATAAGGCGATCGATCTTGTCGATGAA
GCTGCTGCCCAACTGAAAATGGAGATCACTTCTAAGCCCACAGAACTAGAAATCATCGATCGCCGTTTAATGCAGCTAGA
GATGGAAAAACTCTCGATTGCTGGTGAAGACCAACGCGCCGCCATTACCAAAGAACGCTTGGAACGGATCGAGCAAGAAA
TTTCTACATTAACGCAAAAACAACAGGAATTAAACTCTCAGTGGCAGGGTGAAAAACAGATCCTCGACGCGATCGGGGCT
TTGAAAAAAGAAGAAGAATCTTTGCGGGTACAGATCGAACAGGCAGAACGCGCCTACGATCTCAACACGGCAGCGCGGCT
GAAATACGGGCAGCTAGAAGGAGTACAGCGCGATCGCGAAGCGAAAGAAACTCTACTAATAGAAATTCAAAGCCAGGGTT
CTACACTGTTGCGAGAAGAAGTCTCCGAAGCTGATATTGCTGAGATCGTTGCTAAGTGGACGGGTATTCCAATCAATCGC
CTGTTAGAGTCAGAACGCCAGAAATTACTCCAACTCGAAAGCCACCTCCACGCTAGGGTAGTCGGTCAATCTGAGGCTGT
ATCGGCTGTAGCAGCAGCTATTCGCCGCGCTCGGGCAGGGATGAAAGATCCCGGTCGTCCCATCGGTTCTTTTTTGTTCA
TGGGTCCGACGGGAGTAGGTAAAACAGAATTAGCGCGAGCATTGGCACAGTTTTTATTTGATGCCGACGATGCTTTAGTC
CGCTTGGACATGTCTGAATACATGGAAAAACATTCGGTTTCCCGCCTAGTGGGTGCGCCTCCTGGTTACGTTGGTTACGA
AGAAGGAGGGCAACTCTCAGAAGCGATCCGTCGTCGTCCCTATTCTGTGGTGTTGCTAGATGAGGTGGAAAAAGCCCACC
CAGATGTATTTAATATTCTGCTTCAAGTGTTGGATGACGGACGAATTACGGATTCTCAAGGTAGAGTCATTGACTGCCGC
AACACGGTGATCGTCATGACGAGTAATATTGGTAGCGATCGCATTCTCGATTTATCTGGAGATGATACCGACTACGAACA
GGTACAACGGCAAGTTTTAGAGGCACTGCGATCGCACTTCCGCCCCGAATTTCTCAACCGCGTTGATGACCTGATTATTT
TCCACCCCCTCGATCGCAGCCAGTTACGGCAAATTGTCAGCATTCAGCTCAAACGAGTCCAAAGACTGCTCGACGAGCAA
AAAATTGGCATCGTGCTATCGCCAGCAGCTCAAGATTACTTGGTAGATATCGGCTATGACCCCGTATATGGCGCTCGTCC
GCTCAAACGAGCCATCCAACGCTACCTAGAAAATCCTCTGGCAACTAAATTATTAGAGGGGACTTTTACCGAAGGCGACA
CCATTCAAGTCGATTGTCAGGATGGCGCTCTTTCTTTCCAGCGTCAACGCTCAGTTGTTACCTATGCGCCAGCTCTTTCT
AAGCCTGATACAAATTAA

Protein sequence :
MQPTDPNKFTEEAWEAIVNSQDLMVQRFQQQQLEVEHLAIALLEQPKGLANRILTKAGVDPAAMNQQLEAFAKRQPKVGK
SSQLYLGRNLDLMLDRTDAIRTNWQDQHVSVEHLLIAFGEDERIGRRVCKGFNLDTAKLEAAIKAIRGNQKVIDRTPEAR
YEALEKFGRDLTEQAKVGKLDPVIGRDDEVRRVIQVLSRRTKNNPVLIGEPGVGKTAIAEGLAQRIIKGDVPESLKNRQL
ITLDMGSLIAGAKYRGEFEDRLRAVLREVTESNGQIVLFIDELHTVVGAGGTTQGAMDAGNLLKPMLARGELRCIGATTL
DEYRKYIEKDAALERRFQQVFVDQPSVETTISILRGLKERYEAHHSVKITDSALVAAATLSHRYITDRFLPDKAIDLVDE
AAAQLKMEITSKPTELEIIDRRLMQLEMEKLSIAGEDQRAAITKERLERIEQEISTLTQKQQELNSQWQGEKQILDAIGA
LKKEEESLRVQIEQAERAYDLNTAARLKYGQLEGVQRDREAKETLLIEIQSQGSTLLREEVSEADIAEIVAKWTGIPINR
LLESERQKLLQLESHLHARVVGQSEAVSAVAAAIRRARAGMKDPGRPIGSFLFMGPTGVGKTELARALAQFLFDADDALV
RLDMSEYMEKHSVSRLVGAPPGYVGYEEGGQLSEAIRRRPYSVVLLDEVEKAHPDVFNILLQVLDDGRITDSQGRVIDCR
NTVIVMTSNIGSDRILDLSGDDTDYEQVQRQVLEALRSHFRPEFLNRVDDLIIFHPLDRSQLRQIVSIQLKRVQRLLDEQ
KIGIVLSPAAQDYLVDIGYDPVYGARPLKRAIQRYLENPLATKLLEGTFTEGDTIQVDCQDGALSFQRQRSVVTYAPALS
KPDTN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 7e-94 41
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 5e-98 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 4e-98 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Chro_2788 YP_007092126.1 ATP-dependent chaperone ClpB VFG0079 Protein 2e-137 43
Chro_2788 YP_007092126.1 ATP-dependent chaperone ClpB VFG2084 Protein 4e-108 42
Chro_2788 YP_007092126.1 ATP-dependent chaperone ClpB VFG2076 Protein 1e-110 41