Gene Information

Name : Cha6605_4300 (Cha6605_4300)
Accession : YP_007098767.1
Strain : Chamaesiphon minutus PCC 6605
Genome accession: NC_019697
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4526837 - 4529458 bp
Length : 2622 bp
Strand : -
Note : PFAM: AAA domain (Cdc48 subfamily); Clp amino terminal domain; C-terminal, D2-small domain, of ClpB protein; ATPase family associated with various cellular activities (AAA); TIGRFAM: ATP-dependent chaperone ClpB

DNA sequence :
ATGCAACCTAATAACCCCCAACAATTTACGGAAAAGGCTTGGCAGGCGGTTACCAATACCCTAGATATTGCCAAAGCTTC
GCATCACCAGCAGATGGAGTCGGAGCATTTGCTCAAAGCATTACTCGAACAAGATGGGTTAGCAACGAGCATCTTGAGCA
AGGCTGGTGTGAATTTAAGTCAATTTCGGCAAAGCTTAGAGAGTTTTATTCAAAAACAGCCCCGGATATCGGGTGAAGTA
ACTTCGGTTTATCTCGGACGCAGCATCGATACATTACTCGATCGAGCAGAGAAATATCGTAAAGAGTATGGCGATGAATT
TATTTCGATCGAGCATCTATTGCTAGCATATCCCCAGGACGATCGATTTGGCAAGCAATTTTTTGCCGATTTCAAACTCG
AAGAAAGTAAGTTAAAAACGATCGTCTCCCAAATTCGAGGCAGTCAAAAAGTAATGGATCAAAATCCCGAAAACAAATAC
GAGTCTCTTTCTAAATACGGGCGAGATCTAACTGATTTTGCTCGTCGGGGTAAACTCGATCCGGTGATCGGACGCGATGA
TGAAATTCGGCGGACGATTCAGATCTTATCCCGCCGGACTAAAAATAATCCTGTCCTGATTGGCGAACCGGGTGTCGGGA
AAACGGCAATTGCCGAAGGATTAGCCCAGCGAATTATTAGCGGCGACGTACCCCAATCGCTCAAAGATCGGCAGCTAATT
TCGCTCGATATGGGTGCTTTAGTTGCTGGCGCAAAATATCGCGGTGAGTTTGAAGAAAGACTCAAAGCCGTGCTTAAAGA
GGTCACAGAATCTCAAGGTCAAATCATCTTATTTATCGATGAGATCCATACTGTCGTTGGTGCGGGTGCCACTCAAGGCG
CAATGGATGCAGGCAACTTGCTCAAGCCCATGATGGCGCGGGGCGAGCTGCGGTGTATCGGTGCGACTACGCTCGATGAA
TATCGCAAATATATCGAAAAAGATGCCGCTCTCGAACGTCGCTTTCAACAAGTATATGTCGATCAACCCAGCGTCGAAGA
TACGATCTCGATTCTGCGGGGTTTAAAAGAACGCTATGAAGTGCATCACGGGGTAAAAATCTCTGATAATGCCTTAGTAG
CTGCGGCCACCTTATCTACCCGGTATATTAGCGACAGGTTTTTACCAGATAAAGCGATCGATTTAATGGACGAAGCCGCC
GCCCGCCTGAAGATGGAAATTACATCTAAGCCTGAAGAATTAGATGAGATCGATCGCAAGGTACTCCAATTAGAAATGGA
GCGGTTGTCAGTCAATAAAGATACGTCGAATACTGCTCGCGAACGTTTGCAAAAGATCGAAAAGGAACTGGGAGATCTCA
AGGAAGAACAACGCGCCCTCACCGCCCAATGGCAGTCGGAAAAAGATGTGATTACCGACATCCAAACCATCAAAGAGGAA
ATCGATCGAGTCAATATCGAAATCCAACAAGCCGAACGCAATTACGATCTGGCGCAGGCTTCGGCACTTAAATTTGGTAA
ATCGATCGAACTCCAAGGTAAACTCGAAGCCGCCGAAGTCAAACTCTCCCAATCCCAAACTACCGGAAAATCTCTGCTCC
GCGAAGAAGTCACCGAAGCCGATATCGCCGAGATCATCTCTAAATGGACGGGAATTCCGCTCACCAAACTCGTCGAAACC
GAACGCGAAAAACTCCTCTACCTCGAAGACGAACTACACCGTCGCGTCATTGGCCAAGACGAAGCCGTCACTGCCGTCGC
CGAAGCCATCCAGCGTTCCCGTGCGGGCTTATCCGACCCCAATCGCCCCACAGCTAGCTTTATCTTCCTCGGCCCCACAG
GTGTTGGTAAAACCGAATTAGCCAAAACGCTAGCCAACTACCTCTTCGACGCTGAAGATGCCCTAATTCGGATTGATATG
TCCGAATATATGGAAAAACATGCTGTTTCGCGCCTCATTGGTGCGCCTCCAGGCTACGTCGGCTATGAAGAAGGTGGACA
ACTCACTGAAAGTATTCGCCGTCGCCCCTACGCGGTAATTCTGTTCGACGAAATCGAGAAAGCACACCCCGATGTCTTCA
ACGTGCTGTTGCAGGTACTCGATGACGGACGGGTCACCGATTCCCAAGGTCGGACGGTAGACTTCAAGAATACGGTAATT
ATCATGACCAGTAATATTGGTTCGCAATATATTTTGGATCTAGCTGGCGATGATAAATATGATGAGATGAAAGCTCGCGT
CTTAGAAGCTCTAAGCCATAATTTCCGTCCCGAATTTCTCAACCGGATCGACGATACAATTATCTTCCACAGTTTGCAAA
AATCCGAACTGCGGAATATCGTCAAAATCCAAGTTGGCCGTCTCGAAAAACGGTTGGCCGATCGCAAGTTATCGCTAAAA
TTAGCAGAATCTGCGCTCGACTTCTTAGTAAATGTCGGTTACGACCCCGTATATGGCGCACGACCGCTCAAACGGACGAT
TCAAAAAGAGTTAGAAACAACCGTCGCCAAAGGCATTTTGCGGGGTGATTTTAAGGAAGGAGATACGATCTTTGTAGAGG
TACAGAACGAACACTTAGCTTTCAGTCGGCTACCTGCAACTATTGCCGTCCAAGAAACTTAA

Protein sequence :
MQPNNPQQFTEKAWQAVTNTLDIAKASHHQQMESEHLLKALLEQDGLATSILSKAGVNLSQFRQSLESFIQKQPRISGEV
TSVYLGRSIDTLLDRAEKYRKEYGDEFISIEHLLLAYPQDDRFGKQFFADFKLEESKLKTIVSQIRGSQKVMDQNPENKY
ESLSKYGRDLTDFARRGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIISGDVPQSLKDRQLI
SLDMGALVAGAKYRGEFEERLKAVLKEVTESQGQIILFIDEIHTVVGAGATQGAMDAGNLLKPMMARGELRCIGATTLDE
YRKYIEKDAALERRFQQVYVDQPSVEDTISILRGLKERYEVHHGVKISDNALVAAATLSTRYISDRFLPDKAIDLMDEAA
ARLKMEITSKPEELDEIDRKVLQLEMERLSVNKDTSNTARERLQKIEKELGDLKEEQRALTAQWQSEKDVITDIQTIKEE
IDRVNIEIQQAERNYDLAQASALKFGKSIELQGKLEAAEVKLSQSQTTGKSLLREEVTEADIAEIISKWTGIPLTKLVET
EREKLLYLEDELHRRVIGQDEAVTAVAEAIQRSRAGLSDPNRPTASFIFLGPTGVGKTELAKTLANYLFDAEDALIRIDM
SEYMEKHAVSRLIGAPPGYVGYEEGGQLTESIRRRPYAVILFDEIEKAHPDVFNVLLQVLDDGRVTDSQGRTVDFKNTVI
IMTSNIGSQYILDLAGDDKYDEMKARVLEALSHNFRPEFLNRIDDTIIFHSLQKSELRNIVKIQVGRLEKRLADRKLSLK
LAESALDFLVNVGYDPVYGARPLKRTIQKELETTVAKGILRGDFKEGDTIFVEVQNEHLAFSRLPATIAVQET

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-109 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-109 42
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 2e-106 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Cha6605_4300 YP_007098767.1 ATP-dependent chaperone ClpB VFG2076 Protein 6e-125 42