Gene Information

Name : clpB2 (NIES39_F00620)
Accession : YP_005069293.1
Strain : Arthrospira platensis NIES-39
Genome accession: NC_016640
Putative virulence/resistance : Virulence
Product : ClpB protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2939387 - 2942173 bp
Length : 2787 bp
Strand : -
Note : -

DNA sequence :
ATGCAACCGACCGATCCGAGTAAATTTACAGATAAAGCCTGGGAAGCGATAGTAAAATCCCAAGATGTTGCTCGTCGCTT
CCAAAATCAGCATTTAGAAGTTGAGCACGTGGCGATCGCCGCCTTAGAACAAAATGGACTCGCTAATAATATATTAGGAC
GAGCTGGTTTTAACCCCGAACAGGTCCAACAACAATTAGAAGCCTTCACCAAACGACAACCCCGAGTCGGAACTATTGAC
CATCTGTACCTAGGTCGTGGGTTAGAATTAATGCTAGATACCGCTGAAGCGACGCGGTCCGCATGGCAAGACCGCTACAT
TGCTGTTGAACACCTGATTATTGCCCTAGCAGAAGATGATCGCGTCGGTCGTAGGATTTTGGGAAATGGAAACCCCACCA
GAGCCACACCAGGAAGACCAGGGTTCGATCGCCCTACCAATGACTCCAAAGCTACAGTCCGGGAGAAACTCGAAGAAGCC
ATTAAATCCATGCGCGGCAGTGCTAAAGTTGACAACCAGAACCCGGAAAACAGTTATGATGCCCTGAGTAAGTATGGGCG
AGATTTAACCGAATTGGCTAAATCTGGGAAACTGGACCCAGTGATCGGACGGGATGAAGAAATTCGCCGGGTGGTGAGTT
GTCTGTCCAGGCGAACCAAAAATAATCCGGTGTTAATTGGAGAACCGGGAGTCGGTAAAACAGCGATCGCCGAAGGCCTG
GCCCAACGCATTGTTAACGGAGATGTGCCAGAGTCCCTAAAAAACCGCCAGTTGATTTCCCTGGATATGGGTAGTTTAAT
CGCAGGAGCCAAATATAGGGGGGAATTTGAAGAAAGACTGCGATCGGTATTGCGGGAAGTAACCCACTCTGACGGTCAAA
TAGTCCTATTTATTGACGAACTCCATACCGTGGTGGGGGCCGGAGCCGGTTCTGGAGGTTCGGGAATGGATGCCGGAAAC
CTGCTTAAACCTATGTTAGCCCGGGGAGAATTGCGCTGTATCGGTGCTTCTACCGTTGATGAATACCGTAAGCATATCGA
AAAAGACCCAGCCCTAGAAAGGCGATTTCAACAGGTCTATGTTGATCAACCTAGTCCAGAGAATACAGTATCAATTCTGC
GGGGGTTGAAAGACCGTTACGAACGCCATCACGGGGTCAAAATTACGGATTCGGCATTGGTGGCGGCGGCTATGCTGTCA
GCCCGATATATTAGCGATCGCTTCCTTCCTGATAAGGCGATCGACCTAGTAGACGAAGCCGCCGCCCAGTTGAAAATGGA
AATCACCTCCAAACCCGTAGAGTTAGAACAAATTGAGCGCCGCCTCATGCAGTTGGAGATGGAAAAACTCTCCGTGGAGG
GGGAAAGCCCAAGAAGTACACCTCTACGGCCAGGGGAACAAGATCGAAGTGCTGATATCGGTTTAAATCTGCGTATCCAA
TCCTTACTCGAAGAAATTAACACCCTCAAAGACAAGCAGAAAACTTTATCATCCCAGTGGCAGGGTGAAAAGGAACTCCT
CGAAGCTATTAACCGCCTCAAAGAAGAAGAAGAAAAACTGCGGGTGCAAATTGAACAAGCCGAACGTGCCTATGATCTCA
ATAAAGCAGCCCAGTTAAAATACGGCCGCTTAGAGACCGTTCACCACGACCGAGAGGCGCGGGAGGCAGAATTGTTAAAA
TTGCAGGCTCAGGGGTCTTCCTTGCTGCGAGAGCAGGTAACAGAAGCCGATATTGCTGCCATTGTCGCTAAATGGACGGG
GATTCCCGTTAATCGCCTGTTGGAGTCCGAACGACAAAAATTATTGCAATTGGAATCCCATTTACATCGCCGGGTAATTG
GTCAGCAGCAAGCCGTGGAGGCTGTATCTGCCGCTATCCGCCGCGCTAGGGCGGGGATGAAAGACCCCGGACGGCCTATC
GGTTCGTTTCTGTTTATGGGACCCACAGGGGTGGGTAAAACTGAGTTGGCTAGGGCTTTGGCGGAGTTCCTCTTTGACAG
CGAGGAAGCTATGATTCGCATTGATATGTCGGAATATATGGAAAAACACGCTGTCTCTCGGTTGGTGGGTGCGCCTCCGG
GATATGTGGGCTATGACGAAGGCGGACAACTATCAGAAGCGGTCCGCCGCCATCCCTATTCAGTGATTCTATTTGATGAG
GTAGAAAAAGCTCATCCTGATGTCTTTAATATCCTGTTACAAGTTTTGGATGATGGTAGAATTACTGACTCCCAAGGACG
GTTGGTCGATTTCCGCAATACTGTCATTGTCATGACTAGCAATATTGGCGGTGAGTATATTCTGGGGGTGGCGGGGGATG
ACTCTCGCTATGGGGAAATGTCTGCTTTGGTTATGCAGGCTTTGCGATCGCATTTCCGACCGGAATTTTTGAATCGGGTA
GATGAAATTATCCTCTTCCATACCCTCAGTAAAGCTGAATTGCGCGATATTGTCGCCATTCAAATGCAGCGCCTCCAAAG
ACTTTTGGCTGATCAAAAAATTGCTCTGGAGTTATCACCTGCTGCTATTGATCATGTGGCTGATGTGGGTTATGATCCTG
TTTATGGGGCTAGACCTCTGAAACGGGCGATTCAACGGGAGTTGGAAAATCCGATCGCTAACCTGATTTTAGAGCAGAAG
TTTGTCACGGGCGATACTATTGAAATTAACATGAAAGATGGAACACTGACTTTTGATCATCCCTCCAACACGCCAGAAGC
CCAATTAAATCAGGGTGAGTTGGCTATGTTACCAGTGGCAGTCAACAACCCAGGTGATGATAATTAA

Protein sequence :
MQPTDPSKFTDKAWEAIVKSQDVARRFQNQHLEVEHVAIAALEQNGLANNILGRAGFNPEQVQQQLEAFTKRQPRVGTID
HLYLGRGLELMLDTAEATRSAWQDRYIAVEHLIIALAEDDRVGRRILGNGNPTRATPGRPGFDRPTNDSKATVREKLEEA
IKSMRGSAKVDNQNPENSYDALSKYGRDLTELAKSGKLDPVIGRDEEIRRVVSCLSRRTKNNPVLIGEPGVGKTAIAEGL
AQRIVNGDVPESLKNRQLISLDMGSLIAGAKYRGEFEERLRSVLREVTHSDGQIVLFIDELHTVVGAGAGSGGSGMDAGN
LLKPMLARGELRCIGASTVDEYRKHIEKDPALERRFQQVYVDQPSPENTVSILRGLKDRYERHHGVKITDSALVAAAMLS
ARYISDRFLPDKAIDLVDEAAAQLKMEITSKPVELEQIERRLMQLEMEKLSVEGESPRSTPLRPGEQDRSADIGLNLRIQ
SLLEEINTLKDKQKTLSSQWQGEKELLEAINRLKEEEEKLRVQIEQAERAYDLNKAAQLKYGRLETVHHDREAREAELLK
LQAQGSSLLREQVTEADIAAIVAKWTGIPVNRLLESERQKLLQLESHLHRRVIGQQQAVEAVSAAIRRARAGMKDPGRPI
GSFLFMGPTGVGKTELARALAEFLFDSEEAMIRIDMSEYMEKHAVSRLVGAPPGYVGYDEGGQLSEAVRRHPYSVILFDE
VEKAHPDVFNILLQVLDDGRITDSQGRLVDFRNTVIVMTSNIGGEYILGVAGDDSRYGEMSALVMQALRSHFRPEFLNRV
DEIILFHTLSKAELRDIVAIQMQRLQRLLADQKIALELSPAAIDHVADVGYDPVYGARPLKRAIQRELENPIANLILEQK
FVTGDTIEINMKDGTLTFDHPSNTPEAQLNQGELAMLPVAVNNPGDDN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 1e-101 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-99 41
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-99 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
clpB2 YP_005069293.1 ClpB protein VFG2084 Protein 1e-107 42
clpB2 YP_005069293.1 ClpB protein VFG2076 Protein 1e-118 42