Gene Information

Name : Nos7107_1767 (Nos7107_1767)
Accession : YP_007049553.1
Strain : Nostoc sp. PCC 7107
Genome accession: NC_019676
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2033079 - 2035721 bp
Length : 2643 bp
Strand : +
Note : PFAM: AAA domain (Cdc48 subfamily); C-terminal, D2-small domain, of ClpB protein; Clp amino terminal domain; ATPase family associated with various cellular activities (AAA); TIGRFAM: ATP-dependent chaperone ClpB; COGs: COG0542 ATPase with chaperone activi

DNA sequence :
ATGCAGCCTACAGATCCTAATAAATTTACAGATAAAGCCTGGGATGCAATTGTTAAATCTCAGGATATAGTTCGTGGTTA
TAATCAACAACAATTAGACGTTGAACATTTAATTATTGCCCTTTTAGAAGAACCCACAAGTTTAGCAATTCGCATCTTAG
TACGGGCAGAAATCGACCCCATTCGCTTGCAACAGCAACTAGAAGCTTATACCCAACGTCAGCCCAAAGTTGCTAATAAT
GACCAGCTTTACCTCGGTCGCACTTTAGATGTGCTGCTTGACCACGCCGAGGAAGCAAGAGTGAAAATGAAAGATACCTA
CATATCTGTAGAACATATTCTTTTAGGCTTTGCGGAAGACGAACGTGTTGGTCGGCGAATTCTCAAAGGTTTTAATGCTG
ATAGTGGTACTCTAGAGGCCGCAATTAAAGCTGTGCGTGGTAGCCAAAAAGTGACAGATCAAAATCCAGAATCTCGCTAT
GAAGCACTGCAAAAATTTGGTAGAGACTTGACAGAACAAGCCAAAGCGGGAAAACTCGACCCAGTTATTGGCCGGGATGA
TGAAATTCGGCGCGTGATTCAAGTATTATCCCGCCGGAGTAAAAATAATCCGGTTTTGATTGGTGAACCAGGAGTAGGAA
AAACTGCGATCGCCGAAGCTTTGGCACAGCGTATCATTAACGGTGATGTCCCGGAATCTCTCAAAAATCGCCAATTAATC
TCTTTAGATATTGGGAGTTTAATTGCTGGGGCGAAATATCGGGGTGAATTTGAAGACCGACTCAAAGCCGTCCTGAAGGA
AGTTATCGATTCCAACGGACAAATTGTCTTGTTTATTGATGAACTGCATACGGTTGTGGGTACAGGTTCCAATCAACAAG
GGGCGATGGATGCGGGAAATTTACTCAAACCGATGTTGGCGAGGGGTGAACTGCGGTGTATTGGCGCAACCACTTTAGAT
GAGTACCGCAAATATATTGAGAAAGACGCAGCCTTAGAACGCCGTTTTCAACAAGTATTTGTCGATCAGCCGAACGTAGA
AAATACTATCTCGATTTTGCGGGGTTTGAAAGAACGCTACGAAGTTCATCACAACGTCAAAATTTCTGATTCAGCATTAG
TAGCGGCGGCTACTTTATCAGCACGATACATTTCTGACCGCTTTTTACCCGATAAAGCCATTGACTTGGTAGATGAAGCC
GCCGCCCAGTTGAAAATGGAGATTACCTCCAAACCTTCAGAATTGGAAACCATCGACCGCCGCTTAATGCAGCTAGAAAT
GGAAAAGCTGTCACTAGCGGGTGAAGAAAAAGTTCCTACCCAAACACGGGAACGTTTGCAGCGCATTGAAGAAGAAATTG
ACACCTTAAAGGTAAAACAGCAAGAATTTAACGAACAATGGCAAGGTGAAAAGCAGTTATTAGAAGCGATTAGTGTTTTA
AAGAAAGAAGAAGAAGCCTTGCGGGTGCAAATTGATCAGGCAGAACGGGCGTATGATTTAAATAAAGCTGCCCAATTGAA
GTATGGCAAATTAGAAGGCGTACAGCGCGATCGCGAAGCTAAAGAAGCAAAACTTCTAGAACTGCAAAGCCAAGGTTCAA
CTTTGTTGCGCGAACAAGTCACCGAAGCCGACATCGCCGAAATCGTCGCCAAGTGGACGGGAATTCCAGTTAACCGTTTA
TTGGAATCAGAACGGCAAAAATTACTGCAACTCGAAAGCCATTTACATCAACGCGTCATTGGACAACACGAAGCCGTCGA
AGCCGTAGCCGCCGCCATTCGTCGCGCCCGTGCAGGAATGAAAGACCCCAGCCGTCCCATTGGTTCATTTTTGTTTATGG
GGCCGACTGGAGTAGGTAAAACTGAACTAGCCCGCGCATTAGCACAGTTTCTCTTTGATTCTGATGATGCTTTGGTACGC
CTAGATATGTCCGAGTATATGGAAAAACACTCGGTTTCCCGGTTAGTTGGTGCGCCTCCGGGATATGTTGGTTACGAAGA
AGGCGGTCAACTTTCCGAGGCGGTGCGCCGTCATCCTTACTCAGTGGTGCTGTTGGATGAAGTCGAAAAAGCTCATCCCG
ATGTGTTTAATATTTTGTTGCAAGTACTGGATGATGGGAGAATTACTGATTCTCAAGGTAGAACAGTAGATTTTCGCAAT
ACTGTAATTGTCATGACCAGTAATATTGGCAGTGAACACATTTTAGATGTGTCTGGTGATGATTCTAAATATGACATGAT
GCGGAACCGAGTAACAGATGCACTGCGATCGCACTTCCGCCCCGAATTTCTCAACCGTGTCGATGATATTATCCTGTTCC
ATACCCTCAGCCGTTCCGAAATGCGTCACATCATCCGCATTCAACTCAAGCGTGTTGAGAACTTGCTAAAAGAGCAGAAA
ATCTCCTTTGAAATCACCCAAGCAGCTTGTGATTACTTAGTAGAAATGGGTTATGATCCAATTTATGGCGCACGTCCCTT
AAAACGAGCAATTCAGCGAGAAGTCGAAAATCCCATCGCCACTAAATTATTAGAAAATACATTTATTGCTGGCGATACCA
TCCTGATTGATAAAGATGAAGATAATTTGACTTTTAACAAAAAAGTTACTGTGAAAGTCTCAGTTCCACAGGTAATCACA
TAA

Protein sequence :
MQPTDPNKFTDKAWDAIVKSQDIVRGYNQQQLDVEHLIIALLEEPTSLAIRILVRAEIDPIRLQQQLEAYTQRQPKVANN
DQLYLGRTLDVLLDHAEEARVKMKDTYISVEHILLGFAEDERVGRRILKGFNADSGTLEAAIKAVRGSQKVTDQNPESRY
EALQKFGRDLTEQAKAGKLDPVIGRDDEIRRVIQVLSRRSKNNPVLIGEPGVGKTAIAEALAQRIINGDVPESLKNRQLI
SLDIGSLIAGAKYRGEFEDRLKAVLKEVIDSNGQIVLFIDELHTVVGTGSNQQGAMDAGNLLKPMLARGELRCIGATTLD
EYRKYIEKDAALERRFQQVFVDQPNVENTISILRGLKERYEVHHNVKISDSALVAAATLSARYISDRFLPDKAIDLVDEA
AAQLKMEITSKPSELETIDRRLMQLEMEKLSLAGEEKVPTQTRERLQRIEEEIDTLKVKQQEFNEQWQGEKQLLEAISVL
KKEEEALRVQIDQAERAYDLNKAAQLKYGKLEGVQRDREAKEAKLLELQSQGSTLLREQVTEADIAEIVAKWTGIPVNRL
LESERQKLLQLESHLHQRVIGQHEAVEAVAAAIRRARAGMKDPSRPIGSFLFMGPTGVGKTELARALAQFLFDSDDALVR
LDMSEYMEKHSVSRLVGAPPGYVGYEEGGQLSEAVRRHPYSVVLLDEVEKAHPDVFNILLQVLDDGRITDSQGRTVDFRN
TVIVMTSNIGSEHILDVSGDDSKYDMMRNRVTDALRSHFRPEFLNRVDDIILFHTLSRSEMRHIIRIQLKRVENLLKEQK
ISFEITQAACDYLVEMGYDPIYGARPLKRAIQREVENPIATKLLENTFIAGDTILIDKDEDNLTFNKKVTVKVSVPQVIT

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-106 42
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 2e-106 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 1e-106 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Nos7107_1767 YP_007049553.1 ATP-dependent chaperone ClpB VFG2084 Protein 3e-114 42
Nos7107_1767 YP_007049553.1 ATP-dependent chaperone ClpB VFG2076 Protein 2e-118 42