Gene Information

Name : Nos7107_4122 (Nos7107_4122)
Accession : YP_007051823.1
Strain : Nostoc sp. PCC 7107
Genome accession: NC_019676
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4816288 - 4818927 bp
Length : 2640 bp
Strand : +
Note : TIGRFAM: ATP-dependent chaperone ClpB; COGs: COG0542 ATPase with chaperone activity ATP-binding subunit; InterProIPR017730:IPR004176:IPR003959:IPR013093:IPR 019489:IPR003593; KEGG: npu:Npun_F4427 ATPase; PFAM: ATPase, AAA-2; ATPase, AAA-type, core; Clp, N

DNA sequence :
ATGCAACCTACTAATCCTAACCAATTTACCGAAAAAGCCTGGGAAGCGATCGCCCACACGCCAGATATTGCTAAACAATA
TCACCAACAACAAATCGAAAGCGAACACCTGCTCAAAGCACTGCTAGAACAAGAAGGTTTAGCCAGCAGTATTTTAACCA
AAGCAGGTGCGAATCTGCAAAAAATCCGCGATCGCACTGATCAATTTCTCCAACGTCAGCCAAGGGTATCTGGTACAAGT
AGTTCTGTGTTTTTGGGACGCAGCTTAGATACCCTATTAGATAGGGCTGATAACTATCGCAAAGATTTTAAAGACGAATA
TATTTCTATTGAACATTTATTGCTGGGTTATGCCAAAGATGACCGTTTTGGCAAAGGTCTACTCCAAGAATTCGGTTTAG
ACGAAGGCAAGCTGAAAAACATTATTAAACAAATTCGTGGGAGCCAGAAAGTGACCGACCAAAATCCAGAAGGTAAATAC
GAAGCACTGGAAAAATATGGACGTGACCTCACCGAAGCCGCCCGCAAAGGTCAACTTGATCCGGTGATTGGGCGAGATGA
CGAGATTCGCCGGACTGTGCAGATTCTCTCCCGTCGTACCAAAAATAACCCAGTGTTGATTGGGGAACCAGGAGTGGGTA
AAACTGCGATCGCGGAAGGACTCGCCCAACGCATCATCGCGGGTGATGTTCCCCAATCCCTCAAAGACCGCAAACTCATT
GCTTTGGATATGGGTGCTTTGATTGCGGGGGCAAAATTCCGGGGTGAATTTGAAGAACGCCTGAAAGCAGTCTTAAAAGA
AGTTACCGAATCTGGCGGGAATATAGTTCTATTTATTGATGAAATTCATACCGTTGTCGGTGCGGGTGCAACCCAAGGGG
CGATGGATGCAGGTAACTTGTTAAAACCGATGTTGGCGCGGGGTGAGTTGCGTTGTATTGGGGCGACAACTTTAGATGAA
TATCGCAAATATATCGAAAAAGATGCCGCTTTAGAAAGACGCTTCCAGCAAGTTTATGTCGATCAGCCCAGCGTTGAAGA
TACGATTTCCATATTGCGCGGTTTGCGGGAACGCTACGAAAACCACCACGGGGTGAAGATTTCTGATAGTGCTTTAGTAG
CAGCGGCGCAACTTTCGGGTAGATATATTAGCGATCGCTTTTTACCAGATAAAGCCATTGATTTGGTAGACGAAGCCGCC
GCCCGGTTGAAAATGGAAATTACCTCCAAACCAGAAGAACTCGACGAAATTGATCGCAAAATTCTTCAATTAGAAATGGA
GAAACTTTCTTTGAAAAAAGAAAGTGATGCAGCATCTCGTGAACGTCTAGAAAGACTAGAAAAAGAAATTGCTGATTTCA
AAGAAGAACAAAGAACACTTAACACTCAATGGCAGTCAGAAAAAGATATTATTGAAAAAATTCAGTCTGTTAAAAAAGAA
ATTGAACGGGTTAACTTAGAAATTCAGCAAGCTGAAAGAAACTACGACCTCAACCGCGCGGCGGAATTGAAATATGGCAA
TTTAACTGATTTACATCGCCAACTACAAGTAGCGGAAAGTGAATTAGCTAACGCCCAAAGAAGTGGTAAATCTCTACTGC
GGGAAGAAGTCACCGAATCTGATATTGCGGAAGTTATTTCTAAATGGACAGGAATTCCCATCAGCAAGTTAGTGGAATCA
GAAAAAGAAAAACTGCTGCATTTAGAAGACGAACTACATCAGCGGGTAATTGGCCAAGATGAAGCTGTGACAGCCGTCGC
CGATGCAATTCAGCGATCGCGTGCGGGATTGGCTGACCCCAATCGCCCCATCGCCAGCTTTGTGTTCCTCGGCCCTACAG
GCGTTGGTAAAACTGAGTTAGCCAAAGCCTTAGCGGGATATATGTTCGACACTGAAGATGCACTGGTGCGAATTGATATG
TCTGAGTACATGGAAAAACACGCCGTCTCGCGGTTAATCGGTGCGCCTCCTGGATATGTGGGTTATGAAGAAGGCGGACA
ACTCACCGAAGCAATTCGCCGTCGTCCCTACTCAGTGATTTTGTTCGACGAAATCGAAAAAGCCCACCCCGATGTCTTTA
ATATCTTCCTGCAAATTCTGGATGATGGACGCGTCACCGATGCCCAAGGTCATACTGTAGACTTCAAGAATTCAATTATT
ATTATGACCAGTAACATTGGTTCCCAGTTCATTCTTGATATTGCTGGGGACAATTCTCGCTACGACGAAATGCGTCATCG
AGTTATGGAAGCGATGCGGAATAGCTTCCGGCCAGAATTCCTCAACCGCATTGATGAACTGATCATCTTCCACAGCTTAG
ATAAAAAAGAACTGCGCCATATTGTGCAGTTGCAAGTAGGAAGATTAAGAGAAAGATTGAGTGATACCTCCAGTGGGCTT
CGCCAACGCAAAATATCTCTCAAACTCGCCGATGCGGCGCTGGACTTCTTGGCTGAAGTTGGTTATGACCCTGTATTTGG
CGCACGTCCACTCAAACGCGCAATCCAGCGTGAGTTAGAAACGCAAATTGCTAAAGCCATCCTGCGCGGCGAATTCCACG
ACGGCGACACCATTTTTGTCGATGTGCAGAATGAACGCCTAGCTTTTAGCCGTTTACCTGTGGAAGTATTTAGCAGTTAA

Protein sequence :
MQPTNPNQFTEKAWEAIAHTPDIAKQYHQQQIESEHLLKALLEQEGLASSILTKAGANLQKIRDRTDQFLQRQPRVSGTS
SSVFLGRSLDTLLDRADNYRKDFKDEYISIEHLLLGYAKDDRFGKGLLQEFGLDEGKLKNIIKQIRGSQKVTDQNPEGKY
EALEKYGRDLTEAARKGQLDPVIGRDDEIRRTVQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIIAGDVPQSLKDRKLI
ALDMGALIAGAKFRGEFEERLKAVLKEVTESGGNIVLFIDEIHTVVGAGATQGAMDAGNLLKPMLARGELRCIGATTLDE
YRKYIEKDAALERRFQQVYVDQPSVEDTISILRGLRERYENHHGVKISDSALVAAAQLSGRYISDRFLPDKAIDLVDEAA
ARLKMEITSKPEELDEIDRKILQLEMEKLSLKKESDAASRERLERLEKEIADFKEEQRTLNTQWQSEKDIIEKIQSVKKE
IERVNLEIQQAERNYDLNRAAELKYGNLTDLHRQLQVAESELANAQRSGKSLLREEVTESDIAEVISKWTGIPISKLVES
EKEKLLHLEDELHQRVIGQDEAVTAVADAIQRSRAGLADPNRPIASFVFLGPTGVGKTELAKALAGYMFDTEDALVRIDM
SEYMEKHAVSRLIGAPPGYVGYEEGGQLTEAIRRRPYSVILFDEIEKAHPDVFNIFLQILDDGRVTDAQGHTVDFKNSII
IMTSNIGSQFILDIAGDNSRYDEMRHRVMEAMRNSFRPEFLNRIDELIIFHSLDKKELRHIVQLQVGRLRERLSDTSSGL
RQRKISLKLADAALDFLAEVGYDPVFGARPLKRAIQRELETQIAKAILRGEFHDGDTIFVDVQNERLAFSRLPVEVFSS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 7e-108 42
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 3e-105 41
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 4e-105 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Nos7107_4122 YP_007051823.1 ATP-dependent chaperone ClpB VFG2076 Protein 2e-118 45
Nos7107_4122 YP_007051823.1 ATP-dependent chaperone ClpB VFG2084 Protein 1e-112 44