Gene Information

Name : Nwat_2227 (Nwat_2227)
Accession : YP_003761380.1
Strain : Nitrosococcus watsonii C-113
Genome accession: NC_014315
Putative virulence/resistance : Virulence
Product : ATP-dependent chaperone ClpB
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 2427026 - 2429623 bp
Length : 2598 bp
Strand : -
Note : TIGRFAM: ATP-dependent chaperone ClpB; PFAM: ATPase AAA-2 domain protein; AAA ATPase central domain protein; Clp domain protein; Clp ATPase-like; KEGG: noc:Noc_2381 ATP-dependent Clp protease; SMART: AAA ATPase

DNA sequence :
ATGCGACAGGATAAATTAACGACCAAGTTCCAGGAAGCTTTGGCCGATGCTCAAAGTGTGGCGGTAGGCCAGGATCATCA
ATTTTTAGAACCACTTCATATCATATTCGCCCTGCTAGAGCAGCAAGGGAGTAGTGTCCCTCCGCTGCTAATGCAGGCTG
GGGTGAATGTCCGTAACCTCCATACCCAGCTTAGGGAAGCCCTGAAGCAGCTTCCCCAGGTACAGGGAACGCCTGGAGAG
ATTCATCTTTCCCAGGATTTAGCCCGCTTGCTTAATGTGACTGATAAGCTGGCCCAGCAGCGCCAGGATCAATATATTTC
CAGTGAATTATTTGTGCTTGCCGCGGTGGAAGATAAAGGCCAGGTGGGCGAATTGCTGCGTAAAAATGGCGCAACCAAGG
GTGGTATAGAGGCGGCTATCCAGGCGATTCGGGGCGGCCAGCAGGTGAATGAACCTGGCGCTGAGGATCAACGTCAAGCT
TTGGAACGTTATACCATTGATCTTACGGAGCGAGCAGAGCAGGGTAAACTGGACCCTGTCATTGGCCGCGACGATGAGAT
TCGCCGCACTATTCAGGTGTTGCAACGTCGGACTAAAAATAATCCTGTGCTCATTGGTGAGCCAGGGGTCGGTAAAACCG
CGATTGCTGAAGGGCTAGCCCAGCGTATTATTAATAGTGAAGTACCGGAAGGGCTTAAAAACAGGCGTCTGCTTGCTTTG
GATATGGGCGCGCTGATTGCTGGGGCCAAGTTCCGGGGCGAATTCGAAGAGCGCCTTAAGGCCGTGCTCAAGGATATTAG
TAAGGCGGAAGGAAATATTATCTTATTTATTGATGAACTCCATACGGTGGTGGGGGCCGGTAAGGCGGAAGGAGCCATGG
ACGCGAGTAATATGCTTAAACCTGCCTTGGCCCGGGGCGAACTTCACTGTATTGGAGCCACAACTCTGGATGAATACCGC
CGCTACATCGAAAAAGATGCGGCTCTGGAGCGGCGCTTCCAAAAGGTGCTGGTTGATGAGCCTAATGTGGAGGATACGAT
CGCTATTCTGCGGGGATTAAGTGAACGCTATGAGGTTCATCATGGAGTAGAAATTACTGATCCTGCTATTGTGGCGGCGG
CGACTTTATCTCACCGTTACATCACCGATCGCAAGCTGCCGGATAAGGCTATTGATCTCATTGATGAAGCCGCCAGTCGC
ATTCGGATGGAAATCGACTCCAAACCAGAGCCCATGGACCGATTGGAGAGGCGGTTGATTCAACTGAAGATTGAACGGGA
AGCTCTCCGTCGGGAGACTGACGAAGCTTCCAAGAGGCGTCTGGAAACCCTGGAAACCGAACTGAACCAGTTGGAAAAGG
AATATGCGGATTTGGAGGAAATCTGGAAGGCGGAGAAAGCTACTTTAAGTGGCGCCCAAGGAATCAAGGAGCAACTGGAG
CAGGCTCGCCTGGATTTGGAATCCGCCCATCGTGCCGGAGATTTAGGGCGGATGTCGGAGTTGCAATATGGCCGTATCCC
AGAGTTGCAAAAGCAGTTGGATGCCGCCACGGCGGTGGAGCAGCATGATTTTAAGTTGCTGCGCAATAAAGTCTCGGAAG
AGGAAATTGCGGAGGTCGTTTCCAAATGGACGGGAATTCCAGTCTCTAAAATGCTGGAAGGCGAGCGGGAGAAATTGTTA
AAAATGGAAGCGGCGCTTCACCAGCGTGTGGTGGGTCAAGATGAGGCTATTGCCGTGGTGAGCAACGCGATTCGTCGCTC
CCGAGCGGGGCTTGCTGATCCCAACCGGCCCAATGGGTCCTTTCTCTTTTTAGGGCCTACAGGGGTAGGTAAAACCGAGC
TTTGCAAGGCCCTGGCGATGTTCCTGTTCGATACGGAGGAGGCCATGGTCCGTATTGATATGTCCGAATTTATGGAGCGG
CATTCCGTGGCCCGTTTGATCGGTGCTCCACCGGGTTATGTGGGTTTTGAGGAAGGAGGTTACCTGACCGAGGCGGTACG
CCGTAAGCCCTATTCGGTAATTTTGCTCGATGAGGTAGAAAAGGCCCATCCGGATGTGTTTAATATCTTGCTTCAGGTTC
TGGACGACGGGCGGCTAACGGATGGTCATGGCCGCACGGTGGACTTTCGCAATACGGTGGTGGTCATGACTTCTAATCTC
GGCTCCCATGTTATCCAGGAAATGGCCGGAGAAGATCGTTACCAGGAAATGAAAAGTGCGGTAATGGAAATTGTAGGACA
ACATTTTCGGCCAGAATTTATTAACCGGGTTGATGATGTGGTGGTCTTTCATCCCTTACAGAAAGAGCAGCTTCAGGCTA
TTGCCCAGATCCAGATGGGTTATCTCCGGCAGCGATTGGCGCAACGGGATATGGTGTTAACTATTCCCGAGGATGCGTTA
AGCAAGCTGGCGGAGGCGGGCTTCGATCCAGTTTACGGTGCCAGGCCCCTCAAGCGGGCGATTCAGCAACGGCTCGAAAA
TCCCTTAGCCCAGGAAATTCTCGCTGGCAAATTTGAGTCTGGGGATTTCATTGAGGTGGGGGTTGAAGGGGATCAATTTA
CTTTTAAGAGAGAGACGAGAGCAGCATCCGTAGCCTAG

Protein sequence :
MRQDKLTTKFQEALADAQSVAVGQDHQFLEPLHIIFALLEQQGSSVPPLLMQAGVNVRNLHTQLREALKQLPQVQGTPGE
IHLSQDLARLLNVTDKLAQQRQDQYISSELFVLAAVEDKGQVGELLRKNGATKGGIEAAIQAIRGGQQVNEPGAEDQRQA
LERYTIDLTERAEQGKLDPVIGRDDEIRRTIQVLQRRTKNNPVLIGEPGVGKTAIAEGLAQRIINSEVPEGLKNRRLLAL
DMGALIAGAKFRGEFEERLKAVLKDISKAEGNIILFIDELHTVVGAGKAEGAMDASNMLKPALARGELHCIGATTLDEYR
RYIEKDAALERRFQKVLVDEPNVEDTIAILRGLSERYEVHHGVEITDPAIVAAATLSHRYITDRKLPDKAIDLIDEAASR
IRMEIDSKPEPMDRLERRLIQLKIEREALRRETDEASKRRLETLETELNQLEKEYADLEEIWKAEKATLSGAQGIKEQLE
QARLDLESAHRAGDLGRMSELQYGRIPELQKQLDAATAVEQHDFKLLRNKVSEEEIAEVVSKWTGIPVSKMLEGEREKLL
KMEAALHQRVVGQDEAIAVVSNAIRRSRAGLADPNRPNGSFLFLGPTGVGKTELCKALAMFLFDTEEAMVRIDMSEFMER
HSVARLIGAPPGYVGFEEGGYLTEAVRRKPYSVILLDEVEKAHPDVFNILLQVLDDGRLTDGHGRTVDFRNTVVVMTSNL
GSHVIQEMAGEDRYQEMKSAVMEIVGQHFRPEFINRVDDVVVFHPLQKEQLQAIAQIQMGYLRQRLAQRDMVLTIPEDAL
SKLAEAGFDPVYGARPLKRAIQQRLENPLAQEILAGKFESGDFIEVGVEGDQFTFKRETRAASVA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 3e-105 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 2e-105 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Nwat_2227 YP_003761380.1 ATP-dependent chaperone ClpB VFG2076 Protein 8e-120 42