Gene Information

Name : Npun_F3510 (Npun_F3510)
Accession : YP_001866859.1
Strain : Nostoc punctiforme PCC 73102
Genome accession: NC_010628
Putative virulence/resistance : Virulence
Product : ATPase
Function : -
COG functional category : O : Posttranslational modification, protein turnover, chaperones
COG ID : COG0542
EC number : -
Position : 4410262 - 4412901 bp
Length : 2640 bp
Strand : +
Note : PFAM: ATPase AAA, central domain protein; Clp N terminal domain protein; ATPase associated with various cellular activities, AAA_5; ATPase AAA; SMART: ATPase AAA; KEGG: ava:Ava_0140 ATPase

DNA sequence :
ATGCAACCTACAGATCCGAATAAATTTACTGATAAAGCCTGGGAAGCAATTGTTAAATCTCAGGATATAGTCCGTGCTTA
TCAACAACAGCAACTAGATGTTGAACATTTAATTATTGCCCTGTTAGAAGAACCCACTAGTCTAGCAATACGTATCCTGG
CTCGATCTGAGGTCGATCCAATCCGCTTTCAACAGCAGCTAGAAGCCTTTATCCAACGTCAGCCCAAAGTTGGTAAAAGC
GATCAGCTTTATCTTAGCCGTAGTTTAGATGTTTTACTAGATAAAGCTGAGGAAGCTAGAGTCAGGATGAAAGACTCCTA
CATTTCCGTAGAACATATACTTTTGGCTTTTGCTGAAGACGATCGCATTGGACGAAAGATCCTCAAAAGCTTTAGTGCGG
ACGCAGCTAAACTAGAAGCTACTATCAAAGCCGTTCGCGGTAGCCAAAAAGTAACAGATCAAAGCCCAGAATCGCGCTAT
GAAGCTTTACAAAAATTTGGCAGAGATTTGACAGAACAGGCAAAAGCTGGAAAACTCGACCCGGTAATTGGGCGAGATGA
CGAAATTCGGCGGGTAATTCAAGTATTGTCTCGTCGGAGCAAAAATAACCCCGTCTTGATTGGTGAACCTGGGGTAGGTA
AAACTGCGATCGCAGAAGCTTTGGCACAACGGATGGTAAACGGTGACGTTCCCGAATCTCTGAAAAACCGCCAACTGATC
TCTTTAGACATCGGTAGTTTAATTGCTGGGGCAAAATTGCGAGGAGAATTTGAAGAACGTTTGAAAGCTGTCCTCAAAGA
AGTTATGGACTCTAACGGGCAAATTGTCCTGTTTATCGACGAACTACATACCGTAGTCGGTACGGGTTCCAGCCAACAAG
GGGCAATGGATGCCGGAAATTTGCTCAAACCAATGCTGGCGCGGGGAGAACTGCGTTGTATTGGTGCAACTACCCTCGAC
GAGTTCCGCAAACACATTGAGAAAGACGCTGCCCTAGAACGCCGCTTTCAGCAAGTATTTGTCGATCAGCCAAGTGTGGA
AAATACTATTTCCATTCTGCGGGGGTTGAAAGAACGCTATGAAGTGCATCACAACGTCAAAATTTCTGATTCGGCTTTGG
TAGCAGCAGCAACCCTGTCAGCACGTTATATTAGCGATCGCTTCTTACCAGATAAAGCAATAGATTTGGTGGATGAAGCC
GCAGCACAGTTGAAAATGGAGATTACCTCCAAACCAGCAGAATTGGAAACCATCGATCGCCGCCTCATGCAGTTAGAAAT
GGAAAAGCTGTCATTAGCTGGCGAAGAAAAGGGTACTCCCCAAACAAAAGAGCGTTTGGAGCGCATTGAGCAAGAAATCG
CCAATTTAACTGAAAAACAGCAAATATTTAATGAGCAATGGCAAGGTGAAAAGCAGATATTGGAGGCGATAAGCGCCTTA
AAGAAAGAAGAAGATGCGCTGCGAGTGCAAATTGAACAGGCGGAACGCGCTTATGACCTAAATAAAGCTGCCCAACTGAA
ATATGGCAAATTGGAAGGAGTACAGCACGAGCGCGAAGCTAAAGAAGCCAGCCTTTTAGAAATTCAAAACCAAGGTTCCA
CGTTGCTGCGAGAACAAGTCACTGAAGCTGATATTGCGGAAATCGTCGCCAAATGGACAGGAATTCCCGTTAATCGCCTG
TTGGAATCGGAACGGCAAAAATTACTGCAACTAGAAAGTCATTTGCATCAACGAGTCATTGGGCAAGAAGAGGCTGTAGA
AGCGGTAGCAGCAGCCATTCGCCGCGCCCGTGCGGGGATGAAAGACCCCTCCCGTCCCATTGGTTCATTTTTGTTCATGG
GCCCCACAGGTGTGGGCAAAACCGAACTCGCCCGTGCTTTAGCTCAGTTTCTCTTTGATTCTGATGATGCCTTGGTGCGC
TTAGATATGTCTGAGTATATGGAAAAACACTCAGTTTCTCGGTTAGTGGGAGCGCCTCCAGGATACGTAGGCTATGAAGA
AGGCGGTCAACTTTCCGAGGCGGTTCGCCGCCGTCCCTACTCAGTGGTGCTGCTGGATGAAGTGGAAAAAGCGCACCCCG
ATGTGTTCAATATTTTGTTGCAGGTGTTAGATGATGGGAGAATTACTGACTCTCAGGGGAGAACGGTAGATTTTCGTAAC
AGCGTTATTGTAATGACCAGTAACATAGGTAGCGAACACATTTTAGATGTGTCTGGTGATTCTCAGTATGAAACGATGCG
GAAGCGGGTAATGGAAGGTTTGCGATCGCATTTCCGCCCAGAATTTCTCAACCGCGTTGATGATATTATTCTCTTCCATA
CCCTAAATCGCACGGAAATGCGGCAAATCATCCGTATTCAACTCAAGCGAGTAGAAAATCTCCTACGAGAGCAAAAAATC
TTCTTTGAGATATCCCAAGCAGCCTGCGATCACCTTGTCGAATCAGGCTATGACCCAGTTTATGGTGCGCGTCCACTCAA
ACGCGCAATTCAGCGAGAAGTAGAAAACCCCCTCGCCACCAAGTTATTGGAAAATACTTTTATCTCTGGAGACACGATTC
TCATTGACAAAAATGAAAATGGTCTGTCTTTTAGTAAAAAAGTGCTGGTGAAGGTGTCAGTACCACAGATTGCTACATAG

Protein sequence :
MQPTDPNKFTDKAWEAIVKSQDIVRAYQQQQLDVEHLIIALLEEPTSLAIRILARSEVDPIRFQQQLEAFIQRQPKVGKS
DQLYLSRSLDVLLDKAEEARVRMKDSYISVEHILLAFAEDDRIGRKILKSFSADAAKLEATIKAVRGSQKVTDQSPESRY
EALQKFGRDLTEQAKAGKLDPVIGRDDEIRRVIQVLSRRSKNNPVLIGEPGVGKTAIAEALAQRMVNGDVPESLKNRQLI
SLDIGSLIAGAKLRGEFEERLKAVLKEVMDSNGQIVLFIDELHTVVGTGSSQQGAMDAGNLLKPMLARGELRCIGATTLD
EFRKHIEKDAALERRFQQVFVDQPSVENTISILRGLKERYEVHHNVKISDSALVAAATLSARYISDRFLPDKAIDLVDEA
AAQLKMEITSKPAELETIDRRLMQLEMEKLSLAGEEKGTPQTKERLERIEQEIANLTEKQQIFNEQWQGEKQILEAISAL
KKEEDALRVQIEQAERAYDLNKAAQLKYGKLEGVQHEREAKEASLLEIQNQGSTLLREQVTEADIAEIVAKWTGIPVNRL
LESERQKLLQLESHLHQRVIGQEEAVEAVAAAIRRARAGMKDPSRPIGSFLFMGPTGVGKTELARALAQFLFDSDDALVR
LDMSEYMEKHSVSRLVGAPPGYVGYEEGGQLSEAVRRRPYSVVLLDEVEKAHPDVFNILLQVLDDGRITDSQGRTVDFRN
SVIVMTSNIGSEHILDVSGDSQYETMRKRVMEGLRSHFRPEFLNRVDDIILFHTLNRTEMRQIIRIQLKRVENLLREQKI
FFEISQAACDHLVESGYDPVYGARPLKRAIQREVENPLATKLLENTFISGDTILIDKNENGLSFSKKVLVKVSVPQIAT

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 3e-105 42
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 1e-105 41
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 8e-106 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Npun_F3510 YP_001866859.1 ATPase VFG2084 Protein 3e-114 42
Npun_F3510 YP_001866859.1 ATPase VFG2076 Protein 2e-118 42