Gene Information

Name : Nwat_1885 (Nwat_1885)
Accession : YP_003761055.1
Strain : Nitrosococcus watsonii C-113
Genome accession: NC_014315
Putative virulence/resistance : Unknown
Product : SNF2-like protein
Function : -
COG functional category : K : Transcription
COG ID : COG0553
EC number : -
Position : 2071841 - 2075278 bp
Length : 3438 bp
Strand : -
Note : KEGG: vcm:VCM66_1699 putative helicase; PFAM: SNF2-related protein; helicase domain protein; restriction endonuclease; SMART: DEAD-like helicase ; helicase domain protein

DNA sequence :
ATGTCCCTGAAGGAATGGATCACGGGCCTTGTTGGCGCCAAGGCGACGGGTTGCGCTTGGAATTACGATTCCGAGGGCAT
TAATTTTATCTTCACTGAAAAACAAGAAGCGCAAATTCATGGTGGTGGCGCAAGCGATCTGCTGATACATCAATATATTG
CCCTAAGCATGCTAGTCGAACAGGGGCGGGCAGAAGAGTTGCCTAACGGTGTGCTCGTCCCTTCGGACGTTGTGGTACAA
CTAGATGAAAATACCCGCCTACTGTTGGGGCTGCCCGACCGATGGAAGGGCGTCCTCAATGCAAACATCAAGGGTAAGAC
GGGCAGCGCATCTTTTCAGGTAGAGCTGGCCGCTGCGGATGCGGGGGGCGGATTTACGCATGGCTTTGCAGTAGAGGGGC
CAGTCATTCGCTTTTCGGCGGAAAAACGTTATGTGCTGTCTCCTGCTCATTTGCTCATATTTGAGGCATATGAGAAGCAT
GCACATTCCGCGAAGATGGAGTATGACGACTTACGTCTCGTTCTCTCGTTGCAAAAGGCTCAAGAGCTAGGCGCTTCGAT
CTCTCTGGGCCATTTTGGAAAGCTTAATATTCGCGCGCCAGCATCAATTACGGTTGCAGCCGAGCTAGACGAGCACGGCC
GCCTCGTGCTTACGCCACGGATGGGTCAGGCGGCGAGTCATGAACGTATTCAACGTGTTCTTGGGCAATTGAGGGCAGAA
CACGCGACTTCTCTCCGCGTAGACGGCGAAATTATTCTTTTCGATGAAGAAAAATTAGCGGCCGTTCGGGAAATTCTTCG
GAATCGCGTGATACCGAAGGATAAAGTGAGCGCTTTTCTCAAAAACCCAACTGCATTTATCAATGCCAGCTTCGTAGACT
TAGACCTTGGTTTTTCGGCTAGAGTGAAGGGAGCGACGAAATTCAGGCATGCGTATTTTGGTGAGACCGATGAATCTGGC
ATCGATTGGTTCGGTAAAAAATTCTCTCCTACGGAGGTATTTGCGCCGGCAAAGTTAGTGGATGTCATATCAGATGGAGA
TCAATTAGACGACGTTCGGCAGAAGCTAGCGAATGTCGCGGAAACAGGCGCTCAGGAGATGGAATTTGGGGGCCGGATCT
TCGATGTTTCTAACCCAAAGCTGGTCAATTCCATCTTGGAGAAGCTTGAACGTAAACTGATTGAAGAAAAAAGGGAAAAT
GTTTCGGCTGTTGAAGCCGATAGCGCCAGCAAGAAAGAGCAAGCGGAACCCACCGTGGTCGATATTGGGCTCAACGATGT
AGATTTAGAAACGCCTTCAGCTTCACTTAATAGAGCGATTAATGACGTCCTCATCGCCGAGTCTGCGCTCGACTGGGGCA
CTTTCAAGCGATTGCCGTTTCCTCACCAGGTATTGGGTACGCGCTGGATCTTGGGGTTGGCATTAGATCGCGACGGACGC
GGTGGAGGTCTTCTTGCGGATGACATGGGACTAGGAAAAACATTCATGTCCCTTGCAGCTATGGAGCATCTATACAAATC
GTATCGAGACCGAAAGCTTACCGAGAAGCCCTGCCTTGTCGTTGCACCCTTGAGCCTGTTGCAGAACTGGAAAGATGAGG
TCGGTCAAACTTTTAAGGAGTCGCCTTTCAACGATATCGTTATCCTGCAGTCGGATGGGCGGCTTTCCGAGTTCCGAGCA
GGAGGCGTCGAGACGAAAAACCAGAATATTACTGAGGATGAAACCGCTCCTATCCGGTATTCGCTGAAGATCGGTAGCGG
TTTTGCGCAGGAGCGACTTGACCTACCGAAGCGCTTGGTGATTACCACCTACCAAATGCTGCGAGACTATCAATTCTCTC
TTTGCTCAATTGACTGGGGGATAGCGGTATTCGATGAAGCTCAGAACATAAAAAATCCAAATGCCCTGCAAACACGGGCA
GCAAAGGGAGTCAAAGCAGATTTCAGGCTGATTGCCACGGGAACGCCGGTTGAAAACAGCCTTACAGATTTTTGGTGCCT
AATGGATACAGCCTGTCCTGGGCTCTTGGATAGCTATCAAGCGTTTCGTAGTGCTTACATCACGCCGATACTGCGTGCGG
CTGGAGATGAAATTGAACATGTTAGGGGCATTGTTGGCCGGCGGCTCCGAGAACATGTTGGACCTTGGATGTTGCGCCGT
GTGAAAGAGGATAATCTGGAAGGTCTGCCGGATAAGTCCGTTTTCGTGGGTATGAAGGATGATGTTTGGGCGTATTGCCC
AGCGCTGCATTCCACCCTGGAAGGGGGTCAGTTTGAATCCTATAACACAATTCTTATGCGTGCCTCCAGTTCAGATAGCA
ATGCCGCATTGGCTGCGTTGCAGAAGCTCAGAGATGTTTCTTTGCATCCACGACTCGTATTTGGCGGGGGGCTAGACACG
CCGCGCCCATTAGCAGATTTGGTAAATTTGACAGACGAATCCGGGAAAATTCGTAGTTTACTGCCTATTTTGGATCAAAT
TCGGGATCGGGGCGAAAAATGCATCATTTTTGTAATCAATAAGAAGCTCCAAGCATTTCTCGCATTGACCTTAGCAAAGA
TATATTTGCTGCCCCCTGTTTCAGTCATAAATGGTGATGCGAAAGCGGTGGCAAAGAGGGCAGCAAGCCCCACTCGACAA
AGCATGATAAGAGCTTTTGAGGAGAGGGACGGATTTAATGTTATCATCATGTCGCCGATAGCCGCGGGCGTTGGACTTAC
CGTGGTCGGCGCCAATAACGTTATTCATCTGGAACGCCATTGGAACCCAGCGAAAGAGGCTCAGGCAACCGACCGTGTTT
ACCGCATCGGACAGAAGCGGAAGGTAAATGTGTTTATTCCTTTGATTCACCATCCTGAATACCAGTCTTTTGATGTCAAT
TTGCATCAACTTCTGTCGAGAAAAGGGCAGTTGAAAGACGCAGTAGTTACTCCCGTGCAGGTAATGCCTTCACCGGAAGG
TTTACCTCAGGAGAGCAGGTCTTCAAGTCAGCGCATTTCGTTCGATGAGTTGAGAAATTTGAGTTGGCAACAATTTGAGG
CGCTGTGCGCAGAATTGCTGTACAAAGAATATCAAGCGGATAACTGTTGGTTGACACACAGTGGCGCGGATTTTGGTGCT
GATGTTGTTTTGACGAAAGATGGAATTGGGTTATTGATTCAGTGCAAGCATACGAGCGGTAGTGCTTATGATGGTTATAA
GGCTATTCTTGAAGTTCACTCTGCCACAGTCAAATACGAAGCGGAACTCGGTAAGCGGATCGATTCTTGGATACTTGCTA
CGAATGCCGTGTGTTTAGGATCAGCAACTCAAAAGGCCGCAAAACAGTACTCGGTTGAAATTTTAGATGGAAAGACGCTC
TCCCGTCTGCTAGGCAGACACGCGGTGACATACGCAGATGTGCTTAATCGGCTCGAAAAAAAGCGTCTTCGTGTGTAA

Protein sequence :
MSLKEWITGLVGAKATGCAWNYDSEGINFIFTEKQEAQIHGGGASDLLIHQYIALSMLVEQGRAEELPNGVLVPSDVVVQ
LDENTRLLLGLPDRWKGVLNANIKGKTGSASFQVELAAADAGGGFTHGFAVEGPVIRFSAEKRYVLSPAHLLIFEAYEKH
AHSAKMEYDDLRLVLSLQKAQELGASISLGHFGKLNIRAPASITVAAELDEHGRLVLTPRMGQAASHERIQRVLGQLRAE
HATSLRVDGEIILFDEEKLAAVREILRNRVIPKDKVSAFLKNPTAFINASFVDLDLGFSARVKGATKFRHAYFGETDESG
IDWFGKKFSPTEVFAPAKLVDVISDGDQLDDVRQKLANVAETGAQEMEFGGRIFDVSNPKLVNSILEKLERKLIEEKREN
VSAVEADSASKKEQAEPTVVDIGLNDVDLETPSASLNRAINDVLIAESALDWGTFKRLPFPHQVLGTRWILGLALDRDGR
GGGLLADDMGLGKTFMSLAAMEHLYKSYRDRKLTEKPCLVVAPLSLLQNWKDEVGQTFKESPFNDIVILQSDGRLSEFRA
GGVETKNQNITEDETAPIRYSLKIGSGFAQERLDLPKRLVITTYQMLRDYQFSLCSIDWGIAVFDEAQNIKNPNALQTRA
AKGVKADFRLIATGTPVENSLTDFWCLMDTACPGLLDSYQAFRSAYITPILRAAGDEIEHVRGIVGRRLREHVGPWMLRR
VKEDNLEGLPDKSVFVGMKDDVWAYCPALHSTLEGGQFESYNTILMRASSSDSNAALAALQKLRDVSLHPRLVFGGGLDT
PRPLADLVNLTDESGKIRSLLPILDQIRDRGEKCIIFVINKKLQAFLALTLAKIYLLPPVSVINGDAKAVAKRAASPTRQ
SMIRAFEERDGFNVIIMSPIAAGVGLTVVGANNVIHLERHWNPAKEAQATDRVYRIGQKRKVNVFIPLIHHPEYQSFDVN
LHQLLSRKGQLKDAVVTPVQVMPSPEGLPQESRSSSQRISFDELRNLSWQQFEALCAELLYKEYQADNCWLTHSGADFGA
DVVLTKDGIGLLIQCKHTSGSAYDGYKAILEVHSATVKYEAELGKRIDSWILATNAVCLGSATQKAAKQYSVEILDGKTL
SRLLGRHAVTYADVLNRLEKKRLRV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VPI2_0006c ACA01823.1 hypothetical protein Not tested VPI-2 Protein 0.0 54
VC0395_A1359 YP_001217302.1 hypothetical protein Not tested VPI-2 Protein 0.0 53