Gene Information

Name : Nwat_0200 (Nwat_0200)
Accession : YP_003759493.1
Strain : Nitrosococcus watsonii C-113
Genome accession: NC_014315
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 206329 - 209322 bp
Length : 2994 bp
Strand : +
Note : KEGG: dde:Dde_1859 DEAD/DEAH box helicase-like; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase

DNA sequence :
ATGAAGTCGACTGACACCAGCGAAAAGGGCCTGGAGTCCACCATCGTGGCCTCTTTGGTGGAGGAGGCAGGCTACGTCCA
GGGTGATCCGCAGGATTTCGACCGGGAACACGCGGTTGATCGGGCCAAGCTAGTGCAGTTCCTTGCCGCTACCCAGCCCG
ATACCTTTGAAAATCTCGGCATCGAGCAGGACAGCCCCAAGCGCACCCAGTTTCTGCACCGGCTACAAGGGGAGATCGCC
AAGCGCGGCGTGATCGACGTGCTGCGCGGTGGCCTTAAGCATGGCCCGGCCCATATAGATCTCTTCTACGGCACCCCAAC
GCCGGGCAATGTGAAGGCCGCCGAGCGGTTCGCGGCCAATCTCTTCAGCGTCACCCGTCAGCTCCGCTACAGCCGCGTTG
GTACCGCGCTCTCCCTCGACCTGGCGGTGTTCATCAACGGTCTGCCTATCGCCACCTTCGAACTTAAGAACAAGCTCACC
AAGCAGACGGTCCTTGATGCCGTTCAGCAGTACCAGCGGGACCGGGACCCGAAGGAACCGCTCTTTCAATTTGGTCGCTG
CGTTGTGCATTTTGCCATGGACGACCACGAGGTACGGATGTGTACCCACCTCAAGGGCAAGGGCTCGTGGTTTCTGCCCT
TCAACAAGGGCTACAACGACGGTGCGGGCAACCCGCCCAACCCTCATGGGCTGGCCACCGATTACCTGTGGAAGGAAATT
CTCACCAAGGAGGGCTTGGTGGACATCCTAGAAAACTATGCCCAGGTGGTGGAGGAAAAGGACGAAAAAACCGGCAAAAA
AAAGGACAAGCAGATTTTTCCCCGCTACCACCAATTGAAGGTGGTGCGGAGGCTGCTGGCCCATGCTCGGAAGAGCGGTG
TCGGCAAACGGTATCTGATCCAGCACTCGGCAGGCAGCGGCAAGAGCAACTCCATCGCCTGGCTGGCGCATCAGCTTGTG
GGGCTGGAGCAGGGGGGCAGGGCGTTGTTCGATTCGGTCATCGTGGTCACCGACCGGCGGGTGCTGGACAAGCAAATCCG
CGACACTATCAAGCAGTTCGTCCAGGTCTCGGCCACGGTGGGCCACGCGGAACACTCCGGCGATTTGCGCAAATTCCTCA
AGGTGGGGAAGAAAATCATCATTACCACGGTCCAGAAGTTTCCGTTCATCCTCGATGAGATCGGCGACGAGCACCGCGGA
GCCAAGTTCGCCATCGTCATTGACGAAGCCCACTCCAGTCAGGGCGGCAAGACCACCGCTGCTATGAACCGCGTGCTGGA
GGAGACCGCACCCTACGGTGATGAAATGGCTTCAGTCGAGGACAAGATCAACCAGATCATGGAAAGCCGGAAAATGGTGA
CCAACGCCAGCTATTTCGCCTTTACCGCGACCCCCAAGAACAAGACCCTGGAGATTTTCGGCGAGCCGGACCCCCAGCCC
GACGGCACGGTGAAGCACTCCCCGTTCCACAGTTACACCATGAAGCAGGCCATCCAGGAGGGGTTCATTCTGGATGTATT
GAAGCATTACACGCCGGTGGAGAGTTACTATCAGCTGGTCAAAGCGGTGGAGGACGATCCATTGTTCGATGCCAACAAGG
CCCAGAAGGAGCTGCGCCGCTATGTGGAATCCCATGCGCATGCTGTTCGTGCAAAGGCCGAAATTATGGTGGACCATTTC
CACGCCCAAGTGATTGGCCACCGCAAGATCGGTGGCCAGGCTCGGGCCATGGTAGTCACCCACGGCATTGAGCGGGCTAT
ACAGTATTTCCACGCCTTCAAGGACTACCTGAAGGAGCGCAAGAGCCCTTATGCGCCTATCGTGGCTTTTTCTGGTGAGC
ATGACGACCCCGCCTGGAGCGGGGGCGGACGCCTGCAAGGCGCTCCTCAATACGGCGGCAAGAAAGTGACCGAAGCGGCC
TTGAACGGCTTCCCCGGCAGCCAGATTCCAGACAAAATTCAACAGGACCCCTATCGCTTCTTGATCGTCGCTGATAAATA
TCAGACCGGTTATGATGAGCCGCTGCTTCACACCCTGTACGTGGACAAGGCGCTCTCCGGCATCAAGGCGGTGCAGACGC
TCTCGCGCCTGAACCGCGCCCACCCGCAAAAGCATGACACCTTTGTCCTGGATTTTTGCAATGATTCGGATACTATTCAG
CAGTCGTTCGAGCCCTACTATCGCGCCACGCTCCTCAGCGATGAGACCGATCCCAACAAGCTCCACGACCTCAAGTCCGA
TCTAGATGACTATCAGGTTTATTCCCAGGCGCAAATCGACGATTTGGTAGGGCGCTATCTGAGCGGCGCAGACCGGGACC
AGCTTGATCCGATCCTGGACGTTTGTGCGACCACTTACAATGCGGATCTGGATGAAGACGGCCAGGTGGATTTCAAGGGC
AAGGCCAAATCCTTCGTGCGTACCTACGGTTTTTTGGCCTCCATCCTGCCGTACTCCAATGCCGGCTGGGAAAAGCTGTC
GATATTGCTGAATTTCCTAATCCCCAAACTCCCTGCGCCCAGGGAGGAAGATCTGTCTTGGGGTATTTTGGAAACCGTGG
ATATGGACAGCTACCGGGTCGAGGCGAGATCCAGTTTAAAAATTGGTCTTGCGGATCAGGAGGTAGAGATCGGTCCCGTG
CCGACCAGCGGCGGTGGCCGCAAGTCGGAACCTGAACTGGATCAACTCAGTAACATCATCAAGGCGTTCAACGACCAGTT
CGGCAATATTGAATGGAAGGATGCCGACAAGATCCGCAAGGTCATCGCGGAAGAAATTCTGATCAAAGTCAGCATGGACC
TGGCCTACCAAAATGCCATGAAGAATAACGACAAGAAAACAGCGCGGATCGAACACGACGCTGCCTTGCAGCGGGTCATG
ATCGATCTGTTGGCCGATCACACCGAGCTGTACAAGCAGTTCAGCGACAACCCCTCGTTCAAGAAATGGCTGGGTGATAC
CATCTTCGGCGCGACCTACCAAAACGCACCTTGA

Protein sequence :
MKSTDTSEKGLESTIVASLVEEAGYVQGDPQDFDREHAVDRAKLVQFLAATQPDTFENLGIEQDSPKRTQFLHRLQGEIA
KRGVIDVLRGGLKHGPAHIDLFYGTPTPGNVKAAERFAANLFSVTRQLRYSRVGTALSLDLAVFINGLPIATFELKNKLT
KQTVLDAVQQYQRDRDPKEPLFQFGRCVVHFAMDDHEVRMCTHLKGKGSWFLPFNKGYNDGAGNPPNPHGLATDYLWKEI
LTKEGLVDILENYAQVVEEKDEKTGKKKDKQIFPRYHQLKVVRRLLAHARKSGVGKRYLIQHSAGSGKSNSIAWLAHQLV
GLEQGGRALFDSVIVVTDRRVLDKQIRDTIKQFVQVSATVGHAEHSGDLRKFLKVGKKIIITTVQKFPFILDEIGDEHRG
AKFAIVIDEAHSSQGGKTTAAMNRVLEETAPYGDEMASVEDKINQIMESRKMVTNASYFAFTATPKNKTLEIFGEPDPQP
DGTVKHSPFHSYTMKQAIQEGFILDVLKHYTPVESYYQLVKAVEDDPLFDANKAQKELRRYVESHAHAVRAKAEIMVDHF
HAQVIGHRKIGGQARAMVVTHGIERAIQYFHAFKDYLKERKSPYAPIVAFSGEHDDPAWSGGGRLQGAPQYGGKKVTEAA
LNGFPGSQIPDKIQQDPYRFLIVADKYQTGYDEPLLHTLYVDKALSGIKAVQTLSRLNRAHPQKHDTFVLDFCNDSDTIQ
QSFEPYYRATLLSDETDPNKLHDLKSDLDDYQVYSQAQIDDLVGRYLSGADRDQLDPILDVCATTYNADLDEDGQVDFKG
KAKSFVRTYGFLASILPYSNAGWEKLSILLNFLIPKLPAPREEDLSWGILETVDMDSYRVEARSSLKIGLADQEVEIGPV
PTSGGGRKSEPELDQLSNIIKAFNDQFGNIEWKDADKIRKVIAEEILIKVSMDLAYQNAMKNNDKKTARIEHDAALQRVM
IDLLADHTELYKQFSDNPSFKKWLGDTIFGATYQNAP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 3e-177 44
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 3e-177 44
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 8e-177 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Nwat_0200 YP_003759493.1 hypothetical protein VFG1098 Protein 1e-177 44