Gene Information

Name : Nhal_0754 (Nhal_0754)
Accession : YP_003526321.1
Strain : Nitrosococcus halophilus Nc 4
Genome accession: NC_013960
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 765755 - 768787 bp
Length : 3033 bp
Strand : +
Note : PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase; KEGG: psp:PSPPH_0104 type I restriction-modification system restriction subunit

DNA sequence :
ATGGTCAGCCAAACCAACGAACAGGCCCTAGAGGCTACCATAGAGAAATATCTGACCGGCACCTGTCTGGAAGAACAAAA
AGCCATCAGGGAGGGGATAAGGGAGGTGCCCTTCAGCCCGAACCAGGGTTACAAGTTGGGGCAGGCTGCGGATTTCAATG
CCCACTATGCGGTCGATACCCGACTTTTCTGGCAGTTTTTGGAGAGCGCCCAGGCCGAGGAACTGGACAAGCTCAAAAAG
CACCACCCTGACTGGCGGAGCAAGATTCTGGAGCGGTTCGACCGGCTGGTGAAAAAGCACGGGCTGCTGCACCTGCTCAA
GCGGGGCCTGAGCATTGATGACGCCCATTTCCATCTGATGTTTCCTCCGCCCCTGGCCAGCAGCTCGGATAAGGTCAAGG
AGAACTTCAGCGCCAACCTCTTCAGCAGCACCCGTCAGGTGCGCTATTCCCTGACCCGTACGCTGGAAGAAATCGACCTG
GTGCTGTTCATCAACGGCCTGCCCTTCGCCACCCTGGAGCTGAAAAACCCCTGGACGGGCCAGACCGCCCGCTACCACGG
CCAGAGGCAATATCGCCTGGAGCGGGACATCACCCAGCCCTTGCTGCAATTCGGGCGCTGTCTGGTGCATATGGCGGTGG
ATACCGACGAAGTGTACATGACCACCAAGTTGGCCGGGAAAAGCACTTTCTTTCTGCCCTTCAATAAGGGGCATCACTTT
GGCGCCGGCAATCCGCCCAATCCCGGCGGCCATAAAAGCGCCTATCTGTGGCGGGAGGTGTTCAGCAAGGAGAGCATCGC
CAACATCATCCAGCATTTTGTGCGCCTCGACGGCGGCAGCAAAACCTCCCTGGCCAAGCGCACCCTGTTTTTTCCCCGCT
ATCACCAGTTGGATGTGGTCCGCAAGCTGGTGGCCCACGCCAGCGAACGGGGAGTGGGCCAGACTTACCTGATTCAGCAC
TCCGCCGGTTCCGGGAAATCCCACTCCATTACCTGGGCCGCCTACCAACTAATCGAGACCTACCCGGTCTCTGCCCATGT
CCGGGGAGCCAAGGGGCTGGATGCGCCCCTGTTCGATTCGGTGATTGTGGTCACCGACCGGCGTCTGCTGGACAAGCAGC
TGCGGGAGAACATCCGCGAGTTCTCGGAGGTGAAGAACATTATCGCCCCGGCCCTCCGCTCCTCAGACCTTAAACAGGCC
CTGGAGCAGGGCAAGAAGATCATTATCACCACCCTCCAGAAATTCCCGTTTATCATTGATGGCATCGAGGACATGAGCGA
TAAGTGCTTCGCCGTCATCATCGATGAGGCCCACGGCTCCCAGGATGGCAGCGCCCACGGCAAGATGAATCAGGCCATGG
GGCGGGATGCCGATGACGAAGACGATCAGAGCGACCCTCAGGATAAGGTTCTGAACGCCATGCGTTCGCGCAAGATGCGG
GGCAATGCCTCCTACTTTGCGTTCACCGCCACCCCCAAGAACAGCACCCTGGAGAAATTCGGGCAGCAACAGCCCGACGG
CAGCTTCAAGCCGTTCCATCTCTACTCCATGAAGCAGGCCATCGAGGAGGGCTTTATTCTCGATGTGCTGGCCAACTACA
CCACCTACAAAAGCTATTACGAGATCCAAAAATCCATCGAGGACAACCCCCTGTTCGATACCGCCAAGGCCCAGAAGAAG
TTGCGGGCCTATGTGGAGCGCAATCCCCAGACCATCAACGCCAAGGCCGAGATCATTTTGGAGCACTTTATCCCCCAGGT
GGTGAACGCCAAAAAGCTCAGGGGCAGGGCCAAGGGCATGGTGATTACCCAGAATATCGAAACCGCCATCCGCTATCACC
AGGCGATTGGGTGCATCCTTCGGGACCGGGGCAACCCTTTCAAGGCGCTAGTGGCTTTCTCCGGCGCCAAAGAAGTAGAC
GGCATTGAATACACCGAGGCCGAGATGAATGGCTTTCCTGAAGCGGACACCCGAGATAAGTTCGATAAGGACGAGTACCG
CCTGCTGGTGGTGGCCAACAAGTACCTCACCGGCTTCGATCAGCCCAAGTTGACCGCCATGTATGTGGATAAAAAACTGC
AAGGGGTCCTGGCCGTGCAGGCCCTGTCCCGCCTCAACCGGGCGGCTCCCCAGTGGGGCAAGAAGACGGAAGACCTGTTC
GTGCTGGATTTCTTCAACACGGTGGAGGACATCAAGGCCGCCTTTGATCCCTTCTACACCGCCACCAGCCTGTCACAGGC
AACGGATGTCAATGTGCTGCACGAACTCAAGGATGCCCTGGGGGATGTGGGGGTCTACGAATGGCGCGAGGTGGAAGCGT
TTGTCGGCTTGTACTTTGACAACGCCGATGCCCAGCAATTGAGCCCCCTGATGGATACCGCCGCGGCGCGTTTCAACCAG
GAATTGGGCCTGGAGGACGAGGAAAAGGCGGACTTCAAAATCAAGGCCAAGCAATTCGTGAAGATTTATGGCCAGATGGC
GTCCATTATGCCCTATGAAGTGGTTGCCTGGGAGAAGCTGTTCTGGTTTCTGAAGTTCCTGATTCCCAAGCTGGTGGTTA
AAAGCAAGGATGCCGATGAAATCGATGCCCTGCTGGACTCCGTCGATCTGTCTTCCTATGGCCTGGAACGGGTCAAGCTG
AATCAGGCCATTGGGCTGGATGAATCCGAGACGGAAGTTGACCCCACTAACCCCAATCCTCGGGGCGTTCACGGAGGTGG
CGAAGAACAGGATCCTCTTGATGAGATTATCCAAAGCTTTAACGAACGCTGGTTCCAGGGTTGGGAGGCCACGCCGGAAG
AACAGCGGGTGAAGTTCCTCAGTATCATCAAAAGCATCCAGGCCCACCCGGACTTTGAAGCCAAATACAAGCACAACCAG
GACGCGGATAACCGCACCTTGGCCTTCGAGAAGATCTTCGAGGAAGTCATGCTCGAGCGCCGGAAAATGGATCTGGATTT
GTACCGCTTGCTGGCTAGCGATCCGGCGTTCAAATCATCTCTGCAACAGAGCCTAAGGCATTTGCTGGAGTAG

Protein sequence :
MVSQTNEQALEATIEKYLTGTCLEEQKAIREGIREVPFSPNQGYKLGQAADFNAHYAVDTRLFWQFLESAQAEELDKLKK
HHPDWRSKILERFDRLVKKHGLLHLLKRGLSIDDAHFHLMFPPPLASSSDKVKENFSANLFSSTRQVRYSLTRTLEEIDL
VLFINGLPFATLELKNPWTGQTARYHGQRQYRLERDITQPLLQFGRCLVHMAVDTDEVYMTTKLAGKSTFFLPFNKGHHF
GAGNPPNPGGHKSAYLWREVFSKESIANIIQHFVRLDGGSKTSLAKRTLFFPRYHQLDVVRKLVAHASERGVGQTYLIQH
SAGSGKSHSITWAAYQLIETYPVSAHVRGAKGLDAPLFDSVIVVTDRRLLDKQLRENIREFSEVKNIIAPALRSSDLKQA
LEQGKKIIITTLQKFPFIIDGIEDMSDKCFAVIIDEAHGSQDGSAHGKMNQAMGRDADDEDDQSDPQDKVLNAMRSRKMR
GNASYFAFTATPKNSTLEKFGQQQPDGSFKPFHLYSMKQAIEEGFILDVLANYTTYKSYYEIQKSIEDNPLFDTAKAQKK
LRAYVERNPQTINAKAEIILEHFIPQVVNAKKLRGRAKGMVITQNIETAIRYHQAIGCILRDRGNPFKALVAFSGAKEVD
GIEYTEAEMNGFPEADTRDKFDKDEYRLLVVANKYLTGFDQPKLTAMYVDKKLQGVLAVQALSRLNRAAPQWGKKTEDLF
VLDFFNTVEDIKAAFDPFYTATSLSQATDVNVLHELKDALGDVGVYEWREVEAFVGLYFDNADAQQLSPLMDTAAARFNQ
ELGLEDEEKADFKIKAKQFVKIYGQMASIMPYEVVAWEKLFWFLKFLIPKLVVKSKDADEIDALLDSVDLSSYGLERVKL
NQAIGLDESETEVDPTNPNPRGVHGGGEEQDPLDEIIQSFNERWFQGWEATPEEQRVKFLSIIKSIQAHPDFEAKYKHNQ
DADNRTLAFEKIFEEVMLERRKMDLDLYRLLASDPAFKSSLQQSLRHLLE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 70
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 70
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 0.0 70

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Nhal_0754 YP_003526321.1 hypothetical protein VFG1098 Protein 0.0 70