Gene Information

Name : Nhal_0892 (Nhal_0892)
Accession : YP_003526454.1
Strain : Nitrosococcus halophilus Nc 4
Genome accession: NC_013960
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease, HsdR family
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : 3.1.21.3
Position : 902096 - 905299 bp
Length : 3204 bp
Strand : -
Note : KEGG: sat:SYN_00497 type I restriction-modification system restriction subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase

DNA sequence :
GTGTCGGACAAGTTGACGGAATCGGCCATTGAGAACCTGGCTATTGAGCTGCTGGCCAATCAGGGCTACCAATACCTCCA
CGGTGCCGATCTGGCCCCTGATGCGCCCAACCCTGGGCGCCCCTCGTTCGGCGATGAGTTCCTAGTGGGCCGGTTGCAGG
ATGCGGTCGCACGGCTAAATCCGACGATCCCGCCGGACGCTCAGGAGCAGGCAATCAAGGAAATCTTGCGCTTGGCCGTG
GAGACCGGCTCGACTAGGGAGCTGTTGGCGGCTAATGAAGCTTTTCATAGGCTCCTGACCAATGGCGTGGAGGTGGAGTT
TCAGCACGAAGGGCGGACCAAGGGCGACAAGGTCTGGCTAGTGGACTTCGCCAATCCGGCGGCGAACGATTGCCTGGTAG
TGAATCAGTTCACCGTCACCACCAAGTCCGGCCACGGGCACGTCAACAAGCGGCCCGACCTGGTGCTGTTCATCAACGGC
CTGCCCCTGGTGGTGGTCGAGTTGAAAAATGCCGCCAGCGAGAATGCCACCGTACGTTCTGCCTATGAACAACTCCAGAC
CTACAAGCATGCTATCCCCAGTCTGTTCTTGGCCAATGGGCTGTTGGTCGCTTCCGACGGGCTGGAGGCGCGCATGGGGT
CACTTTCCGCTGGCTTCAGCCGCTTCATGGCCTGGAAGACGACGGATGGAAAGAAGGAGGCGTCGCACCGGGTCGGGCAG
TTGGAGACTCTGATTCAGGATGTATTGAAGCCCGCCACCTTACTGGAGTTGATCCGCCACTTTACGGTGTTCGAGAAGTC
CCGTCGGGAAGACCCCAAGACGGGGCTGACAACGGTGGAGACGGTGAAGAAGACTGCCGCTTATCACCAGTTCTACGCGG
TGAAGAAGGCGGTGGAATCGACCCTGCACGCTACCGGTACGGACGGCAGCCGCAAGGGCGGGGTGGTGTGGCATACGCAG
GGCAGCGGCAAGTCGCTCTCCATGGTCTTCTATGCGGGCAAGCTGGTGCTGGCATTGGACAACCCGACCATCGTGGTGCT
GACTGATCGCAACGATCTGGACGACCAGCTCTTCGATACCTTCGCCGCCAGTCGCCAATTACTGCGTCAGGAGCCAGTGC
AGGCAGAAAATCGGGACGACCTGCGCGACAAGCTCAAGGTGGCCTCGGGGGGAGTTATCTTCACCACCATGCAGAAGTTC
TCGCCGAAGAATGGCGAGGCGATCTATCCGCTACTTTCCGACCGCCGCAATATCGTGGTGATCGCCGACGAGGCCCACCG
CAGCCAGTATGGGTTCAAGGCCAAGGAGGTGGACATCAAGGATGCGGATGACAACATCGTCGGCAAGAAGACCGCTTACG
GCTTCGCCAAGTATCTGCGCGACGCATTGCCCAATGCCACCTTTATTGGTTTTACCGGCACCCCGGTGGAACTGGACGAC
AAGAATACTCCGGCCGTTTTCGGCGATTATGTGGACGTCTATGACATCGCTCAGGCAGTGGAGGATGGCGCGACAGTACG
CATCTACTATGAGAGCCGATTGGCCAAAGTACGGTTAAAGGAAAAAGAGAAAGAGGCTCTGGATCAGCGTTTCGACCAGG
TCATGGAAGATGCTGCAGATTATGAAACGGGGTTCAGCGATGAGGACGAGTTGAGCGAGAAAGCCAAGGCTAAATGGACA
CAGCTAGAGGCCATTGTGGGCAATCGGCAACGGGTGGAGAACGTGGCCCGCGATCTGGTGGCGCACTTTGAAGAGCGGCA
GAAGATATTCGACGGCAAAGGACTAATCATCGCCATGAGCAGGCGCATCTGCGTCGAACTCTACGACGCCATCGTTACGC
TACGCCTTGGGTGGCATAGCGACGACGATGCCCAGGGTGCCATCAAGGTGATCATGACCGGTTCCAGTGCCGACCCGCAG
GCGATGCAGCCGCACATCCGCAGCAAGGAGGCGCGGAAGGCGATTGGGGAGCGTTTGAAGGACCCGGGCGATCCGCTGAA
GCTGGTTATTGTGCGGGATATGTGGCTGACCGGCTTCGATGCGCCCTGCCTGCATACCCTCTACGTGGACAAGCCGATGA
AGGGACACAACCTGATGCAGGCCATCGCGCGCGTGAACCGGGTCTATAAGGACAAGCCGGGGGGGCTGGTGGTGGACTAC
ATCGGGATCGCCTCCGACCTGAAGCGGGTGCTAGCGATCTATACCGAGAGCGGCGGCAAGGGCCAGCCTACGCTGGATAT
TGATGATGCCGTCCGCGCCATGCAGGAAAAATTCGAGGTCGTCCAACAGATGCTGGCGGGATTTGACTACCGCCGCTACT
TCGGGGCAAACACCTCAGGCAAGCTGACGATCATCCTGGAGGCAGAGGAGTACGTTCTTGCTCTAGAGGACGGCAAGGCG
CGTTTCTCGAAGGAGGTCGACTTGCTGGCGAAGGCGTTTGCGCTCTCCGTGCCGGATGAACGGGCCATGGCCATCAAAGA
CGAGCTCGCCTTTTTCCAGGCGGTGAAGGCGCGGCTGGCCAAGTTTGAGCGCGGCGAAGGCAAGAGCAAGGAAGAGCTGG
ATTCGGCTATTCGGCAGTTGGTAGACGAGGCGGTCGTCTCCGATCAGGTGGTGGACATCTTCGATGCGGCGGGGATCAAA
AAGCCGGATATTTCCATTCTCTCCGATGAGTTCATGGCGGAAATCAAAGGGATGCAGCATCGGAATCTGGCGCTGGAACT
GCTGAGGAAAATTCTCAACGATGAGATTCGCGTTCGCTCCAAAAAGAATCTGGTGCAGAGCAGGGCATTGTCCGAAATGC
TGGAGAGTGCCATCAAGCGCTATCAGAACAACCTGCTGTCGGCCGCCGGGATCATCGAAGAGCTGATCGAACTGGCCAGG
GAGATCAAGGAGGCGGATCGACGAGGCGAAAAACTGGGGCTAACGGAGGATGAGCTGGCCTTCTATGACGCCCTTGAAGT
GAATGACAGCGCAGTACAGGTGCTTGGCGATAACCAGTTGCGCGAGATCGCACGGAAGTTGGTGGAGAAAGTAAAGCAGA
ACGCTACCATCGACTGGACGGTAAAGGAGAGCGTACGCTCCAAGTTGAAGGTGATCGTCAAGCGGATTCTACGCAAGTAT
GGCTACCCGCCAGACAAGCAGGCGACCGCAACCGAGACAATTCTGAAGCAGGCGGAGTTGTTGGCCGAGAGCTGGGCTGT
ATAA

Protein sequence :
MSDKLTESAIENLAIELLANQGYQYLHGADLAPDAPNPGRPSFGDEFLVGRLQDAVARLNPTIPPDAQEQAIKEILRLAV
ETGSTRELLAANEAFHRLLTNGVEVEFQHEGRTKGDKVWLVDFANPAANDCLVVNQFTVTTKSGHGHVNKRPDLVLFING
LPLVVVELKNAASENATVRSAYEQLQTYKHAIPSLFLANGLLVASDGLEARMGSLSAGFSRFMAWKTTDGKKEASHRVGQ
LETLIQDVLKPATLLELIRHFTVFEKSRREDPKTGLTTVETVKKTAAYHQFYAVKKAVESTLHATGTDGSRKGGVVWHTQ
GSGKSLSMVFYAGKLVLALDNPTIVVLTDRNDLDDQLFDTFAASRQLLRQEPVQAENRDDLRDKLKVASGGVIFTTMQKF
SPKNGEAIYPLLSDRRNIVVIADEAHRSQYGFKAKEVDIKDADDNIVGKKTAYGFAKYLRDALPNATFIGFTGTPVELDD
KNTPAVFGDYVDVYDIAQAVEDGATVRIYYESRLAKVRLKEKEKEALDQRFDQVMEDAADYETGFSDEDELSEKAKAKWT
QLEAIVGNRQRVENVARDLVAHFEERQKIFDGKGLIIAMSRRICVELYDAIVTLRLGWHSDDDAQGAIKVIMTGSSADPQ
AMQPHIRSKEARKAIGERLKDPGDPLKLVIVRDMWLTGFDAPCLHTLYVDKPMKGHNLMQAIARVNRVYKDKPGGLVVDY
IGIASDLKRVLAIYTESGGKGQPTLDIDDAVRAMQEKFEVVQQMLAGFDYRRYFGANTSGKLTIILEAEEYVLALEDGKA
RFSKEVDLLAKAFALSVPDERAMAIKDELAFFQAVKARLAKFERGEGKSKEELDSAIRQLVDEAVVSDQVVDIFDAAGIK
KPDISILSDEFMAEIKGMQHRNLALELLRKILNDEIRVRSKKNLVQSRALSEMLESAIKRYQNNLLSAAGIIEELIELAR
EIKEADRRGEKLGLTEDELAFYDALEVNDSAVQVLGDNQLREIARKLVEKVKQNATIDWTVKESVRSKLKVIVKRILRKY
GYPPDKQATATETILKQAELLAESWAV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 48
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 48
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 47