Gene Information

Name : Desgi_1004 (Desgi_1004)
Accession : YP_007944218.1
Strain : Desulfotomaculum gibsoniae DSM 7213
Genome accession: NC_021184
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease, HsdR family
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1026603 - 1029836 bp
Length : 3234 bp
Strand : -
Note : PFAM: Type I restriction enzyme R protein N terminus (HSDR_N); Domain of unknown function (DUF3387); Type III restriction enzyme, res subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family

DNA sequence :
GTGAAGAATAGAATAACTGAGAATGCAATAGAAGAATTTGCCATTGAGCTGTTGGAGAAAGCTGGCTACCAATACATCTA
CGCCCCGGATATCGCGCCCGACAGTGCTACCCCGGAGAGGCAATCCTTTGAAGAGGTACTGTTGCTGGATTGTCTACAAA
CAGCCGTTGGCAGAATAAACCCTAAAATGCCGGCAGATGTTCGGGAAGATGCCATAAAACAAATACAGCGTTTAAATTCA
CCTGAACTGATAACCAATAACGAAGCGTTTCACCGAATGCTTACCGAAGGGATCAAAGTAAGCTATCAGAAAGACGGAAA
TGAAAGAGGGGATTATGTCTGGCTGATTGATTATAAAAACCCAGACAATAACGATTTTATTGTTACCAATCAATTTACGG
TAATCGAAAAAGGAGTAAATAAGCGCCCGGATATTATTCTTTTTGTAAACGGTTTGCCGTTGGTCGTGATCGAACTGAAA
AATCCCGCGGATGAAAATGCAACAGTAAAATCGGCATACAAACAATTACAAACCTACATACAAGCCATCCCGAACCTCTT
CACCTTCAATGCCATCATGGTTATTTCCGATGGTCTGGAAGCAAAAGCAGGTTCACTTTCTGCCGGTTTAAGCCGCTTTA
TGACATGGAAATCATCTGACGGAAAAATTGAAGCATCTCATCTTATCGGGCAACTGGAAACTCTACTTAAAGGCATGCTT
AATAAGGAAACATTGTTAGACCTGATCCGGCATTTTATCGTATTTGAAAAATCCAAAAAAGAAGATAAGAAAACCGGCAT
CATCACCATCCAAACGGTCAAGAAATTAGCGGCCTATCATCAATATTACGCAGTAAACAGAGCGGTAGAATCCACATTGA
GAGCCGCAGGATATTCTTTCATATCTGGAAAGCATAGTTTAAGTATTGTAATGGAATCACCTGAGAGTTATGGTGTCGCC
GGGGTAAAGCAACAACCGGTCGGAGACCGTAAAGGCGGCGTGGTCTGGCATACGCAGGGAAGCGGAAAATCATTATCGAT
GGTATTTTATACCGGAAAAATTGTACTCGCCATGAATAACCCAACCGTAGTAATTATAACAGACCGAAACGATTTGGACG
ATCAGTTGTTTGATACCTTTGCGTCTTCAAAACAACTTCTCAGACAAGACCCCGTGCAGGCAGAAGACAGAGAACACCTG
AAAGAACTTCTAAAAGTTGCCTCGGGCGGCGTGGTTTTTACAACCATTCAAAAGTTTCAACCAGATGAAGGCAATGTTTT
TGAGCAGCTTTCAGTCAGGGAAAATATCATTGTCATAGCAGATGAAGCGCATAGGACACAATATGGTTTTAAGGCGAAAA
CAATTGATGATAGGGATGAACAAGGAACTGTAATCGGTAAAAAAATTGTCTACGGTTTCGCAAAATATATGCGTGATGCC
CTGCCCAATGCCACCTATCTTGGTTTTACGGGAACGCCGATTGAGAGCACAGATATAAATACCCCGGCTGTATTTGGTAA
TTACATTGATGTCTACGATATAATGCAGGCCGTTGATGATGGAGCAACCGTAAGGATTTATTACGAAAGCAGATTGGCTA
AAATCAATCTCAGTGAAGAAGGTAAAAAGCTGGTTGCTGATTTGGACGAAGAATTGGATAAGGATGGTTTGACAGAAACT
CAAAAAGCTAAGGCGAAATGGACGCAACTGGAAGCTTTGATCGGAAGTGCCGATCGCGTAAAACAGGTTGCCCGGGATAT
TATTAATCATTTTGAACAAAGACAGGAAGTATTTGAAGGCAAGGCAATGATTGTGTCCATGTCCAGAAGGATTGCCGCCG
ACTTGTATGATGAAATAATTAAAATCAAACCACAGTGGCACAGTGCTGATTTAAAAAAAGGTGTCATTAAAGTGGTTATG
ACTTCTAATTCTTCCGATGGGCCGGAAATTTCAAAACATCACACGACGAAAGAACAGAGAAGAGCACTGGCTGACAGGAT
GAAGGACCCGGAAGATGATTTGAAACTGGTCATTGTCCGGGATATGTGGCTTACCGGTTTTGATGTTCCTTCAATGCATA
CCCTGTACATCGATAAGCCTATGAAAGGCCATAACCTGATGCAGGCCATTGCCAGGGTTAACAGGGTATATAAGGACAAG
CCCGGTGGTTTGGTGGTGGATTATCTTGGCATTGCATCAGACCTGAAAAAAGCTCTGGCTTTCTATTCTAACAGTGGAGG
AAAAGGCGACCCTGCAATTAGCCAGGAAAAAGCAGTGCAATTTATGTTGGAAAAAATAGAAGTTGTTGCGCAGATGTACC
ATGGGTTTGCTTATGAGAATTATTTTGATGCGGATACATCAACAAAATTATCGATTATCCTGGCAGCAGAGGAGCATATT
CTGGGCCTCGTAAACGGTAAAAAAAGATATATTGATGAGGTAACGGCCTTATCCAAAGCTTTTGCCATTGCTATTCCACA
TGAACAGGCAATGGATGTGAAAGATGAAGTTTCTTTTTTTCAAGCTGTTAAGTCCAGATTGGCAAAGTTCGACAGCACCG
GCGCAGGTAAAACAAACGAAGAAATGGAAACCGCCATCAGGCAGGTTATTGATAAAGCCCTGATAACCGAGCAAGTGATC
GATGTTTTTGATGCGGCGGGTATTAAAAAACCGGATATTTCCATACTTTCAGAAGAATTTCTTTTGGAAGTTAAAAATAT
GGAGCATAAAAATGTTGCTCTTGAAGTCCTTAAGAAATTACTCAATGATGAGATAAAATCAAGAACTAAAAAGAACCTGA
TTCAGAGCAAAGCTTTGATGGAAATGTTGGAGAATTCAATTAAAAAATATCACAACAAGATCTTAACAGCGGCAGAAGTT
ATCGAAGAATTGATTGCACTTGGTAAAGATATTCAAAAAATGGATAAAGAGCCTCAAGAAATGGGTTTGTCGGAATATGA
ATATGCTTTTTATACAGCCATTGCCAACAATGAAAGCGCCAGGGAATTAATGCAAAAAGATAAATTGAGGGAACTGGCTG
TTGTATTATTTGAAAAAGTGAAAGAAAATGCATCAATAGACTGGACAATAAAAGAAAGCGTAAAAGCAAAATTAAAAGTA
ATCGTAAAGCGCACTTTAAGGAAATATGGTTATCCGCCGGATATGCAGAAACTTGCAACAGAAACAGTATTGAAACAGGC
TGAACTGATTGCAGAGGAATTAACTCATAAATAG

Protein sequence :
MKNRITENAIEEFAIELLEKAGYQYIYAPDIAPDSATPERQSFEEVLLLDCLQTAVGRINPKMPADVREDAIKQIQRLNS
PELITNNEAFHRMLTEGIKVSYQKDGNERGDYVWLIDYKNPDNNDFIVTNQFTVIEKGVNKRPDIILFVNGLPLVVIELK
NPADENATVKSAYKQLQTYIQAIPNLFTFNAIMVISDGLEAKAGSLSAGLSRFMTWKSSDGKIEASHLIGQLETLLKGML
NKETLLDLIRHFIVFEKSKKEDKKTGIITIQTVKKLAAYHQYYAVNRAVESTLRAAGYSFISGKHSLSIVMESPESYGVA
GVKQQPVGDRKGGVVWHTQGSGKSLSMVFYTGKIVLAMNNPTVVIITDRNDLDDQLFDTFASSKQLLRQDPVQAEDREHL
KELLKVASGGVVFTTIQKFQPDEGNVFEQLSVRENIIVIADEAHRTQYGFKAKTIDDRDEQGTVIGKKIVYGFAKYMRDA
LPNATYLGFTGTPIESTDINTPAVFGNYIDVYDIMQAVDDGATVRIYYESRLAKINLSEEGKKLVADLDEELDKDGLTET
QKAKAKWTQLEALIGSADRVKQVARDIINHFEQRQEVFEGKAMIVSMSRRIAADLYDEIIKIKPQWHSADLKKGVIKVVM
TSNSSDGPEISKHHTTKEQRRALADRMKDPEDDLKLVIVRDMWLTGFDVPSMHTLYIDKPMKGHNLMQAIARVNRVYKDK
PGGLVVDYLGIASDLKKALAFYSNSGGKGDPAISQEKAVQFMLEKIEVVAQMYHGFAYENYFDADTSTKLSIILAAEEHI
LGLVNGKKRYIDEVTALSKAFAIAIPHEQAMDVKDEVSFFQAVKSRLAKFDSTGAGKTNEEMETAIRQVIDKALITEQVI
DVFDAAGIKKPDISILSEEFLLEVKNMEHKNVALEVLKKLLNDEIKSRTKKNLIQSKALMEMLENSIKKYHNKILTAAEV
IEELIALGKDIQKMDKEPQEMGLSEYEYAFYTAIANNESARELMQKDKLRELAVVLFEKVKENASIDWTIKESVKAKLKV
IVKRTLRKYGYPPDMQKLATETVLKQAELIAEELTHK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 49
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 49
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 49
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 44