Gene Information

Name : GYMC52_2377 (GYMC52_2377)
Accession : YP_004132907.1
Strain : Geobacillus sp. Y412MC52
Genome accession: NC_014915
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2441753 - 2444764 bp
Length : 3012 bp
Strand : -
Note : TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; helicase; KEGG: gyc:GYMC61_0288 type I site-specific deoxyribonuclease, HsdR family; SMART: DEAD-like helicas

DNA sequence :
GTGTTTACAGAAGAGCAGCTAGAGCAAGCCGTCATTGAATATTTTCAAGAGCTCGGCTACCCATATATGCCAGCGAAGGA
GTTAAAGCGAGATAAAAAGGACGTTTTATTGCTTGACCGTTTGGAAGAAGCGTTAGTGAAATTGAATCCAGAAGTGCCTG
TTGAGATCATTCGCGAGGTGATGCGCAAAATCCACTATTTTGAAACGAATGACGTGCTTACAAACAACAAAATGTTTCAT
AAGTACTTAACAGAAGCGGTAGTAGTGCCTGAGCTTGTGAATGGAGAGACGGTTTACCATCATGTTCGACTCATCGATTG
GGAAACGCCAGAAAACAACGATTTTCTCGTTGTCAACCAATTGGAAGTCATTGAAAAAGGACAGGAGAAAATTCCAGACA
TCGTGCTGTATGTCAACGGTCTTCCGCTTGTTATTGCAGAGTTAAAAAGCACGTCACGCGAAGAAGTCGATATTGAAGAT
GCATACAAGCAGCTAAAAAATTATATGAACGTCCACATCCCTTCATTGTTCTACTACAATGCGTTTCTTGTCATTAGTGA
CGGGGTTCAGGCGCGCGCGGGAACGATCACCGCGCCGCTTGACCGGTTTATGGCTTGGAAAAAGATCAATATCGAAGATG
ATGTCATAGAAAACCGGGAGTTAGAAACGTTAATATTCGGCTTGTTCGAGCCAAAACGCTTTTTGGATGTCATCAAAAAC
TTTACGCTATTTGCAAACGAAGCAAAAATAATGGCGGCGTATCACCAATATTACGGCATGAAGAAAGCAGTGGCTTCTAC
GATCCAAGCCATTCATACAGATAAACGTGCAGGAGTCATTTGGCATACGCAAGGAAGTGGCAAAAGTTATTCGATGGTGT
TTCTTGCGGGGAACTTAGTCAGACAGGAACAGCTGAAAAATCCAACCATTGTCGTGATCACCGATCGCAACGATTTAGAC
GGACAGTTATTTGAAACGTTTTGCGGGGCGAGTGAATTTTTACGGCAAACACCATTACAAGCGGAAACGCGCAGCCATTT
GAAAGAGTTGTTGGAACATCGGCAAACGGGCGGCATCGTTTTTTCAACGATTCAAAAGTTTGAAGAAGAAACCGGCTTGC
TTTCTGAACGGGAAAACATCATCGTGATGGTTGACGAAGCCCACCGCTCCCAATACGGCGTTGATCCGAAATACGATATC
GAAACGGGGGAGCAAAAGTACGGGTATGCGAAATATTTGCGTGAAGCTTTGCCGAATGCGACGTATATCGCATTTACAGG
GACGCCGATTGAAACGACCGATCGATCGACGACCGGCTTGTTCGGCGATGTCATTGATGTGTATGATATGACCCAAGCGG
TGGAAGACGGGGCGACGGTCAAAATTTATTACGAATCTCGCTTGGCGAAAGTGAAATTGGACGAGAAGAAAATGAATGAA
ATTGACCAAGAATATTGGCGCATGCAAGTTCACGAAGGCGTCGGCGACTATATTATTGAACAAAGCCAAAAAAGCTTGAG
CCGCATGGAGCAAATCATAGGCGACCAAGACCGAATTCGCGAAGTCGTCACGGACATTATCCACCATTACGAGGAGCGCG
AACATCTTGTCAAAGGAAAAGCGATGATCATTGCTTATTCGCGCAACACCGCGTTTGCAATGTATAAAGAGATCATGCGT
CAACGTCCGGACTGGAAAGACAAAGTGAAAATTGTCATGACCGAAAACAATCAGGATCCGGAGGAGCTCGCGAAGCTTGT
CGGAAATAAACAAACGCGGAAACAGCGGGAAAAAGAATTTAAAGATGTCGATCATCCGTTTAAAATCGTCATTGTTGTTG
ATATGTGGCTCACCGGTTTTGACGTGCCGGCGCTCGATACAATGTATATTGACAAGCCCATGAAGGCGCATAACTTGATG
CAAGCGATCGCCCGCGTCAACCGCGTTTATCCGGGGAAAACAGGCGGATTGATTGTTGATTACATCGGCTTAAAGAAAAA
TTTAATGGAAGCGTTGCAAACGTATACGAAGCGCGACCAAGATAAAGTGCAAGAAAATACCCAAGCCCGCGACATCGCGT
TAAACATCCTCGAAGTGTTGCGCAATATGTTCCATTCGTTTGATTATCGCGCCTTTTTCGGTGATAGCGACAAAAAGCGT
TATGAAGTCATTCGCGACGGAGCAGAATTTGTGCAGCAAACGGAAAAAAGAAAATTGCTGTTTATGACGGAAACGAAAAA
GCTGAAAGATGTTTATAAAATTTGCACCGGCTTGCTTTCGAAAGAACAAAAAGAGGAAATTTCCTATTTTATCGCTGTTC
GTTCCTTTATTATGAAATCTTCGCGAACAGGCACACCTGACTTAAAAGAAGTGAATGAACGAATCGCGAAAATGTTGGAA
GAAGCGATTTTGGAAGATGAAGTGATGGTGTTGACCCAAGCGGTTTCATCGGAAAGTTTTGATTTGTTGAATGAGGAGAA
CATCAAAAAATTACGCGCCTTGCCGCAAAAAAATATTGCGTCGACCATTTTAATGCGCGTATTAAAGCAAAAATTGCAAG
ATGTGAAAAAGACAAACATGACGGTGAGCCAAACATTTTCCAAACGTTTTGAAAAAATATTAGAAAAATACAACAATCGA
AATGATTACACGGATGTGTATGAAGTATTTGAGGAATTGCTTAAATTTAAAGAAGAGTTGCAGGCGGCGATTGAAGAAGG
GAAACAGCTTGGCTTAACCGAGGAGGAGAAGGCGTTTTTCGACGTGTTAGGTTCTGACCCGGATATAAAAAAATTAATGG
AAGATGAGGTATTAATTCAAATCGCAAAGGATTTGGCGAAAACGGTAAAGGAAAACCGGACGCACGATTGGGATAAAAAA
GCGCAAGCCCAAGCGCGCATGCGCCTTGAAATTAAGAAGGTGCTGCGCAAGTACGATTATCCGCCAAATAAACAGCCGAA
AGCGGTGGAAGATGTGCTTGAGCAGGCGAAGCTGCAGTGCATGAATATGTAA

Protein sequence :
MFTEEQLEQAVIEYFQELGYPYMPAKELKRDKKDVLLLDRLEEALVKLNPEVPVEIIREVMRKIHYFETNDVLTNNKMFH
KYLTEAVVVPELVNGETVYHHVRLIDWETPENNDFLVVNQLEVIEKGQEKIPDIVLYVNGLPLVIAELKSTSREEVDIED
AYKQLKNYMNVHIPSLFYYNAFLVISDGVQARAGTITAPLDRFMAWKKINIEDDVIENRELETLIFGLFEPKRFLDVIKN
FTLFANEAKIMAAYHQYYGMKKAVASTIQAIHTDKRAGVIWHTQGSGKSYSMVFLAGNLVRQEQLKNPTIVVITDRNDLD
GQLFETFCGASEFLRQTPLQAETRSHLKELLEHRQTGGIVFSTIQKFEEETGLLSERENIIVMVDEAHRSQYGVDPKYDI
ETGEQKYGYAKYLREALPNATYIAFTGTPIETTDRSTTGLFGDVIDVYDMTQAVEDGATVKIYYESRLAKVKLDEKKMNE
IDQEYWRMQVHEGVGDYIIEQSQKSLSRMEQIIGDQDRIREVVTDIIHHYEEREHLVKGKAMIIAYSRNTAFAMYKEIMR
QRPDWKDKVKIVMTENNQDPEELAKLVGNKQTRKQREKEFKDVDHPFKIVIVVDMWLTGFDVPALDTMYIDKPMKAHNLM
QAIARVNRVYPGKTGGLIVDYIGLKKNLMEALQTYTKRDQDKVQENTQARDIALNILEVLRNMFHSFDYRAFFGDSDKKR
YEVIRDGAEFVQQTEKRKLLFMTETKKLKDVYKICTGLLSKEQKEEISYFIAVRSFIMKSSRTGTPDLKEVNERIAKMLE
EAILEDEVMVLTQAVSSESFDLLNEENIKKLRALPQKNIASTILMRVLKQKLQDVKKTNMTVSQTFSKRFEKILEKYNNR
NDYTDVYEVFEELLKFKEELQAAIEEGKQLGLTEEEKAFFDVLGSDPDIKKLMEDEVLIQIAKDLAKTVKENRTHDWDKK
AQAQARMRLEIKKVLRKYDYPPNKQPKAVEDVLEQAKLQCMNM

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42