Gene Information

Name : GWCH70_1642 (GWCH70_1642)
Accession : YP_002949695.1
Strain : Geobacillus sp. WCH70
Genome accession: NC_012793
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1718140 - 1721151 bp
Length : 3012 bp
Strand : -
Note : KEGG: fma:FMG_1372 type I restriction-modification system restriction subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicases

DNA sequence :
ATGTTTACAGAAGAGCAATTAGAAAATGTCGTGATTGAGTATTTTCAAGAGCTTGGATATAACTATCTACCCGCAAGTGA
GTTAAAGCGGGATGAGAAAGAGGTTTTGCTGTTTGACCGTTTGGAAGCAGCGTTGGTGAGACTGAATCCGAGTTTGTCTT
TGGATGTCATCCGTGAAGCAATTCGGAAAATCCGTCATTTTGAAACAAACGATGTGTTTACGAACAATAAAGTGTTTCAT
AAGTATTTGACGGAAACGGTGGAAGTGGCGGAGTTTGTGAATGGGGAAACGGTTTATCACCGCGTTCGGCTCATCGATTG
GGAAGTACCTGAAAATAATGATTTTCTTGTTGTCAATCAATTAGAGGTCGTTGAAAAAGGCCAGAAGAAAATCCCTGACA
TTGTGCTCTATGTAAACGGAATCCCGCTTGTCGTGTTCGAGTTAAAAAGTACGTCACGTGAAGAAGTCGATATTGAGGAT
GCGTACAAACAATTGAAAAATTATATGAACGTCCACATTCCTTCCTTATTCTATTACAATGCGTTTCTTGTGATTAGTGA
TGGGGTGAAAGCCCGAGCTGGAACAATCACAGCGCCGCTTGATCGTTTTTTGGCATGGAAAAAGATTCATATCGAAGACG
AGGTTGTTGAAAATCGTGAATTAGAAACATTGATGTACGGATTATTCAATCAAAAACGCTTTTTGGATGTCATAAAAAAC
TTCACGTTGTTTACGAATGAAGCAAAAATTATGGCTGCGTATCACCAATATTATGGAATGAAAAAAGCTATCGAGTCTAC
AATACGGGCAGTTGGGAAAGATGGACGCGCCGGGGTTATTTGGCATACGCAAGGAAGCGGCAAAAGTTATTCCATGGTAT
TCCTTGCTGGAAACTTAGTGAAACGGGAAGAATTGAAAAACCCAACGATTGTCGTCATTACCGACCGGAATGATTTAGAC
GGACAGCTATTCGAAACGTTCTCCGGAGCAAGTGAATTTTTGCGGCAAACACCACAACAGGCGGAAACGCGGAGTCATAT
AAAAGAGCTATTGGAAAATCGCCAAACCGGTGGAATTATTTTTTCAACGATTCAAAAATTTGAAGAAGAAACCGGCTTGC
TTTCTGATCGGGAAAATATCATTGTCATGGTCGATGAAGCTCACCGCTCGCAATACGGTGTCGATCCGAAATATGATATT
GTGACCGGTGAACAAAAGTACGGCTATGCGAAATATTTGCGGGAAGCGTTGCCGAATGCGACGTATATTGCGTTTACTGG
CACACCGATTGAAACAACGGATAAATCGACGACTGGATTGTTCGGTGATGTCATTGATGTGTATGATATGACACAAGCGG
TTCAAGACGGGGCAACCGTGAAAATTTATTATGAATCCCGCTTGGCGAAAGTAAAACTAGACGAGAAAAAAATGAATGAA
ATTGATCAAGAATATTGGAATATGCAAGTCAACGAAGGTGTTGACGACTATATCGTCGAACAAAGCCAGAAAAGCTTAAG
CCGCATGGAGCAAATTATCGGCGATAAAGACCGAATTAGAGAAGTCGTAGCCGACATTATTAGTCATTACGAGGAGCGCG
AAAATCTTGTTGCTGGGAAAGCGATGATTGTTGCCTATTCGCGAAAAACAGCGTTCGCAATGTATAAAGAAATCATGAGA
CAACGCCCCGATTGGAAGGAAAAAGTGAAAATTGTCATGACGGAAAACAACCAAGATCCGGAAGAATTAGCGAAGCTTGT
TGGGAATAAACAAACGCGGAAGCAGCGGGAGAAAGAATTTAAAGATGTCAATCATCCGTTCAAAATCGTGATTGTCGTTG
ATATGTGGCTAACTGGTTTCGACGTTCCAGCGCTTGATACGATGTATATCGATAAGCCGATGAAGGCGCATAACTTGATG
CAAGCCATCGCTCGCGTCAATCGTGTCTATCCGGGCAAGACGGGCGGATTGATTGTCGACTATATCGGTTTAAAGAGAGA
CTTAATGGAAGCACTCAAAACGTATACAAAGCGTGACCAAGATAAAGTGCAGGAAAACGAGCAAGCTCGCGATATCGCGC
TAAATATTCTTGAAGTTCTTCGCAATATGTTTCACGAGTTTGACTACAGTGCGTTTTTCGGGGACAGCGACAAAAAACGT
TATGAAGTCATCCGGGATGGTGCGGAATTTGTTCAACAAACGGAAAAAAGAAAATCACTGTTTATGACAGAAACGAAGAA
GTTAAAGGATGTTTATAAAATTTGTACCGGTTTGCTTTCCAAAGAGCAAAAAGAGGAAATTTCCTACTTTATTGCCGTTC
GTTCTTTTATCATGAAATCTTCGCGAAAAGGAGCACCGGATTTAAAAGAAGTAAATGAACGAATCTCAAAAATGTTAGAA
GAGGCCATTTTAGAAGATGAAGTGATGGTATTAACGCAGGCTTCTTCATCCGAGAGTTTTGACTTGTTAAACGAAGAGAA
TATAAAGAAACTTCGCGCGCTGCCGCAAAAGAATATCGCCGCTAATATTCTCATGCGCGTACTAAAGGAAAAACTGCAAG
ATGTGAAAAAGAAAAACATGACCGTCAGCCAAACGTTTTCCAAACGCTTTGAAAAAATATTAGAAAAATACAACAACCGT
AACGATTATACGGATGTATACGAAGTATTTGAAGAACTCATTAAATTTAAAGAAGAATTGGAAGCAGCGATTCAAGCAGG
AAAACAACTTGGTTTAACGGATGAGGAAAAAGCATTTTTTGATGTGTTAGGCTCAGATCCGGATATAAAAAAATTAATGG
AAGATGAAATTTTAATCAAAATCGCGAAAGAGCTGGCGAAAACAGTGAAAGAAAATCGCACGCACGATTGGGATAAAAAA
GAACAAGCCCAAGCACGCATGCGCCTGCAGATAAAAAAAGTTCTACGTAAATATGATTATCCTCCAAATAAACAGCCAAA
AGCAGTGGAAGATGTATTAATGCAAGCGAAGTTACAGTGTCAGAATATGTGA

Protein sequence :
MFTEEQLENVVIEYFQELGYNYLPASELKRDEKEVLLFDRLEAALVRLNPSLSLDVIREAIRKIRHFETNDVFTNNKVFH
KYLTETVEVAEFVNGETVYHRVRLIDWEVPENNDFLVVNQLEVVEKGQKKIPDIVLYVNGIPLVVFELKSTSREEVDIED
AYKQLKNYMNVHIPSLFYYNAFLVISDGVKARAGTITAPLDRFLAWKKIHIEDEVVENRELETLMYGLFNQKRFLDVIKN
FTLFTNEAKIMAAYHQYYGMKKAIESTIRAVGKDGRAGVIWHTQGSGKSYSMVFLAGNLVKREELKNPTIVVITDRNDLD
GQLFETFSGASEFLRQTPQQAETRSHIKELLENRQTGGIIFSTIQKFEEETGLLSDRENIIVMVDEAHRSQYGVDPKYDI
VTGEQKYGYAKYLREALPNATYIAFTGTPIETTDKSTTGLFGDVIDVYDMTQAVQDGATVKIYYESRLAKVKLDEKKMNE
IDQEYWNMQVNEGVDDYIVEQSQKSLSRMEQIIGDKDRIREVVADIISHYEERENLVAGKAMIVAYSRKTAFAMYKEIMR
QRPDWKEKVKIVMTENNQDPEELAKLVGNKQTRKQREKEFKDVNHPFKIVIVVDMWLTGFDVPALDTMYIDKPMKAHNLM
QAIARVNRVYPGKTGGLIVDYIGLKRDLMEALKTYTKRDQDKVQENEQARDIALNILEVLRNMFHEFDYSAFFGDSDKKR
YEVIRDGAEFVQQTEKRKSLFMTETKKLKDVYKICTGLLSKEQKEEISYFIAVRSFIMKSSRKGAPDLKEVNERISKMLE
EAILEDEVMVLTQASSSESFDLLNEENIKKLRALPQKNIAANILMRVLKEKLQDVKKKNMTVSQTFSKRFEKILEKYNNR
NDYTDVYEVFEELIKFKEELEAAIQAGKQLGLTDEEKAFFDVLGSDPDIKKLMEDEILIKIAKELAKTVKENRTHDWDKK
EQAQARMRLQIKKVLRKYDYPPNKQPKAVEDVLMQAKLQCQNM

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 43
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 5e-174 41