Gene Information

Name : Ethha_0476 (Ethha_0476)
Accession : YP_004090792.1
Strain : Ethanoligenens harbinense YUAN-3
Genome accession: NC_014828
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 511139 - 514156 bp
Length : 3018 bp
Strand : +
Note : TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; KEGG: clo:HMPREF0868_0494 type I site-specific deoxyribonuclease, HsdR family; SMART: DEAD-like helicase

DNA sequence :
ATGCCTTATGCCGAAGCCAATTTTGAAAATGCCGTTATGATTTTACTGGAAGAGCTGGGGTACGCTAAGCTCTATGGCCC
GGATGTAGAACGGGATTTTCATAACCCGCTTTACTTGGATGCGCTCACTGGACAGCTTCCCCGCATTAACCCGATGGCTG
ATCGGCAGGCGGTGGAAGAGGCCCTTCACAAGCTGACGCATATCGAGCATGGTACACTTCTACAGAAAAATAAGCGGTTT
ATGGATTGGCTGCAAAATGGCATAGAGGTTACTTACCAAAAGGGCGGAGAAACGAAAAACGAGCTTGTCCGTTTAATAGA
CTACGAACACCCGGAATACAATATGTTCTGCGCCATCAACCAGTGGACGATAACGGAACATGAGACGAAACGGCCAGATG
TGGCGGTTTTTGTCAACGGCTTGCCTCTTGTGGTAGTGGAATTAAAAACCTGTATGCGCGAAGATACCGATTTTTCCGAC
GGCTACCGGCAGATCAAAAACTATATGAAAGAGATCCCGGTACTGTTTCAATATAACGCTTTCTGTATCATAAGCGACCT
GATCGATTCCAAAGTGGGAACGATCACCTCGGAGTTTGACCGGTTTGTGGACTGGAAAACGGTGAACGGCGATTATGAAG
AAACACAATTCGCCCGTTATGACGTGCTCTTTCGCGGAATGCTGGAGCCAAGGCGTTTTCTGGACATCCTGCGGTACTTT
ATTCTCTTTTCACAGGATATACCGGAAGATCACAAAATACTGGCAGGCTACCACCAGTATTTTGCAGTACGTAAAGCCAT
TGAATCAACTCGCCATGCGGTGGAGACTGACGGTAAGGGCGGCGTGTTCTGGCACACTCAGGGTAGCGGTAAAAGCCTTT
CGATGGTGTTCTACGCCGCACTGCTCCAGCAGGCGCTGGACAGCCCCACCATTATTGTCATTACTGATCGAAACGATTTG
GACGATCAACTCTATGGACAATTTTCTGCCTGCAAAGATTTTTTGCGCCAGACGCCGGAGCACGCTCAAAGCCGCGAACA
TTTGAAAGAACTGTTGGCCGGGCGGCGAGCCAACGGTATCTTTTTTACCACCATGCAGAAGTTTGAAGAGGCTCACGAAC
CGCTGAGTGAACGACGCAACATCATTGTGATGGCGGATGAAGCTCATCGCAGCCAATATGGCCTAACAGAGCGCGTGAAA
AAGGATGGAACGTTAGTTGTCGGCGCAGCGCGCGTTATCCGCGATAGCTTGCCGGATGCAACGTTCATTGGGTTCACAGG
TACGCCGATTTCTTCTAAAGATCGCGACACGCGTGCGGTTTTTGGCGATTACATTGATATTTATGATATGACGCAGGCTG
TAGAAGATGGCGCGACGCGCCCTGTTTATTATGAAAGCCGCGTGATGAACCTCGGCCTGAAGGAGGATGTCCTTCGTCAA
ATTGATACGACCTATGAACTGCTGGCACAAAACGCAAACGAGCAGGACATTGAACGCAGTAAAAAAGAGTTGGGTAATCT
GGAAGCCATCCTTTCTGCACCTGAAACCATTGATACGCTTTGCCGTGACATCATCGCACACTATGAGCAAAACCGCGAGC
ACTTGCTTACCGGCAAGGCGATGATCGTCGCCTATTCCCGCTCAATTGCCATTGATATCTATCATAAGCTATTGGAATTA
CGGCCGGGTTGGCAGAAAAAGGTCAAGGTGGTCATGACATCCGGAAATAACGACCCGGAAGAGTGGCGGAACATTATTGG
CAATAAAAGCTATAAAAGAGAGCTGGCGCGCAAGTTTAAGGACAATGACGATTCGCTAAAAATTGCGATTGTGGTGGATA
TGTGGCTCACCGGATTCGATGTCCCCAGCCTTGCCACAATGTACGTCTACAAACCCATGAGTGGTCATAACCTGATGCAG
GCCATTGCGCGTGTCAACCGCGTTTTTCGAGATAAAGAGGGCGGGCTCGTAGTCGATTATATTGGGATTGCCCGTGCACT
CAAGCGGGCAATGAATGATTACACCGTGCGCGACCGCAGTAATTACGGCAACTTGGATATTGCCAAAACTGCGCTGCCAA
AGTTCGAGGAAAAGCTGCGCGTTTGTGGAGAGCTGCTGTACGGGTTTGATTATTCTGCATTTCTGCTTGAGACGAGCAAT
GACCGCCAGCGCGCTGATCTGATTGCGGGCTGCATTAATTTTGTTTTGGGTAAGAACGCAGATACGCAAAAGACGTTTGC
AACAGAAGCATTGCTACTCAAACAGGCGCGTACACTTTGCCAGAGCCTGCTGAATCGGGAGCAGCGCATGGAATCTGCGT
TGTTTGAAGCTGTGCGCGTTGCGTTAAGTCGCATTTCCGGCACCCAAAAGTTGTCGCTGAAAGAGATCAACGACCGCATC
AATGAGATGTTGCAACAAAGTGTGCATAGCGAGGGCGTTATCAATCTGTTTGGAGAGCAAAGCGCGGAATTTTCATTGTT
TGATCCAGCTTTTTTGGAAGAAATCAGCCGAATGAAGCAGAAAAATTTGGCTGCGGAGCTCTTGCGCAAGCTACTGGCGG
AGCAGATCTCCGCATATCAGCATACCAACTTGGTTCAGGCTGAAAAATTCTCTGACCGCATGCAGAAACTCATGAACGCC
TATCGCAATGGACAAATTACCAATGCCGAAGTAATTGAAGAGTTACAAAAGATGGCAGCAGACATTGCAAAAGCGCACCA
AACCGGCGCATCGCTGGGCCTATCGCCGGAGGAACTGGCATTCTACGACGCCATTACCCGTCCAGAAGCAGTAAAAGATT
TTTATACCAACGACCAGTTGCTGCATATGACCAAAGAATTAGCAGATACACTGCGCCGCAGTCGAACCGTTGACTGGCAG
AAAAAAGAGAGCGCCCGTGCCCAAATGCGTGTGATGGTTAAGCGTCTGTTACGCAAATATAAATATCCCCCCGATGGGAT
GCAGGATGCGATTCAGACTGTGCTGGCGCAGTGCGAGCTATGGACAGATGAAGCATAG

Protein sequence :
MPYAEANFENAVMILLEELGYAKLYGPDVERDFHNPLYLDALTGQLPRINPMADRQAVEEALHKLTHIEHGTLLQKNKRF
MDWLQNGIEVTYQKGGETKNELVRLIDYEHPEYNMFCAINQWTITEHETKRPDVAVFVNGLPLVVVELKTCMREDTDFSD
GYRQIKNYMKEIPVLFQYNAFCIISDLIDSKVGTITSEFDRFVDWKTVNGDYEETQFARYDVLFRGMLEPRRFLDILRYF
ILFSQDIPEDHKILAGYHQYFAVRKAIESTRHAVETDGKGGVFWHTQGSGKSLSMVFYAALLQQALDSPTIIVITDRNDL
DDQLYGQFSACKDFLRQTPEHAQSREHLKELLAGRRANGIFFTTMQKFEEAHEPLSERRNIIVMADEAHRSQYGLTERVK
KDGTLVVGAARVIRDSLPDATFIGFTGTPISSKDRDTRAVFGDYIDIYDMTQAVEDGATRPVYYESRVMNLGLKEDVLRQ
IDTTYELLAQNANEQDIERSKKELGNLEAILSAPETIDTLCRDIIAHYEQNREHLLTGKAMIVAYSRSIAIDIYHKLLEL
RPGWQKKVKVVMTSGNNDPEEWRNIIGNKSYKRELARKFKDNDDSLKIAIVVDMWLTGFDVPSLATMYVYKPMSGHNLMQ
AIARVNRVFRDKEGGLVVDYIGIARALKRAMNDYTVRDRSNYGNLDIAKTALPKFEEKLRVCGELLYGFDYSAFLLETSN
DRQRADLIAGCINFVLGKNADTQKTFATEALLLKQARTLCQSLLNREQRMESALFEAVRVALSRISGTQKLSLKEINDRI
NEMLQQSVHSEGVINLFGEQSAEFSLFDPAFLEEISRMKQKNLAAELLRKLLAEQISAYQHTNLVQAEKFSDRMQKLMNA
YRNGQITNAEVIEELQKMAADIAKAHQTGASLGLSPEELAFYDAITRPEAVKDFYTNDQLLHMTKELADTLRRSRTVDWQ
KKESARAQMRVMVKRLLRKYKYPPDGMQDAIQTVLAQCELWTDEA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 42
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 7e-172 41