Gene Information

Name : Arth_0986 (Arth_0986)
Accession : YP_830483.1
Strain : Arthrobacter sp. FB24
Genome accession: NC_008541
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : 3.1.21.3
Position : 1063872 - 1066994 bp
Length : 3123 bp
Strand : -
Note : KEGG: dge:Dgeo_2012 type I site-specific deoxyribonuclease, HsdR family; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; PFAM: type III restriction enzyme, res subunit; protein of unknown function DUF450; SMART: DEAD/DEAH box helicase domain

DNA sequence :
ATGAGCGACCTGAACGAGTCAACGGTAGAGCTGGCGGTGCTTCAGTATCTGCGCGAACTCGGCTACACGACGGCCTTCGG
CCCAGACATCGCGCCCGAAGCGGCTGCGGCTGAGCGTTCGTCTTACGAGCAGGTATACCTGTTGAGCCGCTTGCGCGCGG
CGGCGGTTCAGATCAACCGTGGCCTTGATGCGGCTCTCATTGATGAGGCGATCAAGCGCCTCGGCCGGGCGGAATCACAG
AACCCAGTTGCGGAGAACCTTCGCGTCCACGAACTCCTGACCGAGGGCGTTCCGGTCGAACACCGCGACGCCGAAGGCTC
TGTACGGACGACTCGTGTCCGCCTCATCGACTTCGAGGACCCGGCAGGCAACGACTGGCTCGCCGTCAACCAGTTCACAA
TCATCGAGAACGGCAAGAACCGACGTCCGGACGTCGTGCTCTTCTTGAACGGCATGCCGCTTGGCCTGCTGGAGCTGAAG
AACCTGGCCGACGAGCACGCGACGCTCAAGGGAGCCTGGAACCAAATCCAGACCTACCGCCATGACATTCCATCCCTGTT
CACTCCGAACGCGGTGACGGTCGTCAGCGATGGCGTCAGCGCCGCCATGTCGTCATTCACGGGCGGCTTCGAGCACTATG
CGCCGTGGAAAACGATCGACGGACGCGAGGTCGTGATGAATCTGCCCGCAGTCGAGGTGTTGATCAAGGGCGTCTTCGAC
CAGAAGCGCTTCCTCGACATTCTGCAGAACTTCATCGTCTTCAGCGACGAGTCCAAGGGGCTCGTGAAGCGCGTCGCGAA
ATACCACCAGTACTGGGCCGTCAATGCCGCGGTCGAGTCGACCATCGAGGCAGCAGGTCCCGATGGCGACCGCCGCGGCG
GCGTCGTGTGGCACACCCAAGGGAGTGGCAAGTCGATCGAGATGTTGCTCTACGCGGCGAAGATCATGCGCGACATTCGG
ATGGGCAACCCGACGTTGTTGTTCATCACCGACCGCAACGACCTCGACGACCAGCTCTTCGGCGAGGTGTTTGCACCAGC
CGAGATCCTGCCTGAGAAGCCCGTCCAAGCTGACTCCAGAGCAGACCTTCGCAGCCTGCTCCGCCGCGCGTCCGGCGGCA
TCATCTTCACCACCGTGCAGAAGTTCGCCCCCGAGGCTGGTGGCGACACCAACCCGGTACTGACCGACCGCCGCAACGTC
GTCGTTGTCGCCGACGAGGCTCACCGATCCCAATACGGCTTCACCGAGTCCCTTGACGAGCGCACCGGGCAGTTGAAGTC
TGGGCTCGCGAAGCACATGCGCGACGCCCTCCCGAACGCCACCTATCTCGGGTTCACGGGCACACCCATCGAGTCGAACG
ACAAGTCAACTCGCTCCGTGTTCGGTGACTACATCGACATCTATGACCTCACGCGCGCTGTTGAGGACGGTGCCACCGTC
CGGATTTTCTACGAGTCCCGACTCGCGAAGGTGTCCCTCGATGCTGATGTGCACGCTGCAATCGACGAACTCGCCGACGA
AATAACCGAGACCGCAGAAGAGGACGAGGCCACCCGCGCCAAGTCCAGGTGGGCACGGTTGGAAGCCGTCGTGGGCGCGA
ATGACCGTCTCGATGTGATTGCAGGCGACATCGTCGACCACTGGGAGAAACGCCGGACCGAGATGTTCGGCAAGGCCATG
ATTGTCACGATGTCGCGGCGCATCGCCGTCGACCTCTACGACAAGATCGTTAAGCTCAAGCCGGAGTGGCACACCGACGA
CCCGACGACAGGCATGATCAAGGTCGTCATGACCGGTTCGGCGGCCGACCCTCAGGCCTTCCAGCCGCACATCTACGACA
AGAAGACCCGCAAAGACCTTAAACTGCGGGCAAAGGACCCAAACGATTCCCTCGAGATCGTCATCGTCCGCGACATGTGG
CTTACCGGCTTCGACGCCCCGTCGATGCACACTATGTACGTCGACAAACCAATGCAGGGCGCCGGCCTGATGCAAGCCAT
CGCACGGGTGAACCGTACCTTCCGCGACAAGCCCGGCGGCCTGATCGTCGACTACATCGGCGTCGCCACAAATCTGCGCA
GGGCCCTAGCCGAGTACTCCCCCAGCGATCGTGACCAGGCCGGCGTGCCGATTGAGGAGATCGTCTCCGCCATGTTGGAA
AAGCACGACATCGTGCGAGGACTCCTTCACGGTTGCAGGTACAATTCCTCGCCGCTACTGGCGCCCGCCGCACGCCTAGC
CCAGCACGCGCTCGTTCTCGACTTCGTTATGGCCGACCCGGACCGCACCGCACGTTACCTCGACCAAGTGCTCGCGCTAG
CCAAGGCCTTTGCGCTCTGCGGGGCGCGGGATGAAGCAGCTGCGATCCGTAATGACGTGCGGATGTTCGCCGATGTCCGA
GCGGCGACCCTGAAGATCCAGAATCCGGACTCAGGGCGTGCCGGCAGCGGTGCCGTAGAAATAGACACCGCGATCGGGCA
ACTCGTCAACGAAGCCGTCACCGCCGACGAGGTCGTTGATATCTACAAGCTCGCCGGCATTGAAACTCCAGAGCTGTCGA
TCCTGTCGGACGAGTTCCTCGACACCCTGGCCGGGAAGGAGAAGCCCAACCTCCAGATGGGGCTCCTCCGCCGGCTGATC
AACGATCAAATCCGCACCGTCCAGCGCACCAATATCGTTCAGGCACGAAAATTCTCCGAGCAGCTCGACGAGGCAATTAA
CCGCTATACGAACCGCACACTGACGACAGCAGAAATCATTGCCGAGCTCGTCAAGCTCGCCAAAGACATGCGAAACCAGA
ACGACCGTCACAACAGACTCGGTCTTTCTGTCGCCGAGGCTGCATTTTACGATGCCATCGTGCAAAACGACGTCGCTGTC
CTCCAGATGGGCGACGACACGCTAAAGAAGATTGCCGTCAATCTCGTTTCCACCGTCCAGCGGAGCGCCACAATCGACTG
GTCCCTCAAACATTCGGTCCGGGCCGCCATGAGATCCAAAATCCGTCGGCTACTTGCAAGGTACGACTACCCGCCCGATC
ACGAGGAGAAGGCGATTGAGCTGATACTCCGACAAGCTGAGCTAATCGCCGGGACCGAAGCGCAGTCAACGGTACGCACT
TGA

Protein sequence :
MSDLNESTVELAVLQYLRELGYTTAFGPDIAPEAAAAERSSYEQVYLLSRLRAAAVQINRGLDAALIDEAIKRLGRAESQ
NPVAENLRVHELLTEGVPVEHRDAEGSVRTTRVRLIDFEDPAGNDWLAVNQFTIIENGKNRRPDVVLFLNGMPLGLLELK
NLADEHATLKGAWNQIQTYRHDIPSLFTPNAVTVVSDGVSAAMSSFTGGFEHYAPWKTIDGREVVMNLPAVEVLIKGVFD
QKRFLDILQNFIVFSDESKGLVKRVAKYHQYWAVNAAVESTIEAAGPDGDRRGGVVWHTQGSGKSIEMLLYAAKIMRDIR
MGNPTLLFITDRNDLDDQLFGEVFAPAEILPEKPVQADSRADLRSLLRRASGGIIFTTVQKFAPEAGGDTNPVLTDRRNV
VVVADEAHRSQYGFTESLDERTGQLKSGLAKHMRDALPNATYLGFTGTPIESNDKSTRSVFGDYIDIYDLTRAVEDGATV
RIFYESRLAKVSLDADVHAAIDELADEITETAEEDEATRAKSRWARLEAVVGANDRLDVIAGDIVDHWEKRRTEMFGKAM
IVTMSRRIAVDLYDKIVKLKPEWHTDDPTTGMIKVVMTGSAADPQAFQPHIYDKKTRKDLKLRAKDPNDSLEIVIVRDMW
LTGFDAPSMHTMYVDKPMQGAGLMQAIARVNRTFRDKPGGLIVDYIGVATNLRRALAEYSPSDRDQAGVPIEEIVSAMLE
KHDIVRGLLHGCRYNSSPLLAPAARLAQHALVLDFVMADPDRTARYLDQVLALAKAFALCGARDEAAAIRNDVRMFADVR
AATLKIQNPDSGRAGSGAVEIDTAIGQLVNEAVTADEVVDIYKLAGIETPELSILSDEFLDTLAGKEKPNLQMGLLRRLI
NDQIRTVQRTNIVQARKFSEQLDEAINRYTNRTLTTAEIIAELVKLAKDMRNQNDRHNRLGLSVAEAAFYDAIVQNDVAV
LQMGDDTLKKIAVNLVSTVQRSATIDWSLKHSVRAAMRSKIRRLLARYDYPPDHEEKAIELILRQAELIAGTEAQSTVRT

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 45
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 45
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 45
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 44