Gene Information

Name : Pnap_4782 (Pnap_4782)
Accession : YP_973933.1
Strain :
Genome accession: NC_008761
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : 3.1.21.3
Position : 19345 - 22434 bp
Length : 3090 bp
Strand : +
Note : KEGG: pol:Bpro_1944 type I site-specific deoxyribonuclease, HsdR family; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; PFAM: type III restriction enzyme, res subunit; protein of unknown function DUF450; SMART: DEAD-like helicases-like

DNA sequence :
ATGACCGAAGACCAACTCGAACAAGAAACCCTGGCCTGGCTTCAGGACGTGGGCTACACCTGCCACTGCGGCTACGACAT
CGCGCCTGACGGTCCTGCGCCCGAGCGCAGCAGTTTCAGCCAGGCGCTGCTGCCCTTCCGGCTGCGCGAGGCCATCCACA
AGCTCAATCCCGGCATCCCGACCCCTGCCCGCGAAGACGCCTTCAAGCAGGTTCTTGACTTGGGCATCCCGGCGCTGCTG
AGCGCCAACCGGCATTTCCACAAGCTGCTGGTGGGCGGCGTGCCCGTGGAGTACCAGAAAGACGGCCAGACCCGGGGCGA
CTTCGTGCGCCTGATCGACTGGGCGCAGCCGGCGCGCAACGAGTTTCTGGCGGTCAACCAGTTTTCCCTCAAAGGTGCGC
ACCACACGCGCCGCCCCGACATCATCCTGTTCGTCAACGGCCTGCCCTTGGTGCTGCTCGAACTCAAGAATCCCGCCGAC
CTCAACGCCAATGTGTGGAAAGCCTACGACCAGATCCAGACCTACAAGGCGCAGATCCCGGGCGTGTTCGAGTACAACGA
AGTGCTGGTGATTTCGGACGGCACCGAGGCGCTGCTGGGGTCCTTGTCTAGCAGCAGCGAGCGCTTCATGGCCTGGCGCA
CGATTGACGGCCAGGCGCTGGACCCGCTGGGCCAATTCAACGAGCTGCAGACCCTGGTGCGCGGCGTGCTGGCCCCGGCG
TACCTGCTGGACTACCTGCGCTACTTCGTGCTGTTCGAGGACGACGGCCAGCTGGCCAAGAAAATCGCCGGCTACCACCA
GTTCCATGCGGTCCGCTCGGCCATTACCCAGGTCGTGACCGCCTCTCGCCCCGGCGGCACCCACAAGGGCGGCGTGGTCT
GGCACACCCAGGGCAGTGGCAAGAGCATCACCATGACCTGCTTTGCTGCTCGCGTGATGCAGGAGCCGGCGATGGAGAAC
CCCACCATCGTGGTGATTACCGACCGCAACGACCTGGACGGCCAGCTCTTTGGCGTGTTCAGCCTGGCCCAGGATCTGCT
GCGCGAGCAGCCGGTGCAGGTCAGCACGCGGCAGGATCTGCGGACCAGGCTGGCGAACCGGCCCTCGGGCGGCATCGTGT
TCGCCACCATCCAGAAATTCATGCCGGGCGAGGATGAGGACACCTTCCCTACCCTGTCCGAGCGCCACAACATCGTGGTG
ATTGCCGACGAGGCGCACCGCACCCAGTACGGCTTCGAGGCCAAGCTCAAGGGCAAGCCCGGACACGAGACCTACCAGGT
CGGCTACGCCCAGCACCTGCGCGACGCGCTGCCCAACGCCACCTTCGTGGCCTTCACCGGCACCCCGGTCAGCAGTGAAG
ATCGCGACACGCGCGCCGTGTTCGGCGACTACATCTCGGTCTATGACATGCAGCAGGCCAAGGAGGACGGCGCCACGGTC
GCCATCTACTACGAGTCGCGCCTGGCCAAGTTGAAGCTCAAGGAAGAAGATTTTTCGCTGATCGACGAGGAGGTCGATGA
GCTGGCCGAAGACGAGGAGGAAAGCACCCAGGCCAAGCTCAAAAGCCGCTGGGCTGCCCTGGAGAAGGTGGTCGGCGCTG
AACCGCGCGTGGCCAGCGTGGCGGCCGACCTCGTGGCGCATTTCGAGGAGCGCAACAAGGCCCAGAGCGGCAAGGCCATG
ACGGTGGCCATGAGCCGCGACATCTGCGTGCATCTGTACAACGAGATCGTCAAGCTGCGCCCGGACTGGCACGACCCGGA
TCCAGAAAAGGGCGCCATCAAGATCGTGATGACCGGGTCCAGCAGCGACAAGGCGCTGCTGCGCCCGCACATCTACAGCG
CCCAGGTCAAGAAACGCCTGGAAAAGCGCTTCAAGGATCCGGCCGACCCGCTGCGCCTGGTCATCGTGCGCGACATGTGG
CTCACCGGCTTTGACGCGCCGTGCGTGCATACCCTCTACGTGGACAAGCCCATGAAGGGCCACAACCTGATGCAGGCGAT
TGCCCGGGTCAACCGCGTGTTCAAGGACAAGCAGGGCGGCCTGGTGGTGGACTACATCGGCATCGGCAATGAACTCAAAG
CCGCCATGAAGGAATACACCCAGAGCAAAGGCCGGGGTCGGCCCACGGTGGACGCGCATGAAGCGTATAGCGTGCTGGCC
GAAAAACTCGACATCCTGCAAACGATGCTGCACGGCTACGACTACAGCGGTTTTCTGACCGGCGGCCACAAGGCGCTGGC
CGGCGCCGCCAACCATGTGCTGGGCGCCCAGGACGGCAAGAAGCGCTTTGCCGACACGGCCCTACAGATGAGCAAGGCGT
TCAGCCTGTGCTGCACGCTGGACGAGGCCAAGGCGGTGCGCGAGGAGGTGGCCTTTTTGCAGGGCGTCAAGGTCATCCTG
ACCAAAAAGGATTTGAGCGCGCAAAAGAAGACCGACGAGCAGCGCGACCTGGCCATCCGGCAGATCATCAACTCGGCCGT
GGTGTCGGACAGCGTGGTGGACATTTTCGATGCCGTCGGGCTGGACAAGCCCAACATCGGACTGCTGTCCGACGAGTTCC
TGGCGCAGGTGAAAAACCTGCCGGAGAAGAACCTGGCGGTGGAATTGCTGGAGCGGCTGCTGGAGGGCGAGATCAAGAGC
CGGTTTGCCAGCAACGTGGTGCAGGAGAAGAAGTTTTCCGAGCTGCTTGCCGGTGTCATCAAGCGCTACCAGAATCGCTC
CATCGAGACCGCCCAGGTCATGGAGGAGCTGGTGGCGATGGCCAGGAAGTTTCAGGAGGCGGCCAACCGAGGCGAAGCAC
TGGGCCTCACCGAGGACGAGATCAAGTTTTATGACGCGCTGGCCACCAACGAATCGGCCGTGCGGGAACTGACCGATGAA
ACCCTCAAGAAGATCGCCCATGAGCTGACCGAGAACCTGCGCCAAAACCTCAGCGTGGACTGGTCGGAGCGCGAGAGCGT
GCGCGCCAGGCTGCGCCTGATGGTCAGGCGCATCCTGCGCAAATACAAGTACCCGCCCGACCTGCAGGATGCGGCGGTGG
AACTGGTGCTGCAGCAGGCGCAGGCGCTAGGAACCGTATGGATGATCTAG

Protein sequence :
MTEDQLEQETLAWLQDVGYTCHCGYDIAPDGPAPERSSFSQALLPFRLREAIHKLNPGIPTPAREDAFKQVLDLGIPALL
SANRHFHKLLVGGVPVEYQKDGQTRGDFVRLIDWAQPARNEFLAVNQFSLKGAHHTRRPDIILFVNGLPLVLLELKNPAD
LNANVWKAYDQIQTYKAQIPGVFEYNEVLVISDGTEALLGSLSSSSERFMAWRTIDGQALDPLGQFNELQTLVRGVLAPA
YLLDYLRYFVLFEDDGQLAKKIAGYHQFHAVRSAITQVVTASRPGGTHKGGVVWHTQGSGKSITMTCFAARVMQEPAMEN
PTIVVITDRNDLDGQLFGVFSLAQDLLREQPVQVSTRQDLRTRLANRPSGGIVFATIQKFMPGEDEDTFPTLSERHNIVV
IADEAHRTQYGFEAKLKGKPGHETYQVGYAQHLRDALPNATFVAFTGTPVSSEDRDTRAVFGDYISVYDMQQAKEDGATV
AIYYESRLAKLKLKEEDFSLIDEEVDELAEDEEESTQAKLKSRWAALEKVVGAEPRVASVAADLVAHFEERNKAQSGKAM
TVAMSRDICVHLYNEIVKLRPDWHDPDPEKGAIKIVMTGSSSDKALLRPHIYSAQVKKRLEKRFKDPADPLRLVIVRDMW
LTGFDAPCVHTLYVDKPMKGHNLMQAIARVNRVFKDKQGGLVVDYIGIGNELKAAMKEYTQSKGRGRPTVDAHEAYSVLA
EKLDILQTMLHGYDYSGFLTGGHKALAGAANHVLGAQDGKKRFADTALQMSKAFSLCCTLDEAKAVREEVAFLQGVKVIL
TKKDLSAQKKTDEQRDLAIRQIINSAVVSDSVVDIFDAVGLDKPNIGLLSDEFLAQVKNLPEKNLAVELLERLLEGEIKS
RFASNVVQEKKFSELLAGVIKRYQNRSIETAQVMEELVAMARKFQEAANRGEALGLTEDEIKFYDALATNESAVRELTDE
TLKKIAHELTENLRQNLSVDWSERESVRARLRLMVRRILRKYKYPPDLQDAAVELVLQQAQALGTVWMI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 56
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 48
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 48
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 48