Gene Information

Name : WQG_11930 (WQG_11930)
Accession : YP_007548261.1
Strain : Bibersteinia trehalosi USDA-ARS-USMARC-192
Genome accession: NC_020515
Putative virulence/resistance : Unknown
Product : type I restriction enzyme HindVIIP R protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1302546 - 1305698 bp
Length : 3153 bp
Strand : +
Note : Type I site-specific restriction-modification system, R (restriction) subunit and helicases COG0610; type I restriction enzyme HindVIIP R protein of Proteobacteria UniRef RepID=T1RH_HAEIN

DNA sequence :
ATGCTGATCAACGAAAACACCATTGAACAATCCGCCATTGCGACTTTGCAATCTTTGGGCTGGGACTACACCTACGGCAA
AACGATTTTGGCAGGTTTAGAACACGAATGGCGGGAGCGAGCCGCTGAGGTGATTTTAAAGCCGCTTTTGGCACAAGCGA
TTGCAAAATTCAACCCGAATTTGCCCGCTTATGAGGTGGAAAATGTGGTGGCACAGGTGTGCCGTGCGGAAAGTGGCGAT
TTGGCGGAGCGTAACCGTCAGGCTTATGATTGGCTGAGAAATGGGGTGAAAATCACTTATCAGTTGGACGGTGAGCAGGT
GTCTGAGGTGGTGCAGCTGATTGATTTTCAGCGTCCGGAAAACAACGATTTTCGTATTGTCAATCAGCTGGATATTGGCG
GTAAAAAAGGCAAACGCATTCCTGATTTGATTGGCTTTGTTAATGGCTTGCCGCTGGTGGTATTTGAGCTGAAAAATCCG
CTCAAAGAAAATGCCGACATTGGCAAGGCGTTTGCCCAACTGCAAACCTATAAAGATGAAATTTCTGATTTGTTTGTGTT
TAATCAGGCACTTGTGATTTCAGACGGCATTGTCGCTCGTATCGGTTCGCTGACCGCCGATTTAGACCGTTTTACCCCTT
GGCGTGTGGTTGATGAAAAAAATCAGAGCAAACGCATTGTGTTTGAAGATGAACTCACTGCCCTGCTGCAAGGCGTAATG
ACACCGCAAAATCTGCTGGATTATGTGCAGAATTTTGTGGTGTTTGAACGAGACGGCAAAAACCGCTTAATTAAGAAAAT
CGGGGCGTATCATCAGTTTTATGGCGTGAATGAAGCGGTGGATTGCACCTTGCTTGCCGCCACAGGCAACCGCAAAATCG
GTGTGTTTTGGCATACGCAAGGTTCGGGCAAATCGCTTTCGATGCTGTTTTATGCAGGTAAAGTATTAAGCCAAAGCAGC
CTGAAAAATCCAACTTTGGTGTTGGTAACCGACCGCAACGATTTGGACGGTCAGCTTTACGCCACTTTTTGCGGTGGTGA
GGCATTGCTCAAACAAACGCCAATCCAAGCCGATGGGCGAGACGAACTCCGCTCTGCCCTTGCCAGTCGTTCAGCAGGCG
GTGTGATTTTTACCACCATTCAAAAATTCGGCTTAATGGAGGGTGAGCTTGCCCACCCTGTGCTAAACGAGCGGGAAAAC
ATCATTGTGATTACCGATGAAGCCCACCGTTCGCAATATGGCTTCAATCAAAAAATCGACCACAAGGGGCAATATAAAGA
AGGCTATGCAAAGCATTTGAGAAGCGCTTTGCCTCATGCCTCTTTTATTGGCTTTACAGGAACGCCAATTGCCCTAGATG
ATAGAGACACCCAAGAAGTCTTCGGTAAATATGTCTCAATTTACGATTTTGAAGATGCGGTGGAAGATGGGGCGACTGTG
CCAATTATTTATGAGCCACGCCAAATCAGCTTAGGTGAAAGCCGTGAATTTGCCAAAGTGATGGAAGAGGCACAGCAGCT
TATTGATGACGATGAAAACAGCGATAACTTCCGCCTGCGTGAAAAACTGCACAGCGTGGATAGCCGTTTGCAAAAAATGG
CGGAAGATATTATTGCCCATTACGATGAGCGTACCAAACAGCAAGACGGCAAGGCGATGATTGTGGTGATGAGCCGTGCC
ATTTGCGTCAAACTCTATGACAAAATCACCGTCCTTCGCCCCGAATGGCACTCAAACGATGTGCATCAAGGCAGCATTAA
AATTGTGATGACAAGCAATGCCAGCGACCCTGCCGAGTGGCAAGTACACAATCAGGACAAAAAAACCTTAGAAAAACGCT
TTAAAGACCCAGACGATCCGCTCAAAATCGTGATTGTGCGAGATATGTGGCTGACAGGGTTTGATGCCCCCTGCTGTAAC
ACAATGTACATTGATAAACCAATGAGCGGACACAACCTGATGCAGGCAATCGCTCGGGTAAATCGGGTATTTCGCAATAA
AAGCCGTGAAAATGGCGGCTTGATTGTGGATTATGTGGGCTTGACCGATGAGCTGGAAAAAGCGATGAAGCAGTACACCA
ACGCAGGCGGCAAAGAAAAGCCGGTGCGGGACATTTCAGCGGTGCTGGAAAAAATGGTGGAACATATCACGGTCATTCGT
GGGCAATTTGCCACGCCGATTGACGGACAAGCAGTTGATATTGCCAAAATGTTGCAAATCAGCGAACCGCCTAAACTACT
CAATACGATTTTGCAGGCTGCCAACCATATTCTTGCCCTAGACCGCATTCAACCGGCTGATAACACTGCCAAAGACAAAA
CCCCACGCAAAAACGCTTTTTTACAGTCGGTACGCTTGGCGAAAAAAGGCTATGCTTTGTGTGGAGCATTAAAAGCGGTT
GAACCCTATAAACAGGAATTGGCGTTTTATGATGCAGTGCGTGCCACCATTATCAAAAACAGCACCGCTCCTCGCAATTC
TTCGAGCGAAAATGACCGCTTGTTGCAGCTTACCGCCTTGATGAATCGTGCGGTACAGTCGGACGGTGTGGTGGATTTAT
TTGATTTGCTGAAAAAAGACCGCCCAAACATCAACCTGCTTTCTGATGAGTTTTTGGAAACAGTGAAAAACAGCCCAACC
AAAGATTTGTGGCTGTCGGCAATGGAACGTTATCTTGCCGCACAACTGCGTGAACAAAGCGGTGCCAATCTTGCCACCAA
AAAAGCATTTGAACAGAAACTCAAAGAGGCAATGAACCAATACCACAACCACAATTTAAGCGTATTGGAAATTTTAGAAG
AGCTGATCGCACTTGCCAAAGAGTTTGAAGCTCGCCAAAAACGAGGGGAAGCGTTGGGACTAAACCCGGCGGAAATGGCG
TTTTATGATGCTTTGGCACGCAATGAAAGTGCGGTGCGGGAAATGGGCGATGAGGTATTGATGAACCTTGCCAAAGACAT
CACCGATAAATTACGCAAATCCGTCACCGTAGATTGGCAATATAAAGACTCGGTGCGTGCCAAAATGCGAACCTTAATCC
GCATTGCCCTGCGTAGCTATAAATACCCGCCCGATTTACAGGCAGAAGCGATTGAGTTTGTGTTGCAACAAGCGGAAGAG
ATCGCCGGAGAATTGAGCGAAAACGACATATAA

Protein sequence :
MLINENTIEQSAIATLQSLGWDYTYGKTILAGLEHEWRERAAEVILKPLLAQAIAKFNPNLPAYEVENVVAQVCRAESGD
LAERNRQAYDWLRNGVKITYQLDGEQVSEVVQLIDFQRPENNDFRIVNQLDIGGKKGKRIPDLIGFVNGLPLVVFELKNP
LKENADIGKAFAQLQTYKDEISDLFVFNQALVISDGIVARIGSLTADLDRFTPWRVVDEKNQSKRIVFEDELTALLQGVM
TPQNLLDYVQNFVVFERDGKNRLIKKIGAYHQFYGVNEAVDCTLLAATGNRKIGVFWHTQGSGKSLSMLFYAGKVLSQSS
LKNPTLVLVTDRNDLDGQLYATFCGGEALLKQTPIQADGRDELRSALASRSAGGVIFTTIQKFGLMEGELAHPVLNEREN
IIVITDEAHRSQYGFNQKIDHKGQYKEGYAKHLRSALPHASFIGFTGTPIALDDRDTQEVFGKYVSIYDFEDAVEDGATV
PIIYEPRQISLGESREFAKVMEEAQQLIDDDENSDNFRLREKLHSVDSRLQKMAEDIIAHYDERTKQQDGKAMIVVMSRA
ICVKLYDKITVLRPEWHSNDVHQGSIKIVMTSNASDPAEWQVHNQDKKTLEKRFKDPDDPLKIVIVRDMWLTGFDAPCCN
TMYIDKPMSGHNLMQAIARVNRVFRNKSRENGGLIVDYVGLTDELEKAMKQYTNAGGKEKPVRDISAVLEKMVEHITVIR
GQFATPIDGQAVDIAKMLQISEPPKLLNTILQAANHILALDRIQPADNTAKDKTPRKNAFLQSVRLAKKGYALCGALKAV
EPYKQELAFYDAVRATIIKNSTAPRNSSSENDRLLQLTALMNRAVQSDGVVDLFDLLKKDRPNINLLSDEFLETVKNSPT
KDLWLSAMERYLAAQLREQSGANLATKKAFEQKLKEAMNQYHNHNLSVLEILEELIALAKEFEARQKRGEALGLNPAEMA
FYDALARNESAVREMGDEVLMNLAKDITDKLRKSVTVDWQYKDSVRAKMRTLIRIALRSYKYPPDLQAEAIEFVLQQAEE
IAGELSENDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 43
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 42
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 42