Gene Information

Name : SFBM_0021 (SFBM_0021)
Accession : YP_004770546.1
Strain : Candidatus Arthromitus sp. mouse isolate
Genome accession: NC_015913
Putative virulence/resistance : Unknown
Product : type I restriction-modification system, restriction protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 27530 - 30565 bp
Length : 3036 bp
Strand : +
Note : -

DNA sequence :
ATGTCATACACAGAAGCGAATTATGAAAATGCTTTGATTGAGGTTTTCAGAGATACTCTTTCTTATAATTATTTATACGG
ACCAGATATAAGCCGTGATTATTCAGATGCTATTTATATGGATGAGCTTACACCATCTCTTAAGAGAATAAACAAAGACA
TCCCACAATCTGCAATTAATGAATGCATAAATAAACTAAGAAACATTGAAGGTGGAACCTTACTTGATAAAAATAGAATA
TTTATGAATTATCTACAAAATGGAGTGAGCGTCAACTGTTTTAATAACGGAAATCAGGAAAGCTATTTAGTTAAGCTTGC
CGATTATAACTGCCTAGAGAAAAATACCTTCACAATAGTTAATCAGTGGACAATTGTTGAAAATAGTGAGAAGAGACCAG
ACCTTATTTTATTTTTAAATGGTATTCCTATTGTGGTATTTGAGCTTAAGTCTCCAAGTCGTGAAGAGACAAGCGTGTCA
GAAGCATATAATCAGCTCAGAAATTATATGTACGAGATACCTTCTCTTTTTAATTACAATGCTTTTTTAGTTATGAGTGA
TCTTGCAGTATCAAAGGCTGGAACTATAACAAGTAGTGAAGATAGATTCATGGAGTGGAAAACTAAGGATGGAAGCTATG
AAAATACCTCTTATGCTCAGTTTGATACTTTTATTGAAGGAATGTTTGATAAAGCTCGCCTACTTGATATCATCAAAAAT
TTTATATGCTTTTCTGAAAATTCAAAAATATTTTCAGCCTATCATCAATATTTCGCTGTTAGAAAAGCTATTAAATCGAC
TCTTAATGCTTTAGAAAGCGATGGAAAAGCTGGAGTATTTTGGCACACACAAGGGAGTGGTAAATCCCTATCTATGCTAT
TTTATGCTCATCTTCTTTGTGAAACCATAAATTCTCCAACTATACTTGTACTTACAGATAGAAATGATTTAGATGATCAA
CTCTTCTCTCAGTTTTCTAAATGTAAAGATTTTCTTTATCAAGTTCCAAATAAGGCAGAAAGCCGTGAGCATTTGAAAAA
ACTTCTTGCTGGAAGAGAGGCAAATGGAATCATTTTTTCAACAATTCAAAAGTTTGAGGAAAGTACAGAGCCTCTATCAG
AGAGAAAAAATATTATAATCATGGCAGATGAGGCCCATAGGGGACAGTATGGATTAGATGAGAAAGTTGATAAAAATACA
GGGCGTATAAGTTTTGGTATAGCTAGAATAATACGAGATAATTTCCCTAACGCAACATATATAGGTTTCACAGGAACTCC
GATTTCATTAAAAGATAGATCTACAATTGAGGTGTTTGGAAATTATATTGATATATATGATATGACTCAAGCTGTAGAAG
ATGGAGCAACTCGCCCAGTTTATTATGAAAGCCGTGTAATTAAATTGAATTTGGATAACTCTATATTGGAGATGATAGAT
AAAGAGTATGAAATTTTATCTAATAACGCTGAGATTCATGCAATTGAGAAGAGTAAAAAAGAACTTGGAAGATTAGAGAG
TATACTTGGAGCAGATCAAACTATAGACTCTTTAACTAGAGATATCGTAATGCACTATGAAGAAAACAGACAGTATGAAT
TAACAGGTAAAGCAATGATAGTTGCATATTCTCGCCCTATTGCTATGAAAATTTATCATAAAATTTTAGAATTAAAGCCT
GAATGGGAAGAAAAAGTATATGTTGTTATGACTTCAGGAAATAACGATCCAGAAGATTGGAGAAAAATAATAGGAAATAA
ACGTTATAAGGATGAACTAGCCAAAAAATTTAAGGATAATGGATCTTCATTTAAAATAGCAATTGTTGTTGATATGTGGC
TTACTGGATTTGATGTTCCATCTCTTGCAACTATGTATATATATAAACCTATGAGTGGTCATAATTTAATGCAGGCTATA
GCTAGAGTTAATAGAGTTTATAAAAATAAAGAGGGCGGACTTATAGTTGATTATATTGGAATTGCAGGAGCTTTGAAAAA
GGCGATGAGCGATTACACAAAGAGAGATAAGGTAAATTATGGGGAGAGTGATATCTCAAAAATTGCTTATCCAAAGTTTA
TAGAAAAACTTGAGGTATGTCGTGCTCTTTTGCATGGGTTTGATTATTCCTCATTTATGCTTAAATCATTAACAGATCTT
ACGCGTGCAAAACTTATAAGTGGAGGCGTTAACTTTCTATCAGATCCATCAAGAAAGGAGGATAAAAAATTCTATATAAA
AGAGTCGCTTCTCTTGCGTCAATCTGTTTCACTTTGTCGCTCTATTTTAACTAAGGATCAAAGACTTGAATCTGCATATT
TTGAAGCACTTCGTACACTACTTACACGCATAACAGGAGAAAATAAACCTCTTTCTTTAAAAGATATAAATAAGCATATT
AATGACTTATTAAAAGAGAGTATAAAAAGTGATGGTATTATAAATTTATTTTCTGATATTGATATTGGGGTTTCTATATT
TGATGAAAAATTTTTAGAAGAAGTTTCAAATATGAAAGAGAAGAATTTGGCAGTTGAAATGTTAAAGAAACTTTTAAATG
AACAGATTTCAGTATATAAAAGGAATAATATTGTTAAATCACAAAAATTTTCGGAAATGTTAAATAAGGCTATGAGGTTA
TATATAAATGGAATGATCACAAATGAAGAGATAATAGAAGAGCTATTAAAGATGGCACGAGATATATCACGCTCCTCAGA
AGAGGCAGAATCACTTGGACTTAGTGATGAGGAGATGGCATTTTATGATGCACTAACTCGACCAGAGGCTATAAAAGATT
TTTACACAAATCAAGAACTTGTAGCACTCACTCGTATGCTTACAGACAGCTTAAGAAGAAGTAGAACTATAGATTGGCAG
AAAAAGGATACTGCGAGAGCAAAAATGCGTAGAATGGTTAGAAAACTTTTAAAAGACTATGATTATCCACCAAAAGGTTT
AGAGGATGCCGTGGCAACAGTATTATCTCAATGCGAAATTTGGGCAGATTTTGGAGAAGAAAGTTATAGAGTTTAA

Protein sequence :
MSYTEANYENALIEVFRDTLSYNYLYGPDISRDYSDAIYMDELTPSLKRINKDIPQSAINECINKLRNIEGGTLLDKNRI
FMNYLQNGVSVNCFNNGNQESYLVKLADYNCLEKNTFTIVNQWTIVENSEKRPDLILFLNGIPIVVFELKSPSREETSVS
EAYNQLRNYMYEIPSLFNYNAFLVMSDLAVSKAGTITSSEDRFMEWKTKDGSYENTSYAQFDTFIEGMFDKARLLDIIKN
FICFSENSKIFSAYHQYFAVRKAIKSTLNALESDGKAGVFWHTQGSGKSLSMLFYAHLLCETINSPTILVLTDRNDLDDQ
LFSQFSKCKDFLYQVPNKAESREHLKKLLAGREANGIIFSTIQKFEESTEPLSERKNIIIMADEAHRGQYGLDEKVDKNT
GRISFGIARIIRDNFPNATYIGFTGTPISLKDRSTIEVFGNYIDIYDMTQAVEDGATRPVYYESRVIKLNLDNSILEMID
KEYEILSNNAEIHAIEKSKKELGRLESILGADQTIDSLTRDIVMHYEENRQYELTGKAMIVAYSRPIAMKIYHKILELKP
EWEEKVYVVMTSGNNDPEDWRKIIGNKRYKDELAKKFKDNGSSFKIAIVVDMWLTGFDVPSLATMYIYKPMSGHNLMQAI
ARVNRVYKNKEGGLIVDYIGIAGALKKAMSDYTKRDKVNYGESDISKIAYPKFIEKLEVCRALLHGFDYSSFMLKSLTDL
TRAKLISGGVNFLSDPSRKEDKKFYIKESLLLRQSVSLCRSILTKDQRLESAYFEALRTLLTRITGENKPLSLKDINKHI
NDLLKESIKSDGIINLFSDIDIGVSIFDEKFLEEVSNMKEKNLAVEMLKKLLNEQISVYKRNNIVKSQKFSEMLNKAMRL
YINGMITNEEIIEELLKMARDISRSSEEAESLGLSDEEMAFYDALTRPEAIKDFYTNQELVALTRMLTDSLRRSRTIDWQ
KKDTARAKMRRMVRKLLKDYDYPPKGLEDAVATVLSQCEIWADFGEESYRV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 41
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 41
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 41