Gene Information

Name : STH1908 (STH1908)
Accession : YP_075737.1
Strain : Symbiobacterium thermophilum IAM 14863
Genome accession: NC_006177
Putative virulence/resistance : Unknown
Product : type I restriction-modification system endonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 2046785 - 2049976 bp
Length : 3192 bp
Strand : -
Note : -

DNA sequence :
ATGCCCGTCCTGAGCGAAGAGACCCTGGAACAGGCCGCACTCGAATGGCTCCGCGAGCTCGGCTGGGCGGTCGCACATGG
GCCCGACATCTCCCCCGTGGACGCCAACACCCCCGGCACCGAGCGGGAGTCCTACCGCCAGGTTGTGCTGACGGGCCGGC
TGCGGGAGGCCATCCGCGGCCTCAACCCCCACATCCCGGCCTCGGCGCAGGAGGACGCCCTCCGGCAGGTGGTGAGCCCC
AACATCCCCGGTCTGGTGCAGGCCAACCGGCAGTTCCATCGGTGGCTCGTGGAGGGCGTGCCGGTCGAGTTCCAGAAGGA
CGGGGAGACCCGGGGCGACCGGGTGCGGCTGGTGGACTTCACCGACCCGAGTCGCAACGACTGGCTGGCGGTCAACCAGC
TCTCCATTCAGGGGCCGAAGAAGGTGCGCAGGCCGGATATCATCCTGTACCTCAACGGCCTGCCCATCGTCGTCATCGAG
CTCAAGAACCCGGCGGACGAAAGCGCCGACATCTGGGCCGCCTTCAACCAGCTCCAGGCCTACAAGGAGGACATCCCCGA
CCTCTTCGTCTACAACGAACTGCTGGTCATCTCCGACGGCCTCTCCGCCCGCATGGGGTCGCTGACCGCCAACCGGGAGC
GGTTCATGGCCTGGCGGACCATCGACGGCCACGAGATCGACCCCCTGGGCGAGATGCGGGAGCTGGAGACCCTGATCCGC
GGCGCCTTCCGGCACGACCTGCTGCTGGAGTACCTGCGCTACTTCATCCTCTTCGAAGAGGATGGCCACCTCGTCAAGAA
AGTGGCCGGGTACCATCAGTTCCACGCCGTCCGGGCCGTGGTGGAGAGCGTGCTGAAGGCATCGGCCCCGGGCGGCTCCC
GCAAGGGCGGCGTCGTCTGGCACACCCAGGGGGCCGGCAAGTCCCTGGAGATGACCTGCCTCGCCGGCCGGCTGATGAGC
CACCCGGACCTGAAAAACCCCACCATCGTGGTGGTCACCGACCGGAACGACCTGGACAACCAGCTCTTCGGCGTCTTCGC
CGGCGCCACGGAACTCCTGCGGGAGACGCCTGTGCAGGCCGAGACCCGGCCGCGCCTCCGGGAGCTTCTGGCCAACCGGC
CGTCGGGCGGCATCATCTTCACCACCATCCAGAAGTTCATGCCCGGCGAGGACGAGGACACCTTCCCGGTGCTCTCCGAG
CGGACCAACATCATCGTCATCTGCGACGAGGCCCACCGCAGCCAGTACGGCTTCGCCGCCAGGCTGAGCCTGCCTGAGCG
GCGCAGGCGCCCGGCGACCGCACCCATGGGGTCCGGGGGCGACCTGTCCGACCAGATCGCGGCCGAGACGCCCGGCGGGA
ACTACGCCGTCCGCTACGGCTACGCCCAGCACATGCGCGACGCCCTGCCGGGGGCCACCTTCGTCGCCTTCACCGGCACC
CCCGTTGCCCTGGAGGACCGGAACACCCGGGCCGTCTTCGGTGACTACGTGCACATCTACGACGTGCTGCAGGCGGTGAA
GGACGGCGCGACGGTGCCCATCTACTACGAGTCCCGCCTGGCCAAGCTCGACCTGAAGGAGGAGGAGATCCCCCGGATCG
ACGAGGCCGTGGAAGAGCTGACCGAGGACGAGGAGGACGACGCCGCGCGGGCCGCTCAGTACCGCCGCTGGACGGCTCTG
GAGAAGCTGGTGGGCGCGCCGCCTCGCATCCAGAAGGTGGCCGCCGACCTGGTCGCCCACTTCGAGCGGCGCCTGGCGGC
CATGGACGGCAAGGCGATGGTCGTGTGCATGAGCCGGGAGATCTGCGTGCACATGTACAACGCCATCGTCGCCCTGCGGC
CCGAGTGGCACGACCCGGATCCCGAAAAAGGCGTCATCAAGATCGTCATGACCGGTTCCGCCGCCGACAAGCCGCTTCTG
CGGCCGCACATCTACAGCAAGGAGGTCCGCAAGCGGCTGGAGCGGCGCTTCAAGGACCCCAACGACCCGTTCAAGATCGT
CATCGTGCGGGACATGTGGCTCACGGGCTTCGACGCCCCCTGCCTGCACACGATGTACATCGACAAGCCGATGCGGGGCC
ACAACCTGATGCAGGCCATCGCCCGGGTCAACCGGGTCTTCAGGGACAAGCCCGGCGGGCTGGTGGTGGACTACATCGGC
ATCTCGCACGAGCTGAAACAGGCGCTGCGGGAGTACACCGCCGCGCGGGGCCGGGGCGAGCCGGCCATCGATGCGGAGCG
GGCCCTGGACATCCTGCGCGAGAAGATGGATGTGCTTAGGGCGATGCTCCACGGCTGCGACTACTCGGCCTTCCGCACCG
AGGCCATGGCGCTTCTGCCCAAGGTGGCCAACCACATCCTGGGGCTGGAGGACGGACAGAAGCGCTTCGCCGACCACGTG
GTGGCCGCCTCCAGGGCCTTCGCCCTCTGCTGCACCCTGGAGGGCGCCCTGGCCTACCGGGACGAGCTGGCCTTCTTCCA
GGCGGTCAAGGCGGCGCTCTCCAAGCGCGCGGAAACCGACCGGAAGGTGGCGGATGAGCGCAAGGAGGCGGCCCTGCGGC
AGATCATCGCCCAGGCGGTGGTCTCCGACGAGGTGGTGGACATCTTCGCCGCGGCGGGGCTCAGCAAGCCCGACATTTCC
ATCCTCTCGGAGGAGTTCCTCGACGAGGTGCGCCGGATGAAGGAGCGCAACCTCGCCGTCGAGCTGCTGCAGCGCCTGAT
CAAGAACGAGATCAAGGCGCGCTTCGAGACCAACGTGGTCCAGTCGGCGAGGTTCTCGGACCTGCTCCAGCAGGCGCTCA
CCCGCTACCGGAACCGCACCATCGAGACGGCGCAGGTGATCGAGGAGCTCATCGCCATGGCCAGGCGCTTCCAGGAGGAG
GCGCGGCGGGGCGAGCAGCTCGGGCTGAACGAGGATGAGCTCGCCTTCTACGACGCCCTGGCCAGCAACGAGTCGGCCGT
GCGGGAGCTGGGCGACGAGGTGCTGAAGAAGATGGCTGTCGAGCTCACGGAGCGGCTGCGTAAGTCGGTGACCGTGGACT
GGGCGCGCCGCGAGACGGTGCGGGCCCGGTTGAGGGTCATGGTGCGCACGCTGCTCCGACGGTATAAGTACCCGCCGGAC
CGGCAGGAGGCGGCGACGAACCTGGTGCTGAAGCAGGCGGAGGTGCTGTCGCAGGAGTGGGCCACGGCCTAG

Protein sequence :
MPVLSEETLEQAALEWLRELGWAVAHGPDISPVDANTPGTERESYRQVVLTGRLREAIRGLNPHIPASAQEDALRQVVSP
NIPGLVQANRQFHRWLVEGVPVEFQKDGETRGDRVRLVDFTDPSRNDWLAVNQLSIQGPKKVRRPDIILYLNGLPIVVIE
LKNPADESADIWAAFNQLQAYKEDIPDLFVYNELLVISDGLSARMGSLTANRERFMAWRTIDGHEIDPLGEMRELETLIR
GAFRHDLLLEYLRYFILFEEDGHLVKKVAGYHQFHAVRAVVESVLKASAPGGSRKGGVVWHTQGAGKSLEMTCLAGRLMS
HPDLKNPTIVVVTDRNDLDNQLFGVFAGATELLRETPVQAETRPRLRELLANRPSGGIIFTTIQKFMPGEDEDTFPVLSE
RTNIIVICDEAHRSQYGFAARLSLPERRRRPATAPMGSGGDLSDQIAAETPGGNYAVRYGYAQHMRDALPGATFVAFTGT
PVALEDRNTRAVFGDYVHIYDVLQAVKDGATVPIYYESRLAKLDLKEEEIPRIDEAVEELTEDEEDDAARAAQYRRWTAL
EKLVGAPPRIQKVAADLVAHFERRLAAMDGKAMVVCMSREICVHMYNAIVALRPEWHDPDPEKGVIKIVMTGSAADKPLL
RPHIYSKEVRKRLERRFKDPNDPFKIVIVRDMWLTGFDAPCLHTMYIDKPMRGHNLMQAIARVNRVFRDKPGGLVVDYIG
ISHELKQALREYTAARGRGEPAIDAERALDILREKMDVLRAMLHGCDYSAFRTEAMALLPKVANHILGLEDGQKRFADHV
VAASRAFALCCTLEGALAYRDELAFFQAVKAALSKRAETDRKVADERKEAALRQIIAQAVVSDEVVDIFAAAGLSKPDIS
ILSEEFLDEVRRMKERNLAVELLQRLIKNEIKARFETNVVQSARFSDLLQQALTRYRNRTIETAQVIEELIAMARRFQEE
ARRGEQLGLNEDELAFYDALASNESAVRELGDEVLKKMAVELTERLRKSVTVDWARRETVRARLRVMVRTLLRRYKYPPD
RQEAATNLVLKQAEVLSQEWATA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 57
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 46
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 46
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 46