Gene Information

Name : Cylst_5220 (Cylst_5220)
Accession : YP_007149934.1
Strain : Cylindrospermum stagnale PCC 7417
Genome accession: NC_019757
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease, HsdR family
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 5908449 - 5911559 bp
Length : 3111 bp
Strand : -
Note : PFAM: Type I restriction enzyme R protein N terminus (HSDR_N); Domain of unknown function (DUF3387); Type III restriction enzyme, res subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family

DNA sequence :
ATGAGCGCTACATTCACCGAATCCACAGTTGAACAAGCCACCCTAGACTGGCTAGAAGAACTCGGATACACCCCCCTTAA
TGGTGCTGAAATCGCCCCCGACATACCCCACAGCGAACGCCAAGAATATAACGATGTCGTCCTTATTAACCGCTTGCAAA
CCGCCTTAGAAATTATTAACCCCCAGATTCCCTTTGACGCAATCCAAGACACCATCAAAAAAATTACCCGCACCGAAACC
CCCGTTTTAGAAGAAAATAACCGCCGCTTTCATCAATACCTCACAGAAGGGGTAGATGTTGAATACCAACAACACGGACA
AACTCAATATAAAAAACTTTGGTTAATTGATTTTGATGAAATTGATAACAACGATTGGTTAGCAGTTAATCAATTTACCG
TCATTGAAAATAAAAATAATCGCCGCCCTGATGTCATCATCTTTATTAATGGTTTACCCATCGCTGTAATTGAACTCAAA
AACGCTGTTAAAGAATACGCCACCATCAAAGGTGCATTTAATCAACTGCAAACTTATAAAAAAGATATCCCCTGTTTATT
TTCCTATAACGAAATCCTCGTAATATCAGACGTTGTTAATGCCAGAGTGGGAACCTTAACCGCAGATTGGGAACGGTTTA
TGCCTTGGCGTTCTGTAGATGGTGAAAATATCTTTCCTAAAGGACAATCAGAACTAGAAACAATTATCAAAGGCATCTTT
AATAAAACTACTATATTAGATATTCTCCAATACTTTATCGTTTTTGAAGTAGACCAAGACACGATTATTAAAAAAATCGC
CGGCTATCACCAATACCACGCTGTTAATAAAGCCATTACTGCTACCATCAAAGCCACCCGTCTTAATGGTGATCAAAAAG
TTGGCGTAGTTTGGCATACTCAAGGCAGCGGTAAAAGCCTAACTATGGCCTTCTATGCGGGGAAACTCATTCAAGAACCA
GACATGAGAAACCCCACTCTAGTTATCCTCACCGATAGAAACGACCTTGACGACCAATTATTTAATACCTTCTCATCCTG
CGCTGATTTATTACGCCAAACCCCAGTTCAAGCAGAAAATAGGGAAAGTTTAACAGCATTACTCCAAGTTGCAGCGGGGG
GAATTGTTTTTACCACCATCCAAAAATTTGCCCCAGAGACGGGTAACGAATACCCGGAACTTTCCCCCAGACGCAATATT
GTTTTAATAGCCGATGAAGCACACAGAAGCCAATACGGTTTAAAAGCGAAAGTCGTTCAAAAAGATGATGCGGCTTATAT
TAAATACGGTTACGCTAAATATTTAAGAGACGCCATCCCTAACGCTTCCTTTGTGGGGTTCACAGGTACACCCATTAGCC
AAACTGATAAAAATACTGCTGCCCTGTTTGGTAATTATATCGACATTTACGACATTCAACGCGCTGTGGAAGATGAAGCC
ACGGTCAAAATTTATTATGAGGGACGACTGGCTAAACTAGATTTGGAACCTTCAGAACGTCCGAAAATTGACCCGGAATT
TGAAGAAATCACCGAGGACGAAGAACTCACCAGCAAGGAAAAGCTCAAAAGCAGATGGGCAAGATTAGAAGCTTTGGTGG
GTGCAGAAAAACGCATTGCTCAAATTGCTAAAGATATTGTCGCACATTTCGAGAATCGCACCGCAGCACCGGAACTCAGA
GATGGTAAGGGGATGATTGTTTGCATGAGTCGCAGGATTTGTGCAGACTTGTACAACGCCATCACCCAACTTAAACCAGA
ATGGCACAGTGAAGACGATAGCCAAGGTTTTCTCAAAGTGGTGATGACTGGTTCCGCTGCTGATGAAGAGAAAATGCAAC
CCCACATCCGCAATAAAAAACGCCGCAAAGATTTAGCTAAACGCTTCAAAAAAGCAGATGATCCTTTTAAATTAGTCATA
GTCCGGGATATGTGGTTAACCGGCTTTGATGCCCCAAGTCTGCATAGTCTGTATGTAGATAAACCGATGCAGGGACACAA
TTTGATGCAAGCGATCGCTAGGGTAAATCGCGTCTTTAAAGACAAGCCAGGGGGGTTGGTGGTTGATTACCTGGGTATCG
CTGAACAACTCAAGGAGGCACTGAAAGACTACACCGAAAGCGATAGAGGGGAAACAGGTATTCCTACAGAACTTGCTTTA
GCTGTGTTGCAAGAAAAATATGAAGTGGTGCAAGGGATGTATCACGGCTTCAACTACCAGAAGTTCTTTACAGGTAAACC
CACAGAACGAGTATCAATGATTCCCGCTGCCCTGAATCATATCTTGGGACTAGAGGACGGTAAACAACGCTATATCAAGG
CTGTGACTGAATTATCCCAGGCTTTTGCCCTAGTTAGCAGCACTGATGAGGCGATCGCTATTCGGGATGAAGTGGGATTT
TTTCAAGCTATCAAAGCAGCGATGGTGAAGCATACCACAATTAACGGCAAAAGTCCAGAAGATGTTGATGCAGCAGTTCG
CCAGATTGTCTCGAAAGCGATCGCTAGTGACCAAGTAATAGACATCTTTGCATCTGCTGGACTGAACAAACCAAATATCG
CTGTCCTGTCTGATGAATTTCTCGAAGAAGTGCGGGGTTTACCCTACCGGAATGTGGCCCTAGAAGCCTTACAAAAATTA
ATCAATGACCAAATCAAAGTTAGTTCCCGCAAAAATTTGATTCAATCTCGCTCTTTTCGAGAAATGTTAGAAAACACCAT
CAAAAGATACCAAAATCGTGCTATAGAAACCGCCCAAGTCATCAACGAATTCATTGAACTAGTAAAAGCCATGAGGGAGG
CACAGAAACGGGGTGAAAACTTAGGACTAACTGAAGACGAAACTGCTTTTTATGATGCCCTGGAGGTGAATGACAGCGCG
GTTATCAGCTTGGGAGACAACACCCTCAAAGCGATCGCTCGTGATTTGGTAAAAGCCATCCGCTCTAATTTAACCATTGA
TTGGACTGTGAAGGAAAATGTCCGGGCTAAGTTACGTGTGACAGTTAAACGATTACTCAAAAAATATGGCTATCCACCCG
ATAAACAGGAGAAGGCAACGGCAACAGTGTTGGAACAAGCGGAGTTGTTATGCAAAGACTGGGTAGCTTAG

Protein sequence :
MSATFTESTVEQATLDWLEELGYTPLNGAEIAPDIPHSERQEYNDVVLINRLQTALEIINPQIPFDAIQDTIKKITRTET
PVLEENNRRFHQYLTEGVDVEYQQHGQTQYKKLWLIDFDEIDNNDWLAVNQFTVIENKNNRRPDVIIFINGLPIAVIELK
NAVKEYATIKGAFNQLQTYKKDIPCLFSYNEILVISDVVNARVGTLTADWERFMPWRSVDGENIFPKGQSELETIIKGIF
NKTTILDILQYFIVFEVDQDTIIKKIAGYHQYHAVNKAITATIKATRLNGDQKVGVVWHTQGSGKSLTMAFYAGKLIQEP
DMRNPTLVILTDRNDLDDQLFNTFSSCADLLRQTPVQAENRESLTALLQVAAGGIVFTTIQKFAPETGNEYPELSPRRNI
VLIADEAHRSQYGLKAKVVQKDDAAYIKYGYAKYLRDAIPNASFVGFTGTPISQTDKNTAALFGNYIDIYDIQRAVEDEA
TVKIYYEGRLAKLDLEPSERPKIDPEFEEITEDEELTSKEKLKSRWARLEALVGAEKRIAQIAKDIVAHFENRTAAPELR
DGKGMIVCMSRRICADLYNAITQLKPEWHSEDDSQGFLKVVMTGSAADEEKMQPHIRNKKRRKDLAKRFKKADDPFKLVI
VRDMWLTGFDAPSLHSLYVDKPMQGHNLMQAIARVNRVFKDKPGGLVVDYLGIAEQLKEALKDYTESDRGETGIPTELAL
AVLQEKYEVVQGMYHGFNYQKFFTGKPTERVSMIPAALNHILGLEDGKQRYIKAVTELSQAFALVSSTDEAIAIRDEVGF
FQAIKAAMVKHTTINGKSPEDVDAAVRQIVSKAIASDQVIDIFASAGLNKPNIAVLSDEFLEEVRGLPYRNVALEALQKL
INDQIKVSSRKNLIQSRSFREMLENTIKRYQNRAIETAQVINEFIELVKAMREAQKRGENLGLTEDETAFYDALEVNDSA
VISLGDNTLKAIARDLVKAIRSNLTIDWTVKENVRAKLRVTVKRLLKKYGYPPDKQEKATATVLEQAELLCKDWVA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 52
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 51
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 51
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 50