Gene Information

Name : NIES39_A06620 (NIES39_A06620)
Accession : YP_005067038.1
Strain : Arthrospira platensis NIES-39
Genome accession: NC_016640
Putative virulence/resistance : Unknown
Product : type I restriction-modification system R subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 632281 - 635319 bp
Length : 3039 bp
Strand : -
Note : -

DNA sequence :
ATGGACTCTATCAGTGTTGTTTTCCCACAAAGCGAAAAAGCCTTTGAGCAAGTGATAGAAGACTCTCTTATAAAACAGGG
CTATGTTAAACCCCTAACCCCATTTAACAAAAATCTAGCCATTTTCCCAGATATTGCCCTTAACTTTATCCGTCAAACCC
AACCCAAAGCATGGGCAAAATTAGAAGCCTTACACGGAGAAAATACGGGTAATCAAATCATTACCGACCTGTGTAAATGG
ATGGATACATACGGCTCACTCCACACCCTGCGTCACGGTTTTAAATGTTATGGTCGCACCCTGCGGATTGCCTATTTCAA
AGCTGCCCACAGCCTCAATTCTGACTTGGAAACCTGCTACAAAGCCAATCAACTTGGGATTACCCGCCAACTTCAATATA
GCGATCGCCATCGAAATGAACTCGATATTACTCTCAGTCTCAATGGTATCCCCATTGTCACGATAGAACTGAAAAACCCC
CTCACCGGACAAACCGTAGAACATGGGAAACAACAATATCGTCAAGATCGCGATCCTCACGAAATTATTTTTGAGTTTAA
ACGGCGAACTTTAGTTCATTTTACTGTTGATACCGAAGAAGTCTGGATGACCACCAGACTCGCCGGAAAAGCCACCCATT
TTTTACCCTTTAATCAAGGTGACAATAACGGTGCCGGAAACCCTCGCGACCCCCAAGGACGAACCTATCGCACCGCCTAT
TTTTGGGAAAACATTTTACAACCGGATAGTTTATTAGACCTTCTGGCTCAGTTTCTTCATGTACAAATCGAAGAAAAATA
TGACGATCGCGGGCGAAAATATCAGAAAGAAACCATGATTTTTCCTCGTTTCCATCAATTGCAAGCGGTCCGAAAATTGA
TTGCAGCCACCCAAGAGGAGGGAGTCGGATCTAATTACTTAATAGAACATTCGGCAGGAAGTGGCAAAAGTAATACAATA
GCTTGGTTAACCCATCGTTTGGTTTCTCTGCATAACACAGAAAACCAGCGAATTTTTGAAACTGTCATTGTGATTAGCGA
TCGACGTATTCTTGACCGACAACTACAAGATACTATCTATCAATTTGAACACCGCCAAGGCGTTATTCAAAAAATAGATA
AAGATTCTAAGCAGTTGGCAGAAGCCCTAGAAAGTGCTGTTCCTCTTATTATTACAACCCTCCAAAAATTTCCCTTTGTA
ACCCGACAGTTGCTTAAACTTGCAGAGGAACGAAACGAACAGGGAAGCGGAAGACTAACCACTCGTCGCTGTGCGGTGAT
TATTGATGAAGCCCACAGTTCACAATCTGGAGATACGGCGACAGAATTGAAGGGGGTGCTAGGGGGTGAAACATTACATC
AGAAAGCCCGTGAAATGGCACAAGAGGAAGGATTAGAACATTTAGAACAGATGTTCCTCAGCATGGCAAAACGCTCCCGA
CAAGACAACCTCAGCTTTTTTGCATTTACCGCTACACCCAAGCACAAAACATTGAAATTTTTTGGGCGTGAAGGTGAACC
TTTCCATCGTTACACCATGCGTCAAGCTATTGAAGAAGGCTTTATCATGGATGTCTTAAAAAACTACACCATCTATACCG
CTTACTTTAAACTACTCAAAGACTCTGGTGATGACCCCCATGTTCAACGGAAGCAAGCTGCTAAATCTCTAACTCATTTT
ATGCGCTTACATCCTCATAATATTGCCCAAAAGACTCAAATTATGGTAGAACATTTTCATCATTTTACGCGGCATAAAAT
TGGGGGTAAAGCAAAAGCAATGGTGGTCACAGGTTCGCGCCTGGAAGCCGTTCGTTACAAACAGAGCTTTGATAAGTACA
TTAAAGAAAAAGGCTATGACATCAAGAGTTTAGTTGCCTTCTCCGGAATTGTCAATGATGACAAAATACCGGAAAAAACT
TACACCGAAGAAGAAATGAATCAGGGAATTCGGGAAAAGGAATTAGCCGAAAAATTTGCTGGTGATGATTATCAAGTGTT
ATTGGTGGCGGAAAAATATCAAACGGGTTTCGATCAACCTTTACTCCATACCATGTATGTTGACAAGCGGTTGGCGGGTA
TTAAAGCAGTACAAACCTTGTCTCGCCTGAATCGCACCCACCCTCATAAAGAAGATACTTTTATCCTCGATTTTGTCAAT
AAACGTCAAGAAATTCAAGAGGCTTTTCAACAGTTCTATGAAGGGGCAGAGTTGGGACAGGAAGCAGAACCCGGACAGCT
TTACCACCTTAAAAGTCAACTCGACGCTTCTGGAATATATTTAAGTGAAGAAGTTAACAGATTTTGCACTATTTATTTTA
AGCCAAAACAGCGTCAGAATCCTAGCGATCATCAAGGGATGAATGCTGCTCTAGATCCGGCTGTAGATCGATTTCTGGCG
TTATACGAGCAGGATGTTGAAGCGGCAGAACTCTGGCGACGAAAATTAACAGCTTTTCGCAATTTATATAGTTTTTTAAG
CCAGATTATTCCTTATCAGGATTCTGATTTGGAGCAAGTTTATATTTTTCTCCGCCATTTGGCTACTAAATTATTGCGTC
CCTCAAATCAAGACCAATATGATTTTGATAGTGAGATTAAGCTTGAATATTATCGTCTTCAGAAAATCAGTGAAGGCTCA
ATTCACCTAAAAAAAGGAGAAACGATTCCTCAAGATGGCTCCCCAGCAATTGGTACGGCAATTTTGCGGGAAAATTCTGT
TAAGCTGTCACAGTTAATTGATGTTTTGAATAACCGCTTTGGTACTGACTTTAATCAAGCTGACCAACTATTCTTCGACC
AAATTGTTGAAGCGGCTGTCAACACGGAAGCACTGCAACAAGCAGCCCAAGTCAATTCGGTGAATAAATTTGGTTTACTG
TTTGAGAAAATTGTGGAGTCTCTTTTTGTCGAGCGTGTAGACCAAAATGAAACTATTTTCGCTCGTTATATGAATGACAA
TGATTTTAAAAATGTCGTTTCGGAATGGTTGCTTTCAGCAGTCTATAAACGCCTGTCGGATCCTCATAATTCTCGGTGA

Protein sequence :
MDSISVVFPQSEKAFEQVIEDSLIKQGYVKPLTPFNKNLAIFPDIALNFIRQTQPKAWAKLEALHGENTGNQIITDLCKW
MDTYGSLHTLRHGFKCYGRTLRIAYFKAAHSLNSDLETCYKANQLGITRQLQYSDRHRNELDITLSLNGIPIVTIELKNP
LTGQTVEHGKQQYRQDRDPHEIIFEFKRRTLVHFTVDTEEVWMTTRLAGKATHFLPFNQGDNNGAGNPRDPQGRTYRTAY
FWENILQPDSLLDLLAQFLHVQIEEKYDDRGRKYQKETMIFPRFHQLQAVRKLIAATQEEGVGSNYLIEHSAGSGKSNTI
AWLTHRLVSLHNTENQRIFETVIVISDRRILDRQLQDTIYQFEHRQGVIQKIDKDSKQLAEALESAVPLIITTLQKFPFV
TRQLLKLAEERNEQGSGRLTTRRCAVIIDEAHSSQSGDTATELKGVLGGETLHQKAREMAQEEGLEHLEQMFLSMAKRSR
QDNLSFFAFTATPKHKTLKFFGREGEPFHRYTMRQAIEEGFIMDVLKNYTIYTAYFKLLKDSGDDPHVQRKQAAKSLTHF
MRLHPHNIAQKTQIMVEHFHHFTRHKIGGKAKAMVVTGSRLEAVRYKQSFDKYIKEKGYDIKSLVAFSGIVNDDKIPEKT
YTEEEMNQGIREKELAEKFAGDDYQVLLVAEKYQTGFDQPLLHTMYVDKRLAGIKAVQTLSRLNRTHPHKEDTFILDFVN
KRQEIQEAFQQFYEGAELGQEAEPGQLYHLKSQLDASGIYLSEEVNRFCTIYFKPKQRQNPSDHQGMNAALDPAVDRFLA
LYEQDVEAAELWRRKLTAFRNLYSFLSQIIPYQDSDLEQVYIFLRHLATKLLRPSNQDQYDFDSEIKLEYYRLQKISEGS
IHLKKGETIPQDGSPAIGTAILRENSVKLSQLIDVLNNRFGTDFNQADQLFFDQIVEAAVNTEALQQAAQVNSVNKFGLL
FEKIVESLFVERVDQNETIFARYMNDNDFKNVVSEWLLSAVYKRLSDPHNSR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0025 YP_042158.1 type I restriction enzyme protein Not tested SCC476 Protein 6e-177 42