Gene Information

Name : alr4604 (alr4604)
Accession : NP_488644.1
Strain : Nostoc sp. PCC 7120
Genome accession: NC_003272
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease chain R
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 5501462 - 5504353 bp
Length : 2892 bp
Strand : +
Note : ORF_ID:alr4604

DNA sequence :
ATGAAATACCTCACCGAAGACGAAATCGAACAATACCAACTTCAACTTTTGCAAAACCTTGGTTATAGCTACAGCAACGG
CTACGACATCCAACCAGAAGGCATCAAGCAAGAGCGAGAAAGCTTCGGTGAAGTAATTTTAAAACATAGACTCCCACAGG
CAATATCACGAATTAACCCCACCATTCCCCATGATGCCCAATATCAGGCACAACGGGAAATATTTAATATTGCTAGTTCC
GACCTACTCAACAACAACGAAATATTCCATAAATACCTCACAGAAGGCATCACCGTTGAATATCAAAAAAACGGCGAAAC
CAGAGGCGAACCTGTTAAATTAATTGACTGGGAACACCCCGAAAATAACGAATTTCTCGCCGTCAACCAATTTACAGTAA
TTGAAGACAACCATAACCACCGCCCCGATATTGTCCTGTTCATTAACGGCTTACCCCTCGTCGTCATCGAACTAAAAAAC
GCCGCCAATAAAAAAGCCAATCTTAACGCCGCCTATAACCAACTCCAAACCTATAAACGCAGAATCCCTAGCCTATTTAC
CTACAATGCCCTATTAGTAATTTCAGATGGGCTATCCGCCCGTGCTGGTTCACTTACGGCTGGGTTTAACCGCTTCTCCA
CATGGAAAAACCCCACAGGCGAAAACCAGATAAACGAACTAGAAATTTTAACTAATGGGCTACTCAACAAGCAAACTTTA
CTAGATTTAATTCGTCACTTCACTGTATTTGAAAAGTCCAAAACAGAAGACCTGAAAACAGGCATAGTTAGCATTACCAC
CATTAAAAAAATCGCCGCCTATCATCAATATTACGCCGTTAATAAAGCCGTCGAATCTATTATTAATGCCTCATCCCAAG
AAGGCGCACGCAAAGGCGGTGTACTTTGGCATACCCAAGGAAGCGGTAAATCCCTTTCAATGGTATTTCTGGCAGGAAAA
CTGGTTTTAAACGAAAACCTCCAGAACCCCACCATTGTCATGTTGACAGATCGCAATGACTTAGATGATCAACTATTTGA
TACCTTCGCAGGTTGTCAGCAACTCCTCAGACAAGACCCCCAACAAGCAGGAGATAGAGAACAAGTCCGCCAATTACTTA
ACACCAACTCAGGCGGCATCATATTTACCACCGTCCAGAAATTTTCCCCTGCTGATGGCGAAACCCTCTATCCCCAAATC
AGCCCTCGCCCTAATATCATTGTCCTGGCTGATGAAGCCCACCGCAGCCAATACGGTTTCACAGCTAAACAAGTTAATGT
CCTTGATGCTGAAGGTAACGTGATTGGTAAACGCACCAAATACGGCTTTGCAAAATATATTAGACAAGCCTTACCCAATG
CTACCTTTGTCGGCTTCACAGGTACACCCGTAGAACAAACCGACAAAAACACCCCCGCCATTTTTGGTGAATATATCGAC
ATCTACGACATCTCCCAAGCCGTCAAAGATGGGGCAACCGTGCCGATTTATTACGAAAGCCGCTTAGTACAAGTTGATTT
AGACGCAGCAGGTAGACAACTCCTAGATGAACTAGACGAAGACCTGAGCTTTGAAGACCTCAGCACCACCCAAAAAGCCA
AAGCTAAACAAACCAAACTTGAGGCGATTGTTGGTTCTACTAAAAGAATAAGACAGATTGCCCAAGATATTGTCACCCAC
TTTGAGGCACGGCAACAGGTAAACAAAGGCAAAGCCATGATTGTCACCATGAGCCGCCAAATCGCCGTTAATCTCTACGA
TGCCATAATTCAACTCCGCCCTGATTGGCACAGTGAGGATCTCAACTTAGGCAAAATCAAAGTTGTAATTACTACCTCTG
CCGCCGATGAAGGTAATTTAGTTAAACATCACACCAGTAAAGCCCAACGTCAAAACCTCGCCCAACGCCTCAAAGACCCC
GAAAACTCCCTAGAATTAGCCATAGTCTGTGATATGTGGCTCACAGGCTTTGATGCCCCATGTCTGCACACTATGTATAT
TGATAAACCGCTTAAAAGTCATAATTTAATGCAAGCGATCGCCAGAATTAACCGCGTCTATTTTGAAAAAACAGGCGGTT
TGATTGTGGACTATCTAGGACTAGCCACCGAACTCAAAAAAGCCCTATCCTTTTATTCTCAAAGTGGCGGTAAAGGCGAC
CTCACCCTCAATCAAGAAGTGGCGGTGGGACTACTCCTAGCCAAATTGGAAATCGTCGAGCAAATCATATCAGGCTTTAC
CTATCAGCATTATTTTGAGGCTGACACTGGCGAAAAATTGAATATCCTCAAAAATGCCACTAACTACGTAGCTGCACCAA
ACATCAAAGATAGATTTCTCAGTGAAGTTATCGCCCTATCTAAAGCCCATTCTCTCGCCGTACCCCATCCCCAAGCGATC
GCCGCATCAGAAACCATCTCATTTTTCCAAGCCATCCAAGCCAGCCTCAGAAAACTAGAAGGAAGTGGTGATAGTGGTGG
ACTCAGCAACCAAGACATCGAAACCGCCATCCGTCAAGTCGTAGATCAAGCCTTGGTATCCGATGCCGTGATTAACATCT
TTGATGAAGCAGGTATTAAAAATCCTGACATCTCCATCATCTCCGATGAATTTATGGCAGAAGTGCGGGGCATGGAACAC
CAAAACCTCGCGGTGGAACTGCTGCAAAAACTACTCAAGGATGAAGTTAAAACCCGCAGTCGCACAAATATTGTCCAAAG
TCGTAAACTCTCAGAAATGTTGGAAGATGCCCTACGCCGCTATCGCAACCAAGTGATCAGCGTTACGGATATTTTAGAAG
AACTGCTGGAACTAGCAAAAGATACCAAAGCTGCCAACGCCAGGGGTGAAGAACTGGGACTAGAACCCTATGAACTGGCC
TTCCTTTTATGA

Protein sequence :
MKYLTEDEIEQYQLQLLQNLGYSYSNGYDIQPEGIKQERESFGEVILKHRLPQAISRINPTIPHDAQYQAQREIFNIASS
DLLNNNEIFHKYLTEGITVEYQKNGETRGEPVKLIDWEHPENNEFLAVNQFTVIEDNHNHRPDIVLFINGLPLVVIELKN
AANKKANLNAAYNQLQTYKRRIPSLFTYNALLVISDGLSARAGSLTAGFNRFSTWKNPTGENQINELEILTNGLLNKQTL
LDLIRHFTVFEKSKTEDLKTGIVSITTIKKIAAYHQYYAVNKAVESIINASSQEGARKGGVLWHTQGSGKSLSMVFLAGK
LVLNENLQNPTIVMLTDRNDLDDQLFDTFAGCQQLLRQDPQQAGDREQVRQLLNTNSGGIIFTTVQKFSPADGETLYPQI
SPRPNIIVLADEAHRSQYGFTAKQVNVLDAEGNVIGKRTKYGFAKYIRQALPNATFVGFTGTPVEQTDKNTPAIFGEYID
IYDISQAVKDGATVPIYYESRLVQVDLDAAGRQLLDELDEDLSFEDLSTTQKAKAKQTKLEAIVGSTKRIRQIAQDIVTH
FEARQQVNKGKAMIVTMSRQIAVNLYDAIIQLRPDWHSEDLNLGKIKVVITTSAADEGNLVKHHTSKAQRQNLAQRLKDP
ENSLELAIVCDMWLTGFDAPCLHTMYIDKPLKSHNLMQAIARINRVYFEKTGGLIVDYLGLATELKKALSFYSQSGGKGD
LTLNQEVAVGLLLAKLEIVEQIISGFTYQHYFEADTGEKLNILKNATNYVAAPNIKDRFLSEVIALSKAHSLAVPHPQAI
AASETISFFQAIQASLRKLEGSGDSGGLSNQDIETAIRQVVDQALVSDAVINIFDEAGIKNPDISIISDEFMAEVRGMEH
QNLAVELLQKLLKDEVKTRSRTNIVQSRKLSEMLEDALRRYRNQVISVTDILEELLELAKDTKAANARGEELGLEPYELA
FLL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 47
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 47
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 47
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 45