Gene Information

Name : Turpa_0053 (Turpa_0053)
Accession : YP_006438222.1
Strain : Turneriella parva DSM 21527
Genome accession: NC_018020
Putative virulence/resistance : Unknown
Product : Restriction endonuclease, type I, EcoRI, R subunit/Type III
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 21720 - 24731 bp
Length : 3012 bp
Strand : -
Note : PFAM: Type I restriction enzyme R protein N terminus (HSDR_N); Type III restriction enzyme, res subunit; COGs: COG0610 Type I site-specific restriction-modification system R (restriction) subunit and related helicase; InterPro IPR014001:IPR007409:IPR00693

DNA sequence :
ATGAGTCCCCAAACCAACGAAGCCGCCCTCGAAACAGCGATTGAGCATTATCTGCTCAACGTGCACAAATACAAAAGGCT
CGAAGACGCCGACTTTAGTCAGGATATTGCCCTCTTCAAAGCCGAGATTATTGCTTTCGTCAAAGACACTCAGGCCGAAA
CCTATGCCACCATTGAACGCACCAACGACGATCGCACCGATAAGCTGATTATTGACGATCTGGTTAAGGCGCTGAATTCC
CTCGGTGCGCTCGAAGTGATGCGCCATGGCTTCAAATGCTTTGGCCGAACCATTCGCATCGCGTACTTTCAGCCCGCGCA
TGGCATGACACCCGAGCTCGAAGAGCTATTCCATAAGAACCGCTTTCGCGTGATTCGCCAGTTACATTATAGTAGCCAGA
ATCGCAACTCGCTCGATATGGTGATTGTCTTGAATGGCATACCGATTATCACCGTCGAACTCAAGAACCATTTCACAGGC
CAGAATGTATCGCACGCACGGCGGCAATACCAGAACGACCGCGATCCGCGTGAGATTATCTTCACCTTTAAGAAACGCAC
CTTAGTGCATTTCGCCGTTGACCCTGATCTCGTTTACATGACGACCAAGCTTAATGGCGCGAGCACGTATTTTCTGCCAT
TCAACAAAGGACAGAATGAGGGTGCTGGCAATCCAGTCGCACCAGCGGGCAAACACAGAACCCACTATTTATGGGAAGAT
GTTTTCAGTCCCGTCGTTCTGCTCGATATCATTGGGCGATTCTTGCACATAGAGAAGAAAGAGCGGCAGATTGAAGTGAC
ACGCGCAGGCAAGAGTGATTTGCAGAAGGTGACCAGCGAATCGCTGATCTTTCCGCGCTACCACCAGCTGGATGTCGTGC
GTAAACTACTGGCAAACGCTAAGGCCAAAGGCCCTGGCCGCAACTATTTGGTGCAGCATTCCGCCGGTAGTGGAAAGAGT
AACTCAATCGCCTGGCTCGCCCACCGGCTATCCAGCCTGCACAATGACAAGGACGAGAAGATATTCCATACGGTGATCGT
TATCACTGACCGGCTGGTGCTCGACAGGCAACTACAAGAGACCATCTATCAATTTGAGCACAAGCAGGGTGTTGTGGTCA
AGATTGACCGTGATTCTGCGCAATTAGCAGAAGCTATTCAAAACAGCACACCCATTATCGTCACGACGCTGCAGAAGTTT
CCGTTTGCGGCAAGCCATATCGAAAGGCTTGAATCGCGTAACTTTGCCATCATTGTCGATGAGGCACACAGCTCGCAGTC
GGGTGAGGCTGCCACCGAAGTGCGCCACCTGCTTTCGATGAACGAAATTGAGCAGACCGTCAGGCAGCGTGCAGAAGAGG
AGGATCTATCCGATATCGACGAGGCGATACTTAAGGCGGCAGCGGGCCGCAGTCGCCAGAAGAATCTCAGCTACTTTGCT
TTCACTGCCACGCCCAAGGAAAAGACCCTCTCGATCTTTGATGAACCCGGTGAGAATGGTAATTCCCCATTCCATTTGTA
CAGTATGCGGCAGGCGATTCAGGAACACTTTATCAAGGATGTTCTGGAGAATTACACCACATACAAAACCTATTACAAGC
TGGTGAACACGGCGGGGGAGGACCCGCTCGTCCCGAAATCCAAGGCTGCAAAAGCATTGGCGCGTTTTATGAGTCTGCAT
CCGCACAATATTGCTCAGAAGACCGAGGTGATGATAGAGCATTTTCGGCACCATACGATGCACAAGATCGGTGGTCGCGC
GAAGGCGATGGTGGTGACATCTTCGCGCCTGCATGCTGTCCGGTACAAAGAGGCATTTGATAAGTATATTCAGGACAACA
ACTACCAAGGGATCAAGACACTCGTCGCCTTCTCGGGAACAGTCATCGATAAAGACATTCCGGGCGTGAGCTACACCGAA
GTCGGCATGAATGGTGGCATCAAAGAGAAAGAGCTTCCGGAGAAGTTTGCCACAAATGAATACCAGGTCTTGCTCGTGGC
AGAGAAATACCAGACTGGGTTTGATCAACCGCTATTACATACCATGTACGTAGACAAGCGGCTTGCAGGCATTCAGGCGG
TGCAGACTCTCTCGCGCCTCAATAGAACCCATGCCGGTAAGGAAGATACCTTCGTACTCGACTTCTATAATGAAACTGAA
GACATCTTTGAATCGTTCAAGCCCTATTACAAGATAACCGAATCAGGTGGCCATGCGGATTACGCCAGGCTGGCCGAACT
CAAGTCAGAGATCGACACGGCGCAGATTCTGCATCATGATGAGATCGACGCTTTCTGTAACGTGTTCTTCGCTCCGCAGG
AAATCGAGTCGAAAAAGGATCACGGGCGCCTGGAATCTATACTGCAGTCAGCTGTAGACCGGTTCAAAGACCTGGCCACC
GAGGCCCGCGAAGACTATCGTGCCAAGCTGAAATCCTTCCAGCTACTCTATAGCTACCTTTCGCAGATGGTTCCCTTTCA
AGATGCAGAGTACGAGAAATACTATACAGTTATTCGGTACTACATCAAGAAGTTGCCATTACCGATCGGCGATCCGATAC
CCGAGGTCGATGATGACATAAACCTCAAGTACTATCGTCTGCAAAAGATCAGCGAAGGGCGGATTGATCTTGAATCGGGT
ACTGCCAACCCATTGAAAGGCCCTATGGATGTCGGTACTGGAAATCCCGACGAAGAAAAGATCAGGCTTTCTGAGTTAGT
CGATATGCTCAATGAACGCTTTGGCACCGACTTTACCCAGGCCGATCAGCTATTCTTTGATCAAGTCGCCGAAGAGGCCG
TTAATGACGAGGCCTTGCAAGCCGCAGGTCGCGTTAACACACTCGATAACTTCAAGCTCGTTTTCGATCAGGCACTCATG
GATTACTTCATCAAGCGCATGGACGGCAATGAGAAGATCTTTACGAAGCTGATGAACGACGAATCGTTCAGGGCAGTGGC
GTCCGGGCATTTGCTGAAAAAGGTTTATGGGCGGATAAGGGAAGATGGGTAG

Protein sequence :
MSPQTNEAALETAIEHYLLNVHKYKRLEDADFSQDIALFKAEIIAFVKDTQAETYATIERTNDDRTDKLIIDDLVKALNS
LGALEVMRHGFKCFGRTIRIAYFQPAHGMTPELEELFHKNRFRVIRQLHYSSQNRNSLDMVIVLNGIPIITVELKNHFTG
QNVSHARRQYQNDRDPREIIFTFKKRTLVHFAVDPDLVYMTTKLNGASTYFLPFNKGQNEGAGNPVAPAGKHRTHYLWED
VFSPVVLLDIIGRFLHIEKKERQIEVTRAGKSDLQKVTSESLIFPRYHQLDVVRKLLANAKAKGPGRNYLVQHSAGSGKS
NSIAWLAHRLSSLHNDKDEKIFHTVIVITDRLVLDRQLQETIYQFEHKQGVVVKIDRDSAQLAEAIQNSTPIIVTTLQKF
PFAASHIERLESRNFAIIVDEAHSSQSGEAATEVRHLLSMNEIEQTVRQRAEEEDLSDIDEAILKAAAGRSRQKNLSYFA
FTATPKEKTLSIFDEPGENGNSPFHLYSMRQAIQEHFIKDVLENYTTYKTYYKLVNTAGEDPLVPKSKAAKALARFMSLH
PHNIAQKTEVMIEHFRHHTMHKIGGRAKAMVVTSSRLHAVRYKEAFDKYIQDNNYQGIKTLVAFSGTVIDKDIPGVSYTE
VGMNGGIKEKELPEKFATNEYQVLLVAEKYQTGFDQPLLHTMYVDKRLAGIQAVQTLSRLNRTHAGKEDTFVLDFYNETE
DIFESFKPYYKITESGGHADYARLAELKSEIDTAQILHHDEIDAFCNVFFAPQEIESKKDHGRLESILQSAVDRFKDLAT
EAREDYRAKLKSFQLLYSYLSQMVPFQDAEYEKYYTVIRYYIKKLPLPIGDPIPEVDDDINLKYYRLQKISEGRIDLESG
TANPLKGPMDVGTGNPDEEKIRLSELVDMLNERFGTDFTQADQLFFDQVAEEAVNDEALQAAGRVNTLDNFKLVFDQALM
DYFIKRMDGNEKIFTKLMNDESFRAVASGHLLKKVYGRIREDG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0025 YP_042158.1 type I restriction enzyme protein Not tested SCC476 Protein 0.0 43