Gene Information

Name : Turpa_2677 (Turpa_2677)
Accession : YP_006440822.1
Strain : Turneriella parva DSM 21527
Genome accession: NC_018020
Putative virulence/resistance : Unknown
Product : type I site-specific deoxyribonuclease, HsdR family
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2757168 - 2760299 bp
Length : 3132 bp
Strand : +
Note : PFAM: Domain of unknown function (DUF3387); Type I restriction enzyme R protein N terminus (HSDR_N); Type III restriction enzyme, res subunit; TIGRFAM: type I site-specific deoxyribonuclease, HsdR family; COGs: COG0610 Type I site-specific restriction-mod

DNA sequence :
ATGCCGACCGCAAAAGTCAACGAAGATCTGCTCGAACAAGCAGCGCTGCAATGGTTCAAGGAGCAGGGTTATACTCATAT
TCACGGTTCCACCATCGCCCCCGGCGAACCGGCAGCCGAGCGTGAATCATTTGAAGAAGTCGTTCTTTCAGGCAGAATAC
GCGATGCTCTTGCGCGAATCAATCCGACGCTTTCCACTGAAGTCATCGATGAAGCGCACAAGCAGCTCATGCGTATGGAT
GCCCCGACCTGCCTTATCAACAACCGAACCTTTCACCGCTACGTTACCGACGGCATCAGCGTGTCCTATTCCAAGAACGG
TGATGAGCGCGTAGAGCCGGTCAAGATCATAGACTTCGAGAAGCCAGAGAACAACGACTGGCTGGTGGTGAACCAGTTCG
CGATCATTGCCAAGCCGTACCATCGGCGGCCGGACATCATTGTTTTTCTAAATGGTTTGCCGTTGGTAGTGTTTGAGCTA
AAAAACATGGTTGGGGCACCAACAGTTGCGGATGCTTACAGACAATTGCAAACGTACAAGGCCCAGATTCCAAAGTTGTT
TAATTACAACGAACTAATGGTCGTTTCTGACGGCAATGCCACAACACTGGGTACCCTGACTGCTGATGAATCAAGGTTCA
TGCCATGGAAATATATTGAAAGCGAAAAACTGATTTCTGGTGGTAAGCTCAGTATACAGGTGGTAATCGAGGGCGTTTTT
GAAAAGCGCCGATTTCTTGATTTGATTCGTTACTTTGTGGTCTATGAAGACAATTATGGCGGTGAAGTCATCAAGAAGAT
TGCTGGCTACCATCAGTTTCATGCCGTCAACCGCGCCTTGGCTGAGACCGTGAGAGCAACGGGTGCCAACGGTGACCGTC
GAATTGGTGTGGTCTGGCATACCCAGGGCTCAGGTAAGAGCCTCACCATGGCATTTTATGCCGGGCGTGTCGTGCAATCC
GCTGCGATGGAAAATCCGACAGTGTTGGTCATTACAGATCGCAACGATCTCGATGACCAGCTGGCCGGTACCTTTTCGCG
CTGCCATGAAATCTTGCGGCAAAACCCGACACAGGCAGAGAACCGCGATGACCTGAAAGAAAAGCTCAAGGTAGCATCAG
GTGGCATTATCTTCACGACTGTTCAGAAGTTCTTTCCTGATGAAAAGGGCATGAAGCACCCTTTACTTTCAGACCGGCGC
AATATCGTGGTGATTGCCGACGAAGCGCACCGCAGCCAGTATAGCTTCGGTGCACGCGTGGTTGACGTCAAAGGTGCCAG
TGGCCAGGTTGAAGGCAAAGAGATCACTTATGGTTTCGCTCAGCATATGCGCGACGCTTTGCCCAACGCCAGTTTCATCG
GCTTTACCGGCACGCCGATTGAGTTATCAGACAAGAATACACGCGCAGTGTTCGGTGAATACATATCGGTCTACGATATT
CAACGGGCAGTCGAAGACGGCGCCACAGTGCGGATTTACTACGAAAGCCGTCTCGCCAAGATTGCACTCGATGAGAAAGA
ACGCCCGAAGCTCGACGCTGACTTTGAAGAGGTGACAGAGGGCGAAGAAGACCTCAAAAAAGAGAAGCTGAAATCGCGTT
GGGCTCAGCTCGAAGCACTGGTCGGTGCGCCTAAGCGGGTAAAGTTGATTGCTGAAGATATTGTTCGCCATTGGGAAGGC
AGAAAGGCAGCCATGAAAGGCAAGGCTATGATTGTCTGTATGAGCCGTCGCATCTGCGTCGATATGTACAATGCAATTGT
CAAGATTCGCCCAAACTGGCACAGCGAAAATGACGCCGAAGGTTCAATCAAGATTATCATGACGGGCTCGGCCAGTGATC
CGGCTGAGTATCAACCCCATGTGCGCAGTAAACAGCGCCGTGAAGACATGGCAAAGCGTTTCAAGAAATCAGCAGATTCT
CTTGAGATCGTCATTGTCAGGGACATGTGGCTGACGGGCTTCGATGCCCCGTGCGCCCACACCATGTACGTCGACAAACC
GATGAAGGGCCACGGCCTGATGCAGGCCATAGCCCGCGTTAACCGGGTGTTTGAAGATAAGAAAGGTGGCTTGGTTGTCG
ACTACCTTGGCCTTGCTGACAGCCTCAAGTCTGCCTTAGCCAATTACACCGACAGTGGCGGCGAAGGTAAGCCCACATTC
GACCAGGACGATGCCGTTGCCGTGATGCTAGAGAAATATGAAATCTGCTGCGGTCTATTCCACGGCTTCGACTGGACTAA
ATTCAAGAGTGGTTCGCCTGCAGAACGTTTGGGCATCATGCCTCTGGCTCAAGAACATATCCTATCCCAAAAAGATGGTG
CAGAGCGGCTGATTCAGCATGTCGCAGAGCTAACAAATGCCATGGCACTGGCGATGCCGCATGATGAAGCAAAGAAAATT
GTCGAGGATGTCGCATTCTTTCAAGCAATCAAAGCTGTTTTGACCAAGCCCACAACAAGACAGGCCAGGACCGAAGAGGA
TATGAACCGGGCAATTCAGCAGATTATCTCCCGCGCACTGGTCTCAGATGAGGTCATTGATGTATTCAGGGCGGCCGGGC
TTAAGAAGCCCGATGTCAGTATTCTCTCCGATGAGTTCCTCATGGAAATTCAGGGCATGAAGCATAAGAACCTGGCTATC
GAACTATTACGCCGTCTGCTCAATGACGAAATCAAAACCCGTGGTCAACGCAACGTCGTTGAATCCCGCTCGTTCTCTAA
GATGCTCGAAGATGCGATACAGAAATATAAGAACCGTGCAATCGAGACGGTACAAGTGATCGAAGAACTGATCAAGCTCG
CGAAAGAACTTCGTGACGCCCATGCCCGCGGAGAACAACTCGGCCTGACAGAAGACGAGATGGCTTTTTACGATGCTCTT
GAGGTAAGCGACTCGGCAGTGAAAATCATGGGCGACAAGGTTCTCAGCGAGATTGCGCGCGAGCTCGTTAAATCGATCAA
AGCAAACGTCTCTATCGATTGGACGGTTCGTGAGAACGTGCGGGCCAAGCTGCGAACCGTCGTGAAGCGCATTCTGCGCC
AGAGTGGCTATCCGCCAGACAAGCAAGAGAAGGCTGTCGAGACTGTACTTGAGCAGGTGGAGAGATTATCTGAGCAGTGG
GCGGTGGCATGA

Protein sequence :
MPTAKVNEDLLEQAALQWFKEQGYTHIHGSTIAPGEPAAERESFEEVVLSGRIRDALARINPTLSTEVIDEAHKQLMRMD
APTCLINNRTFHRYVTDGISVSYSKNGDERVEPVKIIDFEKPENNDWLVVNQFAIIAKPYHRRPDIIVFLNGLPLVVFEL
KNMVGAPTVADAYRQLQTYKAQIPKLFNYNELMVVSDGNATTLGTLTADESRFMPWKYIESEKLISGGKLSIQVVIEGVF
EKRRFLDLIRYFVVYEDNYGGEVIKKIAGYHQFHAVNRALAETVRATGANGDRRIGVVWHTQGSGKSLTMAFYAGRVVQS
AAMENPTVLVITDRNDLDDQLAGTFSRCHEILRQNPTQAENRDDLKEKLKVASGGIIFTTVQKFFPDEKGMKHPLLSDRR
NIVVIADEAHRSQYSFGARVVDVKGASGQVEGKEITYGFAQHMRDALPNASFIGFTGTPIELSDKNTRAVFGEYISVYDI
QRAVEDGATVRIYYESRLAKIALDEKERPKLDADFEEVTEGEEDLKKEKLKSRWAQLEALVGAPKRVKLIAEDIVRHWEG
RKAAMKGKAMIVCMSRRICVDMYNAIVKIRPNWHSENDAEGSIKIIMTGSASDPAEYQPHVRSKQRREDMAKRFKKSADS
LEIVIVRDMWLTGFDAPCAHTMYVDKPMKGHGLMQAIARVNRVFEDKKGGLVVDYLGLADSLKSALANYTDSGGEGKPTF
DQDDAVAVMLEKYEICCGLFHGFDWTKFKSGSPAERLGIMPLAQEHILSQKDGAERLIQHVAELTNAMALAMPHDEAKKI
VEDVAFFQAIKAVLTKPTTRQARTEEDMNRAIQQIISRALVSDEVIDVFRAAGLKKPDVSILSDEFLMEIQGMKHKNLAI
ELLRRLLNDEIKTRGQRNVVESRSFSKMLEDAIQKYKNRAIETVQVIEELIKLAKELRDAHARGEQLGLTEDEMAFYDAL
EVSDSAVKIMGDKVLSEIARELVKSIKANVSIDWTVRENVRAKLRTVVKRILRQSGYPPDKQEKAVETVLEQVERLSEQW
AVA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 49
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 48
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 48
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 48