Gene Information

Name : HMPREF9137_1803 (HMPREF9137_1803)
Accession : YP_004329475.1
Strain : Prevotella denticola F0289
Genome accession: NC_015311
Putative virulence/resistance : Unknown
Product : type III restriction enzyme, res subunit
Function : -
COG functional category : S : Function unknown
COG ID : COG4951
EC number : -
Position : 2089015 - 2091903 bp
Length : 2889 bp
Strand : -
Note : identified by match to protein family HMM PF00270; match to protein family HMM PF04851

DNA sequence :
ATGATGAATAATGAACAATACCAACAGCTTCTCCTGCGTTACGAAGCATTAGAAAGAGAAAACGAACGGTTGAAAGTTCT
GCTCCGTAAACATGGGATAACGTATGTGCCAGATAGAATTAAAATCCCTGACACATCACCTTACTCACCCATTTCTTTCC
CTCCAGTCCGACTCTCTCTTGATGGGAAGATCCGGTTATTCCGTAGTCTTTTCAGAGGAAGGGAAGATGTTTATGCAAGA
CGATGGCAAAGTCGGACTACTGGCAAAGGTGGCTATCAGCCTGTTTGTACCAACGAATGGCGAACAGGTGTTTGTGACAA
GCGGCGATATAAATGTGCTGAATGTCCCAATCGTAGTTTTGAGACATTAAATGATAGAGCCGTTTACAGGCATTTAGAAG
GTAGGAATGAGGATTGTCTTGATGTGATCGGGCTATACACCATAATGCCAGATAACTCTTGCGCTTTCTTGTGTACAGAT
TTCGATGATAAAAACTGTGAGCATGGTTACAAAGGTGATGTAAAGGCTTTTATAGATGTCTGCAAAGAGTGGAGTATTCC
TTACGCCATTGAACGATCTCGTTCAGGAAACGGTGCACATGTGTGGATATTCTTTGAAGATGTTTTACCAGCTTATAAAG
CACGACAAGTAGGAAGTGCTATTCTTACAGAAGCTATGAATCGGAATGGACGAATGTCGTTCAAATCGTACGATCGTTTC
TTTCCCAATCAAGACTATCTTCCTGAAGGGGGATTGGGTAACCTCGTTGCACTTCCCTTGCAGGGACGTGCCCGAAGAAA
GTTGAATACTGTTTTCGTTGATGATGACTTCCTTGCTTATCAAGATCAGTGGGCTTTCCTTTCTCAAATTAAGAAGATGA
CAGCAGGGACAGTTGAGAAAGTCTTAGCAGAGCATAGACAAGCAGAGTTGGGACCTTTATCTGTATCTGACGAGCAGAAA
CCTTGGACACCACCAATTGTGCAACCTATTGGTAATAGCGACATCTATGAAAACGTAGAAATAGTTAAAGCCGATAAACT
GTATATTCCGATAAAGGCTGTGTCGGCAAAGGTGCTGAATCATCTGAAGCGTATTGCTGCTTTTAAGAACCCGGAGTATT
ATCGGAGACAGGCAATGCGCTTGTCCACTTATTCTGTGCCGCGTGTTATATCTTGTTGTGAGTTTACTGACGATTATTTA
CTGATGCCGCGTGGTTGCGAAGATGCCGTTACCTCCTTTTTGAATAAGAATAATATTGACTTCACACTGACGGACAAAAC
CCAACCTGGGCAGCCTGTTTCCGTTACATTCCAGGGTGAGCTGCGCACAGAGCAGCAACAGGCTGTTGATTGTATTTCTG
CATTCGATAATGGCGTTCTTTACGCTACAACAGCGTTCGGTAAGACTGTTGCGGCTGCAGCACTCATAGCGTATAAGAAA
GTCAACACGCTGATACTCGTCCATTCAAAAGCCTTGCTGGGTCAGTGGAAAGAACGGTTATCAGAGTTTCTCAATATTGA
TTATCAAAAGCCAGACCGTCCGAAAAGAAGAGGAAGGTATAAGGCATTTTCGCCTATTGGTTGCCTTGATTCTACAGGTA
ATGTCTTACATGGAATGATAGACGTCGCCTTGATGCAATCGTGTTTAGATGGAGATGAGGTGAAACCGTTTGTACGTGAT
TATGGAATGGTGATAGTTGATGAGTGTCACCATGTCTCTTCCGTTACTTTTGAGCGTGTGTTGAAAGGCGTGACGGCTCG
TTACGTGTATGGATTAACTGCTACACCGATTAGAAAAGACGGTCATCAGCCCATTATTTTTATGCAGTGCGGCCCTATTC
GGTTCTCTGCCGATGCTGCAGGCTTGATGGCACAGCAAGGTTTCCATCGTTACTTTATTCCTCGGTTTACATCTTTCCGG
CCACTTACAGAAGATAAACAGAATATCAGCTCGCTTTATCAGTCTTTGGTGGATGATGAGGTTCGTAACAATCTGATTGT
TGATGATGTGTGTAAGGCAGTAGATGCCGGGCGTATCCCAATGGTCCTCACCAGTAGGACGGCACATGTTGAAATCCTTT
CACAGATGTTGCATGACAGGGGCAAAGATGTCATTCTGTTGACGGGGTCCGGCACTGCCAAAGAGAAAAAGACCGCACTT
CATCAATTGAGGATTGCTCCGGTAAAAAACAGACTGGTAGTTATAGCTACGGGAAAATATGTCGGTGAAGGTTTTGATTA
CCCACGATTAGACACCTTGTTTCTTGCTCTGCCCATTTCATGGAAGGGACTTCTGGCACAGTATGCCGGTCGGCTTCATC
GTGAATACCCCGGCAAAATAGATGTGCGCATCTACGATTACATTGACCTTCATCACCCAGTTTATGACAGTATGTACAAA
CGACGCCTGAAAGGTTACGCCTCTATTGGCTACAAGGTGGCAGATGTATCTTCTCCAGCATTATTTGATTCTTTATCCGA
CCTTGACCTATCGAGCAGCGAAGGACAGATTTTCAATGGCAAGACATTCTTTGTTCCTTTCTGTAAAGACCTTCTTGCTG
CCAAACACTCCATTATCATCTCGTCTCCGAAGCTGTATCATGTTCAGCAGAACCAACTCACCGATATGTTTAAAAACTTA
CTTGTCAATGGCATTAACATCATTGTACTGTCTAAGCAGTCTGACGAACAGACCGATTATCTGAAGTCATTAGGTATCAC
TGTGAATACGAACAAAATCCTCTCTCACAGTTGTGCCATCATCGACAGAACCATCGTCTGGTACGGCGGCATTCACCTGC
TCGGTTTCTCAACAGAAGAAGATAATATTATTAAACTTTCAGATAATGCACGATTAGCCGCAGAACTGATAGGGGCGTTA
ATGGAATGA

Protein sequence :
MMNNEQYQQLLLRYEALERENERLKVLLRKHGITYVPDRIKIPDTSPYSPISFPPVRLSLDGKIRLFRSLFRGREDVYAR
RWQSRTTGKGGYQPVCTNEWRTGVCDKRRYKCAECPNRSFETLNDRAVYRHLEGRNEDCLDVIGLYTIMPDNSCAFLCTD
FDDKNCEHGYKGDVKAFIDVCKEWSIPYAIERSRSGNGAHVWIFFEDVLPAYKARQVGSAILTEAMNRNGRMSFKSYDRF
FPNQDYLPEGGLGNLVALPLQGRARRKLNTVFVDDDFLAYQDQWAFLSQIKKMTAGTVEKVLAEHRQAELGPLSVSDEQK
PWTPPIVQPIGNSDIYENVEIVKADKLYIPIKAVSAKVLNHLKRIAAFKNPEYYRRQAMRLSTYSVPRVISCCEFTDDYL
LMPRGCEDAVTSFLNKNNIDFTLTDKTQPGQPVSVTFQGELRTEQQQAVDCISAFDNGVLYATTAFGKTVAAAALIAYKK
VNTLILVHSKALLGQWKERLSEFLNIDYQKPDRPKRRGRYKAFSPIGCLDSTGNVLHGMIDVALMQSCLDGDEVKPFVRD
YGMVIVDECHHVSSVTFERVLKGVTARYVYGLTATPIRKDGHQPIIFMQCGPIRFSADAAGLMAQQGFHRYFIPRFTSFR
PLTEDKQNISSLYQSLVDDEVRNNLIVDDVCKAVDAGRIPMVLTSRTAHVEILSQMLHDRGKDVILLTGSGTAKEKKTAL
HQLRIAPVKNRLVVIATGKYVGEGFDYPRLDTLFLALPISWKGLLAQYAGRLHREYPGKIDVRIYDYIDLHHPVYDSMYK
RRLKGYASIGYKVADVSSPALFDSLSDLDLSSSEGQIFNGKTFFVPFCKDLLAAKHSIIISSPKLYHVQQNQLTDMFKNL
LVNGINIIVLSKQSDEQTDYLKSLGITVNTNKILSHSCAIIDRTIVWYGGIHLLGFSTEEDNIIKLSDNARLAAELIGAL
ME

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z1129 NP_286664.1 helicase Not tested TAI Protein 6e-147 41
Z1568 NP_287072.1 helicase Not tested TAI Protein 6e-147 41