Gene Information

Name : BN4_12249 (BN4_12249)
Accession : YP_007494635.1
Strain : Desulfovibrio piezophilus C1TLV30
Genome accession: NC_020409
Putative virulence/resistance : Unknown
Product : putative type I restriction enzyme HindVIIP R protein
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 2390047 - 2393202 bp
Length : 3156 bp
Strand : +
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology

DNA sequence :
ATGACAACTAACGGTAGCTCCCCCATGATTAACGAAGACGCTCTCGAAAAACTCGCCATCAGTTGGTTTGAAGACGAAGG
CTACACCAACGTTCACGGCCCGGACTTGAACCCCGAGGTGGACGGGAGCGGGGCGCGGGCGCGGCTGGATGACGTGTTGT
TGCTTAACCCGTTGCGTGCGGCCATCGAGCGGATTAATCCCCAGCTTCCGGCAAGCACGGTTGACGAGGTGCTCCATCTG
GTTCAGAAGCTGTCCCACCCGATCACGGTCAAGGCAAACCAGGAATTTCATCGGCTACTCCGCGAGGGCGTGGATGTCAG
CTACAAGCGGGACGGGGAGAACGTCGAAGACCGCGCCTTCCTCATTGATTTTCACGACGTAAACGCCAACACGTTCTGGA
TTGTGGATCAGCTCACCATCCGGGGAAGCAAGGGCAATCGTCGTCCCGACCTGATCGTGTATCTCAACGGCCTGCCGCTG
GCCGTCATCGAGTTGAAGTCCCCTGTCAAAGAAGACGTTGGCGTTGACGAGGCCTTTCATCAGCTCCAGACCTACAAGCA
GGAGTTGGTGGACCTTGCCATGTTCAACGAGGCGCTGGTGGCCTCGGACGGCATCCAGGCCCGTGTGGGTTCCCTGACCG
CCAATCGTGAATGGTTCCTGCCATGGCGTGCCGTAAAGTCGGAAGAGGACCGCCCGTCCTTCGAGTATGAACTCAAGGGC
ATTGTGAAAGGCTTCTTCGATCGGACGTTGCTCCTGGAATACATCCGCGATTTCGTCCTGTTCGAGGCGGACGATTCCAG
CACGATCAAGAAGATCGCCAGCTACCACCAGTTCCACGGCGTACGCCAAGCCGTGGCCGCAGCCGTGAAAGCGGCTTCGG
AACATGCACCGAACGAGCTGAAAGGGCGCGGCGGCGTCATCTGGCACACGCAGGGTTCCGGCAAATCCATCTCCATGTGC
TGTCTCGCCGGAAAACTCATCCGCCATCCCGATTTGGCCAACCCGACCCTCGTTGTTATCACCGACCGAAACGACCTGGA
CGGGCAGCTCTACGAGACTTTCTGCAAGGCCGGTGATCTGCTGGCCGACAGTCCGATACAGGCGGACGATCGGGGCGAAC
TCCGCCAAATTCTCAACGAGAAGCAGTCCGGTGGCATCGTCTTTACCACCATCCAGAAATTTTCCCTCGACAAGGACGAG
ACCAAATTCCCGGTCTTGTCCGACCGCCGGAACATCATCGTCATCGCCGACGAGTGCCACCGCTCCCAGTACGGCTTCAA
GGGCAAGCTGGATGAAAAGCGGAATGCGTTCGTTGCCGGGTATGCCCAGCACATGCGCGACGCCTTGCCCAACGCCACGT
TCACCGGCTTCACCGGCACGCCCATATCCCAAGAGGACAAGAACACCCAGGCGGTGTTTGGCGAATACGTCAGTATCTAC
GACATCGAGCAGGCCCAGCTTGATGGCGCGACCGTCCCTATCTTCTACGAAAGCCGCTTGGCCAAGCTCGACCTCAATCA
GGATGAGTTGCCGCAAATCGATGAAAAAGTAGACGAGGTAACGGAGGATGAGGAGACCTCCCAGGCCGAAAAAACCAAGG
GCAAGTGGGCAGCCCTGGCCAAGCTGGTCGGGGCGGAACCCCGTATCCGGCAGGTAGCCGAAGATCTGGTCGACCACTTT
GAAGCCCGCCTGGAGGTCGTGGACGGCAAGGCCATGATCGTCTGCATGAGCCGTGATATCTGCGTTGCAATGTTCGATGC
GCTCGTGAAATTGCGTCCGCAATGGGCAGGAAATCAATTGGCTGACGGCACCTATGATCCATCGGACGGGGCAATCAGGA
TCATCATGACGGCCAGCGCGTCCGATCGGCGGGAGTTGCAGAACCACCACTACAGCAAGACCCAGAAGAAGGCGCTGGAG
AAACGGTTCAAAGACCTGGATGATCCGCTGAAGATCGTCATCGTCCGCGACATGTGGCTGACCGGCTTCGACGCACCGTG
CGCACACACCATGTACATCGACAAGCCCATGAAGGGGCACAACCTCATGCAGGCCATCGCTCGCGTAAACCGCGTGTTCA
AGGGCAAGCAAGGCGGTCTGGTTGTCGACTACATCGGCATCGCCAGCGAACTGAAGAACGCGCTGCATACCTACACCCGG
AGCGGCGGGCGTGGAGGCCAGCCGACTATCGATGTGTACGAGGCGTTCTATGTGCTGAAGGAGAAACTCCAGACGGCACG
CGATATCTTCCACAAGTTCGATTACTCAGCCTATAAGACCAAGGCTGTCGAGCTGCTGCCCCCTGCTGCGGACTTTGTTG
TCGCCACCGAGGAGCGCAAGAAAGAGTTTTTTGATGTCGTGGTCGCCATGACCCGTGCCCAGTCCCTTTGCGGCACCCTG
GACGAAGCCGTGGCCATACGGGATGAGATCACCTTCTTTCAGGCCGTGAAGATCTTCATCGACAAGACGACAGCCACCAA
GGGCAAACAAACGCGGCAAGAAAAAGACGCGATCCTGAATCAACTCCTCGCCCGCGCCGTGGTCCCTGAAGGCGTGGACG
ACATCTTCGCTCTGGCGGGCCTCGACAAGCCGGATATCTCCATTTTGTCCGAAGAATTCCTCGACGACGTCCGCAACATG
GAGCACCGCAACCTAGCCGTCGAATTGCTCGAAAAATTGCTCAGAGACGAAATTAGCGCCCGTTCACGCCGCAATACGAC
GCAGGAGCGCAAGTTCTCCGAGCGGCTCAAGGAATCCCTGCTCAAATACCGTAATCGCGCCATCGAAACCGCCCAGGTCA
TCGAAGAACTTATCCAGATGGCCAAGGACTTGAACGAAGCGCTCAAGCGCGGCGACAAGCTCGGCCTCAACCCCAGCGAA
CTGGCATTCTATGACGCCCTGGAAGAGAACGAGTCTGCCGTCCGTGAATTAGGTGATGACGTCCTCAAGAAGATAGCCAA
GGAACTGACCGAGAAGCTCAGGAAGAATGTGACGGTAGACTGGCAGCATAAGGATTCGGTCCGAGCAAAGATGCGGAACT
TGGTTCGCAGGATTTTGAAGAAGTACAAGTACCCGCCGGATGCCCAGAAAGAGGCCGTGGCCGAGGTGCTGCGGCAGGCC
GAGAGCTTGGCGGATGATTGGAGCGAGGCAGCGTAA

Protein sequence :
MTTNGSSPMINEDALEKLAISWFEDEGYTNVHGPDLNPEVDGSGARARLDDVLLLNPLRAAIERINPQLPASTVDEVLHL
VQKLSHPITVKANQEFHRLLREGVDVSYKRDGENVEDRAFLIDFHDVNANTFWIVDQLTIRGSKGNRRPDLIVYLNGLPL
AVIELKSPVKEDVGVDEAFHQLQTYKQELVDLAMFNEALVASDGIQARVGSLTANREWFLPWRAVKSEEDRPSFEYELKG
IVKGFFDRTLLLEYIRDFVLFEADDSSTIKKIASYHQFHGVRQAVAAAVKAASEHAPNELKGRGGVIWHTQGSGKSISMC
CLAGKLIRHPDLANPTLVVITDRNDLDGQLYETFCKAGDLLADSPIQADDRGELRQILNEKQSGGIVFTTIQKFSLDKDE
TKFPVLSDRRNIIVIADECHRSQYGFKGKLDEKRNAFVAGYAQHMRDALPNATFTGFTGTPISQEDKNTQAVFGEYVSIY
DIEQAQLDGATVPIFYESRLAKLDLNQDELPQIDEKVDEVTEDEETSQAEKTKGKWAALAKLVGAEPRIRQVAEDLVDHF
EARLEVVDGKAMIVCMSRDICVAMFDALVKLRPQWAGNQLADGTYDPSDGAIRIIMTASASDRRELQNHHYSKTQKKALE
KRFKDLDDPLKIVIVRDMWLTGFDAPCAHTMYIDKPMKGHNLMQAIARVNRVFKGKQGGLVVDYIGIASELKNALHTYTR
SGGRGGQPTIDVYEAFYVLKEKLQTARDIFHKFDYSAYKTKAVELLPPAADFVVATEERKKEFFDVVVAMTRAQSLCGTL
DEAVAIRDEITFFQAVKIFIDKTTATKGKQTRQEKDAILNQLLARAVVPEGVDDIFALAGLDKPDISILSEEFLDDVRNM
EHRNLAVELLEKLLRDEISARSRRNTTQERKFSERLKESLLKYRNRAIETAQVIEELIQMAKDLNEALKRGDKLGLNPSE
LAFYDALEENESAVRELGDDVLKKIAKELTEKLRKNVTVDWQHKDSVRAKMRNLVRRILKKYKYPPDAQKEAVAEVLRQA
ESLADDWSEAA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 58
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 45
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 45
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 45