Gene Information

Name : Mhar_1856 (Mhar_1856)
Accession : YP_005920836.1
Strain : Methanosaeta harundinacea 6Ac
Genome accession: NC_017527
Putative virulence/resistance : Virulence
Product : Type I site-specific deoxyribonuclease chain R
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2007705 - 2010650 bp
Length : 2946 bp
Strand : -
Note : -

DNA sequence :
ATGAGCCCCACCGATACCTCGGAGAAGAATCTGGAATCCCTAATCGTGCAGAGCCTGATAAGCGAGGCTGGATATGTGGA
AGGCGATTCCAAGGATTACGACCGGGATCATGCGGTGGATCTCATCAAGCTCCTGGAGTTTCTCCTCTCCACCCAGCCTC
AAGCGGTGAACCAGCTGGGCCTCGATGGAGAGGGGCTCAAGCTCCAGCAATTCTTAGCTCGCCTCCAGGGAGAGATAGCC
AAACGTGGTGTCATAGACGTCCTCCGCAACGGCATAAACCACGGTCCTGCCCATATAGACGTATTCTACGGCACGCCCTC
TCCAGGAAACCTGAAAGCGGCCGAACTCTTTGCCGCCAACGTCTTCAGCGTTACGCGCCAACTCCACTACAGCAAGGACG
AGACCCAGCTCGCCTTCGACTTGTGCCTCTTCATCAACGGCCTGCCCATAGCCACCTTTGAGCTGAAAAATAGCCTGACC
AAGCAGACCGTCGAAGACGCGGTCCAGCAGTACAAACGGGACCGCAACCCTCGGGAGCTGCTCTTCCAATTCGGACGCTG
CGCGGTTCATTTCGCAGTTGATGACCATGAGGTGCGCATGTGCACCCATCTCCAGGGCAAAAGCTCCTGGTTCCTGCCTT
TCAACAAAGGCTACGATGACGGTGCAGGCAACCCTCCAAATCCAAACGGCATCAAGACCGACTACCTCTGGCGCGAGATC
CTCACTCGCGAGGGCTTGACCGATATCCTTGAGACCTATGCTCAAGTTGTTCAAAAGAAGAACGAGAAGACAGGCAGAAA
GAAGCTCGAACAGATCTTCCCCCGGTACCATCAGCTCGACGTGGTTCGCAAGCTGCTTGCCAGTGCTCAACATCGGGGTG
CTGGAAAGCGCTATCTTATCCAGCATTCTGCAGGAAGCGGCAAGAGCAATTCCATAGCCTGGCTTGCGCATCAGCTCATC
GGACTGGAGCGAGATGGCAAGACCATCTTCGACTCGATCATCGTGGTCACCGACAGGCGCGTCCTCGACAAGCAGATCAA
AGACACCATCAAGCAGTTTGCTCAGGTCTCTGCCACCGTGGGGCATGCAGAGCGCTCGGCAGACTTGCGGCAATACCTAG
CTGGTGGCAAGAAGATCATCATCACCACAGTTCAAAAGTTCCCTGTAGTTCTGAAGGCGATTGGGGACGAACACAGAGGA
CATTCATTCGCCATAATCATCGACGAAGCCCACTCCAGCCAGGGAGGCCGAACCTCTGCCAAGATGAGCATGGTCTTATC
AAAAGAGGGCGGCGAAAAGGAAGCCGAGACGCCAGAGGATAGGATCAACCGCATTATGGAATCTCGAAAGATGCTTTCCA
ATGCCAGTTATTTCGCTTTCACCGCGACTCCCAAAAACAAGACCCTGGAGATTTTCGGCGAGGCCTACATGGAAGGCGAT
GATGTCAAGCACCGTCCTTTCCACAGCTACACCATGAAACAGGCCATACAGGAGGGTTTCATCCTGGACGTGCTCAAGAA
CTACACTCCGGTTGATAGCTATTACCGCCTGGTCAAGAAAGTTGAAGACGACCCGGAGTTCGATACGAAGAAAGCCCAGA
AGAAGCTGCGCCGCTACGTGGAGCACAATGAGCACGCCATAAAGATGAAGGCCGAGATCATGGTGGACCATTTCCACGAT
CAAGTCTTGGCCAAGAAAAAGATCGGCGGTCAGGCAAGGGCTATGGTTATCACAAGCGGTATCGCTCTGGCCATCGAATA
CTACCACGCCATCAGCAACTACCTGATAGAGCGCAAGAGTCCCTGGCAGGCGATAGTAGCCTTTTCAGGCGAGCATGAAT
ACGGCGGACAGAATGTAACCGAAGCTTCACTCAACGGTTTTGCCAGCAACAAGATTCCGGAAACTTTTCAGGATGAACCG
TACAGATTCCTGGTAGTCGCCGAAAAATTCCAGACAGGCTATGACGAGCCCCTTCTGCACACCATGTACGTGGATAAGCC
TCTATCAAGCATAAAGGCGGTACAGACCCTCTCCCGCCTCAACAGGGCTCATCCACAAAAGCATGATACGTTCGTGCTCG
ATTTCTTCAATGATGCGGATACGATCATGAAAGCGTTCGAGCCGTACTACCGGACCACGATATTGAGCGAAGAAACTGAC
CCCAATAAGCTGCACGATTTGAAGGCGGATCTGGACGGCTACCAGGTATACAACCCGCAACAGGTCGAGCATTTTGTCAA
TTTATATCTTGATGGTGTGGATAGAGAAAAGCTAGATCCGATTCTCGACACATGCGTCGCCACATATAACGAACAGCTGG
ATGAGGATGGCCAGATAGACTTCAAGAGCAAGGCCAAGGGATTCATTCGAACCTACGGATTTTTAGCCTCCATTTTGCCA
TTTACGAATGCTGAATGGGAAAAGCTTTCGATCTTTCTGAACTTCCTCACCCCAAAGCTGCCTGCGCCCAAAGAGGAAGA
CCTTTCTAAAGGCATCATCGAAGCAATCGATATGGATAGCTACAGGGTTGAAGTGCGGTCATCGATTGATATTATCCTGG
AAGACAAGGATGGAGTGGTTGACCCAGTTCCCACCAGTGCTGGTGGGCATAAACCAGAGCCGGAACTCGATCTGCTAAGT
AATATCATCAAAGCATTTAATGACCAGTTCGGCAATATCGAGTGGAAAGACATCGATCGAATTCACAAGATAATTAGCGA
AGAGATTCCAGCAAAGGTTTCAGCTGACAAAGCATATCAGAACGCAATGAAGAACTCAGACAAACAGAATGCTCGTATTG
AACATGATAAGGCCCTACAAAGAGTTATAATCGAGTTATTGAATGATCAGACTGAACTTTTCAAGCTGTTCAGCGACAAT
CCAACATTCAGAAAGTGGCTATCTGATATGAATTTCTTGGTCACTTATAATGAGGCCACGGGTTAA

Protein sequence :
MSPTDTSEKNLESLIVQSLISEAGYVEGDSKDYDRDHAVDLIKLLEFLLSTQPQAVNQLGLDGEGLKLQQFLARLQGEIA
KRGVIDVLRNGINHGPAHIDVFYGTPSPGNLKAAELFAANVFSVTRQLHYSKDETQLAFDLCLFINGLPIATFELKNSLT
KQTVEDAVQQYKRDRNPRELLFQFGRCAVHFAVDDHEVRMCTHLQGKSSWFLPFNKGYDDGAGNPPNPNGIKTDYLWREI
LTREGLTDILETYAQVVQKKNEKTGRKKLEQIFPRYHQLDVVRKLLASAQHRGAGKRYLIQHSAGSGKSNSIAWLAHQLI
GLERDGKTIFDSIIVVTDRRVLDKQIKDTIKQFAQVSATVGHAERSADLRQYLAGGKKIIITTVQKFPVVLKAIGDEHRG
HSFAIIIDEAHSSQGGRTSAKMSMVLSKEGGEKEAETPEDRINRIMESRKMLSNASYFAFTATPKNKTLEIFGEAYMEGD
DVKHRPFHSYTMKQAIQEGFILDVLKNYTPVDSYYRLVKKVEDDPEFDTKKAQKKLRRYVEHNEHAIKMKAEIMVDHFHD
QVLAKKKIGGQARAMVITSGIALAIEYYHAISNYLIERKSPWQAIVAFSGEHEYGGQNVTEASLNGFASNKIPETFQDEP
YRFLVVAEKFQTGYDEPLLHTMYVDKPLSSIKAVQTLSRLNRAHPQKHDTFVLDFFNDADTIMKAFEPYYRTTILSEETD
PNKLHDLKADLDGYQVYNPQQVEHFVNLYLDGVDREKLDPILDTCVATYNEQLDEDGQIDFKSKAKGFIRTYGFLASILP
FTNAEWEKLSIFLNFLTPKLPAPKEEDLSKGIIEAIDMDSYRVEVRSSIDIILEDKDGVVDPVPTSAGGHKPEPELDLLS
NIIKAFNDQFGNIEWKDIDRIHKIISEEIPAKVSADKAYQNAMKNSDKQNARIEHDKALQRVIIELLNDQTELFKLFSDN
PTFRKWLSDMNFLVTYNEATG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-176 42
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-176 42
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 6e-176 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Mhar_1856 YP_005920836.1 Type I site-specific deoxyribonuclease chain R VFG1098 Protein 1e-176 42