Gene Information

Name : MCA0274 (MCA0274)
Accession : YP_112808.1
Strain : Methylococcus capsulatus Bath
Genome accession: NC_002977
Putative virulence/resistance : Virulence
Product : type I restriction-modification system, R subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 258432 - 261440 bp
Length : 3009 bp
Strand : -
Note : identified by similarity to OMNI:NTL01XF02722; match to protein family HMM PF00270; match to protein family HMM PF04313; match to protein family HMM PF04851

DNA sequence :
ATGAGCACGACTGACACCAGCGAAAAGGGCCTCGAAGCCCTGATCGTGCGCGACCTGGTCGCCAGCGGCTACGTACAGGG
CCATGCCGCGGACTACAACCGCGATGTGGCGCTGGACGTGACCCAGTTGCTCGCCTTTCTTCGGGCGACGCAGCCGAAAG
TTGTCGAAACGCTGAACCTGGGCGCCGAAGGCATTCAGCGCACCCAGTTTTTGCACCGCCTGCAGGGCGAGATCACCAAG
CGCGGCGTGGTGGACTGTTTGCGCCGCGGTGTTAGCCACGGCCCGGTACACGTGGACCTTTACAAGCTGTTGCCCACCCC
GGGCAATGCTGCCGCTGCCGAGGCCTTCGGCAAGAACATTTTCAGCGTCACGCGGCAGGTGCGCTACAGCAACGACGAGT
CCCAGCGCGCGCTGGATATGGTGATTTTCATCAACGGCCTGCCGGTGCTCACCTTCGAGCTGAAGAACTCGCTCACCAAG
CAGACCGTCGCGGACGCCATCGTGCAGTACCAGACCGATCGCAACCCGGATGAGCTGCTGTTCCAACTCGGCCGCTGCGT
TGCCCACATGGCGGTGGACGACGCCGAGGTGCGCTTTTGCACTCACCTGACCGGCAAGACCTCCTGGTTCCTGCCTTTCA
ACCAAGGTTGGAACAGCGGCGCAGGCAACCCACCGAATCCTCATGGTCTGAAGACCGACTATCTATGGAAGCAGGTGCTG
GTGAAGGAGTCGCTGGCTAACATCATCGAGAACTACGCGCAGCTGGTAGAGGAGGAAGCGGAAGACGCCAATGGCCGCAA
GCGCAAGACGCGCAAACAGATCTTCCCCCGGTACCACCAACTTCGCACTGTTCGCGCCCTGTTGCGGCGCAGCCAAGCGG
ATGGTGTCGGCAAGCGTTACCTGATCCAGCATTCAGCCGGCAGCGGCAAGAGCAACACGATTGCCTGGCTGGCCCATCAG
CTGGTGGAACTGAAGACGGCGGCGGATGCGGGGCAGGCCCAGTTCGACTCCGTCATCGTCATCACCGACCGCCGTGCGCT
GGACACGCAGATTGCCCGCACGATCCGGTCCTACGACCATGTGGCCTCGATCTACGGTCATTCGGAAAGCGCCGAGGAAC
TGCGCACTTTCCTGCGCCGAGGCAAGAAGATCATCGTCACCACGGTGCAGAAGTTCCCGTTCATCCTGGACGAGCTGGGG
GATCTCGGCGACAAGAAGTTCGCGCTGTTGATCGACGAGGCCCACTCCAGCCAGGGCGGCAAGACGACGGCCAAGATGCA
TCTGGCCTTGTCCGGACAAGCCGCGGAAGGCGGCGAGGACGAAGAGGAAGAATCGGTCGAGGACAAGGTCAACGCCTTGA
TCGAATCCCGCAAGATGCTGGCCAACGCCAGCTACTACGCCTTCACGGCCACGCCCAAGACAAAGACCTTGGAGCTCTTC
GGCGAGCGTCAGGTTGTCGGCGACACGGTGCAGTTCCGCTCGCCCGAGGAGCTGACGTATACCACCAAGCAGGCGATTCA
GGAAGGCTTCATCCTCGACGTGATCGCCAACTACACCCCGGTGTCGAGCTTCTATCACATTGCCAAGACCGTCGAGCACG
ATCCGGAGGTGGACAAAGCCAAGGCGCTGAAGAAGATTCGGCGCTACGTGGAATCCCACGACAAGGCGATCCGTCGCAAG
GCGGAGATCATGGTCGATCACTTTATTGAACAGGTGATCGGCGCCAAGAAGATCGGCGGCAAGGCGCGCGCGATGATCGT
CTGCAACGGAATCGCACGGGCCATCGACTACTTCCGTGAGGTTTCAGACTACCTTCGCGAGATCAAGAGCCCATACAAGG
CCATCGTGGCGTACTCCGGCGATTTCGAGGTCGGCGGGGTGAAGAAGACCGAGGCCGATCTCAACGGATTCCCGAGCAAG
GACATTCCGGCCAAGCTCAGGCAAGACCCGTATCGCTTCCTGATCGTCGCGAACAAGTTCGTCACCGGCTTCGACGAGCC
GCTGCTGCACACCATGTATGTGGACAAACCCCTGGCGGGCGTCCTGGCCGTGCAGACGCTCTCGCGACTGAACCGCGCGC
ATCCGCAGAAGGCCGACACGTTCGTGCTCGACTTCGCCGACAACGCGGAGGCCGTCAAGGCGGCCTTCCAGGAGTACTAC
CGCGCCACGATCCAGGAGGGGGAGACCGATCCGAACAAGCTGCACGACCTGAAGAGCGATCTGGACGCCCAGCAGGTCTA
CAGCTGGCAACAGGTCGAAGACCTCGTCGCGCAGTACCTTGGCGGAGCGGAGCGGGATCAGCTCGATCCAATCCTCGATG
CCTGCGTCGCGGAGTACGTCGAGAAGCTCTCTGAAGACGACCAAGTGAAGTTCAAGGGCAAGGCAAAGGCCTTCGTCCGC
AGCTACGGCTTCCTGGCGGCCATTCTGCCCTACGGGCATCCGGCATGGGAGAAGCTGTCGATCTTCCTCAACTTCCTGAT
TCCGAAGCTGCCTGCCCCCAAGGAGGAGGATCTGTCCAAGGGTGTGCTGGAGGCCATCGACATGGACAGCTACCGCGCTC
AGGCCCAGGCGTCCATGCGCATGGCGATGGATGACGCAGATGCCTTTGTCGAACCTCCACCCCCCGGAGGTAGCGGAGGC
AGCGGCGAACCAGAGCTGGACAGGCTGTCGAACATCATCAAGCAGTTCAACGACCTGTTCGGCAACATCGAGTGGCATGA
CGCCGACAAGATTCGCAAGGTCGTCACCGAAGAGATTCCGGCGCGCGTCGCGCAGGACAAGGCCTACCAGAACGCACAGG
CGAACTCGGGCAAGCAAAACGCCAGGCTGGAGCATGACAAAGCGCTCAACCGCGTGGTGCTGGAGCTGCTCGACGACCAC
ACCGAACTCTTCAAGCAGTTCAGCGACAACCCGAACTTCAAGCGCTGGCTGGCGGACATGGTGTTCGACTCGACCTACCG
CCCAGGACAGAAACCGTCAGTGCCGCCTCAATCGGGCGCCCAAGCCTGA

Protein sequence :
MSTTDTSEKGLEALIVRDLVASGYVQGHAADYNRDVALDVTQLLAFLRATQPKVVETLNLGAEGIQRTQFLHRLQGEITK
RGVVDCLRRGVSHGPVHVDLYKLLPTPGNAAAAEAFGKNIFSVTRQVRYSNDESQRALDMVIFINGLPVLTFELKNSLTK
QTVADAIVQYQTDRNPDELLFQLGRCVAHMAVDDAEVRFCTHLTGKTSWFLPFNQGWNSGAGNPPNPHGLKTDYLWKQVL
VKESLANIIENYAQLVEEEAEDANGRKRKTRKQIFPRYHQLRTVRALLRRSQADGVGKRYLIQHSAGSGKSNTIAWLAHQ
LVELKTAADAGQAQFDSVIVITDRRALDTQIARTIRSYDHVASIYGHSESAEELRTFLRRGKKIIVTTVQKFPFILDELG
DLGDKKFALLIDEAHSSQGGKTTAKMHLALSGQAAEGGEDEEEESVEDKVNALIESRKMLANASYYAFTATPKTKTLELF
GERQVVGDTVQFRSPEELTYTTKQAIQEGFILDVIANYTPVSSFYHIAKTVEHDPEVDKAKALKKIRRYVESHDKAIRRK
AEIMVDHFIEQVIGAKKIGGKARAMIVCNGIARAIDYFREVSDYLREIKSPYKAIVAYSGDFEVGGVKKTEADLNGFPSK
DIPAKLRQDPYRFLIVANKFVTGFDEPLLHTMYVDKPLAGVLAVQTLSRLNRAHPQKADTFVLDFADNAEAVKAAFQEYY
RATIQEGETDPNKLHDLKSDLDAQQVYSWQQVEDLVAQYLGGAERDQLDPILDACVAEYVEKLSEDDQVKFKGKAKAFVR
SYGFLAAILPYGHPAWEKLSIFLNFLIPKLPAPKEEDLSKGVLEAIDMDSYRAQAQASMRMAMDDADAFVEPPPPGGSGG
SGEPELDRLSNIIKQFNDLFGNIEWHDADKIRKVVTEEIPARVAQDKAYQNAQANSGKQNARLEHDKALNRVVLELLDDH
TELFKQFSDNPNFKRWLADMVFDSTYRPGQKPSVPPQSGAQA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 6e-167 41
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 6e-167 41
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-166 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
MCA0274 YP_112808.1 type I restriction-modification system, R subunit VFG1098 Protein 3e-167 41