Gene Information

Name : Msil_3575 (Msil_3575)
Accession : YP_002363825.1
Strain : Methylocella silvestris BL2
Genome accession: NC_011666
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 3934665 - 3937595 bp
Length : 2931 bp
Strand : +
Note : PFAM: type III restriction protein res subunit; protein of unknown function DUF450; SMART: DEAD-like helicases; KEGG: mma:MM_2976 type I restriction-modification system restriction subunit

DNA sequence :
ATGAAAACAGATACCTCGGAAAAGGGACTCGAGGCCCTGATCGTCGCTGGAATGACGGGCCGCACCTCGGCGCCGTCCGG
CGGCGGATTCTCCGAGGAGCCGGAGCCCTTCGTCGGCCTGCATAACTGGTTGCTCGGAAATCCGAAGGACTATGATCGGG
CATGGACGGTCGATCTTGTGCAATTGCGCGCCTTTGTGGGCTCCACGCAACGGCCGTTGGTGGAAGCCTTCGATCTCGAC
AACGACAGCCCGGCGCGGCAGAAATTCCTTGCCCGGCTTCAAGGCGAAATCGGCAAGCGCGGCGTCATCGACGTTCTGCG
CCACGGCGCGAAGCATGGCGCGCATGATGTGGACCTGTTCTATGGCACTCCGTCCCCGGGCAACGCCAAGGCCGCCGAAC
GCTTTGCGCTGAACAGGTTCTCGGTCACGCGCCAGCTTCGCTACAGCCGTGACGATACCGCCCATGCGCTCGATCTCGCG
CTGTTCATCAATGGCTTGCCGATCGCAACGTTCGAACTGAAGAACAGCCTGACGAAACAGACAGTCGAAGACGCCGTTGA
GCAATACAAACGCGACCGCGATCCGCGTGAGAAGCTCTTCGAATTCGGCCGGTGTATCGTGCATCTTGCGGTGGACGACG
CGCAGGTGAAGTTCTGCACCCAGCTGAAGGGCAAGGCATCGTGGTTCCTGCCCTTCAACAAGGGCTGGAACGATGGCGCC
GGCAACCCGCCGAATCCCACAGGCATCAAGACCGACTATCTTTGGAAGGATATCCTCACGCCGCTCAGCCTGACGGACAT
CATCGAGAACTATGCCCAGATCGTTGAGCGCAAAGACCCGAAGACCAACCGGACCAAGCGGGATCAGCTTTTCCCGCGCT
TTCATCAGCTCGATGTGGTGCGCAAGCTCCTCGCGGATGCGAAGGCGAAGGGCGCTGGCCGGCGCGTGCTGATCCAGCAT
TCGGCGGGATCAGGGAAATCAAATTCAATTGCGTGGCTGGCGCACCAGCTCGTGCGGTTGGCGAATGGCGGAGGTCAGGT
CTTCGATTCCGTGGTCGTTGTAACCGACCGCCGAATTCTCGATCAGCAAATCCGCGACACCATCAAGCAGTTCGCCCAAG
TTGGCGCGACGGTCGGGCATGCCGAGCATTCCGGCGATCTTCGCCGCTTCATCGCCGACGGCAAGAAGATCATCATCACC
ACGGTTCAGAAGTTCCCGTTCATCCTCGATGACATCGGCGCGCAGCACAAAGACAGACGCTTTGCGATCCTCATCGACGA
GGCGCATTCCAGCCAGGGCGGCAAAGCGGCGGCGGCTTTGAACGCAGCGTTGACCGGCGCGGAAGACGGCAACGAGGACG
AAACCGTCGAAGACAAGATCAATGCGATCATGGAGCAACGGAAGATGCTCCCGAACGCAAGCTATTTCGCGTTTACAGCG
ACGCCGAAGAACAAGACGCTTGAGATATTTGGCGAGCCGTTCCCCGAAGGCGATGTCGTCAAACACCGCCCGTTCCACAG
CTACACGATGAAGCAAGCGATCCAGGAAGGCTTCATTCTGGACGTGCTTCGCTATTACACGCCCGTTAACAGCTACTATC
GGCTGGTCAAGACGGTCGACGAGGATCCGGAGTTCGATACGAAACGCGCGACAAGGAAGCTTCGCCGCTATGTCGAGAGC
AACGACCATGCCATCAGGCTCAAGGCTGAGATCATGGTCGATCACTTCCACGAGCAGGTGCTCGCGTTGAACAAGATCGG
TGGCCAGGCGCGGGCGATGGTGGTGACTTCAGGAATCGAACGCGCGATCCAGTACTATCAGGCGGTGAGCGCCTATCTGG
TCGAACGCAAGAGCCCTTATCGTGCGATCGTCGCCTTTTCGGGCGAGCATGAATTCTGCGGAGTGAAAGTCTCCGAGGCC
AGCCTCAACGGGTTTCCCTCGAAGGATATCGTCGATCAGATCGAAACCGATCCGTATCGATTCCTGATCTGCGCCGACAA
ATTTCAGACCGGGTACGACCAGCCGCTTCTGCATTCCATGTATGTGGACAAGGCCCTGTCGGGCATCAAAGCGGTTCAGA
CCCTGTCGCGTCTCAACCGCGCACACCCCCAGAAGTACGACACCTTCGTTCTGGATTTCATGAACGATACCGAGACGATC
CGCGCATCGTTCGACAAGTTCTATCGAACAACGATCCTGAGCGACGAAACCGATCCAAACCGGCTTCACGATCTCAAGGC
CACGCTGGACGGGTATCAGGTCTACGATCCGGCCCAGATTGACCAGCTCGTTGGTTTGTATCTCTCGGGCGCTGATCGCG
ATCAGCTCGATCCGATCCTCGATGCTTGCGTCACCACTTACAACGACAGCCTCGACGAGGACGGGCAGGTTGACTTCAAG
GGCAAAGCAAAGGCATTCGCACGGACCTACGCATTCATTTCCGCGATTCTTCCCTACACGACCGGAATGGGAAAAGCCCT
CGATCTTATTGAACTTCTTGCTGCCAAAGCTGCCGGCGCCGCGCGAGGAAGACCTCTCCAAGGGAATTCTCGAAGCCATC
GATATGGACAGCTACCGCGTGGAGAAGCAGGCCGCGCAAAGAGTGCAATTGTCCGATCAAGACGCGGAAATCGATCCCAT
CCCAGCCGAAGGCGGCGGCCACAAGGCCGAACCCCAACTCAATCGGCTGTCAAATATCATTCGAAGCTTCAACGATCTCT
TCGGCAACATCACATGGGCGGACACCGATCGTATTCGTCGCCTGATCGCCATCGAGATCCCCGACAAGGTTGCGGCCAAC
GCGGCCTATCAGAACGCGAAGTTAAACTCCGACAAACAGAACGCCCGGATCGAACACGACAAAGCGCTGGCTGGCGTAAT
CATCGGGCTGATGAAGGACGACACCGAACTGTTCAAGCAGTTCAGCGATAA

Protein sequence :
MKTDTSEKGLEALIVAGMTGRTSAPSGGGFSEEPEPFVGLHNWLLGNPKDYDRAWTVDLVQLRAFVGSTQRPLVEAFDLD
NDSPARQKFLARLQGEIGKRGVIDVLRHGAKHGAHDVDLFYGTPSPGNAKAAERFALNRFSVTRQLRYSRDDTAHALDLA
LFINGLPIATFELKNSLTKQTVEDAVEQYKRDRDPREKLFEFGRCIVHLAVDDAQVKFCTQLKGKASWFLPFNKGWNDGA
GNPPNPTGIKTDYLWKDILTPLSLTDIIENYAQIVERKDPKTNRTKRDQLFPRFHQLDVVRKLLADAKAKGAGRRVLIQH
SAGSGKSNSIAWLAHQLVRLANGGGQVFDSVVVVTDRRILDQQIRDTIKQFAQVGATVGHAEHSGDLRRFIADGKKIIIT
TVQKFPFILDDIGAQHKDRRFAILIDEAHSSQGGKAAAALNAALTGAEDGNEDETVEDKINAIMEQRKMLPNASYFAFTA
TPKNKTLEIFGEPFPEGDVVKHRPFHSYTMKQAIQEGFILDVLRYYTPVNSYYRLVKTVDEDPEFDTKRATRKLRRYVES
NDHAIRLKAEIMVDHFHEQVLALNKIGGQARAMVVTSGIERAIQYYQAVSAYLVERKSPYRAIVAFSGEHEFCGVKVSEA
SLNGFPSKDIVDQIETDPYRFLICADKFQTGYDQPLLHSMYVDKALSGIKAVQTLSRLNRAHPQKYDTFVLDFMNDTETI
RASFDKFYRTTILSDETDPNRLHDLKATLDGYQVYDPAQIDQLVGLYLSGADRDQLDPILDACVTTYNDSLDEDGQVDFK
GKAKAFARTYAFISAILPYTTGMGKALDLIELLAAKAAGAARGRPLQGNSRSHRYGQLPRGEAGRAKSAIVRSRRGNRSH
PSRRRRPQGRTPTQSAVKYHSKLQRSLRQHHMGGHRSYSSPDRHRDPRQGCGQRGLSEREVKLRQTERPDRTRQSAGWRN
HRADEGRHRTVQAVQR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 6e-145 43
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 6e-145 43
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-144 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Msil_3575 YP_002363825.1 hypothetical protein VFG1098 Protein 3e-145 43