Gene Information

Name : Metfor_1467 (Metfor_1467)
Accession : YP_007249008.1
Strain : Methanoregula formicicum SMSP
Genome accession: NC_019943
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1419510 - 1422185 bp
Length : 2676 bp
Strand : +
Note : PFAM: MutS family domain IV; MutS domain II; MutS domain V; MutS domain I; MutS domain III; TIGRFAM: DNA mismatch repair protein MutS

DNA sequence :
ATGGTGCGCGAGCCCGGACCCGAGAACAAGGCCGAGAATGGCAATGGTGCGCGGCTGACCCCGGTCATGCAGCAGTACCA
CGAGATGAAGGAGCAGCATCCCGATACCATCCTGTTTTTCCGGATCGGGGATTTCTACGAGACCTTCAACGATGATGCAA
AACTGGTTTCGCGCGAACTGGATATTGTCCTGACCTCCCGGTCAAAGAGCGGGGATAACCCGATCCCGCTCGCGGGTGTC
CCGTACCATGCAGCGGAAGGATATATTGCAAAACTGATCGCCAAAGGGTACCGCGTAGCAGTCTGCGAACAGGTCGGTGA
TCCGAAAACCACAAAGGGGGTCGTGAAACGCGAGATCGCCCGCGTCATCACACCGGGCACCGTGATAGACCCGGCCCTTG
TCCCGTCAACCGCTGCCACGTACCTGATGGCCGCCCTCCCGGATGCAAAACAAAAAGAGTGGGGCATCTCGCTCCTCGAC
ATTTCGACAGGAGAGTTCTTCGCTGCCATTGTCCCCCATGATCCCATCCTTGAATCCCTGGGATCGGAGATCGCCCGGTA
CCGGCCCGCGGAGTGCATCGTCCCGGCAAACCTTCCAGACACGTTCCGCGATCGTATCCGCGATGCGGGCGTCATCGTGA
ATGCCTGCCGGGACGAGCTCTTCACCTGCGACCGGGCGGAAAAACTTCTCTGCGGGCACTTCGGTACCGCATCCCTTGGA
GGATTCGGGTTCGAGAGCCGGTCCTGTGCCACCGGCGCGGCCGGTGCTGCCCTTGCCTATGCGCTGGAGACACAGCATGC
CCCGCTGACCCACATCCGTGCCCTCTCGCTCCGGAACTCCTCCGAATCGCTGGTGCTGGACGCCGTCACGCTCCGGAACC
TCGAGGTGCGGGAGAGTATCCGGGGAGGAAAAGGCGCAACGCTCCTCTCCTCGCTCGATTTGACAAAGACTCCGATGGGA
AGCCGGCTCCTGGACCGGTACCTCTCCCGCCCGCTTACGGACATTGCCGAGATCAACCGGCGCCTGGACGCAGTGGAGTT
CCTGGCCGGCAGGACTGCAGCACGGATCGCGTTCCGCGACAGCCTGAAGGCCTGTGCCGATATCGAGCGCATTGCAGCCC
GGATCGCGTACGGGAACGCCGGGCCCCGCGATCTGGTTGCGCTGGCAGACACTCTTGAAACCCTGCCGGCGCTGAAACAG
TGCTTTTCCACCCCCAAAGAAAATGTACCGGCTCTCGCAGCAGAAGCGATCAACGCCATTCACAACCTCCCTGAGATAAT
CGCGCTCATCCGGAACGCCATTGCCGACGATCCGCCCGCAGTTGCCCGGAATGGCGGGATCATCCGGCCCGGATACAGCG
GGGAGCTCGACAGCATCCGCGGCGTGCTGCACTCCGGGAAGGACTGGATCGTTAAACTCCAGGAGAAGGAGCGCGAGGCC
ACCGGCATCAGATCCCTCAAAGTGGGGTACAACCGGATCTTCGGGTATTACATTGACGTGACAAAACCCAACCTTTCGCT
TGTCCCGCCCCGGTACGAGCGCAAGCAGACCACAGCCACGGGCGAGCGGTATACAATCCCCGAGCTGCGCGAGAAAGAGA
CCCTCATCACCAACGCCGACGAGCGCGTACTCTCCCTGGAACGCGAACTCTACGTGCAGCTCCTCGGGATCCTCAAAAAG
GATATCCCGGCCATCCAGGAAACCGCGAACGCCATCGCAGTCCTCGATGTTGCCGCCGCCCTTGCCGAATCAGCGCAGGT
ACGGAACTACGTGCGCCCGCAGCTGGACGAGAGCGACGATGTCGTCATCCGCGACGGCAGGCACCCGGTGGTGGAGCAGG
GAGTATCCGGCGGCTTTGTCCCGAACGATACCGAACTCTCCGGCAGCGGGACGCAGATCATGATCATCACCGGCGCCAAC
ATGGCCGGTAAGTCTACCTACATGCGGGCTGCCGCGCTCATCTGCATCATGGCGCAGGCCGGCAGTTTTGTCCCGGCCCG
GCACGCCCGGATCGGCATCCTCGACCGGATCTTCACGAGAGTGGGTGCATTCGATGACCTTGCAAGCGGCCAGAGCACCT
TCTTTGTCGAGATGCTGGAGCTGGCAAACATCCTCAATAACGTCACCCCAAAGAGCCTCGTGATCCTGGACGAGATCGGC
AGGGGCACGAGCACGGCGGACGGCTCCTCGATTGCCCGGGCCGTACTCGAGTTCCTGCACGGGAAAGGCAGTGCAGGGCC
AAAGACCCTCTTTGCCACGCACTTCCACGAGCTCATCGGCATGGAAGAGAAGCTCAAACGTGTAAAGAACTATCACTTCG
CCGTCAGGGAGACAAAAGACGAGGTGGTCTTCCTCCGGAAGATCATCCCCGGCGCAACCGACAAAAGCTACGGTATCCAT
GTTGCACGGCTGGCCGGTATCCCGAAAAAAGTCACCGAGCGTGCCGAGGCACTCCTTGACGAGGACCTGAACGCACCGGT
GAAAAACGGGTCACGCCCGCAGCGGTACACCCAGATCCTCCTTGTCGATGACAAAGCAGAAACCCCGGCCCCTGCAAAGA
ACCCGGTGCTGGATGAACTTGAGGCCATCAACCCGGATGAGATGACCCCCCTCCAGGCACTCGCAACGATTGCGGAACTG
AAGCGGAAACTAAAAAGGGATGGCGGGAACCCATGA

Protein sequence :
MVREPGPENKAENGNGARLTPVMQQYHEMKEQHPDTILFFRIGDFYETFNDDAKLVSRELDIVLTSRSKSGDNPIPLAGV
PYHAAEGYIAKLIAKGYRVAVCEQVGDPKTTKGVVKREIARVITPGTVIDPALVPSTAATYLMAALPDAKQKEWGISLLD
ISTGEFFAAIVPHDPILESLGSEIARYRPAECIVPANLPDTFRDRIRDAGVIVNACRDELFTCDRAEKLLCGHFGTASLG
GFGFESRSCATGAAGAALAYALETQHAPLTHIRALSLRNSSESLVLDAVTLRNLEVRESIRGGKGATLLSSLDLTKTPMG
SRLLDRYLSRPLTDIAEINRRLDAVEFLAGRTAARIAFRDSLKACADIERIAARIAYGNAGPRDLVALADTLETLPALKQ
CFSTPKENVPALAAEAINAIHNLPEIIALIRNAIADDPPAVARNGGIIRPGYSGELDSIRGVLHSGKDWIVKLQEKEREA
TGIRSLKVGYNRIFGYYIDVTKPNLSLVPPRYERKQTTATGERYTIPELREKETLITNADERVLSLERELYVQLLGILKK
DIPAIQETANAIAVLDVAAALAESAQVRNYVRPQLDESDDVVIRDGRHPVVEQGVSGGFVPNDTELSGSGTQIMIITGAN
MAGKSTYMRAAALICIMAQAGSFVPARHARIGILDRIFTRVGAFDDLASGQSTFFVEMLELANILNNVTPKSLVILDEIG
RGTSTADGSSIARAVLEFLHGKGSAGPKTLFATHFHELIGMEEKLKRVKNYHFAVRETKDEVVFLRKIIPGATDKSYGIH
VARLAGIPKKVTERAEALLDEDLNAPVKNGSRPQRYTQILLVDDKAETPAPAKNPVLDELEAINPDEMTPLQALATIAEL
KRKLKRDGGNP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 2e-122 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Metfor_1467 YP_007249008.1 DNA mismatch repair protein MutS VFG0562 Protein 1e-135 43