Gene Information

Name : mutS (trd_A0449)
Accession : YP_002523731.1
Strain :
Genome accession: NC_011961
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 435188 - 437833 bp
Length : 2646 bp
Strand : -
Note : identified by match to protein family HMM PF00488; match to protein family HMM PF01624; match to protein family HMM PF05188; match to protein family HMM PF05190; match to protein family HMM PF05192; match to protein family HMM TIGR01070

DNA sequence :
ATGGGATCGACGCGACGAGCCATCCAGCACCGGTTGCCGGCGACCAAGGAGTCATCGTCCGACGATTTGGTCCCCTCGCG
TCGTCAGTACCTGCGCCTCAAGGCACACTATCCCCATGCCATCTTGCTCTATCGGCTCGGCGACTTCTACGAAGCGTTCG
ACGAGGATGCCCGCATCGTCGCCCGCGACGCCCGCATCACCCTCACCTCACGCTCGTTCGGCCGTAACGGTCGGGTGCCG
ATGGCCGGCATCCCACACCACGCGTTGAACCACTATCTGGCACGCCTCTTGGCCGCGGGACATACCATTGCTATCGCCGA
GCAGGTCAGCGAGCCAGGCAAGGGGCTGGTCGAGCGCGCAGTCACGCGGGTCCTCACGCCGGGAACGGTAGCTGAGGCCG
CGTTGCTCCCCACAACCGAGAATCGCTACCTCGCAGCGGTCGCCAGACTTCCCGAGCGAACCGGGCTCGCCTGGGTCGAT
GTGAGTACCGGCGAGTTCGCGGTTCTCGAACTCAGCGGAGCGGAGCGCGACCTGCTCCTGGCCGAGGAGTACGCGCGACT
CGCTCCCGCCGAAACGCTCGTCCCCGACACAGATGAAATTCCTCTACCGCCCGGCGGGTGCTTGACCCGGCTCGAGCACT
GGCATTTCGAGCCGGAACGAGCTGCTCAACGCCTGCGTGCCCTCTTCGCGACGCGCTCGCTCGCCCCCTTCGGCTGCCAA
CATCTTCCCGCTGCACTCGCCGCCGCGGGCGCCATCGTGGTCTACCTGGAGCGCACCAATCCGGCGTTGCTCTCTCTCCT
GACCAGTCTCCGGACCGAAGTGCCAGCGCGTCGTGTCGGGCTGGATGCTGCCACGCGGCGCAATCTGGAACTCACGCGCA
GCCTGGGTACCGGTGGAACGCGCGGCAGCCTGCTCGGCGTGCTCGACCGCACGGTGACGCCGATGGGTGCCCGCACCCTT
CGCCGCCTGGTCAGCGAGCCCCTGCGGGACCTGGACGAGCTCCGCCGTCGCCAGCACATCGTCGGCGCGCTCCGAGCAAC
CCCCGAACTCCGCTCCCGCTTGGGGTCGATCCTCCTGGCCGCTGGTGATCTCGAGCGCTTGACCAGCAAGATCGTCCAGG
GCTCAGCGACCATCCGCGATTTCGCGACACTTCGCCAAGCGCTGGCGACTGCCGAGGCCCTCCGCGGTGCCCTCCAGGCG
AGCGGCGAGCCAGCTCTGCAGCGCTTCGCCGACGACTTCATCTCCTGCCCGGAACTCGCCGCGCTCCTGGAACGAGCGCT
GATCGAAGACAGCGATGGCCCACGCCTCCGACCAGGCTTTTGTCCGGAACTCGATGCCGTGCTCGCGGCGGTCGAGGAAA
CGCGCCGCTTCTTGGCCACGCTCGAACAGCGGGAGCGCGAGCGGACCGGAATCCGCTCCCTCAAAGTCGGTTACAACAAA
GTTTTCGGCTACTACATCGAGGTAACTCGGCCCCATCTCAGCCGAGTGCCTCCGGACTACGTCCGCAAGCAGACTGTCGC
CACCGGTGAGCGCTTCATCACCCCCGAACTCAAAGACGCTGAGGCACGTCTGCTCGCAGCCGAGGCCGAAATCGCCGAAC
TCGAGCGCGCTGCACTGGCACGCTTGACCCGCGAGGTCACCACGCGGACCAACGAACTCTTGCGCCTTGCTGGCTGGATC
GCCTGGCTCGATGCCTTCCGCTCCCTGGCCGAAGTCGCGGCGCAGTACGACTGGAGCTGCCCGGAACTGGACGAGTCGGA
CACGATCCTGATCGAGGGTGGGCGTCATCCCGTCGTCGAAGTGCTGCTCGATGGACAGCCGTTCGTCCCCAACGATTGCC
AGCTGGGCGGCGATGGACCACGCCTCCTCCTGGTCACCGGACCGAACATGGGCGGCAAGAGCACCTATCTTCGGCAAGTC
GCCTTGATCGTGCTCTTGGCGCAGATCGGTTCCTTCGTCCCCGCGGCACGCGCCCGCATCGGACTCGTCGATCGCATCTT
CTGTCGTGTCGGTGCACACGACGATCTGCCTGGTGGGCAGAGCACCTTCATGGTCGAGATGGTGGAAACGGCCACCATTC
TCCGCCAGGCCACCCAGCGCAGCCTCGTGATCCTCGACGAAGTCGGACGGGGCACCGCCACACAGGACGGGCTCGCCATC
GCCTGGGCTGTACTGGAAGACCTGCACGATCGGGTCGGAGCTCGCACGCTTTTTGCGACCCACTTCCTCGAGCTGACAGC
ATTGGAGGCCGAATTGCCAGGCGTCGCCAACGTCCATGTCGCGGCGATGGAGCAAGACGGGCGAGTGGTCTTCTTGTATC
GCGTTCGACCTGGCGCGGCCGACCGGGCCTACGGCATCCATGTGGCCCGGCTGGCCGGCCTCCCTCCTTGGGTAGCTGAT
CGAGCCGAGCGATTGCTGATCGGCCGGCCCGCTCCCACGCCGGCCGCGCCGCACTCCGAAACAGCCGAGCACCCGCACGG
GCTGACCGCAAGCCCGCACCAGCTCGCGCTTCCCGGCTTCCCCACTCGTCGCCATGCCGCTGAGGAACTGGCTCGCGCGC
TGCTCGAGCTCGATCTCGCCAATCTCACGCCGCGCCAAGCGCTGGACTGGCTCTTCGAGCAGCGCGCCAAGCTGGGCAGA
GCGTAG

Protein sequence :
MGSTRRAIQHRLPATKESSSDDLVPSRRQYLRLKAHYPHAILLYRLGDFYEAFDEDARIVARDARITLTSRSFGRNGRVP
MAGIPHHALNHYLARLLAAGHTIAIAEQVSEPGKGLVERAVTRVLTPGTVAEAALLPTTENRYLAAVARLPERTGLAWVD
VSTGEFAVLELSGAERDLLLAEEYARLAPAETLVPDTDEIPLPPGGCLTRLEHWHFEPERAAQRLRALFATRSLAPFGCQ
HLPAALAAAGAIVVYLERTNPALLSLLTSLRTEVPARRVGLDAATRRNLELTRSLGTGGTRGSLLGVLDRTVTPMGARTL
RRLVSEPLRDLDELRRRQHIVGALRATPELRSRLGSILLAAGDLERLTSKIVQGSATIRDFATLRQALATAEALRGALQA
SGEPALQRFADDFISCPELAALLERALIEDSDGPRLRPGFCPELDAVLAAVEETRRFLATLEQRERERTGIRSLKVGYNK
VFGYYIEVTRPHLSRVPPDYVRKQTVATGERFITPELKDAEARLLAAEAEIAELERAALARLTREVTTRTNELLRLAGWI
AWLDAFRSLAEVAAQYDWSCPELDESDTILIEGGRHPVVEVLLDGQPFVPNDCQLGGDGPRLLLVTGPNMGGKSTYLRQV
ALIVLLAQIGSFVPAARARIGLVDRIFCRVGAHDDLPGGQSTFMVEMVETATILRQATQRSLVILDEVGRGTATQDGLAI
AWAVLEDLHDRVGARTLFATHFLELTALEAELPGVANVHVAAMEQDGRVVFLYRVRPGAADRAYGIHVARLAGLPPWVAD
RAERLLIGRPAPTPAAPHSETAEHPHGLTASPHQLALPGFPTRRHAAEELARALLELDLANLTPRQALDWLFEQRAKLGR
A

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 3e-117 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
mutS YP_002523731.1 DNA mismatch repair protein MutS VFG0562 Protein 1e-119 42