Gene Information

Name : mutS (trd_1109)
Accession : YP_002522319.1
Strain : Thermomicrobium roseum DSM 5159
Genome accession: NC_011959
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 1088704 - 1091358 bp
Length : 2655 bp
Strand : -
Note : identified by match to protein family HMM PF00488; match to protein family HMM PF01624; match to protein family HMM PF05188; match to protein family HMM PF05190; match to protein family HMM PF05192; match to protein family HMM TIGR01070

DNA sequence :
ATGGTGATCGCCGTGCCGAGTAGGCAGTGGGCGATCAGCGTCCGTACACTATCGTCAGGGGTGGCTGTGTCGACACCGAT
CCGCAAGCAGTATCTGGAGATCAAGCGACAGGTTCCCGATGCTCTGCTCTTGTTCCGGTTGGGCGATTTTTACGAACTCT
TCGACGACGATGCGGAAGTCGCCAGCCGGATTCTCGATATCGCCTTGACGTCGCGGGATCTCGGCAAGGGACAGCGGGTG
CCGATGGCTGGCATCCCGGCCCATGCTGCCGAGCCGTACATCGCCAAGTTGGTCGCTGCCGGTTACCGGGTCGCGCTCTG
CGACCAGATCGGTACACCGGACGGTCGTAACCTGGTCGAGCGTCGCATCACCCGGATCCTCACCAGGGGAACCATCACCG
AGCCGGCCATGCTCGACGCGCGGCGCAATACGTACATCGCCGCTGTCCTGCTCGAGTCGTCACGGGCCGGCCTGGCGTAT
GCGGATCTTTCCACGGGAGAGTTCGCCGCGACCGAGTGGGTTGCGGAACAGCTCGAGGAACTCCGTGCAGCGGTGGAACG
GGAACTCTTGCGGATCGCACCGGCTGAACTCGTGCTCCCGGCAGGTCGTCGCGGGGTGATCGGCGTCGAGGGAGCTGCCG
TCACCGAGCTGGAGGAGCGCGCCTGGCGCGAGCACGAGGCACGTCGCGTCCTGCACGAGCACTTCGGTGTTGAAACGCTC
GCGAGCTTCGGGCTCGCGGACCGCCCGGCGGCGCTGCGGGCAGCGGGCGCGCTCCTCGGGTATCTTCTGGACACGCAGGT
GGGGCAGCTACCCCAGCTCGACGATCTCGTCGTGTACCAGACCGATTCGTTCATGACGCTCGACGCGGTGACGCGACGCA
ACCTCGAGCTGCTCGAGTCGGCACGCGGGGAGCGCGCTCATTCACTGGTGTCGGTTCTCGATCGGACGGAAACACCGATG
GGTGCGCGCTTGCTCCGGCGCTGGCTCAGCCAGCCGCTCCTCGATGTGGGCGCGATTCGGCAGCGGCAGGAGCGCGTGGC
GGCGCTCGTCGAGGAGACGCTCGTCCGTGCCCGGCTGGGTATCCTCCTGGCCGGCGTCGCCGACCTGGAACGGTTGGCCA
ATCGTGTCTTGACCGGGCATGTGACACCTCGTGAATTACGCCAACTCGGTCACTCGCTCGCTCGCTTGCCCGAAATCGCC
GAGATAGCGGGCCGACGACCAGAACTTGCACCCCTCTCCGCTCTGCCGGACCTCCTGCCGGCGGCTCGTCTCATCGAGTC
AGCGATCGTCGAGGATCCGCCCCCGAGTCTCGGCCAGGGGCATGTGATCCGTGCCGGGTTTGCGCCGGAGCTGGACGAGC
TGCGGGAACGGGCACGTTCGGCGCGCGAGTGGATCGCCTCGTTGGAGCAGCGCGAGCGGGAGCGCACCGGTATCCGATCC
CTGAAGGTGGGCTACAACAAAGTTTTTGGCTACTACATCGAGGTGAGCCATGCCAATCGGCATCTCGTGCCGCCCGATTA
CCAGCGTAAGCAGACATTGGTCGGGGCTGAGCGCTACGTAACGCCGGAACTCCGCGAGTTCGAGAGCATGGTCTTGCAGG
CGGAGGAACGAATCGCGGCACTCGAGGAGGAGGTCTACCGGCGCGTCGTCAAAGAATTGGCGAGCTGCGCTGCACAGATC
CGGCGAGCAGCTCAGCTCGTGGCTGAATTGGATGTCTACCGTGCGCTCGCTGAGGTGGCAGTGGACCGGCGGTACGTCCG
CCCGGTTGTCGACGAGAGTACGGTTTTGGAAATCAGGGGTGGGCGCCATCCCGTCGTGGAGACGACGCTGGAAGCGGGCC
GATTCGTCTCCAACGACGCCCGGCTCGATACCGAGAGCGATCAGATCGTGATTCTCACCGGTCCCAACATGGCAGGAAAA
TCGACCTTTCTCCGTCAAGTCGCACTGATCGTGCTCTTGGCGCAGGTCGGTTCTTTCGTGCCAGCCGAGTTCGCTCGGAT
CGGGCTGGTGGATCGCATTTTCACCCGCATCGGTGCCCAGGACGACATCGCGGCCGGTCAGAGTACCTTCATGGTGGAAA
TGGTCGAGACGGCATCGATTCTGCGCCAAGCGACGCTCCGCTCGCTCGTTGTTTTGGACGAGGTCGGTCGGGGAACGAGC
ACGTACGACGGTTTGGCGATCGCTCGTGCAGTGGTGGAGTACTTGCACAACCATCCACGACTCGGCTGCCGAACGCTCTT
CGCGACCCACTACCACGAGTTGACGGAACTGGAGCGTGTCTTACCACGAGTCCGTAACTACCGCATGGACGTGTTGGAAG
AAGGTGATCGCGTCGTCTTTCTCCACCGAGTCGTGCGGGGTGGAGCGGACAAGAGCTATGGGATCCATGTGGCCCAGCTC
GCTGGCCTGCCGCACGCGGTGGTTCGTCGGGCACGGGAGATCCTGCAGGAGCTCGAATCAGCACGCAGCGGTGAGCACAC
GCGGCGGCGCCAGAGCATGGCCAAGGAGGTACCGCTGACGATCCAGCTCACGCTCTTCAGCCCGCCTCATCCTGTCTTGG
AACGACTGCGCTCGCTGGAACTCGACGGGATGACGCCGCTCGAGGCGCTGACGACACTCTACGAGTTACAGCGGTTAGCG
CAGGAAGCCGAGTGA

Protein sequence :
MVIAVPSRQWAISVRTLSSGVAVSTPIRKQYLEIKRQVPDALLLFRLGDFYELFDDDAEVASRILDIALTSRDLGKGQRV
PMAGIPAHAAEPYIAKLVAAGYRVALCDQIGTPDGRNLVERRITRILTRGTITEPAMLDARRNTYIAAVLLESSRAGLAY
ADLSTGEFAATEWVAEQLEELRAAVERELLRIAPAELVLPAGRRGVIGVEGAAVTELEERAWREHEARRVLHEHFGVETL
ASFGLADRPAALRAAGALLGYLLDTQVGQLPQLDDLVVYQTDSFMTLDAVTRRNLELLESARGERAHSLVSVLDRTETPM
GARLLRRWLSQPLLDVGAIRQRQERVAALVEETLVRARLGILLAGVADLERLANRVLTGHVTPRELRQLGHSLARLPEIA
EIAGRRPELAPLSALPDLLPAARLIESAIVEDPPPSLGQGHVIRAGFAPELDELRERARSAREWIASLEQRERERTGIRS
LKVGYNKVFGYYIEVSHANRHLVPPDYQRKQTLVGAERYVTPELREFESMVLQAEERIAALEEEVYRRVVKELASCAAQI
RRAAQLVAELDVYRALAEVAVDRRYVRPVVDESTVLEIRGGRHPVVETTLEAGRFVSNDARLDTESDQIVILTGPNMAGK
STFLRQVALIVLLAQVGSFVPAEFARIGLVDRIFTRIGAQDDIAAGQSTFMVEMVETASILRQATLRSLVVLDEVGRGTS
TYDGLAIARAVVEYLHNHPRLGCRTLFATHYHELTELERVLPRVRNYRMDVLEEGDRVVFLHRVVRGGADKSYGIHVAQL
AGLPHAVVRRAREILQELESARSGEHTRRRQSMAKEVPLTIQLTLFSPPHPVLERLRSLELDGMTPLEALTTLYELQRLA
QEAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 6e-124 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
mutS YP_002522319.1 DNA mismatch repair protein MutS VFG0562 Protein 2e-130 42