Gene Information

Name : mutS (ECOK1_3108)
Accession : YP_006102232.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3170288 - 3172849 bp
Length : 2562 bp
Strand : +
Note : identified by similarity to SP:P23909; match to protein family HMM PF00488; match to protein family HMM PF01624; match to protein family HMM PF05188; match to protein family HMM PF05190; match to protein family HMM PF05192; match to protein family HMM TIG

DNA sequence :
ATGAGTACAATAGAAAATTTCGACGCCCATACGCCCATGATGCAGCAGTATCTCAAGCTGAAAGCCCAGCATCCCGAGAT
CCTGCTGTTTTACCGGATGGGCGACTTTTATGAACTGTTTTATGACGACGCAAAACGGGCATCGCAACTGCTGGATATTT
CACTGACCAAACGCGGTGCTTCAGCGGGAGAGCCAATCCCGATGGCGGGGATTCCCTACCATGCGGTGGAAAACTACCTC
GCCAAACTGGTGAATCAGGGCGAATCCGTCGCTATTTGTGAACAGATTGGCGATCCGGCGACCAGCAAAGGGCCGGTTGA
GCGCAAAGTTGTCCGTATCGTTACGCCGGGCACCATCAGCGATGAAGCGCTGTTGCAGGAACGTCAGGACAACCTGCTGG
CGGCTATCTGGCAGGACAGTAAAGGTTTCGGCTACGCAACGCTGGATATCAGTTCCGGTCGTTTTCGCCTGAGCGAACCG
GCAGACCGCGAAACGATGGCGGCAGAACTGCAACGTACGAATCCAGCGGAATTACTGTATGCGGAAGATTTCGCCGAGAT
GTCGCTGATTGAAGGTCGCCGCGGCCTGCGTCGTCGCCCGCTGTGGGAGTTTGAAATCGACACCGCTCGCCAGCAATTGA
ACCTGCAATTTGGCACCCGAGATCTGGTCGGTTTTGGTGTGGAGAACGCACCACGCGGACTTTGTGCTGCCGGTTGTCTG
TTGCAGTATGCGAAAGATACCCAACGCACGACCCTGCCGCATATTCGTTCTATCACTATGGAACGTCAGCAGGACAGCAT
CATTATGGATGCCGCGACGCGTCGTAACCTGGAAATTACTCAGAACCTGGCCGGCGGTGCGGAAAATACGCTGGCTTCAG
TGCTCGACTGCACTGTAACGCCGATGGGCAGCCGTATGCTGAAACGCTGGCTGCATATGCCAGTGCGCGATACCCGCGTG
TTGCTTGAGCGCCAGCAAACTATTGGCGCATTGCAGGATTTCACCGCTGAGCTACAGCCGGTACTGCGTCAGGTCGGCGA
CCTGGAACGTATTCTGGCGCGTCTGGCGTTGCGTACCGCTCGCCCGCGCGATCTGGCCCGTATGCGTCATGCCTTCCAGC
AACTGCCGGAGTTGCGTGCGCAGTTAGAAAATGTCGATAGTGCACCGGTACAAGCGCTGCGTGAGAAGATGGGCGAGTTT
GCCGAACTGCGCGACCTGCTGGAGCGAGCAATCATCGACACGCCGCCGGTGCTGGTACGCGATGGTGGTGTTATTGCGAC
TGGTTATAACGAAGAACTGGATGAATGGCGTGCGCTGGCCGACGGCGCGACCGATTATCTGGAACGTCTGGAAGTCCGCG
AGCGTGAACGTACCGGCCTGGACACGTTAAAAGTTGGCTTTAATGCTGTGCACGGCTACTACATTCAAATCAGCCGTGGG
CAAAGCCATCTGGCACCAATCAACTACATGCGCCGCCAGACGCTGAAAAACGCCGAGCGTTACATTATTCCGGAGCTGAA
AGAGTACGAAGATAAAGTTCTCACCTCAAAAGGCAAAGCGCTGGCACTGGAAAAACAGCTTTATGAAGAGCTGTTCGACC
TGTTGCTGCCGCATCTGGAAGCGTTGCAACAGAGCGCGAGCGCGCTGGCAGAACTTGATGTGCTGGTTAACCTGGCGGAG
CGGGCCTATACCCTGAACTACACCTGCCCGACCTTTATTGATAAACCTGGCATTCGTATTACCGAAGGTCGCCATCCAGT
GGTTGAACAGGTACTGAACGAACCGTTTATCGCTAACCCGCTAAACCTGTCGCCACAGCGCCGCATGTTGATCATCACCG
GTCCGAACATGGGGGGTAAAAGTACCTATATGCGCCAGACCGCGTTGATTGCGCTGATGGCGTATATCGGCAGTTACGTC
CCGGCGCAAAAGGTCGAGATTGGGCCGATCGATCGCATCTTTACCCGCGTAGGTGCCGCGGATGATCTGGCTTCCGGACG
TTCAACCTTTATGGTGGAGATGACCGAAACCGCTAATATTCTGCATAACGCCACCGAATACAGTCTGGTGTTGATGGACG
AGATTGGGCGCGGAACGTCCACTTACGATGGCCTGTCGCTGGCATGGGCGTGTGCGGAAAATCTGGCAAATAAGATCAAA
GCGTTGACGCTGTTTGCTACCCACTATTTCGAGCTGACCCAGTTACCGGAGAAAATGGAAGGCGTCGCCAACGTGCATCT
CGATGCACTGGAGCACGGCGACACCATTGCCTTTATGCATAGCGTGCAGGATGGCGCAGCGAGCAAAAGCTACGGCCTGG
CGGTTGCAGCTCTGGCGGGTGTGCCAAAAGAGGTTATTAAGCGCGCACGGCAAAAACTGCGTGAGCTGGAAAGCATTTCG
CCGAACGCCGCTGCTACGCAAGTGGATGGTACACAAATGTCTTTGCTGTCCGTACCGGAAGAAACTTCGCCTGCGGTCGA
GGCATTGGAAAACCTCGACCCGGATTCACTGACTCCGCGTCAGGCGCTGGAATGGATTTATCGCTTGAAGAGTCTGGTGT
AA

Protein sequence :
MSTIENFDAHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYL
AKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEP
ADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCL
LQYAKDTQRTTLPHIRSITMERQQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRV
LLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLENVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIATGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRG
QSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAE
RAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYV
PAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIK
ALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 0.0 90

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
mutS YP_006102232.1 DNA mismatch repair protein MutS VFG0562 Protein 0.0 96