Gene Information

Name : UWK_01711 (UWK_01711)
Accession : YP_007467922.1
Strain : Desulfocapsa sulfexigens DSM 10523
Genome accession: NC_020304
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1914893 - 1917562 bp
Length : 2670 bp
Strand : -
Note : PFAM: MutS family domain IV; MutS domain II; MutS domain V; MutS domain I; MutS domain III; TIGRFAM: DNA mismatch repair protein MutS

DNA sequence :
ATGACTAAAGCTCCAAAAATCACTCCCATGCTCCAACAGTACCTTGAAATCAAGGAGCAATATCAAGATGCTATTCTCTT
TTACAGGATGGGCGATTTTTATGAAATGTTTTTTGAAGATGCTGCGGTTGCTTCAAAAATTCTCGGAATAACCCTGACAT
CGCGAAACAGTAAAGACGCTACCAATAAAGTTCCCATGTGTGGCATTCCCTATCATGCAGCAAGCGGATACCTTGCGAAA
CTTGTTAAGGCAGGACGCCGTGTTGCAATCTGTGAACAAACAGAAAATCCCAGTGAAGCCAAGGGCATAGTGCGCCGTGA
AGTAGTTCGTGTCGTCTCCCCTGGGGTGGTAGTAGATTCCGGAATTCTTGATGACAAGGACAACCTCTATGTTGCTGCAA
TCTGCTGTAAAGGTAAAGGAAATGACACTCTTTACGGAATCAGTTTTCTTGATCTCAGTACAGGTGCATTTCTCTTAGGT
GAATTTCTAGACACTACCAACAATGGTGAAAGCATCCTCGACCAGCTCACCCGCATGACACCTGCAGAATTGCTGGTAAA
TGAAAACGACCTCGATCTCATTGGAGGTCTGGTCGATACAGCCACCACCCTTCTTCCCGGCTTATGTGTCACCCAGCGCC
CTGCAACTCAGTTTCATTTTTCAAGCTGTGAAGAGCTTCTCATTGAACATTTTAAGGTAAACAATTTAGCGGGATTTGGC
TGCAACACATTAAAACAGGGAGTCATTGCAGCCGGAGTCCTCCTTGACTATGTCATTGAAACGCAGAAAAGTGACATCAG
CCATATAGAGAAACTCACTCCAATTGATCTTGAGCTTATCCTTCAGATTGACGACTCATCCAGAAGAAACCTTGAACTCA
CCCAGACAATTATAGGGTCTCAGCGTGAGGGCTCCCTCCTTTCTGTTCTTGACCACAGCTGCACACCGATGGGTGCCAGA
ATGCTCAAGCAGGAAATTCTCTTTCCCCTTCAGAACGTTGAACGCATTAACGCACGTCTTGGTGCTGTACGGTTTCTTTA
TGGCCACACAGCCATCCGCAATACCTTCCGGGAACTTCTCACCACCATCTATGACGTTGAACGACTAAACAGTCGTATGG
TCCTGGGAAACGGAAATGGTCGTGACATGCTTGCGTTAAAACAATCTCTTGCGAAACTCCCTGCCATCAGGGAACTGCTA
CTCCAATGTGATGCAGAGCATATACGGAAAATTGGAGAAGACCTGGATGTTCTTGCAGATCTCCACCAACTCCTCGAAAA
CACCATTCATGAAGAAGCCCCCATCACCCTCAGGGAAGGACGATTGATAAAGGAAGGGTACAACGAAGAACTCGACGAAC
TGATGCATATTCAGCGCCACGGCAGACAACTTATCCTTGATCTCGAAAGTCAGGAGCGCAACGCCACGGGTATAGCAAAA
CTTAAAGTCGGATTTAATAAGGTCTTTGGCTATTTCATCGAAGTAAGCCGACTCCAGTCCGCCAATGTCCCCGATACCTA
TATTCGAAAACAAACCCTTGTCAACGCAGAACGCTTTATCACACCTGAACTCAAGGAGTTTGAAACAAAGGTGCTTGGTG
CTCAGGATAGACGGCTCGAACTCGAATACCAACTCTTCGTTGAAATTCGCTCCCAACTCGCCAGCGAAAGTTCACGGCTA
TTAAAAAGTGGCGCATTACTCGCCAAAACTGATTTTCTAGTCTGCCTTGCCGAGGTTGCTCACCTTTATCGCTACAAGTG
CCCAGAAGTTAACAACGGCGATTCTATCGACATCATTGAGGGGAGACATCCTGTCATTGAACGATCTCTCCCGAATGGCA
AATTTGTCCCCAATGATGTACACCTTGATCAGGAAACAGAAGAAGTGCTTATTATAACCGGTCCCAATATGGCTGGAAAA
TCCACCATCCTCCGTCAAACAGCACTCATTGTCCTTATGGCACAAATGGGTTCCTTTGTTCCAGCAAAAGAAGCTTCCAT
TGGTGTCGTTGACAGGATCTTTACTCGAGTCGGAGCCATGGATGATCTCAGACGTGGTCAGTCAACGTTTATGGTTGAGA
TGAACGAAACCGCCAATATTCTCAACAATGCTACCGAAAAAAGCCTGGTCATTCTTGATGAAATCGGCCGAGGCACCTCA
ACTTTTGATGGCCTCTCCATAGCGTGGGCAGTAGCAGAGGATCTTGTTCAAAAAAACAATAAAGGCGTCAAAACCCTCTT
TGCTACACATTATCACGAGCTCACCGACCTTGCCAGGACCGAGGAAAGGGTCCGAAACTACTCCATTGCAGTTCGTGAAT
GGAACGACACCATTATCTTTCTCCACAAACTTGTGAAAGGTGGAACCAATCGTTCCTATGGTATTCAGGTTGCTGGCCTT
GCCGGAGTCCCAGAGCGTGTCGTCAGGCGTGCTGGAGAGATTCTTAAAAACATTGAACAAGGAGAATTCAATCACGACGG
GACACCAAGCATCGCTAAAAGTTCTAATCCCCGAAAACCCCGTGGTAAAAAACATCCTAACCAGTTATCTCTCTTCCCTC
CAGCTCAACAGGATCCATTACGTACACTGCTCCAAAATATTAGCGTCGACGACCTCAGCCCCCGTCAGGCGCTTGATCTT
ATCTACGAACTTATGAAGCACCTGAAGTAA

Protein sequence :
MTKAPKITPMLQQYLEIKEQYQDAILFYRMGDFYEMFFEDAAVASKILGITLTSRNSKDATNKVPMCGIPYHAASGYLAK
LVKAGRRVAICEQTENPSEAKGIVRREVVRVVSPGVVVDSGILDDKDNLYVAAICCKGKGNDTLYGISFLDLSTGAFLLG
EFLDTTNNGESILDQLTRMTPAELLVNENDLDLIGGLVDTATTLLPGLCVTQRPATQFHFSSCEELLIEHFKVNNLAGFG
CNTLKQGVIAAGVLLDYVIETQKSDISHIEKLTPIDLELILQIDDSSRRNLELTQTIIGSQREGSLLSVLDHSCTPMGAR
MLKQEILFPLQNVERINARLGAVRFLYGHTAIRNTFRELLTTIYDVERLNSRMVLGNGNGRDMLALKQSLAKLPAIRELL
LQCDAEHIRKIGEDLDVLADLHQLLENTIHEEAPITLREGRLIKEGYNEELDELMHIQRHGRQLILDLESQERNATGIAK
LKVGFNKVFGYFIEVSRLQSANVPDTYIRKQTLVNAERFITPELKEFETKVLGAQDRRLELEYQLFVEIRSQLASESSRL
LKSGALLAKTDFLVCLAEVAHLYRYKCPEVNNGDSIDIIEGRHPVIERSLPNGKFVPNDVHLDQETEEVLIITGPNMAGK
STILRQTALIVLMAQMGSFVPAKEASIGVVDRIFTRVGAMDDLRRGQSTFMVEMNETANILNNATEKSLVILDEIGRGTS
TFDGLSIAWAVAEDLVQKNNKGVKTLFATHYHELTDLARTEERVRNYSIAVREWNDTIIFLHKLVKGGTNRSYGIQVAGL
AGVPERVVRRAGEILKNIEQGEFNHDGTPSIAKSSNPRKPRGKKHPNQLSLFPPAQQDPLRTLLQNISVDDLSPRQALDL
IYELMKHLK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 1e-135 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
UWK_01711 YP_007467922.1 DNA mismatch repair protein MutS VFG0562 Protein 1e-149 43