Gene Information

Name : mutS (gvip140)
Accession : NP_923979.1
Strain : Gloeobacter violaceus PCC 7421
Genome accession: NC_005125
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 1107922 - 1110594 bp
Length : 2673 bp
Strand : -
Note : This protein performs the mismatch recognition step during the DNA repair process

DNA sequence :
ATGGCCGACCCCGCCACTCAACTCTCCCAGTGGGACTTTCGCCGCTTCGCTCGATCGGATCTGACCCCGATGCTCCAGCA
GTACGTCGAGGTCAAGGCCCAGCATCCCCACTGCCTGCTCCTGTATCGGATGGGGGATTTTTACGAAACGTTCCTGGCGG
ACGCCGAAATCGTCTCGCGCGAGCTGGAAATCGTGCTTACCGGCCGCCAGGCGGGCGACAAAATCGGCCGCATCCCGATG
GCGGGGATTCCCCACCACGCTCTGGAGCGCTACTGCGCCCAATTGATCGAAAAGGGCTACGCGGTGGTGATTTGCGATCA
AGTCGAATCGCCCGAGCAGGCCAAAGAGCGCGCCCGGCAGGCGAAAGTCGCTCGTCGCAGCAAGAGCGACGGCGACGCGC
CGCTGTTGCCCCTGCTGCTGGAGGACGGCGAGCAGATTGACTGGGAAGGTGCCGAGAGCGTCCTGGTGCGCCGGGCGGTC
ACCCGGGTGCTCACCCCCGGTACGGTGCTCGAAGATCAGTTGCTGGTGGGACGGCGCAATAACTATCTGGCGGCGCTGGT
GCAGGCGGGGGAGTGCTGGGGATTGGCCTTCGCGGACATTTCGACCGGCGAATTTCAGGTGACCCAGCTGGAGAGTGCCG
AGGCACTGGTGCAGGAGTTGTTGCGCCTGCAGCCGGCCGAAGTGCTCCTGTCCGGCGACGCGCCCGATCCGCTGGTGCTG
CTGCGGCCCGGCGAGGCTTCCAGCGAACGGCCCGAGTGCCTGCCGTCCCAGTTTTGCTATACGCTGCGTCCCCGGCGTTA
TTTCGAATTGGACGAAGCGCGGCGGTTGCTGATGGAAACGTTCGGCGTGCGCTCACTCGAAGGCTTCGGCTGCGAAAATC
TGCCGCTTGCCGTGCGGGCGGCCGGGGGGTTGGTGCAGCATTTGCTGGAGACGCAGCGGGGAGTGTCGATTCCCCTGGAG
GGCATCCGCACCTACACCCTCTCGCAGTACCTGATTCTCGATCACCAGACCCGGCGCAACCTCGAACTGACCCAGACCGT
GCGCGACGGCGCGCAGTACGGCTCGCTTTTGTGGGCGCTCGACCGCACCCGCACGGTGATGGGGGGGCGCGCCCTCAGGC
GCTGGCTTTTGCAGCCGTTGCTCGATACGCGCGCCATCGGCCGCCGTCAGGATTCGGTGGCCGAGTTGTACGACGAGGGG
CTGTTGCGCGAACGGCTCCAGCGCATTCTTGAATCGGTCTACGACCTGGAGCGGCTCGCGGGACGCTGCGGTTCGGGCAC
CGCCAACGCCCGCGATCTGGTGGCTCTGGGCGAATCGCTGCTCAAATTGCCCGCTCTGGCCGAGGCGGTGGCCGCGAGCA
CCAGTCCCTACCTCAAAGCCCTTCAATCCATTCCTGTCGAACTGGAGCGGCTTGGAGAGAAGCTGCGCCGCACGCTCGTG
GATACCCCGCCGCTCATCCTCACCGAGGGCGGTCTGATCCGGGCGGGCGTCCACCCGGAACTGGAAGGAATGCGCGGGCA
ACTTATCGAAGATCGCGACTGGCTGGTGGATCTCGAAGCGCGCGAGCGCGCCCGCACGGGCATCCAGACTCTCAAGGTGG
GCTTTAACAAAGCCTTCGGCTACTACCTGTCGATCTCGCGCGGCAAGGCCGAAAAAGCGCCCCCCGAGTACCTGCGCAAA
CAGACCCTCACCAACGAGGAGCGCTACATCACCCCCGAACTCAAAGAGCGCGAAACCCGCATCCTCAACGCCCAGCAGCA
GACCAACCAGCTTGAGTACGATATTTTTAACATCCTCAGGCAGGAGGCGGGCCGTCACGTTTCGGCTCTGCGGCAGGTGG
CCCGGCGCGTGGCCGCCCTCGATGCCCTGGCCGGTCTGGCTGAGGTGGCCGTCTACCACGACTATTGCCGTCCGGTGCTC
GGCGAAGGGCGCGAGGTGCACATCGAGGCAGGCCGCCACCCTGTGATCGAACAGGCGATCCCGGCGGGTTTTTTCGTGCC
TAACGACGCGCGCATGGGAGCCGAAGCCGAGCCGGACTTGATCATCCTCACCGGCCCGAACATGTCCGGCAAATCGAGCT
TCATCCGCCAGGTGGCATTGATTCAGCTGTTGGCCCAGGTGGGGGCCTTTGTGCCCGCCAGGGGGGCTGTGCTCGGGGTG
GCCGATCGCATCTTTACGCGCGTGGGAGCGGTCGACGATCTGGCCACCGGCCAATCGACCTTCATGGTCGAGATGACCGA
GACGGCGAATATCCTCAACCACGCCACCCCCCGCTCACTGGTCTTGCTCGATGAAATCGGCCGAGGGACGGCGACTTTCG
ATGGGCTGGCCATCGCCTGGGCGGTGGCTGAGTACCTGGCAAGCCACATCCGCTGCCGGACCATTTTCGCCACCCACTAC
CACGAGCTGAACGAACTGGCTTCGGTGGTCAGTGGTGTCGCCAATTATCAGGTGACTGTGCAGGAACTGGCCGATCGAAT
CGTCTTTTTGCACCGGGTCACCCCCGGCGGGGCGGATCGCTCCTACGGCATCGAGGTGGGCCGGTTGGCCGGATTGCCGC
CTTCGGTGGTGGCGCGGGCGCGCACGGTCCTGGCCCAGGTCGAACAGCATTCGCAAATTGCCGTGGGCTTGCGCGATTCT
AACGGCAGCGCCTCCGAGTCGGCCGCTGGCTAA

Protein sequence :
MADPATQLSQWDFRRFARSDLTPMLQQYVEVKAQHPHCLLLYRMGDFYETFLADAEIVSRELEIVLTGRQAGDKIGRIPM
AGIPHHALERYCAQLIEKGYAVVICDQVESPEQAKERARQAKVARRSKSDGDAPLLPLLLEDGEQIDWEGAESVLVRRAV
TRVLTPGTVLEDQLLVGRRNNYLAALVQAGECWGLAFADISTGEFQVTQLESAEALVQELLRLQPAEVLLSGDAPDPLVL
LRPGEASSERPECLPSQFCYTLRPRRYFELDEARRLLMETFGVRSLEGFGCENLPLAVRAAGGLVQHLLETQRGVSIPLE
GIRTYTLSQYLILDHQTRRNLELTQTVRDGAQYGSLLWALDRTRTVMGGRALRRWLLQPLLDTRAIGRRQDSVAELYDEG
LLRERLQRILESVYDLERLAGRCGSGTANARDLVALGESLLKLPALAEAVAASTSPYLKALQSIPVELERLGEKLRRTLV
DTPPLILTEGGLIRAGVHPELEGMRGQLIEDRDWLVDLEARERARTGIQTLKVGFNKAFGYYLSISRGKAEKAPPEYLRK
QTLTNEERYITPELKERETRILNAQQQTNQLEYDIFNILRQEAGRHVSALRQVARRVAALDALAGLAEVAVYHDYCRPVL
GEGREVHIEAGRHPVIEQAIPAGFFVPNDARMGAEAEPDLIILTGPNMSGKSSFIRQVALIQLLAQVGAFVPARGAVLGV
ADRIFTRVGAVDDLATGQSTFMVEMTETANILNHATPRSLVLLDEIGRGTATFDGLAIAWAVAEYLASHIRCRTIFATHY
HELNELASVVSGVANYQVTVQELADRIVFLHRVTPGGADRSYGIEVGRLAGLPPSVVARARTVLAQVEQHSQIAVGLRDS
NGSASESAAG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 4e-114 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
mutS NP_923979.1 DNA mismatch repair protein MutS VFG0562 Protein 1e-125 42