Gene Information

Name : mutS (RB7237)
Accession : NP_867713.1
Strain : Rhodopirellula baltica SH 1
Genome accession: NC_005027
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 3841348 - 3844023 bp
Length : 2676 bp
Strand : +
Note : This protein performs the mismatch recognition step during the DNA repair process

DNA sequence :
ATGACTCCCATGATGCGACAGTACCACGAGGCGAAAGAGGCGTGCGGTGACGCATTGCTCTTCTTTCGCATGGGCGATTT
CTACGAACTGTTTTTGGACGACGCCAAGGTCGCTGCGGGGATCTTGGGGCTGACCCTCACCAGCCGCGACAAAGACAGCG
AAAACCCCACGGCGATGGCGGGTTTTCCCCACCACCAATTGGATCAGTACCTGCAAAAACTGATCCGTGCGGGCTTTCGA
GCTGCGGTTTGCGAGCAGGTCGAAGATCCCAAAGCAGCAAAAGGCTTGGTCCGCCGCGAAATCACTCGGGTGGTCAGTGC
CGGCACGTTGACCGACGAGGGATTGCTCGACCCAAAAGAACCGAACTACTTGGCGGCCGTTTTCGCACCCAGCCAGAAAG
CTCGCGAAAAAGCTCAGAAAGAAGCGGCGAAGACAAACGACCCTAGCGGTGGCGACGTCGTCGGAATCGCTTGGGCGGAA
CTCTCCAGCGGTCGTTTCGAGGCCGGCGTTTTTCCGCGAGCCCGTTTGGATGATGAACTTGCTCGCATCGGTCCCGCGGA
AGTGCTTCATTGCGAAGACGATGCTTCCGTTCATCCCGACCCAACCGCAACCTGGTCCTGGACCGCTCGTCCCGCTTGGA
GCTACGCAGCCGCCGACGCCGAAAAATCGCTCTGCAAACAGCTTTCAGTCGCCAACTTGGAAGGCCTCGGCTTCGAAGAC
AACGGCGATGTTGCGATCCGAGCCGCTGGTGCGGTGCTGTGCTACCTCAAAGAAACCCAGCGTGGCTCACTGGATCACTT
TCGTTCGTTGACCTGCCACAACCGCAGCCCGGTTTTGCAGATCGACGCCGCGACGCGTCGCAGTCTCGAAATCACTCGAA
CGATGCGAACGGGATCCCGCGAAGGTGCCTTGCTCGGCGTGATCGATCGCACCGTGACGCCAATGGGTTCGCGAATGCTC
GCCGATCACTTGGCCGCTCCGCTCATCGATGCGGACGCAATCACCTATCGAACCGACGCGGTGGATGAGTTCGTTCGAAA
CAACAATCTACGAAGCGACATCCGCACGGTCCTCGGTGACACATACGACCTGACTCGGTTGCTCGCCCGAGTCGCCACCG
GACGCACCGGACCGCGTGACTTGCGACAGATCGCCGTGACTCTCAGTGGCCTTCCCGCACTCAAAGCCCGCTTGGCCGAA
CGAGATAGCGCGTGTCTAACCCGTCTGGAATCGGAACTGCATCTCTGTCCCGAACTTCGCGAACAACTCGAATCCGCACT
CAACGACGAATGCCCGCTGTCGGCCGCCGACGGCAACTTCATTCGCGAAGGATTTGACTCCGAACTCGATACGCTCCGCG
AATTGGCTCGCGGCGGCAAACGCTGGATCGCTGAATATCAACAGCGGCAAATGGACGAAACCGGCATCGCCAATTTGAAA
GTCGGTTACAACCGCGTCTTTGGTTACTACCTGGAAGTTAGCAACGCACACAAAGACAAGATTCCTGCGGATTTCATTCG
CAAGCAAACACTAAAAAATTGCGAACGGTACATCACGCCGGAGCTAAAAGAATACGAAGAGAAGGTTCTCGCCGCGGATG
AAAAAGCGTCCAGCCGTGAGCAAATGCTTTTCACGCTGCTCCGCGAAAACACGCACAAGCATTTGGCAATTCTGCAAGAA
GTCGCCAATGCCATCGCGATGACCGATGTGGTCGCGTCGCTGGCCGAAGTTGCCGCGCAACATCATTGGGTCCGTCCGAC
ACTGACCGATGACAGCGTGCTTCGCATCGAAGGTGGCCGACACCCGGTGTTGGACGTCACGATGGCACAGGGCGAGTTCG
TTCCCAACGACTGTATTCAAAGCCCCGAAACAGGAATGATCTTGCTGATCACCGGCCCCAACATGGCCGGCAAGAGCACG
TACATTCGCCAAGTCGCTTTGATCACCTTGCTGGCACAAACCGGCAGCTTCGTCCCCGCAACGTCCGCCGAAATTGGAAT
CGCTGATCGTATCTTTGCTCGGGTTGGCGCGAGCGATGAACTCAGTCGCGGCCAAAGCACGTTCATGGTCGAGATGGTTG
AGACCGCTCGGATTCTGAACACAGCAACCTCACGCAGCTTGGTCATCCTTGACGAAATCGGTCGCGGAACCAGCACGTAC
GACGGCTTGTCGTTGGCCTGGGCAATCACCGAGCACTTACACGAACAGATCGGGGCGAGAACGCTTTTCGCAACGCACTA
TCACGAACTCGCGGCCCTCCAAGAAACGCTTCCACGCGTCGCCAACCTCAGCGTCGCGGTGAAGGAATGGCAAGACGAAG
TGGTGTTTTTGCACCGCATCGTGCCGGGGAGTGCTGACAAGAGTTATGGCATTCAAGTCGCTCGGTTGGCCGGAATTCCG
GTCGAAGTCAACGAGCGTGCCAAGGATGTTCTGGCACAACTCGAAGCGGATCACCGCGACAGTCTCGACCGCCCCACGAT
TGCGCCACCAAGCGGAGTCAACGGAAAGGGCTCCGGCGATACCTATCAACTGACCTTGTTTGGCTACGCCGATCACCCGC
TGATCCAAGAGATCGAAACAGTTGACATTGACTCGATGTCACCGATTCAAGCTTGGCAGTTCCTGCAGGAAGCAAAAGCG
AAACTCTCCGCAGGTCCAAAAGCGGTGAAGGGGTAA

Protein sequence :
MTPMMRQYHEAKEACGDALLFFRMGDFYELFLDDAKVAAGILGLTLTSRDKDSENPTAMAGFPHHQLDQYLQKLIRAGFR
AAVCEQVEDPKAAKGLVRREITRVVSAGTLTDEGLLDPKEPNYLAAVFAPSQKAREKAQKEAAKTNDPSGGDVVGIAWAE
LSSGRFEAGVFPRARLDDELARIGPAEVLHCEDDASVHPDPTATWSWTARPAWSYAAADAEKSLCKQLSVANLEGLGFED
NGDVAIRAAGAVLCYLKETQRGSLDHFRSLTCHNRSPVLQIDAATRRSLEITRTMRTGSREGALLGVIDRTVTPMGSRML
ADHLAAPLIDADAITYRTDAVDEFVRNNNLRSDIRTVLGDTYDLTRLLARVATGRTGPRDLRQIAVTLSGLPALKARLAE
RDSACLTRLESELHLCPELREQLESALNDECPLSAADGNFIREGFDSELDTLRELARGGKRWIAEYQQRQMDETGIANLK
VGYNRVFGYYLEVSNAHKDKIPADFIRKQTLKNCERYITPELKEYEEKVLAADEKASSREQMLFTLLRENTHKHLAILQE
VANAIAMTDVVASLAEVAAQHHWVRPTLTDDSVLRIEGGRHPVLDVTMAQGEFVPNDCIQSPETGMILLITGPNMAGKST
YIRQVALITLLAQTGSFVPATSAEIGIADRIFARVGASDELSRGQSTFMVEMVETARILNTATSRSLVILDEIGRGTSTY
DGLSLAWAITEHLHEQIGARTLFATHYHELAALQETLPRVANLSVAVKEWQDEVVFLHRIVPGSADKSYGIQVARLAGIP
VEVNERAKDVLAQLEADHRDSLDRPTIAPPSGVNGKGSGDTYQLTLFGYADHPLIQEIETVDIDSMSPIQAWQFLQEAKA
KLSAGPKAVKG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 1e-133 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
mutS NP_867713.1 DNA mismatch repair protein MutS VFG0562 Protein 5e-145 43