Gene Information

Name : Mahau_1930 (Mahau_1930)
Accession : YP_004463928.1
Strain : Mahella australiensis 50-1 BON
Genome accession: NC_015520
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 2050429 - 2053077 bp
Length : 2649 bp
Strand : -
Note : COGs: COG0249 Mismatch repair ATPase (MutS family); InterPro IPR005748: IPR007695: IPR007860: IPR007696: IPR 007861: IPR000432; KEGG: cth:Cthe_0777 DNA mismatch repair protein MutS; PFAM: MutS III domain protein; DNA mismatch repair protein MutS domain pr

DNA sequence :
ATGGCCTCGCTTACGCCTATGATGCAGCAATATCTACAGCTAAAGGAACGCTATAAGGATTGTCTGTTGTTTTTCCGCTT
GGGGGACTTCTATGAGATGTTTTTCGACGACGCTGTGTTGGCTTCTAAAGAGCTAGAGCTGACGCTTACCGGCCGTGATT
GCGGCATGGAGGAGAGGGCGCCCATGTGCGGCGTGCCGTATCATGCTGTCGATACCTATATAGCACGCCTGGTGGAGAAA
GGTTATAAAGTCGCCATATGTGAACAGATGGAGGACCCAGCGCTAGCCAAAGGGTTGGTGGAGCGCGATGTCATACGCAT
AATCACGCCGGGCACCATAACAGATGGTTCTATGCTCGATGAAAAGGAGAACAACTATCTCTTGTGCGCTCATGTCAATG
GCGATAACTGCGGCATAGCCTTTGTGGATATATCTACCGGTCGGTGCAGCATCACGCAGTTGCAGACGGCTGGCCTGGCT
GATGAATTGGCGCGTATACAACCTGCCGAGATGATGGCCAATGAGCCTTTCTTTGATCAGGCCGGTATGCTCAAGACCGT
TCAACAGCGCCTAGATATAAAGCCGGGACATTGCAGCGTTGAATTTGACGATGTCGATAAGGCCTATGCCATGCTGGAGG
CCAATATGAGCGCCGACGTTTTGGATTACGTATCCAAAGAGGAAATGCCGCAGGCGGTATGCGCCCTGGCATCGCTCATA
TCATATCTTATAGAAACACAAAAGACCGCTCTTGCCAATATAGGCGGGATAGAGGTTTACCATATTCAGCAATATATGAT
ACTCGATGCGGCTACCCGCCGCAACCTCGAATTATGCGAGACTATGCGCAGCGGCAGCCACAAGGGTACACTTATGTGGG
TGTTGGATCATACGTCTACGGCTATGGGTGGGCGTATGCTCAAGTCTTGGATAGAGCAGCCCCTTTTGAACATAAATGCA
TTGAATGAACGGCAGGAGGCCGTAGAGGCCATGGCAAATCAGCCATTGTGGAAGGATGATATAAAAGAGGCATTATCCGG
CATATACGATATAGAGCGATTGATGAGCAAAGCGGTATATGGCAATATAAATGCCCGCGACCTTATAGCGCTCAAACAGT
CTCTTGGCAGATTGCCGCGTTTGAATGAGCTTGCGCTACAAGGCAAGGCGGCGCGGTTGAAAACATTGGGGCAGCGCATA
GATGTTATGGATGACATCTATACCCTTATAGATAAAGCTATAGCCGATGATCCGCCTTTATCTGTAAAAGATGGAAATAT
AATAAAAGATGGGTATGACCAGTCGGTCGATGAATTGCGCGATATATCGCATAATGGCCGTCAATGGATAAGTCGTCTGG
AACAGCAGGAACGCGACAGGACCGGCATAAAGTCGCTTAAAGTCGGCTACAACAAGGTATTTGGTTATTATATAGAAGTA
ACTAAATCGTATTACGATATGGTACCGGCCGATTATATACGCAAGCAAACGCTGGCCAATGCCGAGCGCTATATCACGCC
GGAATTGAAGGAGATGGAGAATAAGATACTCAGCGCATCAGAACGATTGGTGGCCTTAGAGTATCAGATATTCGCCGATA
TACGCGATACGGTGGTGGGGCATATAAAGCGCGTTCAGCAGACCGCATCTGCAATAGCCGAGCTGGACTGCCTGTGTTCG
CTGGCCGATGCTGCCATTGAAAATCATTATGTGCGGCCGGTGTTAAATGAAGGGCAGCGTATAGTGATACAAAATGGCAG
ACATCCCGTGGTGGAGAAAGTGTTGCCGCCTCATACATTTGTACCCAACGATACGCTGCTCGATAATGGCGAGGATATGG
TATGTATAATTACCGGTCCCAATATGGCAGGCAAAAGCACATATATGCGACAGGTAGCGTTGATAGTGCTTATGGCTCAG
ATAGGCAGCTTTGTGCCGGCTGATATGGCCGAAATAGGTATAGTCGATCGCATATTCACGCGGGTGGGGGCGTCGGATGA
CCTGTCCACCGGCCAAAGCACATTTATGGTGGAGATGACCGAAGTAGCCCATATACTCCATAATGCCACCGCAAAAAGCC
TACTCATATTGGATGAGATAGGCCGTGGAACTAGCACGTTCGACGGCTTAAGTATAGCATGGGCCGTTATAGAATATGTG
GCCGACCCTGGGCGCTTAGGCGCTAAGACCCTCTTTGCTACTCATTATCATGAGCTTACCGAGCTTGAAGGCAGATTGAC
GGGCGTTAAGAATTATTATATATCCGTGAGAGAACATGGTGATGATGTCATCTTCCTAAGGAAGATAATGCGTGGGGGCA
GCGGCAGGAGCTTTGGTATACAAGTCGCCAGGCTGGCCGGTTTGCCTCAGGATGTCATAGACCGTGCCAGAGAGATACTG
GATATATTGAATGCCTCCGATATAAATAAAAAATCAATAAGCGGTAATATACTGGGCGTTAAAGATAGACCCAAGCTTAA
GTTGAAGCAACAAATGGATATATTCTCTTATAAAATAGACGGTATAATGGCGTATATAAAAGGATTAGACGTAAACTCGA
TGACGCCTATAGAGGCTTTGAATGTTCTGCACGATATACAGAGCCAGGTGTTGGATATATATGATAAAAAGGCGGGTGAG
GCGCTATAG

Protein sequence :
MASLTPMMQQYLQLKERYKDCLLFFRLGDFYEMFFDDAVLASKELELTLTGRDCGMEERAPMCGVPYHAVDTYIARLVEK
GYKVAICEQMEDPALAKGLVERDVIRIITPGTITDGSMLDEKENNYLLCAHVNGDNCGIAFVDISTGRCSITQLQTAGLA
DELARIQPAEMMANEPFFDQAGMLKTVQQRLDIKPGHCSVEFDDVDKAYAMLEANMSADVLDYVSKEEMPQAVCALASLI
SYLIETQKTALANIGGIEVYHIQQYMILDAATRRNLELCETMRSGSHKGTLMWVLDHTSTAMGGRMLKSWIEQPLLNINA
LNERQEAVEAMANQPLWKDDIKEALSGIYDIERLMSKAVYGNINARDLIALKQSLGRLPRLNELALQGKAARLKTLGQRI
DVMDDIYTLIDKAIADDPPLSVKDGNIIKDGYDQSVDELRDISHNGRQWISRLEQQERDRTGIKSLKVGYNKVFGYYIEV
TKSYYDMVPADYIRKQTLANAERYITPELKEMENKILSASERLVALEYQIFADIRDTVVGHIKRVQQTASAIAELDCLCS
LADAAIENHYVRPVLNEGQRIVIQNGRHPVVEKVLPPHTFVPNDTLLDNGEDMVCIITGPNMAGKSTYMRQVALIVLMAQ
IGSFVPADMAEIGIVDRIFTRVGASDDLSTGQSTFMVEMTEVAHILHNATAKSLLILDEIGRGTSTFDGLSIAWAVIEYV
ADPGRLGAKTLFATHYHELTELEGRLTGVKNYYISVREHGDDVIFLRKIMRGGSGRSFGIQVARLAGLPQDVIDRAREIL
DILNASDINKKSISGNILGVKDRPKLKLKQQMDIFSYKIDGIMAYIKGLDVNSMTPIEALNVLHDIQSQVLDIYDKKAGE
AL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 1e-144 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Mahau_1930 YP_004463928.1 DNA mismatch repair protein MutS VFG0562 Protein 4e-159 42