Gene Information

Name : Thein_1045 (Thein_1045)
Accession : YP_004625880.1
Strain : Thermodesulfatator indicus DSM 15286
Genome accession: NC_015681
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 1102208 - 1104781 bp
Length : 2574 bp
Strand : -
Note : COGs: COG0249 Mismatch repair ATPase (MutS family); InterProIPR000432:IPR007696:IPR005748:IPR007695:IPR 007860:IPR007861; KEGG: gsu:GSU1822 DNA mismatch repair protein MutS; PFAM: MutS III domain protein; DNA mismatch repair protein MutS domain protein; M

DNA sequence :
ATGGTACACATTACCCCTATGTTTCGTCAGTATCTCGAGATTAAAGAAAAGTACCCAGATGCCATCCTTTTTTTCCGTCT
GGGTGATTTTTACGAAATGTTTTTTGAAGATGCAGAGCTAGCTTCTCGCATCCTTGATATTGCCCTTACGTCGCGAGACA
AAGGCACAAAAGAAAAGGTCCCTATGTGTGGAGTGCCGGCGGCAAACGCCGCCCATTACATAAACCGCCTGGTATCTGCT
GGCTATAAGGTAGCTATATGTGAACAGGTGGAAGATCCCAAACAGGCCAAAGGCATAGTAAAAAGGGAAGTTATCCGGGT
GGTAACTCCAGGGCTCAATCTGGACGAAGAGACGCTCACCTCTAAAGATAATCGTTTTCTTGTAAGCCTTTTCCCTGGAA
AGGCTTGGGGAATGGCCCATCTAGACCTCTCAACCGGAGATTTTAAAGTCACCGAAATTCATAGCGAAGAAGAAATGTTA
AACGAACTTTTCCGCCTTGAGCCTAAAGAAATTCTACTACCAGAAACCCTTAAAGATAGCGCTTTAGAAAGAAAAATTCG
CGAACTTATACCGCATATTTTTATTTCCTATCGAGTTTTTATTAATGCAAAACAAAGGGCTGAAGAGCTAATCAAAGAAA
GATATCAAGTGGCAGACCTCACTGGTTTTGGCCTTAGCCAAGCTCCTGCGGCTCTCTGTGCAGCCGCAACCCTTCTTGAT
TACGTAATAGAAACTCAGAAGGAAGTCTCAAGTCATCTAGGCGTCCCTAAATTTTATTACCTTTCACAGTTTTTAATAAT
AGATGAAGCTACCAAGAGAAATCTAGAAATACTACGCAATAATCTTGATGGTAGCCTTAAAGGAAGCCTTCTCTGGGTAC
TTGATAAGACCCTCACTCCGATGGGCGGCAGGCTCCTTAAAGAATGGCTTCTTTATCCCCTAAGAAATCTAGAGTCAATA
GAAGCACGCCTTGAGGCAGTAGCTTATTTAGTAGATGAACCTTCTAAACGCAAAAATTTACGCGAACTACTGGCTCGCAT
TGCTGATGTGGAAAGACTTACTGGCCGTGCTGCCATGGGGGTGGCAAATCCCCGCGATTTATTGGCCCTAAAAGATTCCC
TAAAAATGGTTCCCCAGCTAAAAGAACTTTTACCAGAAAAAATTTCCCCTTTGCTCGACGCCATTAAGGAAAACCTTTTG
GTGCCAGGAGATTTAGTCCAAAACCTAGAAAAAACTATTCGCGAAGAGGCCCCAGTTAATTTCAAAGAAGGGGGCGTCAT
CAAAGACGGTGTTCACGAAGAACTCGACGAACTACGCCGTTTAAAAGACGATGCCCTTTCTTTTCTGGCTGAACTGGAAA
CCCGAGAACGAGCTCGCACAGGGATCCCCAACCTTAAGGTGGGCTACAATCGCGTATTCGGCTACTACATTGAAGTCTCT
AAAAGCCATTTATCAAAAGTCCCAGATAATTACATTCGCAAACAAACCCTGGTGGGCGGAGAGCGCTTTATTACCCCTGA
GCTAAAAGAATTCGAAGCCAAAGTTCTTTCGGCCGATGAACGTATAAAAGAACTTGAACAGGAGCTTTTCCTGGAGATAA
GAAAAAACGTAGCCGAAAAGGCACAAGAATTAAAAAAGCTCGCTAGAGCACTAGCTACTTTAGATGTATTGGCTTCTCTA
GCTGAAGTGGCGGTTACCAACAATTACATTCGTCCCAAAATTATCGAAGAACCGGGGATACAAATCAGAGAAGGCCGCCA
TCCAGTAGTGGAAAAGGCTCTACCTTCGGGTTCTTTTGTGCCCAATAGTGTAAAACTGGACCTCAAAGAAAACGTTGTTC
TTGTTATCACTGGCCCTAACATGGCCGGGAAATCAACAATTTTGCGCCAGACAGCCCTTATAACCCTTCTTGCCCATGTA
GGTTCTTTTGTGCCCGCTGAAGAGGCTACTATTGGGCTTTGTGACCGCATATTTTCACGGATAGGGGCTTCTGACCAGCT
CTCTCGCGGACGCAGCACCTTTATGGTGGAAATGTCTGAATGTGCTAACATCTTACATCAGGCCACTTCAAGAAGCCTTG
TAATCCTTGACGAAATCGGCCGAGGCACCAGCACTTATGATGGCCTGGCCATTGCGTGGGCAGTGGCAGAGTTTTTGCAT
GAAAAAAAGATTATGACGCTTTTTGCCACTCACTATCACGAACTAGTAGAGCTTGCAGGAGAATATCCTGGTATAAAGAA
CTTCAACGTGGCGGTTAAGACCTTTGAAGACCAAATAATTTTTCTTTATCGCCTACTACCTGGGCCAGCCAGTGAGTCCT
ATGGGGTACAAGTGGCTGCCCTGGCAGGGTTGCCAAAAGAAGTAATTGCCAGAGCAAAAGATATTTTAAAATCTTTGGAA
AACAAAACTTCTCCTCCTTTAAAAGCTAAAAAAGAAAAGAAAAGGCAAAAAAGCCTGTTCTCACCAGATGATATTTTAAA
GCGCCAGATATTAGGAATAGATCCAGATCGCCTGAGCCCACTTGAAGCTCTACAAAAACTCTATGAGCTCAAAGCACTGG
CGGAGAAGCATTAA

Protein sequence :
MVHITPMFRQYLEIKEKYPDAILFFRLGDFYEMFFEDAELASRILDIALTSRDKGTKEKVPMCGVPAANAAHYINRLVSA
GYKVAICEQVEDPKQAKGIVKREVIRVVTPGLNLDEETLTSKDNRFLVSLFPGKAWGMAHLDLSTGDFKVTEIHSEEEML
NELFRLEPKEILLPETLKDSALERKIRELIPHIFISYRVFINAKQRAEELIKERYQVADLTGFGLSQAPAALCAAATLLD
YVIETQKEVSSHLGVPKFYYLSQFLIIDEATKRNLEILRNNLDGSLKGSLLWVLDKTLTPMGGRLLKEWLLYPLRNLESI
EARLEAVAYLVDEPSKRKNLRELLARIADVERLTGRAAMGVANPRDLLALKDSLKMVPQLKELLPEKISPLLDAIKENLL
VPGDLVQNLEKTIREEAPVNFKEGGVIKDGVHEELDELRRLKDDALSFLAELETRERARTGIPNLKVGYNRVFGYYIEVS
KSHLSKVPDNYIRKQTLVGGERFITPELKEFEAKVLSADERIKELEQELFLEIRKNVAEKAQELKKLARALATLDVLASL
AEVAVTNNYIRPKIIEEPGIQIREGRHPVVEKALPSGSFVPNSVKLDLKENVVLVITGPNMAGKSTILRQTALITLLAHV
GSFVPAEEATIGLCDRIFSRIGASDQLSRGRSTFMVEMSECANILHQATSRSLVILDEIGRGTSTYDGLAIAWAVAEFLH
EKKIMTLFATHYHELVELAGEYPGIKNFNVAVKTFEDQIIFLYRLLPGPASESYGVQVAALAGLPKEVIARAKDILKSLE
NKTSPPLKAKKEKKRQKSLFSPDDILKRQILGIDPDRLSPLEALQKLYELKALAEKH

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 1e-143 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Thein_1045 YP_004625880.1 DNA mismatch repair protein MutS VFG0562 Protein 2e-156 44