Gene Information

Name : Thicy_0500 (Thicy_0500)
Accession : YP_004536752.1
Strain : Thioalkalimicrobium cyclicum ALM1
Genome accession: NC_015581
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein mutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 626924 - 629512 bp
Length : 2589 bp
Strand : -
Note : SMART: DNA mismatch repair protein MutS, C-terminal; DNA mismatch repair protein MutS, core; TIGRFAM: DNA mismatch repair protein MutS, type 1; KEGG: tcx:Tcr_1594 DNA mismatch repair protein MutS; HAMAP: DNA mismatch repair protein MutS, type 1; PFAM: DNA

DNA sequence :
ATGACTAACACCGTCTCAGACCATGTCGCCAACCACACGCCTATGATGCAACAGTATCTTAAGATCAAAGCAGAACATTC
GGATCGGCTAGTGTTTTATCGCATGGGCGATTTTTACGAATTATTTTATGACGATGCCCTGAAAGCAGCGCGTTTACTCG
ACATTACCCTGACCCAGCGCGGTCAATCGGCCGGCAAGCCGATCCCAATGGCCGGGATTCCACATCACAGTGCCGAAGGT
TATTTGGCGAAACTTGTGAAATTAGGCGAGTCTATTGCTATTTGTGAACAAGTCGGTGATGTTAGCAACAAAGGCCCTGT
TGAACGAAAAGTCGTGCGAATTCTAACCCCTGGTACTCTAACGGAAGACAGCTTGCTTGAAGCGCGCCAAGACAACTTGC
TGGTCGCCTGGTCACAACAGGGCAAGCAGATAGGGATTAGCTGGCTAGATGTCGCTAGTGGTCGCTTTGAGGTGACCGCT
TTTGATAATCAAGACGATGCCGTAAATGAATTACATCGACTTAATCCCGCCGAATTAATTTTTGCCGAATCGGTCACTCA
CCCTGATGAAAGTCTAAGCGCCCATAGCCATAACCTGCCAGATTGGTTATTTCAAGAAGCCGCGGCCAAGCGCTTATTAC
TAGAACATTTTAGCACTCGGGATTTAAGCCCATTTGGTTGCGAAAATGCACCCGCCAAAAGCGGTGCCGCTGCGGCTCTA
CTCTATTATGCACAATCCATGTTGCAACAACCCTTGCATCAAGTAACCAGCTTGCAAAGTTATCAAACAAACGAGTATTT
GACGCTGGATGCTATTACACGCCGCAATCTAGAAATAGATAGTCATCAACAAGGTTTTCAGCATCACACGCTATTTCATT
TAATTGACCAATGTCAAACAGCGATGGGCAGCCGATTATTGCGGCGCTGGTTACGCCAACCTTTACGCAATCGAACTCAC
ATTCGGCAAAGATTGAATGTGGTTGACAGTTTACTCCACAGCCAAGAGTACCCCATTTTACAAGAACACTTAAAGCCAAT
TGGCGATCTAGAACGTATTTTAAGCCGTGTTGCTTTAGGCAGTGCTCGTCCGCGTGACCTGAGCCAACTTAGCCGTGGGC
TAAACGCGCTACCAGGCCTGCTAAGCTGGGCAAAAGACTGGGGGGCCTTAGATGCATTAACGGCTCAAATTGACCCTTTT
CACGAATTAGGCGACGAACTTAACCGCGCCTTAGTAGCAAACCCGCCTTTATTATTGCGCGATGGGGGTGTTTTTAAATC
CGGTTATGATGAACAGCTGGATGAATTATTGGCGCTAAAAACTCAAGCGGGTGATTTTTTAACCGATTTAGAGACTCGCG
AACGCGAGCGCACCGGCTTAAACAGCTTAAAAATCGGCTTTAATCGGGTGCAAGGCTATTACATTGAATTAAGTAAACAA
TATAGTGATCAAGTACCATTAGATTATACGCGGCGCCAAACTTTAAAAAATGCCGAACGTTACATCACCGCTGAATTAAA
AAACTTTGAAACCCAGATTCTCAGTGCCGATGATCGTGCTCAAGCCCGTGAAAATTGGCTCTACGAACAACTCCTTGGCA
AGATTCAAGCGCAATTAATGGTCTTACAGCAAACGGCAAATGCGCTAGCAACCCTTGATGTATTAGCCAATTTTGCGGCA
CAAGCTATGGCACGCAACTACGCTAAACCCCAGTTTCGTGAGGAACCAGGCCTGGTTATTGAGCAAGGGCGACATCCAAC
AGTAGAAGCCCTGTCTCATGAACCCTTTATTGCCAATGATGCTGACTTTAATGAACAACGCCGTTTACACATTATTACCG
GTCCCAACATGGGCGGTAAATCGACCTATATGCGCCAAACAGCGATTATTACCATTCTTGCACATATCGGTTGTTTCGTG
CCCGCCAAGCAGGCCTGTTTCGGTCCAATTGATCGTATTTTTACCCGCATAGGCGCATCCGATGATTTAACGTCGGGGCG
ATCAACCTTCATGGTTGAGATGACAGAAACCGCGCATATATTACGCCATGCTAGTAACCAATCACTGATTCTAATGGATG
AGGTAGGCCGCGGCACCTCTACATTTGACGGACTTGCTCTCGCTTGGGCAATTGGCGAATACTTGGCTACTGAGGTTAAA
GGTTATTGTTTATTTGCAACCCATTATTTTGAACTGACCAGTCTTGCAGAGCAATTTGACAACACCGTTAATAGTCATTT
AACCGCCGTTGAGCACCAAGACAGTATTATTTTTTTACATCAAGTCAAACCGGGTCCAGCATCACAAAGCTATGGCTTAC
AAGTTGCCGCCTTGGCCGGGGTTCCCGCGGTGGTGATTACCCAAGCCAAAGCGCGCTTAAACGAACTCGAGCAACCAAGA
CCCGCCTTGGTAAACCCATCGAGCCTTGAAAGTGCCGTTAAGGATCATCATCTGCAGTTTGACCTCTTTAATAACCCAGA
ACCTGACCCCATAATAACGGCTATACAAAGCCTTGAGCCGGATAACCTAACGCCTAAACAAGCGCTTGAACTGATTTATC
AATGGCGTAGTCAGATTAACAAGCCTTAA

Protein sequence :
MTNTVSDHVANHTPMMQQYLKIKAEHSDRLVFYRMGDFYELFYDDALKAARLLDITLTQRGQSAGKPIPMAGIPHHSAEG
YLAKLVKLGESIAICEQVGDVSNKGPVERKVVRILTPGTLTEDSLLEARQDNLLVAWSQQGKQIGISWLDVASGRFEVTA
FDNQDDAVNELHRLNPAELIFAESVTHPDESLSAHSHNLPDWLFQEAAAKRLLLEHFSTRDLSPFGCENAPAKSGAAAAL
LYYAQSMLQQPLHQVTSLQSYQTNEYLTLDAITRRNLEIDSHQQGFQHHTLFHLIDQCQTAMGSRLLRRWLRQPLRNRTH
IRQRLNVVDSLLHSQEYPILQEHLKPIGDLERILSRVALGSARPRDLSQLSRGLNALPGLLSWAKDWGALDALTAQIDPF
HELGDELNRALVANPPLLLRDGGVFKSGYDEQLDELLALKTQAGDFLTDLETRERERTGLNSLKIGFNRVQGYYIELSKQ
YSDQVPLDYTRRQTLKNAERYITAELKNFETQILSADDRAQARENWLYEQLLGKIQAQLMVLQQTANALATLDVLANFAA
QAMARNYAKPQFREEPGLVIEQGRHPTVEALSHEPFIANDADFNEQRRLHIITGPNMGGKSTYMRQTAIITILAHIGCFV
PAKQACFGPIDRIFTRIGASDDLTSGRSTFMVEMTETAHILRHASNQSLILMDEVGRGTSTFDGLALAWAIGEYLATEVK
GYCLFATHYFELTSLAEQFDNTVNSHLTAVEHQDSIIFLHQVKPGPASQSYGLQVAALAGVPAVVITQAKARLNELEQPR
PALVNPSSLESAVKDHHLQFDLFNNPEPDPIITAIQSLEPDNLTPKQALELIYQWRSQINKP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 1e-176 50

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Thicy_0500 YP_004536752.1 DNA mismatch repair protein mutS VFG0562 Protein 0.0 52