Gene Information

Name : mutS (TERTU_2828)
Accession : YP_003074221.1
Strain : Teredinibacter turnerae T7901
Genome accession: NC_012997
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 3113720 - 3116314 bp
Length : 2595 bp
Strand : +
Note : identified by similarity to SP:P27345; match to protein family HMM PF00488; match to protein family HMM PF01624; match to protein family HMM PF05188; match to protein family HMM PF05190; match to protein family HMM PF05192; match to protein family HMM TIG

DNA sequence :
ATGAATCAAGAAGTTAAGCAATCAGAAGTCAACCTCGAGCAGCACACCCCAATGATGCAGCAGTACCTGCGCATCAAAGC
CCAACACCCCAATGAACTGGTTTTTTACCGGATGGGGGACTTTTACGAACTGTTTTACGAAGACGCCCGCAAAGCTGCAA
AGCTGCTGGATGTAACGCTCACCGCGCGCGGCAAATCCAATGGTGAGCCCATTCCCATGGCGGGCGTGCCCTACCACGCT
GCTGAAAACTATCTGGCGAAGCTGGTAAAACTCGGAGTATCAGTCGCAATTTGCGAGCAGATAGGCGATCCAGCCACCAC
CAAAGGGCCTGTCGAGCGCAAAGTGATGCGCGTTGTTACGCCAGGTACGGTGAGCGATGAAGCATTATTGGACGAACACA
GGGACAACTGGCTGGTAGCAATTAGCGCGCACGAAAGCCAATTCGGTATTGCCTGCCTTGATATGGGCAGTGGGCGCTTT
AGTGTATTCGAAATAGAAGGCGAAGATGCGCTGATCAGTGAGATCGAACGCTTGCGACCTGCCGAAATTCTGGCGCCTGA
CCTGCTCACCTTGCCACCGGGTGTGCGCAATAAAGCCGGTTATCGCGGTCGCCCGGAGTGGGAGTTTGATATCGAATCCG
GATTGAGATCGCTATGCGCGCACTTTGCGACCAAAGATCTCGACGGCTTCGGTTGCCGCGGCCTTACGGTCGCGCTCGGC
GCGGCGGGCTGCCTTTATGCGTACGCGAAGGAAACCCAGCGCACCGAGCTTAGCCATATCGCGAGCCTGGTGGTTGAAAA
CCCCGATAACACCGTCAGCCTGGACGCAGCGACGCGGCGCAATCTGGAACTGGATATTAATCTCAACGGCAGCGAAGAAA
ACACGCTGTTCAGTGTGCTGAACACCACCGCGACAGCGATGGGCGGCCGCCTTTTGCGCCGCTGGATAAACACCCCGTTG
CGCGACCTACACACACTGCATTCACGACAGTCAGCGATTGCTGCGCTACTGGAAAACTACCGGTTTGAACAAGTGCAGCA
GGAGCTTAAGCATATTGGCGACCTGGAACGAATTCTAGGCCGCATCGCGCTGCGTTCTGCCCGCCCCCGTGACCTGACCC
GCCTGCTGAATTCGCTCGCGATCTACCCTCAGCTACAACCACTACTGAAATCGGCGGAATGCGAAACTCTCGCAACCCTG
GCCAGCGAGATAAATGAGTTCCCGGGTTTAGTACAAGAGTTGGACAAGGCCCTGGTGGAAAATCCACCCGTTGTCATCCG
GGAAGGCGGTGTGATCGCCGAGGGCTACGACGAAGAGCTGGATGAACTGCGCGGAATCAGCACAAACGCGGGCGAATTCC
TTGTCAAACTGGAAACTCAGGAGCGCGAGCGCACCGGCTTAAACACGTTGAAAGTCGGCTACAACCGGGTTCACGGTTAC
TTCATTGAAATCAGCAAGTCACAGGCCGAGAAAGCCCCCGCTGAGTATATTCGACGGCAGACACTGAAAAATGCCGAACG
TTTTATTACACCCGAGCTGAAAACGTTTGAAGATAAAGCGCTGTCTGCAAAAAGCCGGGCGCTATCGCGGGAAAAAGCAC
TCTACGAGCAGCTGATCGAGAAACTGAACGAACACCTGCGGGAGTTGCAGATTTCTGCTGTCGCGGTGGCGGAGTTGGAC
GTGCTTAATACCTTTGCTGAGCGGGCCCACGCTTTGAAACTGGTCAAGCCGGAGTTCCGCGGCGAAGCAGGCATCGAAAT
CGAAAAAGGCCGGCACCCGGTAGTTGAACAGGTATTGACCGACCCATTCATTCCTAATGACCTCACCCTGAATGCGCAGC
AGCGCATGCTGATAATCACTGGCCCCAACATGGGCGGTAAATCAACCTACATGCGCCAGACCGCGTTAATCGTACTGCTT
GCTCAAGTAGGAAGCTATGTGCCTGCTGAAGCGTGCAGGCTGGGACTGGTTGATCGTATTTTTACCCGTATTGGCTCGTC
GGACGATCTCGCTGGCGGTCGTTCGACCTTTATGGTCGAAATGACAGAAACGGCAAATATCCTTAATAATGCGACCAGCG
ACAGTCTTGTATTGATGGACGAAATTGGCCGGGGCACATCCACTTACGATGGTCTTTCCCTGGCGTGGGCCTGTGTGGAG
CACCTGGCGGAAAAGCTCAAGTCCTTCACCCTGTTTGCTACACATTACTTTGAAATTACTGCGCTCCCTGCACAGCTACC
TACCGTGAAGAATGTTCACCTTGACGCCACCGAATATCAGGACAATATCGTTTTTCTGCACAACATCCAGGCCGGGCCGG
CAAGCAAGAGCTACGGCTTACAAGTGGCTAAGCTGGCCGGAATTCCCGGGGCAGTCCTACGCCAGGCGAAGGACGTATTA
CACAAACTTGAGACTGGCAAGCCAGAAAGCCCGGCTCCGGTGGCGAGTCGTTCAAGCAAACCCAGTATGCAGGCGGATAT
GTTTGCTGAACCTCAGCCCAGCAAGGTCGAGAAACGGTTGGCGACAGTAATACCGGATGACCTGAGCCCCAGACAGGCAC
TAGAGCTGGTTTACGAACTGAAAAAGCTGATTTAG

Protein sequence :
MNQEVKQSEVNLEQHTPMMQQYLRIKAQHPNELVFYRMGDFYELFYEDARKAAKLLDVTLTARGKSNGEPIPMAGVPYHA
AENYLAKLVKLGVSVAICEQIGDPATTKGPVERKVMRVVTPGTVSDEALLDEHRDNWLVAISAHESQFGIACLDMGSGRF
SVFEIEGEDALISEIERLRPAEILAPDLLTLPPGVRNKAGYRGRPEWEFDIESGLRSLCAHFATKDLDGFGCRGLTVALG
AAGCLYAYAKETQRTELSHIASLVVENPDNTVSLDAATRRNLELDINLNGSEENTLFSVLNTTATAMGGRLLRRWINTPL
RDLHTLHSRQSAIAALLENYRFEQVQQELKHIGDLERILGRIALRSARPRDLTRLLNSLAIYPQLQPLLKSAECETLATL
ASEINEFPGLVQELDKALVENPPVVIREGGVIAEGYDEELDELRGISTNAGEFLVKLETQERERTGLNTLKVGYNRVHGY
FIEISKSQAEKAPAEYIRRQTLKNAERFITPELKTFEDKALSAKSRALSREKALYEQLIEKLNEHLRELQISAVAVAELD
VLNTFAERAHALKLVKPEFRGEAGIEIEKGRHPVVEQVLTDPFIPNDLTLNAQQRMLIITGPNMGGKSTYMRQTALIVLL
AQVGSYVPAEACRLGLVDRIFTRIGSSDDLAGGRSTFMVEMTETANILNNATSDSLVLMDEIGRGTSTYDGLSLAWACVE
HLAEKLKSFTLFATHYFEITALPAQLPTVKNVHLDATEYQDNIVFLHNIQAGPASKSYGLQVAKLAGIPGAVLRQAKDVL
HKLETGKPESPAPVASRSSKPSMQADMFAEPQPSKVEKRLATVIPDDLSPRQALELVYELKKLI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 0.0 54

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
mutS YP_003074221.1 DNA mismatch repair protein MutS VFG0562 Protein 0.0 56