Gene Information

Name : BC1003_1226 (BC1003_1226)
Accession : YP_003906494.1
Strain :
Genome accession: NC_014539
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 1409914 - 1412589 bp
Length : 2676 bp
Strand : -
Note : TIGRFAM: DNA mismatch repair protein MutS; PFAM: DNA mismatch repair protein MutS domain protein; MutS II domain protein; MutS III domain protein; MutS IV domain protein; KEGG: bpy:Bphyt_2520 DNA mismatch repair protein MutS; SMART: DNA mismatch repair pr

DNA sequence :
ATGGGCACTCAAACCGCAGCGGCCAGCGACGTCGCACAACACACGCCAATGATGCAGCAGTATCTACGCATCAAAGCGAA
GCACCCGGGCACGCTGGTGTTCTACCGGATGGGCGACTTCTACGAACTGTTCTTCGACGACGCGGAAAAAGCCGCGCGCC
TGCTCGATCTCACGCTCACGCAGCGCGGTGCGTCGGCGGGCAATCCGATCAAGATGGCGGGCGTGCCGCATCACGCGGTC
GAACAGTACCTGGCGAAGCTGGTGAAGCTCGGCGAGTCGGTGGCCATCTGCGAGCAGATCGGCGATCCGGCCACGTCGAA
GGGTCCGGTGGAGCGCAAGGTCGTGCGCGTCGTCACGCCGGGCACGCTCACGGACGCCGCGCTGCTGTCCGACAAGAACG
ACGTGTTCCTCATGGCAATGTGCGTTGCGCACAATCGCCGCGGCGTCGCGACAAGCGTCGGGCTCGCCTGGCTGAATCTC
GCGAGCGGCGCGTTGCGGCTCGCAGAAGTCGCACCCGACCAGGTCGCCGCCACGCTCGAACGGATAGACCCCGCGGAAAT
TCTGGTCGCCGATACCGACGCCGACACCGCCGGCTCAGCCCCGCCCACGGGCAATCGCGCGTTGACGCGCGTGCCGGCGT
GGCATTTCGACGTCGCCTCGGGTACGCAGCGTCTGTGCGACCAGTTGGAAGTGGCCGGGCTCGACGGCTTCGGCGCGCAT
TCGCTCACCTGCGCGTGCGGCGCGGCGGGCGCGCTGCTGCTGTACGCTGCCGCGACGCAAGGGCAGCAACTGCGGCATGT
GCGCAGCCTGAAGGTGGAACACGAATCGGAATATATCGGCCTCGACCCCGCCACGCGGCGCAACCTCGAGATCACGCAGA
CCCTGCGGGGCACCGAATCGCCCACGCTGTGCTCGTTGCTCGACACCTGTTGCACGACGATGGGCAGCCGTTTGCTGCGT
CACTGGCTGCATCATCCGCCGCGCGATCCGGTGCATGCCCAATCGCGCCAGCAGGCTGTCGGCGCGCTGCTCGAGGCGCC
CGCCGGCGCGGACGTCGACGCGCTGCGCGGCGCGCTGCGCCATATCTCGGATATCGAGCGGATCACTGGCCGTCTCGCGC
TGCTGTCGGCGCGCCCTCGCGATCTGTCGAGCTTGCGCGATACCTTTATCGCGTTGCCCGAACTGCGCGCAAAACTCGCG
GCCGTCACGTCGAATGCGCAGTCGCTCGCGCGCATCGATACGTCGCTCGAGCCGCCGCAAGCATGTGTCGAGCTGCTCGT
GCGGGCCATCGCGCCCGAACCCGCCGCGATGGTGCGCGACGGCGGCGTGATCGCCCGCGGCTACGATGCTGAGCTTGACG
AACTCAGGGATATTTCGGAGAACTGCGGGCAGTTCCTGATCGATCTCGAAACGCGCGAGCGCGCACGCACGGGCATCGGC
AATCTGCGTGTCGAATACAACAAGGTGCACGGCTTCTATATCGAAGTCACGCGCGGCCAGACCGACAAGGTTCCGGACGA
TTATCGCCGCCGTCAGACGTTGAAGAATGCCGAGCGCTACATCACGCCGGAACTCAAGACATTCGAAGACAAGGCGCTGT
CCGCGCAGGAACGCGCACTGGCCCGTGAGCGCTCGCTGTATGAAGCGCTGTTGCAGGCGCTGTTGCCCTTCATTCCGGAC
TGCCAGCGCGTTGCCTCGGCACTCGCGGAGCTCGATCTTCTCGCCGCTTTCGCCGAACGTGCTCGCGCGCTCGACTGGGT
CGCGCCGCGCTTTTCGGCGGACGCGGGCATCGACATCGAGCAGGGCCGGCATCCGGTCGTGGAAGCCCAGGTGGAACAGT
TCACCGCAAACGACTGCACGCTCACACCGGAGCGCAAACTGCTGCTGATCACCGGCCCCAACATGGGCGGTAAATCGACC
TTCATGCGGCAGACGGCGCTGATCACGCTGCTCGCGTATGTGGGCAGCTACGTGCCGGCGCGGCGCGCGGCATTCGGCCC
CGTCGACCGCATTTTTACCCGCATCGGCGCCGCCGACGACCTCGCGGGCGGCCGTTCGACCTTCATGGTGGAAATGACCG
AAGCAGCCGCGATTCTGAACGACGCTACGCCGCAAAGCCTCGTGCTGATGGACGAAATTGGCCGCGGCACCTCGACGTTC
GACGGCCTCGCGCTCGCGTGGGCGATCGCGCGGCATCTGCTCGCGCACAACGGTTGCCATACCCTCTTCGCAACGCACTA
CTTCGAACTGACGCAGTTGCCGGCGGAATTTCCCCAGGCGGCCAACGTGCATCTGTCGGCGGTTGAGCACGGGCGCAGCA
TTGTGTTCCTGCATGCGGTCAACGAAGGGCCAGCAAGCCAGAGCTATGGTTTGCAGGTCGCACAGCTCGCCGGCGTGCCG
AACGCGGTAATCCGCTCGGCCCGCAAGCATCTCGTTTATCTCGAACAACAATCGGCGGGACAACCGGCGCCGCAGCTCGA
CTTGTTCGCCGCGCCGCCGGCACTGCTGGAAGATGCGGACGACGACGAGCCGCAAACCGGGCAGGAGGCGCCGGCACTCC
AGGCGCTCGTCGAGCGGCTGCGCGGCATCGATCCAAACGATCTGCGCCCACGCGAAGCGCTCGATCTGCTGTATGAACTG
CACGAGTTGGCCGCTGCGCCGGATGCCGGTCGTTGA

Protein sequence :
MGTQTAAASDVAQHTPMMQQYLRIKAKHPGTLVFYRMGDFYELFFDDAEKAARLLDLTLTQRGASAGNPIKMAGVPHHAV
EQYLAKLVKLGESVAICEQIGDPATSKGPVERKVVRVVTPGTLTDAALLSDKNDVFLMAMCVAHNRRGVATSVGLAWLNL
ASGALRLAEVAPDQVAATLERIDPAEILVADTDADTAGSAPPTGNRALTRVPAWHFDVASGTQRLCDQLEVAGLDGFGAH
SLTCACGAAGALLLYAAATQGQQLRHVRSLKVEHESEYIGLDPATRRNLEITQTLRGTESPTLCSLLDTCCTTMGSRLLR
HWLHHPPRDPVHAQSRQQAVGALLEAPAGADVDALRGALRHISDIERITGRLALLSARPRDLSSLRDTFIALPELRAKLA
AVTSNAQSLARIDTSLEPPQACVELLVRAIAPEPAAMVRDGGVIARGYDAELDELRDISENCGQFLIDLETRERARTGIG
NLRVEYNKVHGFYIEVTRGQTDKVPDDYRRRQTLKNAERYITPELKTFEDKALSAQERALARERSLYEALLQALLPFIPD
CQRVASALAELDLLAAFAERARALDWVAPRFSADAGIDIEQGRHPVVEAQVEQFTANDCTLTPERKLLLITGPNMGGKST
FMRQTALITLLAYVGSYVPARRAAFGPVDRIFTRIGAADDLAGGRSTFMVEMTEAAAILNDATPQSLVLMDEIGRGTSTF
DGLALAWAIARHLLAHNGCHTLFATHYFELTQLPAEFPQAANVHLSAVEHGRSIVFLHAVNEGPASQSYGLQVAQLAGVP
NAVIRSARKHLVYLEQQSAGQPAPQLDLFAAPPALLEDADDDEPQTGQEAPALQALVERLRGIDPNDLRPREALDLLYEL
HELAAAPDAGR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 5e-176 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BC1003_1226 YP_003906494.1 DNA mismatch repair protein MutS VFG0562 Protein 0.0 53