Gene Information

Name : Vapar_3589 (Vapar_3589)
Accession : YP_002945472.1
Strain :
Genome accession: NC_012791
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 3791082 - 3793739 bp
Length : 2658 bp
Strand : +
Note : This protein performs the mismatch recognition step during the DNA repair process

DNA sequence :
GTGAAAACGACGACCGCGCAACCCTCCCCCTCCACCAGCGACTTCTCCGGGCATACGCCCATGATGGCGCAGTACCTGGG
CCTCAAGGCGAACCATCCGGACACCCTGCTGTTCTACCGGATGGGCGATTTCTACGAGCTGTTCTGGGCCGACGCCGAAA
AGGCCGCGCGCCTGCTCGACATCACGCTCACCCAGCGCGGCCAGTCGGCCGGCCAGCCCGTGGTGATGTGCGGCGTGCCC
TTCCATGCGGTCGACACCTATCTTGCGCGGCTCATCAAGCTGGGCGAATCGGTGGCCATCTGCGAGCAGGTGGGCGAAGT
CGGCGCGAGCAAGGGCCCGGTCGAGCGCAAGGTGGTGCGGGTGGTCACGCCCGGCACGCTGACCGATTCGGAACTGCTCA
ACGACAAGAGCGAATCGCTGCTGCTCGCAGTGCATGCGGGCACGCGCAACTTCTGCGGCCTGGCCTGGCTCAGCGTGACC
GGCGCGGAGCTGCGGCTGGCCGAATGCCCGGCCGATGCGCTCGAAGCCTGGATCGCGCGCATCGCGCCGAGCGAGCTGCT
CTACAGCGCCGAGGTGACGCCGGCCTTCGAGCAGCGCCTGAAGGCCGCGCGCGCGGCCACGCCCTTCACGCTCTCGATCC
GCCCCGCGTGGCAGTTCGACGGCGGCCTGGGCGAGCGCAAGCTCAGCGAGCAGATGGGCAGCAACAGCCTCGCGGCCTGG
AACGCCGAATCGCTCGCCAACGCGCATGCCGCCGCGGCCGCGCTGCTGGGCTATGCCGAGCACACGCAGGGCCGCGCGCT
GTCGCATGTGCAGCGCCTGTCGGTGGAGCGCGACGGCGACCTGGTCGAGTTGCCGCCCACCACGCGGCGCAACCTCGAAC
TGGTGCAGACATTGCGCGGCGAAGACTCGCCCACGCTGTTCTCGCTGCTCGACACCTGCATGACGGGCATGGGCAGCCGG
CTGCTCAAGCGCTGGCTGCTGTCGCCCCGGCGCGACCGCGGCGAGGCGCAGGCCCGGCTCGAAGCCATTGCGGCGCTCCA
GTCCACGGTGCTGGGCGGCACCGCCGCGCCCTGGCGCACCTTGCGCGAACAGCTCAAGAACACCAGCGACGTGGAACGCA
TCGCGGCGCGCATCGCGCTCAGGCAGGTGCGCCCGCGCGAACTGCTCGCGCTGCGCCTCGCGCTTGCAAAGGCCGAGCAG
CTGGCCCCGGCGCTTCCGGCGTCCGGCGAACTGCTGGGCGGCATCATCGAACGGCTCGCGCCGCCGTCGGGCTGCGCCGA
CCTGCTGGCGAGTGCGATCAAGCCCGACCCCTCGGCGCTGGTGCGCGACGGCGGCGTGATCGCCACCGGCCACGACGCCG
AACTCGACGAGCTGCGCGCGATCAGCGAGAACTGCGACGATTTTTTGCTGAAGCTCGAAGTCAGCGAACGCGAGCGCACC
GGCATCAGCAACCTGCGGGTGCAGTTCAACCGCGTGCACGGCTTCTACATCGAGGTCACGCAAAGCGCGCTCTCCAAGGT
GCCCGACAACTACCGCCGCCGCCAGACGCTGAAGAATGCCGAGCGCTTCATCACGCCCGAGCTCAAGGCCTTCGAGGACA
AGGCGCTGAGCGCGCAGGACCGGGCCCTTGCCCGTGAGAAGTGGCTCTACGAGCAATTGCTCGACGCGCTGCAGCCCTCG
GTGCCCGCGCTCACGCAGCTGGCCGGCGCCATTGCCACGCTCGATGCGCTGTGCGCGCTGGCCGAGCGCTCGCACACGCT
GCACTGGCGCGCGCCGAGCTTCGTCTCGCACCCCTGCATCGAGATCCAGCAGGGCCGCCATCCGGTGGTGGAGGCGCGGC
TGGCCGAAAAATCGTCGGGCGGCTTCATCGCCAACGACACGCAGCTCGGGCCGCAGCAGCGCATGCAGGTGATCACCGGC
CCCAACATGGGCGGTAAGTCAACCTACATGCGGCAGGTGGCGATCATCGTGCTCCTGGCTTCCATCGGCTCGCACGTGCC
GGCGGCGGCCTGCCGGCTCGGGCCGATCGACGCCATCCACACCCGCATCGGTGCCGCGGACGACCTGGCCAACGCGCAGT
CGACCTTCATGCTCGAGATGACCGAGGCCGCGCAAATTTTGCACAGCGCCACCGCCCAGTCGCTGGTGCTGATGGACGAG
ATCGGCCGCGGCACCAGCACCTTCGACGGCCTGGCGCTGGCCGCGGGCATCGCGGCCCAGCTGCACGACCGCAGCAAGGC
CTTCACGCTCTTTGCCACCCACTACTTCGAGCTGACCGAATTCCCGGCCACGCACCACTGCGCCGTGAACATGCATGTGA
GCGCCACCGAGGCGGGCCGCGACATCGTGTTCCTGCACGAAATGCAGCCCGGCCCGGCCAGCAAGAGCTACGGCATCCAG
GTGGCGCGGCTCGCCGGCATGCCGGCCGCAGTGGTCAACCACGCGCGGCAGGCGCTCGAGGCGCTCGAGTCGCAACACGC
CCAGACGCGCGCGCAGGTCGACCTGTTCGCCCCGCCCCCGGCGGCCGAAACGCCGATGGCCAGCGCCGTAGAATCCGCCC
TGGCCGCGCTCGATCCTGATGCGATGACGCCACGGGAGGCGCTCGACGCGCTCTATGCCTTACAAAAACTGAACACGCGC
GAACGCGGCGCCGCGTAG

Protein sequence :
MKTTTAQPSPSTSDFSGHTPMMAQYLGLKANHPDTLLFYRMGDFYELFWADAEKAARLLDITLTQRGQSAGQPVVMCGVP
FHAVDTYLARLIKLGESVAICEQVGEVGASKGPVERKVVRVVTPGTLTDSELLNDKSESLLLAVHAGTRNFCGLAWLSVT
GAELRLAECPADALEAWIARIAPSELLYSAEVTPAFEQRLKAARAATPFTLSIRPAWQFDGGLGERKLSEQMGSNSLAAW
NAESLANAHAAAAALLGYAEHTQGRALSHVQRLSVERDGDLVELPPTTRRNLELVQTLRGEDSPTLFSLLDTCMTGMGSR
LLKRWLLSPRRDRGEAQARLEAIAALQSTVLGGTAAPWRTLREQLKNTSDVERIAARIALRQVRPRELLALRLALAKAEQ
LAPALPASGELLGGIIERLAPPSGCADLLASAIKPDPSALVRDGGVIATGHDAELDELRAISENCDDFLLKLEVSERERT
GISNLRVQFNRVHGFYIEVTQSALSKVPDNYRRRQTLKNAERFITPELKAFEDKALSAQDRALAREKWLYEQLLDALQPS
VPALTQLAGAIATLDALCALAERSHTLHWRAPSFVSHPCIEIQQGRHPVVEARLAEKSSGGFIANDTQLGPQQRMQVITG
PNMGGKSTYMRQVAIIVLLASIGSHVPAAACRLGPIDAIHTRIGAADDLANAQSTFMLEMTEAAQILHSATAQSLVLMDE
IGRGTSTFDGLALAAGIAAQLHDRSKAFTLFATHYFELTEFPATHHCAVNMHVSATEAGRDIVFLHEMQPGPASKSYGIQ
VARLAGMPAAVVNHARQALEALESQHAQTRAQVDLFAPPPAAETPMASAVESALAALDPDAMTPREALDALYALQKLNTR
ERGAA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 5e-165 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Vapar_3589 YP_002945472.1 DNA mismatch repair protein MutS VFG0562 Protein 5e-177 49