Gene Information

Name : Psta_4270 (Psta_4270)
Accession : YP_003372778.1
Strain : Pirellula staleyi DSM 6068
Genome accession: NC_013720
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 5577034 - 5579643 bp
Length : 2610 bp
Strand : +
Note : KEGG: glo:Glov_1145 DNA mismatch repair protein MutS; TIGRFAM: DNA mismatch repair protein MutS; PFAM: DNA mismatch repair protein MutS domain protein; MutS II domain protein; MutS IV domain protein; MutS III domain protein; SMART: DNA mismatch repair pro

DNA sequence :
ATGGCCGTTACCCCCATGATGCAGCAGTATCTCGATGCGAAGCAAGCGTGCGGCGATGCGCTCCTCTTGTTTCGCATGGG
AGATTTCTACGAACTGTTCCACGACGATGCCCGCACTGCTTCGCGCGTGCTGGGGCTCACCCTGACGACGCGCGACAAAG
GGGAAAATCCGGTCCCGATGGCCGGCTTTCCCTACCATCAGCTCGAAGGCTATCTGGCCAAGCTGATCGGCGGCGGCCTG
CGTGCAGCGGTGTGCGAGCAAGTGGAAGATCCACGCCAAGCCAAGGGGCTGGTGAAGCGCGAAGTGACACGTGTGGTCAC
TCCCGGCACACTGACCGACGACGCCCTGCTCGACCCGCGCGAAAGCAACTACCTGGCGGCGATGGTGCTCCCCGATACGC
TTCAGCCGCACACGCCGGTGGGACTCGCTTGGGCTGATCTGTCGACCGGAAGGTTTCAAGCAGCGGTGTTTCCCTTCGCG
CGGCTCGGCGATGAACTCGCGCGGCTGCAGCCTTCGGAATGCCTGCTGGGAGATGATCAGCCTCCCCCAACTTGCCCGTT
CCCGCCACGGATGATGATCACCCGTCGGCCCGAGTGGACCTTCGCCCGCGATACGTCGCAAGCGGTACTCCAAAAACAGC
TGCAGGTCGCCTCGCTCGAAGGGTTTGGCTTCGACGAGAGTGACATGCTGGCCATTCGCGCTGCGGGGGGAATTCTCGAG
TACCTGCGCGAGACCCAAAAAACTTCGCTCGATCACATCGATCGTTTGCTGCCGTATCGCTCGGGGGAATCGCTCGAAAT
CGATGAAGCGACCCGGCGCAGCCTCGAGATCACGCGGACCTTTCGGAGCGGCGCTCGCGAAGGCTCGCTGCTGTCGGTCA
TCGATCAAACGATCACTCCTCCCGGCAGCCGATTGCTGGCCGACTGGGTCGGCGCGCCACTGACCAACCTGGCCGCCATC
GGCGCGCGGCAAGATGCGGTGGAACTCCTCCGCAACAGCGCCACCGTTCGGCGCCAAATTCGTGAGGAACTGGCGGGCGT
TTACGATCTCGAGCGGCTGATTGCGCGGGTGACCACGCTGCGCGCAAGTCCTCGCGATTTGGCGTTTGTCGGACGAACAC
TCGCGCGACTGCCGCAGCTCAAAGCGCTCGTTGCCCATCTGCGTGCGCCGCTGCTCGACGATCTGCAAACGCGGCTCGAC
GAATCTCCTGCGCTCCGCGATTTGCTGGCCGCTGCGCTCGAGGACGATTGCCCGCTTTTGGCCCGCGATGGGAACTTCAT
TCGCCAAGGTTTTCACGGCGAGCTCGATCGCCTGCGCGAGATGGCCCATGGTGGCAAAGCGTGGATCGCGCGCTATCAGG
CCGATCAGATCGAAAAGACGGGAATTCCCAATCTGAAGGTCGCCTTCAACAAAGTCTTTGGCTACTACATCGAGATCACC
AACGCGCAAAAAGAGAAGACGCCGCCCGAGTATATTCGCAAGCAAACGGTCGCTTCCGCCGAGCGCTACATTACCCCCGA
GCTGAAAGAGTACGAAGAAAAAGTCCTCACCGCCGACGAGCGTTCGAAAGAGCTCGAGTATCAGCTGTTTGTAGAGCTGC
GCGACAAGACGCATCAATTTGCCCGCGCTTTGCGGATGACAGCGGCGGCGATTGCCGAGCTCGATGTCCTCGCGGCGCTC
GCGCAGCTGGCCGATCGTCCCGACTATTGCCGCCCGGTGATGACCGAGGATCAAGTGGTCGAGATCGTTGAAGGTCGGCA
CCCGGTGCTCGATGCCATTTTGCCGCGCGGCACGTTTGTGCCGAACGATACCACGCTCGGCACCGACGGAGGACTGGTGA
TGCTGATCACCGGTCCGAACATGGCGGGCAAGAGCACCTACATTCGGCAGGTCGCCGTGCTGTCGCTTCTGGCGCACGTG
GGGAGTTTTTTACCAGCGTCGCGCGCTACGATTGGAATTTGCGACCGGATTTTCGCCCGCGTTGGCGCAAGCGATGAGCT
GTCGCGCGGTCAAAGCACCTTCATGGTCGAGATGACCGAAACCGCGCGGATTTTGAATTCCGCCACAGCGCGCAGCCTCG
TGATCCTCGACGAAATTGGGCGTGGCACAAGCACCTACGATGGCATTTCGCTCGCCTGGGCGATCGTCGAACATCTGCAC
GATCAGATCGGCTGTCGCACGCTGTTTGCCACGCACTACCACGAACTCACCGACCTGGCTGGCTCGCTCGCTGGCGTCCG
CAACCTGAGCGTCGCGGTGCGCGAGTGGCAAGATCAAGTGGTGCTGCTTCACAAGATTGTGCCCGGCGCAGCCGACAAAA
GTTATGGCATTCACTGCGCTCGATTGGCCGGTGTTCCGCGGAGCGTGAACGAACGGGCCAAACAGATTCTCGCGAAACTC
GAGGGAGAAAATCTCGACACCGAGGGACGGCCGAAGCTGATTGCGCGGACCAAAAAATCGCGCAAGGGAGACCTGCAGCT
GACACTGTTTGCCCCGGAAGAGCATCCACTCCTCGAGCAGCTGCGGCAACTCGATTTGGCGGGGCTCACGCCGCTGCAGG
CTATGCAGTGGCTTGCCAACTGGCAGCAAGAGATCGGCCCAAAAAAGTAG

Protein sequence :
MAVTPMMQQYLDAKQACGDALLLFRMGDFYELFHDDARTASRVLGLTLTTRDKGENPVPMAGFPYHQLEGYLAKLIGGGL
RAAVCEQVEDPRQAKGLVKREVTRVVTPGTLTDDALLDPRESNYLAAMVLPDTLQPHTPVGLAWADLSTGRFQAAVFPFA
RLGDELARLQPSECLLGDDQPPPTCPFPPRMMITRRPEWTFARDTSQAVLQKQLQVASLEGFGFDESDMLAIRAAGGILE
YLRETQKTSLDHIDRLLPYRSGESLEIDEATRRSLEITRTFRSGAREGSLLSVIDQTITPPGSRLLADWVGAPLTNLAAI
GARQDAVELLRNSATVRRQIREELAGVYDLERLIARVTTLRASPRDLAFVGRTLARLPQLKALVAHLRAPLLDDLQTRLD
ESPALRDLLAAALEDDCPLLARDGNFIRQGFHGELDRLREMAHGGKAWIARYQADQIEKTGIPNLKVAFNKVFGYYIEIT
NAQKEKTPPEYIRKQTVASAERYITPELKEYEEKVLTADERSKELEYQLFVELRDKTHQFARALRMTAAAIAELDVLAAL
AQLADRPDYCRPVMTEDQVVEIVEGRHPVLDAILPRGTFVPNDTTLGTDGGLVMLITGPNMAGKSTYIRQVAVLSLLAHV
GSFLPASRATIGICDRIFARVGASDELSRGQSTFMVEMTETARILNSATARSLVILDEIGRGTSTYDGISLAWAIVEHLH
DQIGCRTLFATHYHELTDLAGSLAGVRNLSVAVREWQDQVVLLHKIVPGAADKSYGIHCARLAGVPRSVNERAKQILAKL
EGENLDTEGRPKLIARTKKSRKGDLQLTLFAPEEHPLLEQLRQLDLAGLTPLQAMQWLANWQQEIGPKK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 2e-129 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Psta_4270 YP_003372778.1 DNA mismatch repair protein MutS VFG0562 Protein 5e-140 43