Gene Information

Name : NAL212_3137 (NAL212_3137)
Accession : YP_004296061.1
Strain : Nitrosomonas sp. AL212
Genome accession: NC_015222
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 3118396 - 3120993 bp
Length : 2598 bp
Strand : -
Note : TIGRFAM: DNA mismatch repair protein MutS, type 1; PFAM: DNA mismatch repair protein MutS, C-terminal; DNA mismatch repair protein MutS-like, N-terminal; DNA mismatch repair protein MutS, connector; DNA mismatch repair protein MutS, core; DNA mismatch rep

DNA sequence :
ATGAAGCCCTCTAAAGAGCAGCATACGCCGATGATGCAACAATACTTGCGCATTAAAGCACAGTATCCGGATATGTTGAT
GTTTTATCGTATGGGGGATTTTTATGAGTTATTTTTTGATGATGCCGAGAAAGCTGCAAAACTGCTTGGGATTACATTAA
CCCGACGCGGGGCCTCTGCAGGTGAGCCCATCAAGATGGCGGGGGTGCCCTATCATGCAGCCGAGCAATATCTGGCTAAG
CTGGTCAAGGCTGGCGAGTCCATCGCGATTTGTGAGCAGGTAGGTGATCTGGCTACCAGTAAAGGACCGGTAGCTCGCGA
GGTTGTCCGCATCCTCACACCCGGAACATTAACGGATGCAGCATTACTGGAGGATAAGCGCAATTGTATTTTGCTGTCAT
TATGGATCCATGAGTCGATTCTGGGGTTGGCATGGTTGAATCTGGCTGCGGGCCAATTACGGATATTGGAGACATCGCCG
CAAAATATGCTGAGCGAATTGGAGCGCTTGCAGCCGGCGGAAATACTTATACCTGAAGCGCTTACGTTGGATGAGTTGCA
AGGAAAGAAATGGGTTATAAAGCGACTACCGGCGTGGCAGTTTGATCGCGATAACGCGATTAGCAATCTGAAACGACAGT
TTGCAACGCATGATCTTTGCGGATTTGGCTGCGAGGATTTGTCTACGGCTCTGTGCGCGGTGAGTGCGTTATTGGAATAT
GTGCGTTTAACGCAGGGGGCGGCTGCGCTTCATATCACATCATTACAAGCTGAGCGGGAAAGTGTTTATGTCCGCATGGA
TGCCGCAACACGCCGCAATCTCGAAATTTCTGAAACCATCCGGGGTGAAAGATCGCCCACGCTATTGTCCTTGCTAGATA
CCTGCTCAACCAATATGGGTAGCCGTTTATTGCAATTTTGGCTACATCATCCATTGCGCGATCGAGGCGAAATTCAAAAA
CGGCTGGATAGCATTACCGTCTTGATCGGAGAGAATGGATCAAGCTATGTGGCGGTGCGCAATCTTTTACGGCAGATCGT
AGATATTGAGCGGATTACGGCGCGTATTGCGCTTAAATCAGCGCGTCCGAGGGATTTATCGGGTTTGCGCGACAGCTTGA
AACTATTTCCTGAAATTATTACAACGCTTGCGCAGTGCCACAGCGAGAGAATTGATCGATTGATTCAGGCGCTGCAAGTT
GAGCCAGCCCTAGGTGAACGATTGAATAAAGCCTTGCTGGAAGAGCCCGGCGTAGTGTTGCGCGAGGGTAATGTGATTGC
CGATGGTTATGATATTGAATTGGATGAATTGCGGGCATTGCAAAACAATTGCGGAGAATTCTTGTTACAGCTGGAAAGCC
GTGAAAAGGAACGTACCGGTATTCCCAACCTGAAAGTCGAATATAACCGCGTGCACGGTTTCTATATCGAAGTCACGCAT
GCGCATAGCGAGAAAATTCCTGTGGATTATCGGCGCAGACAAACTCTAAAAAGTGCTGAACGTTATATTACGCCTGAACT
GAAAGCTTTCGAAGATAAGGCATTATCGGCGCAGGATCGGGCACTGGCGCGAGAAAAATATTTATACGATGAATTAATTG
AGACACTGATACAGCATGTTCCTGCATTGCAGGAAATGGCGCTCAGCGTTGCCGAGATCGATGTGCTGTGCACATTGGCA
GAGCGTGCGCAAGTACTTGACTACACTGCACCCTATTTATCCAATGAGGATATTGTCAGCATCGATACCGGTCGACATCC
GGTGGTGGAAAGCCAGGTTGAGAATTTTGTCGCTAACGATGTTCAGTTGGGCACCCAACAAACGGGAATACAGCAAATGC
TGCTGATTACCGGACCCAACATGGGTGGGAAATCTACCTACATGCGACAAATAGCACTGATTAGCTTGCTTGTGCATTGC
GGGTGTTATGTTCCGGCTAAAAAGGCGTGCATTGGCATACTTGACCAAATCTTTACCCGTATCGGTGCCGCTGACGATCT
TGCCAGCGGCCGTTCCACGTTCATGGTGGAAATGACGGAAACAGCCAATATACTCCATAACGCAACAGCGCAGAGCCTGG
TGCTGATGGATGAAGTTGGCCGCGGTACATCGACTTTTGATGGTTTGGCGTTGGCATTTGCTATTGCCCGGCATTTACTG
ACAAAAAACCGCAGTCTTACTTTATTTGCTACGCATTACTTCGAACTGACAAAACTTGCAGAAGAGTTTAAGCAAGTCAA
GAACGTTCATCTGGATGCGGTGGAATATAAACAACATATCGTGTTTCTGCATAAAGTTACCGAGGGGCCGGCAAGTCAGA
GCTATGGTTTGCAAGTGGCGGCATTGGCAGGGGTTCCCGAATCAGCGATTAGAGTAGCAAGAAAATACCTGATTAATCTC
GAGCAGGAAAGTATCCATAGGGAGCCGCAATTGGACTTATTCGCTTTGTCGGTTGCGGAGCCGGAAGAGATTGTTATGCA
AGAACACCCCGTAATTCCAATGCTTCGAAATCTTTCGCCTGATGAGCTCAGTCCCAGGCAGGCATTGGAGCAGATTTATT
TACTGAAAAAAATGACAGATAGCGAACATGACAGCTGA

Protein sequence :
MKPSKEQHTPMMQQYLRIKAQYPDMLMFYRMGDFYELFFDDAEKAAKLLGITLTRRGASAGEPIKMAGVPYHAAEQYLAK
LVKAGESIAICEQVGDLATSKGPVAREVVRILTPGTLTDAALLEDKRNCILLSLWIHESILGLAWLNLAAGQLRILETSP
QNMLSELERLQPAEILIPEALTLDELQGKKWVIKRLPAWQFDRDNAISNLKRQFATHDLCGFGCEDLSTALCAVSALLEY
VRLTQGAAALHITSLQAERESVYVRMDAATRRNLEISETIRGERSPTLLSLLDTCSTNMGSRLLQFWLHHPLRDRGEIQK
RLDSITVLIGENGSSYVAVRNLLRQIVDIERITARIALKSARPRDLSGLRDSLKLFPEIITTLAQCHSERIDRLIQALQV
EPALGERLNKALLEEPGVVLREGNVIADGYDIELDELRALQNNCGEFLLQLESREKERTGIPNLKVEYNRVHGFYIEVTH
AHSEKIPVDYRRRQTLKSAERYITPELKAFEDKALSAQDRALAREKYLYDELIETLIQHVPALQEMALSVAEIDVLCTLA
ERAQVLDYTAPYLSNEDIVSIDTGRHPVVESQVENFVANDVQLGTQQTGIQQMLLITGPNMGGKSTYMRQIALISLLVHC
GCYVPAKKACIGILDQIFTRIGAADDLASGRSTFMVEMTETANILHNATAQSLVLMDEVGRGTSTFDGLALAFAIARHLL
TKNRSLTLFATHYFELTKLAEEFKQVKNVHLDAVEYKQHIVFLHKVTEGPASQSYGLQVAALAGVPESAIRVARKYLINL
EQESIHREPQLDLFALSVAEPEEIVMQEHPVIPMLRNLSPDELSPRQALEQIYLLKKMTDSEHDS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 0.0 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
NAL212_3137 YP_004296061.1 DNA mismatch repair protein MutS VFG0562 Protein 0.0 53