Gene Information

Name : Nit79A3_3482 (Nit79A3_3482)
Accession : YP_004696603.1
Strain : Nitrosomonas sp. Is79A3
Genome accession: NC_015731
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein mutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 3691899 - 3694499 bp
Length : 2601 bp
Strand : -
Note : SMART: DNA mismatch repair protein MutS, C-terminal; DNA mismatch repair protein MutS, core; TIGRFAM: DNA mismatch repair protein MutS, type 1; KEGG: nit:NAL212_3137 DNA mismatch repair protein MutS; HAMAP: DNA mismatch repair protein MutS, type 1; PFAM:

DNA sequence :
ATGAAAACCCAGAAAGATCAGCATACCCCCATGATGCAGCAGTACCTGCGCATCAAGGCACAACACCCAGACATGCTGAT
GTTTTATCGCATGGGGGATTTTTATGAGTTGTTTTTTGACGATGCGGAGAAAGCGGCAAAGCTCTTGGGAATAACTTTAA
CGCAACGCGGCGCTTCTGCCGGGGAACCGATTAAAATGGCGGGAGTGCCCTATCATGCAGCAGAGCAGTATCTGGCAAAA
CTGGTCAAGTCAGGTGAATCCGTAGCCATTTGCGAGCAGGTGGGTGATCCGGCGACCAGCAAAGGCCCGGTTGCACGTGA
AGTCACCCGTATCATCACGCCTGGAACATTGACGGATGCGGCATTGCTTGAGGATAAGCGCGACTGCATATTATTGGCTT
TATGGGTGCACGAATCGATTCTGGGTCTGGCTTGGCTGAATCTGGCAGCGGGGCAATTACGCGTGATGGAGACATCACCA
CAAAATCTGCTGAGTGAGCTGGAGCGTCTGCAACCATCTGAGATCCTATTGCCAGAATCACTTAAACAAGCTGAAATACA
AGGTAAAAATTGGGCGCTAAAGCGATTGCCGTTGTGGCAGTTTGATCGTGATACTGCAATCAATAATCTTACGCGGCAAT
TTGAAACGCATGATCTGTCTGGCTTTGGTTGTGAGGATTTGCCTATTGCGCTGTGCGCTGCCGGTGCCCTACTGGAATAC
GCCCGCCTGACTCAAGGATCTGCTGCGCTTCCTATCACATCATTACAAGCTGAGCGGGATAGTATTTATATTCGGATGGA
TGCTGCCACACGCCGCAATCTGGAAATTTCTGAAACCATACGCGGTGAACGCTCACCTACGTTGTTGTCATTACTGGATA
CATGCTCAACCAATATGGGCAGCCGTTTGTTGCAATTTTGGCTGCATCATCCATTACGGGATCATGCAGCAATACAAAAG
CGGCTTGATAGCGTTGCAGCCTTAATCGGAGAAAGTGAGCAGAATAATTATTGGGTCGCGCGAGACCTGCTTCGGCAATT
TGTAGATGTTGAACGGATTACTGCCCGCATTGCACTCAAATCAGCGCGCCCGAGAGACTTATCGGGTTTGCGTGATAGCC
TGAAACTATTGCCAGAAGTCATTCAAGCCATGGCAAATGGTTCCAGTGAGAGAATTAGTCAATTGATTCAGGCTATGCAG
ATAGAACCTGCGCTCTTTGAGTTATTGAGAAAATCTTTGTTGGAAGAACCTGGTGTGGTGTTGCGTGAGGGCAATGTGAT
TGCCGATGGTTATGATGCCGAACTGGACGAATTACGTGCCCTGCAAAATAATTGCGGTGAATTCTTACTGCAGTTGGAGA
TTCGTGAAAAAGAACGCACCGGCATTCCTAATCTCAAGGTGGAATATAACCGTGTGCATGGTTTCTATATTGAAGTCACG
CACGCACACAGCGAGAAAATTCCCGCTGATTACCGGCGCAGACAAACACTAAAAAGTGCTGAGCGTTATATCACGCCGGA
ACTAAAAGCTTTCGAAGATAAAGCATTATCCGCGCAAGACCGGGCATTGGCGCGGGAGAAGTATTTATACGATGAACTTT
TGAATGTACTTCTGGGTTATATCCATCCATTGCAGAAAATGGCTGCGAGCGTTGCGGAAATTGATGTTTTATGTGCATTT
GCTGAGCGTGCACAAGCGCTTGACTATACTGCACCATATTTGTCGCATGAGGAAATTTTTGAGATAGATACCGGTCGCCA
CCCGGTTGTGGAAAGTCAGGTAGAAAACTTTGTTGCCAATGATGTTCAGTTAGGTGCTGACTATACCGGGAAACCGCAAA
TGTTGATAATTACTGGCCCCAATATGGGCGGTAAATCAACCTATATGCGGCAAATCGCGCTGATCGCTTTACTTGCACAT
TGCGGCAGTTATGTGCCTGCCAAAAGAGTACGTCTGGGTAGGCTGGATCAGATCTTTACCCGTATTGGCGCTGCCGATGA
TCTTGCCAGCGGGCGTTCCACATTTATGGTGGAAATGACGGAAACCGCCAATATACTCCATAACGCAACAGCGCAAAGCC
TAGTGCTTATGGATGAAGTCGGGCGCGGCACTTCCACTTTTGACGGCTTGGCCCTAGCGTTTGCCATTGCCCGGTATCTA
TTGAGTAAAAATCGCAGCTTTACACTTTTTGCCACGCATTACTTTGAATTAACGAAACTGGCTGAGGAGTTTAAACAAAT
CCAAAATGTCCATTTGGACGCCGTGGAGTATAAACATCGCATTGTTTTTCTGCATAAAGTAGCCGAGGGTCCGGCCAGTC
AAAGCTACGGTTTGCAAGTTGCTGCACTGGCAGGTGTTCCAGAGTCAGTGATCAAAGTGGCCAGAAAGCACTTGATTAAA
CTGGAGCAGGAAAGTGTAAAAAAGAAACCTCAGTTGGATCTGTTTTCCTTTTCCATTCCAGAACCGGAAGAAGATATTAC
GCAGGAACATCCATTGATTGCAATGTTACAAAACCTCTCGCCGGATGAACTAAGCCCCAGGCAAGCGCTGGAGCAGCTTT
ATTTATTGAAGAAAGCGATAGATGCGACAAATAACAGCTAA

Protein sequence :
MKTQKDQHTPMMQQYLRIKAQHPDMLMFYRMGDFYELFFDDAEKAAKLLGITLTQRGASAGEPIKMAGVPYHAAEQYLAK
LVKSGESVAICEQVGDPATSKGPVAREVTRIITPGTLTDAALLEDKRDCILLALWVHESILGLAWLNLAAGQLRVMETSP
QNLLSELERLQPSEILLPESLKQAEIQGKNWALKRLPLWQFDRDTAINNLTRQFETHDLSGFGCEDLPIALCAAGALLEY
ARLTQGSAALPITSLQAERDSIYIRMDAATRRNLEISETIRGERSPTLLSLLDTCSTNMGSRLLQFWLHHPLRDHAAIQK
RLDSVAALIGESEQNNYWVARDLLRQFVDVERITARIALKSARPRDLSGLRDSLKLLPEVIQAMANGSSERISQLIQAMQ
IEPALFELLRKSLLEEPGVVLREGNVIADGYDAELDELRALQNNCGEFLLQLEIREKERTGIPNLKVEYNRVHGFYIEVT
HAHSEKIPADYRRRQTLKSAERYITPELKAFEDKALSAQDRALAREKYLYDELLNVLLGYIHPLQKMAASVAEIDVLCAF
AERAQALDYTAPYLSHEEIFEIDTGRHPVVESQVENFVANDVQLGADYTGKPQMLIITGPNMGGKSTYMRQIALIALLAH
CGSYVPAKRVRLGRLDQIFTRIGAADDLASGRSTFMVEMTETANILHNATAQSLVLMDEVGRGTSTFDGLALAFAIARYL
LSKNRSFTLFATHYFELTKLAEEFKQIQNVHLDAVEYKHRIVFLHKVAEGPASQSYGLQVAALAGVPESVIKVARKHLIK
LEQESVKKKPQLDLFSFSIPEPEEDITQEHPLIAMLQNLSPDELSPRQALEQLYLLKKAIDATNNS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 0.0 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Nit79A3_3482 YP_004696603.1 DNA mismatch repair protein mutS VFG0562 Protein 0.0 53