Gene Information

Name : Hneap_0653 (Hneap_0653)
Accession : YP_003262553.1
Strain : Halothiobacillus neapolitanus c2
Genome accession: NC_013422
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 707090 - 709735 bp
Length : 2646 bp
Strand : -
Note : KEGG: aeh:Mlg_1485 DNA mismatch repair protein MutS; TIGRFAM: DNA mismatch repair protein MutS; PFAM: DNA mismatch repair protein MutS domain protein; MutS II domain protein; MutS IV domain protein; MutS III domain protein; SMART: DNA mismatch repair prot

DNA sequence :
GTGACTGAATCAAAAGATCTAAGCCAACACACGCCGATGATGCAGCAATTCTGGACGATGAAACAGGCGCACCCGGATGT
GTTGCTGTTTTATCGTATGGGGGATTTTTACGAGCTGTTTTACGCCGATGCCGAGCGGGCGGCGCGCATTCTCGATTTGA
CACTGACGACGCGCGGGCAGTCGGCAGGCGAGCCGATTCCGATGGCGGGTGTTCCGGTTCATGCCTACGAGAGCTATCTG
GCGCGGTTGATTCGCGCGGGCGAATCGGTGGCCATTTGCGAGCAGATCGGTGAAACCAAAACCAAAGGCCCGATGGAGCG
TGCGGTGGTGCGGGTCGTCACACCCGGAACGGTCACGGATGAGGCCTTGCTCGATCAGCGCGAAGGCAACCGCTTGGCGG
CATTGGTGCCGCTGGCAACCACGCCACCGGAATACGGGTTGGCGCATCTGGATCTGGCGGCAGGCGATTTCGTGCTCATG
CGGCTCGATGATGCGGCGCTGACGGCCGAGCTGGCGCGAATCGATCCGCGTGAATTGCTGTTGCCGGAATCGCTGGCCGA
GGCCGCCGACACGGCGGCGAAGATAGGCGTGGACCCCAAACGTTGGCGTACGCGCGCCGATTGGCAGTTCGATGCCAAAC
GCGGGCAGGCGGCCTTGCTCAAACACTGGCAGATTCACGATCTTAAAAGTTTTGGCGTGACGGAAATCCATCAACCGGCG
CTGGGTGCGGCCGCCATTTTGCTGACCTATGTAGCCGAGACCCAGCGTAGTGCCGTGCCGCATATCGAGCGCCTGCGGGT
GGAGCACCTGGGCGATGCCCTGCTGATTGACCGCAACACCCGTCGCCATCTGGAGCTCTTCACTTCAAATCAAGAAGGAA
GTCACGATGACGGCCGTTCGGCAGCCACGCTGATCAACCTGCTGGATGAGACGGTGACCGCGCACGGCTCGCGGCTGCTC
AAGCATTGGCTAGGTCGCCCGCTGCGTGATCAGGCCGTGTTGCGGCATCGGCAGCAGGCGATTGGCGAACTGATCGAGCG
CGGCAAGATCAATGCGCTGCGCGAATCGTTGCGCGGTATCAACGATATTGAACGCATCACCACCCGCATCGTGATGGGCA
GCGCCCGCCCCCGTGATTTGTCCGGGCTGCGCGATGCCCTTGGTGTATTGCCCGCGCTGAGTGCGCAACTCAACCAACTC
GACCTGCCCTTATGGCGCGATCTGGCCGTTCGGCTGACCGATCAACCCGCCCCGCGTGAATTGCTGAACCGCGCACTGGT
GACCCAACCACCCGTGTGGCTGCGCGATGGCGGCGTGATTGCCGCCGGATTCGATGCCGAACTCGACGAATTGCGCCACC
TTTCTGAACACGCGGACGACGCCCTGAATGCGCTCGAAGCCCAAGCGCGACTGCAAAGCGGTATTCAGTCCTTGAAGATC
GCCTACAACCGTGTGCAGGGGTTCTATTTTGAAGTCAGCCGGTTGCAGGCCGAAAAAATGCCACCGCAGTTTATTCGCCG
CCAGACGCTCAAATCGGTGGAGCGCTATACGACCGAAGAGCTGAAAACCTTCGAAGATCGCGTGTTGTCCGCCCGCGACC
GCGCCTTGGCACGCGAACAAGGGCTCTTCACCGAATTGTTGCAAACCCTCGCGACGCACCAGAGCGCCCTGCGCCGCATG
GCCGAAGCCATTGCCGAGGTCGATGTGCTGCACAGTTTGGCGCGGGTGGCCGAGTGCCAGCGCTGGGTGGCACCGGAACT
CGGCAGTGAACCGGGCATCCACATCGAAGCGGGACGACATCCGGTGATTGAAGCCCTGACCAAACAAACCTTAGGGAATC
AGCCCTTCACACCGAATGATTGCGAACTCACGCCAAACCGGCAACTGTTGATGATTACCGGCCCGAACATGGGCGGTAAA
TCGACCTATATGCGGCAAACGGCGTTGATCGTGCTGCTGGCGCACATTGGCGCGTTCGTCCCTGCTACCCGCGCGCGTAT
CGGTCCGATCGATCGCATTTTCACCCGCATCGGCGCGGGCGATGATCTGGCCTCCGGCCGTTCGACTTTTATGGTCGAGA
TGACCGAAACGGCAGAAATCCTGCACACGGCGACCGAAAATTCACTGGTATTGATCGATGAAATCGGTCGGGGCACGTCG
ACCTTCGATGGCCTGGCACTGGCCTGGGCCGTGGCGGAGCACCTGATTCGCCGCAACCGCGCGCTCACGCTGTTCGCCAC
CCATTACTTCGAGCTGACTCAACTGACCGAGCGCTTCGATACGGTCCGAAACGTACACCTCGATGCCGTCACACACAAGG
ACGATTTGATTTTTCTGCACAGCGTGAAAGATGGCCCGGCCAGCCAGAGTTACGGCATCAAGGTCGCTGCGCTGGCCGGT
TTGCCCCGGGAGGCTATTCGGCGAGCACAAGCGTTACTAAAACAACTAGAGCAGCAACACCCCGTGGGAGCGGCCACGCC
GCAGCTCGATTTGTTTGCCGCGCCCGAAGTAACCGATGCAATTGAGGAACCTGAGATTGAGCCGCACCCGTTGATTACCG
CGCTCGAAAAACTCGACCCGGACATACTCACGCCGAAGCAGGCGCTGGATTTGATTTATGCCTGGCGCAATGAACTTAAG
AAGTAA

Protein sequence :
MTESKDLSQHTPMMQQFWTMKQAHPDVLLFYRMGDFYELFYADAERAARILDLTLTTRGQSAGEPIPMAGVPVHAYESYL
ARLIRAGESVAICEQIGETKTKGPMERAVVRVVTPGTVTDEALLDQREGNRLAALVPLATTPPEYGLAHLDLAAGDFVLM
RLDDAALTAELARIDPRELLLPESLAEAADTAAKIGVDPKRWRTRADWQFDAKRGQAALLKHWQIHDLKSFGVTEIHQPA
LGAAAILLTYVAETQRSAVPHIERLRVEHLGDALLIDRNTRRHLELFTSNQEGSHDDGRSAATLINLLDETVTAHGSRLL
KHWLGRPLRDQAVLRHRQQAIGELIERGKINALRESLRGINDIERITTRIVMGSARPRDLSGLRDALGVLPALSAQLNQL
DLPLWRDLAVRLTDQPAPRELLNRALVTQPPVWLRDGGVIAAGFDAELDELRHLSEHADDALNALEAQARLQSGIQSLKI
AYNRVQGFYFEVSRLQAEKMPPQFIRRQTLKSVERYTTEELKTFEDRVLSARDRALAREQGLFTELLQTLATHQSALRRM
AEAIAEVDVLHSLARVAECQRWVAPELGSEPGIHIEAGRHPVIEALTKQTLGNQPFTPNDCELTPNRQLLMITGPNMGGK
STYMRQTALIVLLAHIGAFVPATRARIGPIDRIFTRIGAGDDLASGRSTFMVEMTETAEILHTATENSLVLIDEIGRGTS
TFDGLALAWAVAEHLIRRNRALTLFATHYFELTQLTERFDTVRNVHLDAVTHKDDLIFLHSVKDGPASQSYGIKVAALAG
LPREAIRRAQALLKQLEQQHPVGAATPQLDLFAAPEVTDAIEEPEIEPHPLITALEKLDPDILTPKQALDLIYAWRNELK
K

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 0.0 50

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Hneap_0653 YP_003262553.1 DNA mismatch repair protein MutS VFG0562 Protein 0.0 52