Gene Information

Name : Haur_4078 (Haur_4078)
Accession : YP_001546838.1
Strain : Herpetosiphon aurantiacus DSM 785
Genome accession: NC_009972
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0249
EC number : -
Position : 5208304 - 5211090 bp
Length : 2787 bp
Strand : -
Note : TIGRFAM: DNA mismatch repair protein MutS; PFAM: DNA mismatch repair protein MutS domain protein; MutS III domain protein; MutS II domain protein; MutS IV domain protein; KEGG: rca:Rcas_3531 DNA mismatch repair protein MutS

DNA sequence :
ATGTCAAAAATGACGATGTGGCAGCAATATCTGTCGATTAAGCAAAAATATGCCGATGTGATTCTGTTTTTCCGGCTAGG
CGATTTTTACGAAACTTTCGGCGATGATGCCAAGTTGATTGCCGAGGTGCTTGATATTACCCTGACGGTGCGTGGCCTCA
GCAGCGATGAAAATACGCCGATGGCGGGTGTGCCCTATCACGCCGCCGATAATTATATTGAGCAACTGGTCAGTCGGGGC
TATCGCGTGGCAATCTGCGAACAAATGGATGAGATGGTGCACAAAACCTTGCAAAAACGTGAGGTTGTGCGGATTGTCAC
GCCAGGCACTCTGACCGAGCCAACCATGCTCCAAGCTGAACGCAATAGCTACCTTGCGGCAATTTTGGTTGATCGTGGCA
ATGTTGGTTTGGCGTATGCCGACCTAACCACCGGCGAGTTTTGTGCCACCGAATTGCGCGGCAACGAGGCTTTGAAGCAA
CTTGAAGGCGAATTAGCGCGTTTGGGTGCGGCTGAACTCTTGGTTTCCGATGCTCCTGAGTTGCGTCCAGCCGGCATGGA
AATTGCCAAAAAGCAGTTAGCCCAAGATCTTGCACCAATGCGCAAGGCTGAACGCGAACGCTTGCTACCCCACGAACGCA
CCGCCAAAAAAGTTGAAGGTAATAACGAAAGCACGTGGGTTCAAGGCAATGTCACCCAATGGCCTAATTGGCATTGGGAT
GCCCGCACCGCTCGCGATGCCTTGCTCAATCAATTTAAAAGCCAATCGCTTGATGGCTTTGGCTTGGGCAATAAAGCCCT
CGCAACCCGCGCCGCAGGAGCATTAATTCAATATTTGCATGAAACGCAGCGCGATAGTGTGGCCCAAGTTCGCAGCTTGC
GGGTCTATGATACAACTCGTTTTATGTTTCTCGACCCCCAAACGCGGCGTAATTTGGAGCTAACCGAAGGTGCTGGTGGC
CAACGCAAAGGCTCGTTGATTGCGGTGCTCGACCAAACCCGCACGCCGATGGGTGCACGCCTGTTACGCCAATGGATCTC
ACAACCGCTGATTGAGCTTGGCCCACTGACCGAGCGCCAACAAGCCGTCAGTTGTTTTGTTGAAGAAACCTTGGTCCGCG
GCGAGTTACGGGCCTTATTCAAAGGGGTTGGCGATATCGAACGCACAATCAATCGGGTGGTGCAAGGCATTGCCACCCCG
CGTGATTTAGTGCGATTGCGCGAAGCGCTGCGCCTAACTCCCGATATTTTGAGCCAAATCGAGCGTACAGGTTTGCGCTC
AACCAGCCCAACCGAGGCTGCGCCAAGTGATGATGATCTGTTTGATGACGAGCCAACAAGCAATCAAATCGATGCTTGTG
CCGATATTTGCGAATTACTCGAACAGGCGATTGCCGATGATCCGCCCGCCTTGCTTGGCACATGGGATAACGCCCGCAGC
GACGAAAATGTGATTCGCAAGGGTCATGCTGCCGAAATTGATGCAATTGTTGAGGCTACTCGCGATGCCGCCCGTTGGAT
CAACGAACTTGAAGCCAAGGAACAGCAACGCACTGGCATCAAAACGCTCAAAGTCAGCTATAACAAAGTCTTTGGCTATT
ACATCGAGGTAACCAAGGCCAGCGGCGAAACCCGCATTCCCGATGATTACATTCGCAAACAAACCTTGGTCAATGCCGAG
CGCTATATCACGCCAGAACTCAAAGAATATGAATCGCTGATTTTGAATGCCTCAGAAGCCTTGAACGAAAAAGAGCGCCA
AGCATTTCGCCTGATTTTGCGCCATTTGGCCAACGCTGGCAATCGTTTGCTCGATTTAGCGCGAGCAATCGCCGAGTTTG
ATGTCTATAGCACCTTGGCCGAGGTGGCGGTGCGTCAGCGTTTTGTGCGGCCAACCTTGCGCCTCGACGATGTATTTGTG
ATTCAAGGCGGGCGACATCCTGTGGTTGAGCACAATTTGAACGAGCCATTTACCCCGAATGATGCTCATTTTGATGCTGA
CCATCAGATTATTGTGCTGACTGGACCAAACATGTCGGGCAAAAGCACCTTTTTGCGCCAAGTGGCTTTGATTGGCTTAA
TGGCCCAAATCGGCTCGTTTGTGCCCGCTGATTATGCCGAAATTGGCCTACTCGACCGAATTTTCACGCGGATTGGGGCA
CAAGACGATATTGCTACCGGCCAATCGACCTTTATGGTTGAAATGATCGAGACCGCCAATATTTTACATAATGGGTCACC
ACGATCGCTGATTATTCTCGATGAAATTGGCCGTGGCACCAGCACCTACGACGGGCTTTCGATTGCCCGCGCTGTGGTCG
AATATATTCATAATCAGCCGCGCTTACGAGCCAAAACCCTGTTTGCAACCCACTACCACGAACTGACCGAGCTGGCTAAC
ATCTTGCCACGGGTGCATAATTGGACGTTGGCGGTGGCCGAAGAAGGCGATCATGTAGTGTTTTTGCGCAAAGTGATCGA
GGGTGCGGCTGATCGCTCATATGGAATTCATGTGGCTCAAATGGCGGGCTTGCCCCCAGCCGTGATTAAACGTGCTACCG
AAGTGCTGAGCGAGCTTGAAGGTAAGGGTGATCGGGAGCAGCGCCGCGAGGCCATGCGTCGCATGAACGCAGCAGGCAGT
TCGGCTGTGCCCCAAATGTCGCTATTTGCGAGCAACGAGCCAAATCCAGCGGTTGAGCTATTGCGCGAAATGGATGTAAC
CCAACTAACCCCAATCGAAGCCCTAACCAAACTTTACGAATTACAACGTTTGGCTAAAGTGGAGTGA

Protein sequence :
MSKMTMWQQYLSIKQKYADVILFFRLGDFYETFGDDAKLIAEVLDITLTVRGLSSDENTPMAGVPYHAADNYIEQLVSRG
YRVAICEQMDEMVHKTLQKREVVRIVTPGTLTEPTMLQAERNSYLAAILVDRGNVGLAYADLTTGEFCATELRGNEALKQ
LEGELARLGAAELLVSDAPELRPAGMEIAKKQLAQDLAPMRKAERERLLPHERTAKKVEGNNESTWVQGNVTQWPNWHWD
ARTARDALLNQFKSQSLDGFGLGNKALATRAAGALIQYLHETQRDSVAQVRSLRVYDTTRFMFLDPQTRRNLELTEGAGG
QRKGSLIAVLDQTRTPMGARLLRQWISQPLIELGPLTERQQAVSCFVEETLVRGELRALFKGVGDIERTINRVVQGIATP
RDLVRLREALRLTPDILSQIERTGLRSTSPTEAAPSDDDLFDDEPTSNQIDACADICELLEQAIADDPPALLGTWDNARS
DENVIRKGHAAEIDAIVEATRDAARWINELEAKEQQRTGIKTLKVSYNKVFGYYIEVTKASGETRIPDDYIRKQTLVNAE
RYITPELKEYESLILNASEALNEKERQAFRLILRHLANAGNRLLDLARAIAEFDVYSTLAEVAVRQRFVRPTLRLDDVFV
IQGGRHPVVEHNLNEPFTPNDAHFDADHQIIVLTGPNMSGKSTFLRQVALIGLMAQIGSFVPADYAEIGLLDRIFTRIGA
QDDIATGQSTFMVEMIETANILHNGSPRSLIILDEIGRGTSTYDGLSIARAVVEYIHNQPRLRAKTLFATHYHELTELAN
ILPRVHNWTLAVAEEGDHVVFLRKVIEGAADRSYGIHVAQMAGLPPAVIKRATEVLSELEGKGDREQRREAMRRMNAAGS
SAVPQMSLFASNEPNPAVELLREMDVTQLTPIEALTKLYELQRLAKVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 3e-101 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Haur_4078 YP_001546838.1 DNA mismatch repair protein MutS VFG0562 Protein 2e-112 43