Gene Information

Name : Cri9333_2068 (Cri9333_2068)
Accession : YP_007142456.1
Strain : Crinalium epipsammum PCC 9333
Genome accession: NC_019753
Putative virulence/resistance : Virulence
Product : DNA mismatch repair protein MutS
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2347017 - 2349755 bp
Length : 2739 bp
Strand : +
Note : PFAM: MutS family domain IV; MutS domain II; MutS domain V; MutS domain I; MutS domain III; TIGRFAM: DNA mismatch repair protein MutS; COGs: COG0249 Mismatch repair ATPase (MutS family); HAMAP: DNA mismatch repair protein MutS, type 1; InterProIPR005748:I

DNA sequence :
ATGAGCGCAGACATCGAAGCATTCACCTATCCAGAAACCCAAAATAGTAGAGCGGGTCAGCATCTTGAGGCGATGACGCG
CGTTTCTGCTAATAATGCTGATTATCGGACAGTGGAAAGGCAAAAGCTGACTCCGATGATGCAGCATTTTGTGGAAGTTA
AGGAACAGTATCCCCATGCGTTGCTGTTATATCGTGTAGGGGATTTTTATGAGACGTTTTTTCAGGATGCGCGATCGCTC
GCAGAGTCGCTGGAACTGGTTTTAACTTCTAAGGAGTCGGGTAAAGATATTGGTCGAGTGCCGATGTCGGGTATTCCTCA
TCATGCTTTGGATAGATATTGTACGCTGTTGGTGGAAAAAGGGTTTGCGATCGCCATTTGCGACCAAGTAGAAGATGCAG
CAGAGGCGGCGGCGCAAGGTCGTCAGGTGCGCCGAGAAGTAACGCGGGTGTTAACTCCTGGGACGTTACTAGAAGAAGGG
ATGTTAAATGCGCGTCGTAATAACTTTTTAGCAGCAGTGGTAATTGCTGGGGAACATTGGGGTTTAGCTTACGCAGATAT
TTCTACAGGGGAATTTTTAACTACTCAATCAAATAATTTAGAACACCTCACGCAGGAATTAATGCGTTTGCAACCTGCTG
AGGTGCTAGTACCTGTAAATGCGCCAGATTTAGGTGGTTTTCTACGACCAGGGCAAAAGTCAGATTATTTGCCAGAGTGT
TTACCACCATCATTCTGTTATGCACTGCGATCGCAATATCCTTTTTCTTTATCCGAAGCTAAACAAAGATTACTGGAAAA
GTTAAAAGTGCGATCGCTTGAAGGGATGGGTTGTGAACATCTCCCCCTTGGTGTGCGTGCTGCGGGTGGGTTGCTGGAAT
ACTTAGAAGACACCCAAAAAGGTAATCAAGTACCTTTACAAACCCTACGGACTTATACCCTTGCCGATTATCTAATTATT
GATAACCAAACCCGTCGCAATTTGGAAATTACTCAAACTGTTCGGGATGGTACATTACATGGATCACTATTATGGGCGCT
AGATAGAACCAGTACGGCAATGGGTGGACGTGCTTTGCGCCGATGGTTGTTACAACCATTAGTAGATATAAAAGGCATTG
AAGCACGGCAAAATACGATTCAAGAATTAGTTGAAAATACGGCTTTACGCCAAGATTTACAACAGTTATTGCGCCAAATT
TATGATTTAGAAAGATTAACTGGTCGCTCTGGTTCCGGTAGAGCTAATGCTAGAGATTTAATCGCTTTGGCAGATTCATT
ATTAAGATTACCAGAATTAGCTATATTAGCATCTACTGGTGAATCTCCCTTTTTGAAAGCTGTGCAAAAAGTGCCTCCAA
TGTTGCAGGAATTGGGGCAACAAATTCGTAATCATATAGTAGATTCGCCTTCTCAACATTTAATGGAAGGAAATTTAATT
CGTCCTGGTGTAAACGAGTTATTAGATGAGATGCGAGGTGCTGCTGAAGGCGATCGGCAATGGATTGCTAATTTAGAAGT
TACAGAAAGAAACCGGACTGGTATTTCTACACTAAAGGTAGGTTTTAATAAAACTTTTGGTTATTACATTAGTATTTCTA
GATCAAAAGCGGATCAGGTTCCCGACAATTATATTCGCAAACAAACTTTAACTAATGAGGAGCGTTACATCACTCCAGAT
TTGAAGGAAAGAGAAGCGCGAATTTTAACAGCGCGGGAAGATTTAAATAAGCTGGAATATGAAATATTTTGTGCTTTACG
GGCAGAAGTTAGTGAACAAGCAGAGCAAATTCGTCATGTTTCACGCGCTGTTGCTGCTGTCGATGTTTTATGTGGTTTAG
CAGAAGTAGCGGTACATCAGGGTTATTGTTGTCCGCAAATGGAACAAGGGCGAGAAATTAATATTATTGAAGGTCGTCAT
CCAGTAGTAGAACAATCTCTGCCAGCAGGATTTTTTGTGCCGAATTCGACTAATTTAGGAAGTTCGGAGTTGTCAGAAAG
TCCTGATTTAATTATTCTTACTGGCCCCAATGCGAGTGGTAAAAGTTGTTATTTACGGCAGGTAGGATTAATTCAATTAA
TGGCACAAACGGGGAGTTTTGTACCAGCGCAGTCAGCAAGGTTGGGAGTGTGCGATCGCATTTTTACCCGTGTTGGTGCT
GTTGATGATTTAGCCACAGGTCAATCTACCTTTATGGTAGAAATGAATGAAACTGCTAATATTCTCAACCATGCTACGCC
TAAATCTCTAGTTTTATTAGATGAAATTGGGCGTGGTACTGCTACATTTGATGGTCTTTCTATTGCTTGGGCGGTAGCGG
AATATTTAGCTAGTGATATTAGGGCAAGAACTATTTTTGCTACGCATTACCACGAATTGAACGAATTAGCATCTATTTTG
CCGAATGTAGCAAATTATCAAGTAACAGTCAAAGAGTTACCCGACCAAATTATCTTTTTACACCAAGTACAACCAGGAGG
CGCGGATAAATCTTATGGTATTGAAGCTGGACGTTTAGCAGGTTTACCAGCATCGGTAATTTTACGCGCTAGGCAGGTAA
TGGGGCAAATTGAGCAGCATAGTAAAATTGCTGTGGGATTACGTGAAGGGATTGGTAGTAATAAACCTAGTGGGAAATCA
GCATCAGGTCGTTCTGGACGTAAGAAAAAGGCTGCTGATGTTTCTAATGGAGAGCAGGAAATAGTGGGAAAAGAGGATAA
TTCGAGCGAGAACGAGTGA

Protein sequence :
MSADIEAFTYPETQNSRAGQHLEAMTRVSANNADYRTVERQKLTPMMQHFVEVKEQYPHALLLYRVGDFYETFFQDARSL
AESLELVLTSKESGKDIGRVPMSGIPHHALDRYCTLLVEKGFAIAICDQVEDAAEAAAQGRQVRREVTRVLTPGTLLEEG
MLNARRNNFLAAVVIAGEHWGLAYADISTGEFLTTQSNNLEHLTQELMRLQPAEVLVPVNAPDLGGFLRPGQKSDYLPEC
LPPSFCYALRSQYPFSLSEAKQRLLEKLKVRSLEGMGCEHLPLGVRAAGGLLEYLEDTQKGNQVPLQTLRTYTLADYLII
DNQTRRNLEITQTVRDGTLHGSLLWALDRTSTAMGGRALRRWLLQPLVDIKGIEARQNTIQELVENTALRQDLQQLLRQI
YDLERLTGRSGSGRANARDLIALADSLLRLPELAILASTGESPFLKAVQKVPPMLQELGQQIRNHIVDSPSQHLMEGNLI
RPGVNELLDEMRGAAEGDRQWIANLEVTERNRTGISTLKVGFNKTFGYYISISRSKADQVPDNYIRKQTLTNEERYITPD
LKEREARILTAREDLNKLEYEIFCALRAEVSEQAEQIRHVSRAVAAVDVLCGLAEVAVHQGYCCPQMEQGREINIIEGRH
PVVEQSLPAGFFVPNSTNLGSSELSESPDLIILTGPNASGKSCYLRQVGLIQLMAQTGSFVPAQSARLGVCDRIFTRVGA
VDDLATGQSTFMVEMNETANILNHATPKSLVLLDEIGRGTATFDGLSIAWAVAEYLASDIRARTIFATHYHELNELASIL
PNVANYQVTVKELPDQIIFLHQVQPGGADKSYGIEAGRLAGLPASVILRARQVMGQIEQHSKIAVGLREGIGSNKPSGKS
ASGRSGRKKKAADVSNGEQEIVGKEDNSSENE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
mutS AAA80578.1 DNA mismatch repair protein Virulence SPI-1 Protein 2e-121 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Cri9333_2068 YP_007142456.1 DNA mismatch repair protein MutS VFG0562 Protein 3e-130 42