Gene Information

Name : Sthe_2804 (Sthe_2804)
Accession : YP_003321039.1
Strain :
Genome accession: NC_013524
Putative virulence/resistance : Unknown
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : 1.2.1.8
Position : 411338 - 412870 bp
Length : 1533 bp
Strand : +
Note : KEGG: ssn:SSON_4498 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase; TIGRFAM: 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; PFAM: Aldehyde Dehydrogenase

DNA sequence :
ATGACCACAACGACCGACTACGCGGCCTTGAGCAAGCGGCTGCTCGATGACCTGCCGCCGCTCCAGAACTACATCGGCGG
GGAATGGACTCCCGGCCCGATCGGCGAGACCTTCATGTCCATCAACCCGGCGACGAACCAGCCGATCGCGACGGTCCACT
CCGCCGGGCGCGCCGGAGCGCAGCAGGCCGTTATTGCCGCCCGCGAGGCGTTCGAGAGCGGCGCCTGGTCACGGATGCCC
GCCGCCGACCGCGCCCGGGCCCTGCGCCGGATCGCGGAGCAGATCCGCAAGCGGGCCGATGAAATCTCGGTGCTCGAGAC
GAGCGACACCGGCATCCCGATCACCCAGATCCGGGCCGGGCAGGTGCTGCGCGCCGCCGACAACTTCGACTTCTTCGCCG
AGATGGCGACGCAGATCACTGGCGAGACCTTCCCGGTCGAGGGCACCTTCCTCAACTACACCGTCCACAAGCCGGTAGGG
GTCGCCGTGCTGATCACACCCTGGAATACGCCCTTCATGCTGGAGACGTGGAAGGTCGCCCCGGCGCTCGCGGCCGGCAA
CACCTGCATCCTCAAGCCCGCAAGCTGGTCGCCGATCTCGGCCTACCTGCTGGCGAAGGCCATCGAGGAGGCGGATCTGC
CGCCCGGCACCTTCAACCTCGTCTACGGCTCCGGCGAGACGGTCGGCACCGCGCTCGTCGCCCACCCGGAGGTTAACCTG
GTCTCCTTCACCGGCGAGACGACAACCGGCAAGCAGTTGATGCGCACCGGCGCCGAGACGCTGAAGCGCTTCTCGATGGA
GCTGGGTGGGAAGTCACCCGTCATCGTCTTCGCCGACGCCGACCTGGACCGCGCGCTCGACGCCGCTATCTTCGGCGTCT
TCTCCCTCAACGGCGAGCGCTGCACCGCCGGCACCCGCCTCTTCCTGGAGCGCCCGATCTACGACGACTTCGTGAGCCGC
CTGATCGAGCGCGTGCGCCGGATTCGCGTCGGCGATCCGCTCGACCCGGCGACCGAGGTCGGCCCACTGATCCACCGGCG
GCACCTGGAGCGGGTCATGGGCTATCTCGACATCGCCCGGGAGGAAGGCGCCACCATCGCCGTTGGCGGCCGGCGACCGG
ACCGACCCGAACTGGCGGACGGCAACTACCTGGAGCCGACGGTGATCGTCGACGTGCGCAACGAGATGCGCGTGGCCCAG
GAGGAGATCTTCGGCCCGGTCCTCACCGTGATCCCGTTCGAGGACGAGGCCGAGGTCCTGCGCATGGCGAACGATGTCCG
CTACGGCCTGGCCGCCTACCTCTGGACGTCGGAAGTCAGCCGCGCCACGCGGCTCGCGCCCGAGATCGAGTCCGGGATGG
TCTGGGTTAACTCCCAGAACGTCCGCGATCTGCGGACGCCGTTCGGCGGCATGAAGGAGAGCGGCATCGGCCGCGAGGGC
GGCCGCTACTCCTTCGAGTTCTACACCGAGACGAAGACGATCCACGTGGCGCTCGGGCAGCATCGCATCCCGAAGTTCGG
GGCGACGTCCTGA

Protein sequence :
MTTTTDYAALSKRLLDDLPPLQNYIGGEWTPGPIGETFMSINPATNQPIATVHSAGRAGAQQAVIAAREAFESGAWSRMP
AADRARALRRIAEQIRKRADEISVLETSDTGIPITQIRAGQVLRAADNFDFFAEMATQITGETFPVEGTFLNYTVHKPVG
VAVLITPWNTPFMLETWKVAPALAAGNTCILKPASWSPISAYLLAKAIEEADLPPGTFNLVYGSGETVGTALVAHPEVNL
VSFTGETTTGKQLMRTGAETLKRFSMELGGKSPVIVFADADLDRALDAAIFGVFSLNGERCTAGTRLFLERPIYDDFVSR
LIERVRRIRVGDPLDPATEVGPLIHRRHLERVMGYLDIAREEGATIAVGGRRPDRPELADGNYLEPTVIVDVRNEMRVAQ
EEIFGPVLTVIPFEDEAEVLRMANDVRYGLAAYLWTSEVSRATRLAPEIESGMVWVNSQNVRDLRTPFGGMKESGIGREG
GRYSFEFYTETKTIHVALGQHRIPKFGATS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 3e-113 53
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 1e-68 42