Gene Information

Name : hpaE (ECIAI1_4574)
Accession : YP_002389802.1
Strain : Escherichia coli IAI1
Genome accession: NC_011741
Putative virulence/resistance : Unknown
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : 1.2.1.16
Position : 4647063 - 4648529 bp
Length : 1467 bp
Strand : -
Note : Evidence 2a : Function of homologous gene experimentally demonstrated in an other organism; PubMedId : 7737515, 2061311, 8550403; Product type e : enzyme

DNA sequence :
ATGAAAAAAGTAAATCATTGGATCAACGGCAAAAATGTTGCAGGTAACGACTACTTCCAGACCACCAATCCGGCAACGGG
TGAAGTGCTGGCGGATGTGGCCTCTGGCGGTGAAGCGGAGATCAATCAGGCGGTAGCGGCAGCGAAAGAGGCGTTCCCGA
AATGGGCCAATCTGCCAATGAAAGAGCGTGCGCGCCTGATGCGCCGTCTGGGCGATCTGATCGACCAGAACGTGCCGGAG
ATCGCCGCGATGGAAACCGCGGACACGGGCCTGCCGATCCATCAGACCAAAAATGTGTTGATCCCACGCGCTTCTCACAA
CTTTGAATTTTTCGCGGAAGTCTGCCAGCAGATGAACGGCAAGACTTATCCGGTCGACGACAAGATGCTCAACTACACGC
TGGTGCAGCCGGTAGGCGTTTGTGCACTGGTGTCACCGTGGAACGTGCCGTTTATGACCGCCACCTGGAAAGTCGCGCCG
TGTCTGGCGCTGGGCAATACCGCGGTACTGAAAATGTCGGAACTCTCCCCGCTGACCGCCGACCGCCTGGGTGAGCTGGC
GCTGGAAGCTGGTATTCCGGCGGGCGTGCTGAACGTGGTACAGGGCTACGGCGCAACCGCAGGCGATGCGCTGGTCCGTC
ATCATGACGTGCGTGCCGTGTCGTTCACCGGCGGTACGGCGACCGGGCGCAACATCATGAAAAACGCCGGGCTGAAAAAA
TACTCCATGGAACTGGGCGGTAAATCGCCGGTGCTGATTTTTGAAGATGCTGATATTGAACGCGCGCTGGACGCCGCCCT
GTTCACCATCTTCTCGATCAACGGCGAACGCTGCACCGCCGGTTCGCGCATCTTTATTCAGCAAAGCATCTACCCGGAAT
TCGTTAAACGCTTTGCCGAACGCGCCAACCGTCTGCGCGTGGGCGATCCGAACGATCCGAATACCCAGGTTGGCGCGCTT
ATCAGCCAGCAGCACTGGGAAAAAGTCTCCGGCTATATCCGTCTCGGCATTGAAGAAGGCGCAACCCTGCTGGCGGGCGG
CCCGGATAAACCGTCCGACCTGCCTGCGCACCTGAAAGGCGGCAACTTCCTGCGCCCAACCGTGCTGGCAGACGTTGATA
ACCGTATGCGAGTCGCCCAGGAAGAGATTTTCGGGCCGGTCGCCTGCCTGCTGCCGTTTAAAGACGAAGCCGAAGGCTTA
CGTCTGGCAAACGACGTGGAGTACGGCCTCGCGTCGTACATCTGGACACAGGATGTCAGCAAAGTGTTACGCCTGGCGCG
TGGCATTGAAGCTGGCATGGTGTTCGTCAACACCCAGAACGTGCGTGACCTGCGCCAGCCATTTGGCGGCGTAAAAGCCT
CCGGCACCGGGCGTGAAGGCGGTGAGTACAGCTTCGAAGTGTTCGCGGAAATGAAGAACGTCTGCATTTCCATGGGCGAC
CATCCAATTCCGAAATGGGGAGTCTGA

Protein sequence :
MKKVNHWINGKNVAGNDYFQTTNPATGEVLADVASGGEAEINQAVAAAKEAFPKWANLPMKERARLMRRLGDLIDQNVPE
IAAMETADTGLPIHQTKNVLIPRASHNFEFFAEVCQQMNGKTYPVDDKMLNYTLVQPVGVCALVSPWNVPFMTATWKVAP
CLALGNTAVLKMSELSPLTADRLGELALEAGIPAGVLNVVQGYGATAGDALVRHHDVRAVSFTGGTATGRNIMKNAGLKK
YSMELGGKSPVLIFEDADIERALDAALFTIFSINGERCTAGSRIFIQQSIYPEFVKRFAERANRLRVGDPNDPNTQVGAL
ISQQHWEKVSGYIRLGIEEGATLLAGGPDKPSDLPAHLKGGNFLRPTVLADVDNRMRVAQEEIFGPVACLLPFKDEAEGL
RLANDVEYGLASYIWTQDVSKVLRLARGIEAGMVFVNTQNVRDLRQPFGGVKASGTGREGGEYSFEVFAEMKNVCISMGD
HPIPKWGV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 0.0 93
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-75 42
orf17 AAO17183.1 Orf17 Not tested tcd island Protein 4e-74 41