Gene Information

Name : hpaE (S4653)
Accession : NP_839765.1
Strain : Shigella flexneri 2457T
Genome accession: NC_004741
Putative virulence/resistance : Unknown
Product : 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 4546509 - 4547975 bp
Length : 1467 bp
Strand : -
Note : residues 1 to 488 of 488 are 99.18 pct identical to residues 1 to 488 of 488 from GenPept : >emb|CAA86041.1| (Z37980) 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase [Escherichia coli]

DNA sequence :
ATGAAAAAAGTAAATCATTGGATCAACGGCAAAAATGTTGCAGGTAACGACTACTTCCAGACTACCAATCCGGCAACGGG
TGAAGTGCTGGCGGATGTGGCCTCTGGCGGTGAAGCGGAGATCAATCAGGCGGTAGCGGCAGCGAAAGAGGCGTTCCCGA
AATGGGCTGATCTGCCGATGAAAGAGCGTGCGCGCCTGATGCGCCGTCTGGGCGATCTGATCGACCAGAACGTGCCTGAG
ATCGCCGCGATGGAAACCGCGGACACGGGCCTGCCGATCCATCAGACCAAAAATGTGTTGATCCCACGCGCTTCTCACAA
CTTTGAATTTTTCGCGGAAGTCTGCCAGCAGATGAACGGCAAGACCTATCCGGTTGACGACAAGATGCTCAACTACACGC
TGGTGCAACCGGTAGGCGTTTGTGCGCTGGCGTCGCCGTGGAACGTGCCGTTTATGACCGCCACCTGGAAGGTCGCGCCG
TGTCTGGCGCTGGGCAATACCGCGGTGCTGAAGATGTCGGAACTCTCCCCGCTGACCGCCGACCGCCTGGGTGAGCTGGC
GCTGGAAGCTGGTATTCCGGCGGGCGTGCTAAACGTGGTACAGGGCTACGGCGCAACCGCAGGCGATGCGCTGGTCCGTC
ATCATGACGTGCGTGCCGTGTCGTTCACCGGCGGTACCGCCACCGGGCGCAACATCATGAAAAACGCCGGGCTGAAAAAA
TACTCCATGGAGCTGGGCGGTAAATCACCGGTGCTGATTTTTGAAGATGCCGATATTGAGCGTGCGCTGGACGCCGCCCT
GTTCACCATCTTCTCAATCAACGGCGAACGCTGCACCGCCGGTTCGCGCATCTTTATTCAGCAGAGCATCTATCCAGAAT
TCGTTAAACGCTTTGCCGAACGCGCCAACCGTCTGCGCGTGGGCGATCCGAACGATCCGAATACCCAGGTTAGGGCGCTT
ATCAGTCAGCAACACTGGGAAAAAGTCTCCGGCTATATCCGTCTCGGCATTGAAGAAGGCGCAACCCTGTTGGCGGGCGG
CCCGGATAAACCGTCCGACCTGCCTGCGCACCTGAAAGGCGGCAACTTCCTGCGCCCAACGGTGCTGGCGGACGTAGATA
ACCGTATGCGCGTTGCCCAGGAAGAGATTTTCGGGCCGGTCGCCTGCCTGCTGCCGTTTAAAGACGAAGCCGAAGGCTTA
CGCCTGGCAAACGACGTGGAGTACGGCCTCGCGTCGTACATCTGGACACAGGATGTCAGCAAAGTGTTACGCCTGGCGCG
TGGCATTGAAGCAGGCATGGTGTTCGTCAACACCCAGAACGTGCGCGACCTGCGCCAGCCATTTGGCGGCGTAAAAGCCT
CCGGCACCGGGCGTGAAGGCGGTGAGTACAGCTTCGAAGTGTTCGCGGAAATGAAGAACGTCTGCATTTCCATGGGCGAC
CATCCAATTCCGAAATGGGGAGTCTGA

Protein sequence :
MKKVNHWINGKNVAGNDYFQTTNPATGEVLADVASGGEAEINQAVAAAKEAFPKWADLPMKERARLMRRLGDLIDQNVPE
IAAMETADTGLPIHQTKNVLIPRASHNFEFFAEVCQQMNGKTYPVDDKMLNYTLVQPVGVCALASPWNVPFMTATWKVAP
CLALGNTAVLKMSELSPLTADRLGELALEAGIPAGVLNVVQGYGATAGDALVRHHDVRAVSFTGGTATGRNIMKNAGLKK
YSMELGGKSPVLIFEDADIERALDAALFTIFSINGERCTAGSRIFIQQSIYPEFVKRFAERANRLRVGDPNDPNTQVRAL
ISQQHWEKVSGYIRLGIEEGATLLAGGPDKPSDLPAHLKGGNFLRPTVLADVDNRMRVAQEEIFGPVACLLPFKDEAEGL
RLANDVEYGLASYIWTQDVSKVLRLARGIEAGMVFVNTQNVRDLRQPFGGVKASGTGREGGEYSFEVFAEMKNVCISMGD
HPIPKWGV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 0.0 93
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 4e-74 42
orf17 AAO17183.1 Orf17 Not tested tcd island Protein 7e-73 41