Gene Information

Name : EcolC_3703 (EcolC_3703)
Accession : YP_001726641.1
Strain : Escherichia coli ATCC 8739
Genome accession: NC_010468
Putative virulence/resistance : Unknown
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 4053496 - 4054962 bp
Length : 1467 bp
Strand : +
Note : TIGRFAM: 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; PFAM: Aldehyde Dehydrogenase_; KEGG: sbo:SBO_4411 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase

DNA sequence :
ATGAAAAAAGTAAATCATTGGATCAACGGCAAAAATGTTGCAGGTAACGACTACTTCCAGACCACCAATCCGGCAACGGG
TGAAGTGCTGGCGGATGTGGCCTCTGGCGGTGAAGCGGAGATCAATCAGGCGGTAGCGGCAGCGAAAGAGGCGTTCCCGA
AATGGGCCAATCTGCCGATGAAAGAGCGTGCGCGCCTGATGCGCCGTCTGGGCGATCTGATCGACCAGAACGTGCCAGAG
ATCGCCGCGATGGAAACCGCGGACACCGGCCTGCCGATCCATCAGACCAAAAATGTGTTGATCCCACGCGCTTCCCACAA
CTTTGAATTTTTCGCGGAAGTCTGCCAGCAGATGAACGGCAAGACCTATCCGGTTGACGACAAGATGCTCAACTACACGC
TGGTGCAGCCGGTGGGCGTTTGTGCGCTGGTATCGCCGTGGAACGTACCGTTTATGACCGCCACATGGAAGGTCGCGCCG
TGTCTGGCGCTGGGCAATACCGCGGTACTGAAAATGTCGGAACTCTCCCCGCTGACCGCTGACCGCCTGGGTGAGCTGGC
GCTGGAAGCCGGTATTCCGGCAGGCGTGCTGAACGTGGTACAGGGCTACGGCGCAACCGCAGGGGATGCGCTGGTTCGTC
ATCATGACGTACGTGCCGTGTCGTTCACCGGCGGTACGGCCACCGGGCGCAACATCATGAAAAACGCCGGGCTGAAAAAA
TACTCCATGGAACTGGGCGGTAAATCGCCGGTGCTGATTTTTGAAGATGCCGATATTGAACGCGCGCTGGACGCCGCCCT
GTTCACCATCTTCTCGATCAACGGCGAGCGCTGCACCGCCGGTTCGCGCATCTTTATTCAGCAAAGCATCTACCCGGAAT
TCGTTAAACGCTTTGCCGAACGCGCCAACCGTCTGCGCGTGGGCGATCCGAACGATCCGAATACCCAGGTTGGGGCGCTT
ATCAGCCAGCAACACTGGGATAAAGTCTCCGGCTATATCCGTCTCGGCATTGAAGAAGGCGCAACCCTGCTGGCGGGCGG
CCCGGATAAACCGTCTGACCTGCCTGCACACCTGAAAGGCGGCAACTTCCTGCGCCCAACGGTGCTGGCGGACGTAGATA
ACCGTATGCGCGTTGCCCAGGAAGAGATTTTCGGGCCGGTCGCCTGCCTGCTGCCGTTTAAAGACGAAGCCGAAGGCTTA
CGTCTGGCAAACGACGTGGAGTACGGCCTCGCGTCGTACATCTGGACACAGGATGTCAGCAAAGTGTTACGCCTGGCGCG
TGGCATTGAAGCAGGCATGGTGTTCGTCAACACCCAGAACGTGCGTGACCTGCGCCAGCCATTTGGCGGCGTAAAAGCCT
CCGGCACCGGGCGTGAAGGCGGTGAGTACAGCTTCGAAGTGTTCGCGGAAATGAAGAACGTCTGCATTTCCATGGGCGAC
CATCCAATTCCGAAATGGGGAGTCTGA

Protein sequence :
MKKVNHWINGKNVAGNDYFQTTNPATGEVLADVASGGEAEINQAVAAAKEAFPKWANLPMKERARLMRRLGDLIDQNVPE
IAAMETADTGLPIHQTKNVLIPRASHNFEFFAEVCQQMNGKTYPVDDKMLNYTLVQPVGVCALVSPWNVPFMTATWKVAP
CLALGNTAVLKMSELSPLTADRLGELALEAGIPAGVLNVVQGYGATAGDALVRHHDVRAVSFTGGTATGRNIMKNAGLKK
YSMELGGKSPVLIFEDADIERALDAALFTIFSINGERCTAGSRIFIQQSIYPEFVKRFAERANRLRVGDPNDPNTQVGAL
ISQQHWDKVSGYIRLGIEEGATLLAGGPDKPSDLPAHLKGGNFLRPTVLADVDNRMRVAQEEIFGPVACLLPFKDEAEGL
RLANDVEYGLASYIWTQDVSKVLRLARGIEAGMVFVNTQNVRDLRQPFGGVKASGTGREGGEYSFEVFAEMKNVCISMGD
HPIPKWGV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 0.0 93
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-75 43
orf17 AAO17183.1 Orf17 Not tested tcd island Protein 3e-74 41