Gene Information

Name : BC1003_4087 (BC1003_4087)
Accession : YP_003909314.1
Strain :
Genome accession: NC_014540
Putative virulence/resistance : Unknown
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 629209 - 630666 bp
Length : 1458 bp
Strand : +
Note : KEGG: bpy:Bphyt_5826 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; TIGRFAM: 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; PFAM: Aldehyde Dehydrogenase

DNA sequence :
ATGCGAATAGACCATCTGATCAACGGCAAGACGGCCGGTGCGAAAGACTACTTTGAAACTGTCAACCCGGCGACCCAGCA
AGTGCTCGCCGAAGTCGCGCGTGGCGGCGCAGAAGAAATCCACGCTGCAGTGCGCGCCGCCAAAGAGGCGTTTCCCGCCT
GGGCAGCAAAGTCCGCGTCGGAGCGCGCGAAGCTTGTTCGCAAACTCGGTGAGCTGATTGCGAAAAGCGTGCCGGATATT
TCGGAAACCGAAACGAAAGACACCGGTCAGACGATTTCGCAAACGCGCAAGCAATTGGTGCCGCGCGCAGCGGACAACTT
CAGCTACTTCGCCGAAATGTGCACCCGTGTGGACGGCCACACTTATCCGACCGACACGCACCTGAATTACACGCTGTTCC
AGCCGGTGGGCGTGTGTGCGCTGATTTCGCCGTGGAATGTGCCGTTCATGACGGCGACATGGAAAGTCGCGCCATGCCTC
GCGTTCGGCAACACTGCGGTGTTGAAAATGAGCGAGCTGTCGCCGCTCACTGCGTCCATGCTCGGCGCGCTTGCGCTGGA
AGCCGGCATTCCTGCGGGCGTACTCAACGTGGTGCACGGCTTCGGCAAAGAGACCGGCGAGCCGCTCGTCGCGCACCCGG
ACGTGCGCGCGGTGTCGTTCACCGGCTCCACTGCCACGGGCAACCGGATCGTGCAAACCGCCGGGCTTAAGAAGTTCTCG
ATGGAGCTGGGCGGCAAGTCGCCATTCGTGGTTTTCGACAACGCCGATCTCGAACGCGCGCTCGACGCCGCCCTCTTCAT
GATCTTCTCGAACAACGGCGAGCGCTGCACCGCCGGCTCGCGCATTCTGGTGCAAAAGTCCATCTACGCAGAGTTCGCGC
AGCGCTTCGTGGAGCGAGCAAAGCGCCTCACGGTGGGCGATCCGCTGGCCGAGAGCACGATCATTGGGCCGATGATCAGT
CAGGGGCATCTGGCCAAAGTGCGCAGCTATATCGAGCTCGGGCCCAAGGAAGGCGCCACGCTCGCGTGCGGCGGCCTCGA
CGCACCGGATTTGCCCGACGCCCTGCGCAAAGGCAACTTCGTCATGCCAACGGTTTTCGTGGACGTGGATAACCGCATGC
GCATTGCGCAGGAAGAGATCTTCGGCCCGGTCGCCTGCCTGATTCCATTCGACGACGAAGCTCACGCGATCCGGCTCGCA
AACGACATCTCTTACGGTCTGTCGAGCTACATCTGGACGGAGAACACCGGACGCGCGCATCGCGTCGCAGCCGCAGTAGA
AGCCGGCATGTGCTTCGTCAACAGTCAGAACGTGCGCGATCTGCGCCAGCCGTTCGGCGGCACCAAGGCATCGGGCGTGG
GACGCGAAGGCGGCACGTGGAGCTACGAGGTATTCCTCGAGCCGAAGAACGTGTGCGTGTCGCTCGGCTCACATCACATT
CCGCGCTGGGGCGTCTGA

Protein sequence :
MRIDHLINGKTAGAKDYFETVNPATQQVLAEVARGGAEEIHAAVRAAKEAFPAWAAKSASERAKLVRKLGELIAKSVPDI
SETETKDTGQTISQTRKQLVPRAADNFSYFAEMCTRVDGHTYPTDTHLNYTLFQPVGVCALISPWNVPFMTATWKVAPCL
AFGNTAVLKMSELSPLTASMLGALALEAGIPAGVLNVVHGFGKETGEPLVAHPDVRAVSFTGSTATGNRIVQTAGLKKFS
MELGGKSPFVVFDNADLERALDAALFMIFSNNGERCTAGSRILVQKSIYAEFAQRFVERAKRLTVGDPLAESTIIGPMIS
QGHLAKVRSYIELGPKEGATLACGGLDAPDLPDALRKGNFVMPTVFVDVDNRMRIAQEEIFGPVACLIPFDDEAHAIRLA
NDISYGLSSYIWTENTGRAHRVAAAVEAGMCFVNSQNVRDLRQPFGGTKASGVGREGGTWSYEVFLEPKNVCVSLGSHHI
PRWGV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 4e-158 69
orf17 AAO17183.1 Orf17 Not tested tcd island Protein 2e-60 41
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 3e-64 41