Gene Information

Name : hpaE (AAur_3915)
Accession : YP_949590.1
Strain : Arthrobacter aurescens TC1
Genome accession: NC_008711
Putative virulence/resistance : Virulence
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : 1.2.1.60
Position : 4291265 - 4292776 bp
Length : 1512 bp
Strand : -
Note : identified by match to protein family HMM PF00171; match to protein family HMM TIGR02299

DNA sequence :
ATGACGACCTCAGTAGAGACCGCCAAGCATTACATCCCCGAAAACCTGCCGTCCCACATCCAGCACTTCATCAACGGCGA
GTTCGTTGACTCTCTGTCCGGCAAGACGTTTGATGTCCTGGACCCTGTATCCAGCGGCAACTACGCCACTGCTGCGGCCG
GCCAGAAGGAAGACATCGATCTTGCTGTCGCCGCTGCCCGCGAAGCGTTCGTCAACGGTCCGTGGCCGAAGATGAAGCCG
CGTGAGCGTGCCCGGGTGTTGAACAAGATCGCTGATGCGGTGGAGGCACAGGAAGCCCGCCTCGCCGAGCTGGAAACGTT
CGATACCGGCCTGCCGATCACTCAGGCCAAGGGCCAGGCGCTGCGCGCTGCCGAGAACTTCCGTTTCTTCGCGGACCTGA
TCGTGGCCCAGTTCGATGACGCCATGAAGGTCCCCGGTTCGCAGATCAACTACGTGAACCGCAAGCCGATCGGCGTTGCG
GGCCTGATCACGCCGTGGAACACCCCGTTCATGCTGGAGTCCTGGAAGCTCGCCCCGGCTCTGGCCACCGGCAACACTGT
GGTCCTCAAGCCGGCCGAGTTCACGCCGTTGTCCGCGTCTCTCTGGGCGCAGATCTTCAAGGACGCCGGCCTCCCTGACG
GTGTGTTCAACCTGGTCAACGGCTTGGGCGAAGAAGCCGGCGACGCATTGGTGAAGCACCCGGACGTTCCGCTGATTTCC
TTCACCGGCGAGACCACCACGGGCCAGACGATCTTCCGCAACGCAGCCGCCAACCTCAAGGGCCTGTCCATGGAGCTCGG
CGGCAAGTCCCCGTGCGTCGTGTTCGCCGATGCCGACCTGGACGCAGCCATTGATTCGGCTCTGTTCGGCGTGTTCTCCC
TCAACGGCGAACGCTGCACGGCCGGCTCCCGCATCCTGGTGGAACGGGCAATCTACGACGAATTCTGCGAAAAGTACGCC
GCCCGGGCCAAGAACATCGTGGTGGGCGATCCCCACGATCCCAAGACCCAAGTGGGCGCGCTGGTCCACCCCGAGCACTA
CCAAAAGGTGGCGTCGTACGTGGAGATCGGCAAGTCCGAGGGTCGCCTCCTGGCCGGCGGCGGCCGACCCGACCACCTGC
CCGAAGGCAACTACATCGCACCTACGGTGTTTGCCGACGTCGCGCCTGATGCCCGGATCTTCCAGGAGGAAATCTTCGGT
CCCGTCGTGGCCATCACGCCTTTCGAGAACGACGACGAAGCCCTCGCCTTGGCGAACAACACCAAGTACGGCCTGGCGGC
CTACATCTGGACCCAGAACCTGACACGGGCCCACAACTTCTCCCAGAACGTGGAGGCCGGCATGGTGTGGCTCAACAGCC
ACAACGTCCGCGACCTCCGCACCCCTTTCGGCGGCGTCAAAGCTTCCGGCCTGGGCCACGAGGGCGGCTACCGCTCCATC
GACTTCTACACCGACCAGCAGGCCGTGCACATCACGCTCGGCTCAGTCCACACGCCCAAATTCGGCGCCTAA

Protein sequence :
MTTSVETAKHYIPENLPSHIQHFINGEFVDSLSGKTFDVLDPVSSGNYATAAAGQKEDIDLAVAAAREAFVNGPWPKMKP
RERARVLNKIADAVEAQEARLAELETFDTGLPITQAKGQALRAAENFRFFADLIVAQFDDAMKVPGSQINYVNRKPIGVA
GLITPWNTPFMLESWKLAPALATGNTVVLKPAEFTPLSASLWAQIFKDAGLPDGVFNLVNGLGEEAGDALVKHPDVPLIS
FTGETTTGQTIFRNAAANLKGLSMELGGKSPCVVFADADLDAAIDSALFGVFSLNGERCTAGSRILVERAIYDEFCEKYA
ARAKNIVVGDPHDPKTQVGALVHPEHYQKVASYVEIGKSEGRLLAGGGRPDHLPEGNYIAPTVFADVAPDARIFQEEIFG
PVVAITPFENDDEALALANNTKYGLAAYIWTQNLTRAHNFSQNVEAGMVWLNSHNVRDLRTPFGGVKASGLGHEGGYRSI
DFYTDQQAVHITLGSVHTPKFGA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 1e-101 49
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 3e-71 42
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 3e-70 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
hpaE YP_949590.1 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase VFG0082 Protein 1e-70 41