Gene Information

Name : GYMC52_3134 (GYMC52_3134)
Accession : YP_004133637.1
Strain : Geobacillus sp. Y412MC52
Genome accession: NC_014915
Putative virulence/resistance : Unknown
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 3162739 - 3164250 bp
Length : 1512 bp
Strand : -
Note : KEGG: gyc:GYMC61_3107 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; TIGRFAM: 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; PFAM: aldehyde dehydrogenase

DNA sequence :
ATGGCAGTGCAACCAAAAGTGCTTCATTATATCAATGGGCAGTTGATGGAAGGGGCAGCCGGTGCGTATTTTGACAACAT
CAATCCGTTTACGAACGAACGGATCAACGAAGTGGCCGAAGGGCGGAAAGAAGACATCGACGCAGCGGTCCGGGCGGCAA
AGGAGGCGTTTGATCACGGGCCGTGGCGGACAATGCCGGTCGAACGGCGTCTTCGTTACCTTTTCCGCATTGCTGACTTG
ATTGAGCAGTATGCGGACGACATCGCCTATTTAGAAGCGCTTGACACCGGCATTCCGATCAGCCAGGCGAAAAAGCAAGC
CGCCCGCGCGGCGGAAAACTTCCGCTTTTACGCGGAAATGGTGAAGACGCGCCTTGTCGGCGAGGCGTACCATGTGAATG
GACAGTTTTTAAACTATACCGTTTATAAACCAGTCGGCGTCGCCGGGCTCATCACGCCGTGGAATACGCCATTTATGCTG
GAAACGTGGAAGGTGGCTCCCGCGCTGGCGACCGGCAACACCGTCGTCTTGAAGCCGGCCGAATGGTCGCCGTTGACGGC
GAATAAACTGGCGGAAATCATCGATGAAGCCGGGCTGCCTCGCGGGGTGTTCAACGTCGTGCACGGGTTTGGCGAAACGG
CGGGCGCCGCATTGGTTGCCCACCCGGATGTGCGCCTCATCTCGTTTACCGGCGAGACGACGACCGGCATGGAAATCATC
CGCAACAGCGCTGCGACGTTGAAAAAAACATCGATGGAGCTCGGCGGCAAGTCGCCGCTCATTGTGTTCGCCGATGCGGA
TCTCGAACGGGCGCTCGATGCGGCGGTTTGGGGCGTGTTTTCGCTCAATGGCGAACGGTGCACGGCCAACTCGCGGCTTT
TGCTTGAACAGTCGATTTACGACGAATTTGTCGCCCGGCTCAAAGAGCGCGTCGACCGCATCGTCATCGGCGACCCGATG
AACCCGGCGACTGAACTCGGTCCGCTCATTCACCGCGATCATTGGGAGAGGGTGAACCGCTATATTGACATCGCCAAGCA
AGAAGGGGCGGACGTCTATGCCCCCAGCGTTCCAACAGGATTGGAAAAAGGCAATTTTGTGCCGCCAACGTTGCTGCTTG
GTTGCCATAACGGCATGAGGGTGGCGCAGGAAGAGATTTTCGGACCGGTCATGGCGGTCATGTCCTTTGCGGATGAAGAA
GAGGCGATACGGCTGGCGAACGATGTGAAATACGGGCTGGCGGCATACGTCTGGACGAACGATATGAAGCGCGGCCACCG
CGTCGCCCAAGCGATCGAAAGCGGGATGGCGTGGGTCAACTCGCCGAACGTCCGCGATTTGCGCATCCCGTTTGGCGGGA
CGAAATACAGCGGCATCGGCCGCGAAGGCGGGCATTACAGCTTTGATTTCTATACGGAAGTGCAAGTCGTCCACGTCGCC
GTCGGCGATCCGCCGATCCCCGCGTTCGGCAAGGGGGAGAAACCGACCGCCTTGTCTGCCGAACAGGCATAA

Protein sequence :
MAVQPKVLHYINGQLMEGAAGAYFDNINPFTNERINEVAEGRKEDIDAAVRAAKEAFDHGPWRTMPVERRLRYLFRIADL
IEQYADDIAYLEALDTGIPISQAKKQAARAAENFRFYAEMVKTRLVGEAYHVNGQFLNYTVYKPVGVAGLITPWNTPFML
ETWKVAPALATGNTVVLKPAEWSPLTANKLAEIIDEAGLPRGVFNVVHGFGETAGAALVAHPDVRLISFTGETTTGMEII
RNSAATLKKTSMELGGKSPLIVFADADLERALDAAVWGVFSLNGERCTANSRLLLEQSIYDEFVARLKERVDRIVIGDPM
NPATELGPLIHRDHWERVNRYIDIAKQEGADVYAPSVPTGLEKGNFVPPTLLLGCHNGMRVAQEEIFGPVMAVMSFADEE
EAIRLANDVKYGLAAYVWTNDMKRGHRVAQAIESGMAWVNSPNVRDLRIPFGGTKYSGIGREGGHYSFDFYTEVQVVHVA
VGDPPIPAFGKGEKPTALSAEQA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 3e-113 53
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-67 43