Gene Information

Name : GY4MC1_0470 (GY4MC1_0470)
Accession : YP_003987914.1
Strain : Geobacillus sp. Y4.1MC1
Genome accession: NC_014650
Putative virulence/resistance : Unknown
Product : 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 507657 - 509171 bp
Length : 1515 bp
Strand : +
Note : KEGG: gtn:GTNG_2986 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; TIGRFAM: 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; PFAM: aldehyde dehydrogenase

DNA sequence :
ATGGCAGGCAAAATTGGCAATGTGCTTCACTATATCAATGGAGAATTTGTTGAAGGCGCTTCAGGCAACTATTTTGAAAA
TGTAAACCCGTTTACGAACGAAATCATTAATGAAGTGGCCGAAGGATGGAAAGAGGATATTGACGCCGCTGTCCGTGCGG
CGAAAGAAGCATTTGACAATGGCCCGTGGCGGACGATGTCTGTCCAGGAACGAATGACGTATATTTTGCGGATTGCCGAT
TTAATTGAGAAATATGCGGAGGAAATTTCGTATTTGGAATCGCTTGATACAGGGCTTCCAATCAGCCAAACGAAAAAGCA
GGCGGCCCGCGCTGCTGAAAATTTCCGGTTTTATGCAGAAATGGTAAAAAGCCGCATGGTTGGGGAAGCGTATCACGTAG
ACGGGCAATTTCTCAATTACACCATTTATAAACCAGTCGGTGTGGCAGGATTGATTACGCCGTGGAATGCGCCTTTTATG
CTAGAAACATGGAAAGTTGCACCGGCGCTGGCTACTGGCAATACTGTCGTGTTGAAGCCAGCGGAATGGTCGCCGCTGAC
GGCAAATAAACTGGCAGAAATTATCCATGAAGCGGGGCTGCCCCAAGGGGTATTTAACGTTGTGCACGGCTTTGGAGAAA
CCGCGGGAGCTTCCCTTGTCGCCCATCCGAATGTACGCCTCATTTCCTTTACTGGTGAAACAACAACAGGATCGGAAATT
ATCAAAAACAGTGCTGATACGTTAAAAAAGACGTCGATGGAATTAGGCGGAAAATCTCCGGTTATCGTCTTTGCGGACGC
GGACGTAGAAAAAGCGCTTGATGCGGTAGTATGGGGCATTTTCTCCTTCAATGGCGAAAGATGTACCGCTAATTCCCGTT
TGTTTTTAGAAAAATCGATTTACGATTCGTTTGTCGAAAAGCTAAAGGAACGAGTGAACAACATTTCCATCGGAGATCCA
ATGGACCCGGCGACGGAAGTAGGGCCGCTTATTCACCGAAAACATTGGGAAACGGTAATGAACTATATTGAAATCGCGAA
ACAAGAAGGAGCAGAAGTCTATTCGGCAGACGTTCCAGAGGAACTGAAAAAAGGAAACTTTGTGCCGCCTACGCTGTTAT
TGAATTGTCATAACAGCATGAAAGTAGCGCAGGAAGAAATCTTTGGCCCAGTTATGGCGGTTATGCCGTTTGAAACAGAA
GAAGAAGTGATCCGGATGGCTAATGACGTTAAATACGGATTAGCAGCATATGTATGGACGAATGATATAAAACGGGGCCA
CCGTGTCGCCCAATCGATCGAAAGCGGAATGGCGTGGGTCAACTCGCAAAATGTCAGGGATTTGCGCATTCCATTTGGAG
GCACGAAATACAGTGGCATCGGCCGAGAAGGCGGGCATTACAGCTTCGATTTTTATACGGAAGTGCAAGTGATTCATGTT
TCTATCGGAGATCACTCTATACCAAAATTCGGGAAAATAAAGCAGCATTCTGCTGCCTCTGCCCAGCGCGGATAA

Protein sequence :
MAGKIGNVLHYINGEFVEGASGNYFENVNPFTNEIINEVAEGWKEDIDAAVRAAKEAFDNGPWRTMSVQERMTYILRIAD
LIEKYAEEISYLESLDTGLPISQTKKQAARAAENFRFYAEMVKSRMVGEAYHVDGQFLNYTIYKPVGVAGLITPWNAPFM
LETWKVAPALATGNTVVLKPAEWSPLTANKLAEIIHEAGLPQGVFNVVHGFGETAGASLVAHPNVRLISFTGETTTGSEI
IKNSADTLKKTSMELGGKSPVIVFADADVEKALDAVVWGIFSFNGERCTANSRLFLEKSIYDSFVEKLKERVNNISIGDP
MDPATEVGPLIHRKHWETVMNYIEIAKQEGAEVYSADVPEELKKGNFVPPTLLLNCHNSMKVAQEEIFGPVMAVMPFETE
EEVIRMANDVKYGLAAYVWTNDIKRGHRVAQSIESGMAWVNSQNVRDLRIPFGGTKYSGIGREGGHYSFDFYTEVQVIHV
SIGDHSIPKFGKIKQHSAASAQRG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hpaE AAO17179.1 HpaE Not tested tcd island Protein 1e-122 54
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-72 43