Gene Information

Name : MSMEG_0900 (MSMEG_0900)
Accession : YP_885303.1
Strain : Mycobacterium smegmatis MC2 155
Genome accession: NC_008596
Putative virulence/resistance : Virulence
Product : eptc-inducible aldehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : 1.2.1.3
Position : 986441 - 987964 bp
Length : 1524 bp
Strand : +
Note : identified by match to protein family HMM PF00171

DNA sequence :
ATGACAGTCTTTTCCCGCCCGGGTGCCGCAGACGCTCGCATGTCATTTGAGTCCCGTTACGACAATTTCATCGGCGGTGA
GTGGGTGGCGCCCGTCGACGGCCGGTACTTCGAGAATCCGACGCCGGTCACGGGCCAGGTGTTCTGCGAGGTGGCCCGCT
CCAGCTCGGCCGACATCGAGAAGGCCCTCGACGCCGCCCACGCGGCGGCGCCGGCGTGGGGCAAGACCGCTCCGGCCGAG
CGTGCGCTGATCCTCAACCGCATCGCCGACCGCATGGAGCAGAACCTGGAGGCACTCGCGCTCGCGGAGGCGTGGGACAA
CGGCAAGCCGATCCGCGAGACGCTCAACGCCGACATCCCGCTGGCGATCGACCACTTCCGCTACTTCGCCTCCGCGATCC
GCGCGCAGGAGGGTTCCCTGAGCCAGGTCGACGAGGACACCGTGGCCTACCACTTCCACGAGCCGCTGGGTGTGGTCGGG
CAGATCATCCCGTGGAACTTCCCGATCCTGATGGCCGTGTGGAAGCTCGCCCCGGCACTGGCCGCGGGCAACGCCGTTGT
GCTCAAGCCCGCCGAGCAGACCCCTGCCTCGGTGCTGTACCTCATGTCGCTGATCGCGGACCTCGTGCCTGCAGGCGTGG
TGAACATCGTCAACGGGTTCGGTGTGGAGGCCGGCAAGCCGCTGGCGTCGAGCAATCGCATCGCCAAGATCGCGTTCACC
GGTGAGACCACCACGGGCCGGCTCATCATGCAGTATGCGTCGCAGAACCTGATCCCGGTCACGTTGGAGCTGGGCGGCAA
GAGCCCCAACATCTTCTTCTCGGATGTCATGGCCGCCGCCGACGACTTCCAGGACAAGGCGTTGGAAGGGTTCACGATGT
TCGCCCTCAACCAGGGCGAGGTGTGCACGTGCCCGTCGCGCTCGCTGATCCAGTCCGACATCTTCGACGAGTTCCTCGAG
CTCGCCGCGATCCGCACCAAGGCCGTGCGTCAGGGCGACCCGCTGGACACCGAGACCATGATCGGCTCCCAGGCGTCCAA
CGATCAGTTCGAGAAGATCCTGTCCTACATCGAGATCGGCAAGAGCGAGGGCGCACGCATCGTGACCGGCGGCGAGCGCG
CCGACCTCGGCGGTGACCTCTCGGGTGGTTACTACATCCAGCCGACGATCTTCACCGGCCACAACAAGATGCGGATCTTC
CAGGAGGAGATCTTCGGCCCGGTCGTGGCGGTCACGTCGTTCAGCGACTACGACGACGCGATCTCGATCGCCAACGACAC
GCTGTACGGTCTCGGCGCCGGAGTGTGGAGCCGCGACGGGAACACCGCGTATCGTGCCGGGCGCGACATCAAGGCCGGCC
GGGTGTGGACCAACTGCTACCACCAGTACCCGGCGCACGCCGCGTTCGGTGGGTACAAGCAGTCCGGCATCGGCAGGGAG
AACCACAAGATGATGCTCGACCACTACCAGCAGACCAAGAACCTGCTGGTGAGCTACAGCGACAAGGCGCAGGGATTCTT
CTGA

Protein sequence :
MTVFSRPGAADARMSFESRYDNFIGGEWVAPVDGRYFENPTPVTGQVFCEVARSSSADIEKALDAAHAAAPAWGKTAPAE
RALILNRIADRMEQNLEALALAEAWDNGKPIRETLNADIPLAIDHFRYFASAIRAQEGSLSQVDEDTVAYHFHEPLGVVG
QIIPWNFPILMAVWKLAPALAAGNAVVLKPAEQTPASVLYLMSLIADLVPAGVVNIVNGFGVEAGKPLASSNRIAKIAFT
GETTTGRLIMQYASQNLIPVTLELGGKSPNIFFSDVMAAADDFQDKALEGFTMFALNQGEVCTCPSRSLIQSDIFDEFLE
LAAIRTKAVRQGDPLDTETMIGSQASNDQFEKILSYIEIGKSEGARIVTGGERADLGGDLSGGYYIQPTIFTGHNKMRIF
QEEIFGPVVAVTSFSDYDDAISIANDTLYGLGAGVWSRDGNTAYRAGRDIKAGRVWTNCYHQYPAHAAFGGYKQSGIGRE
NHKMMLDHYQQTKNLLVSYSDKAQGFF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 3e-141 62
aldA AAC12273.1 aldehyde dehydrogenase Virulence VPI Protein 2e-148 61
aldA AAK20747.1 aldehyde dehydrogenase Virulence VPI Protein 2e-148 61
aldA AAK20776.1 aldehyde dehydrogenase Virulence VPI Protein 2e-148 61
aldA-1 YP_001216300.1 aldehyde dehydrogenase Not tested VPI-1 Protein 1e-147 61
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-68 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
MSMEG_0900 YP_885303.1 eptc-inducible aldehyde dehydrogenase VFG0082 Protein 1e-141 62