Gene Information

Name : dhaS (BCG9842_B1655)
Accession : YP_002447056.1
Strain : Bacillus cereus G9842
Genome accession: NC_011772
Putative virulence/resistance : Virulence
Product : aldehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : 1.2.1.3
Position : 3491462 - 3492946 bp
Length : 1485 bp
Strand : +
Note : identified by match to protein family HMM PF00171

DNA sequence :
ATGAGTCAACTAGCTGTAAATCTTCATGAAAAGGTAGAAAATTTTCTTCAAGGTACAAAAAAGTTATATGTGAATGGATC
TTTCATTGAAAGCGCTTCCGGAAAAACATTTAAAACACCTAACCCAGCAACTGGTGAAACACTTGCCGTCGTTTCTGAAG
CTGGTCGTGAAGATATTCATAAAGCTGTAGTCGCAGCTCGTATGGCTTTTGACGAAGGTCCTTGGTCTCGTATGAGCACT
GCTGAGCGAAGCCGTCTCATGTACAAGTTAGCTGATTTAATGGAAGAACATAAAGAAGAGCTTGCACAGCTCGAAACATT
AGATAACGGAAAGCCAATCCGTGAAACAATGGCAGCAGACATACCACTTGCAATTGAGCATATGCGCTATTACGCTGGCT
GGGCTACGAAAATCGTTGGTCAAACAATTCCTGTTTCCGGTGATTACTTTAACTATACACGCCATGAAGCTGTTGGTGTC
GTTGGTCAAATTATCCCTTGGAACTTCCCGCTTCTTATGGCAATGTGGAAAATGGGAGCAGCGCTTGCTACAGGATGTAC
AATCGTTTTAAAACCTGCAGAACAAACTCCACTATCTGCTCTATACTTAGCTGAATTAATTGAAGAAGCTGGATTCCCGA
AAGGTGTTATTAATATCGTACCTGGATTCGGTGAATCAGCTGGACAAGCTCTCGTTAATCATCCACTCGTTGATAAAATT
GCATTTACCGGTTCTACTCCTGTCGGTAAACAAATTATGCGACAAGCATCCGAATCATTAAAACGCGTTACACTTGAGTT
AGGCGGTAAATCACCAAATATCATCTTGCCAGATGCTGATTTATCTCGCGCGATTCCTGGTGCACTTTCTGGTGTTATGT
TTAACCAAGGACAAGTATGCTCTGCTGGATCACGCTTATTTGTTCCGAAGAAAATGTATGATAATGTCATGGCTGATCTC
GTCCTTTATTCTAAAAAATTAAATCAAGGCGCTGGTCTAAGTCCAGAAACTACAATCGGTCCTCTCGTTTCCGAAGAACA
ACAAAAACGTGTAATGGGCTTCATTGAAAAAGGGATTGAAGAAGGCGCTGAAGTACTTTGCGGAGGAAATAATCCATTCG
ATCAAGGCTACTTCGTTTCTCCTACAGTATTCGCTGACGTAAATGACGAAATGACGATCGCAAAAGAAGAAATTTTCGGT
CCAGTTATTTCTGCAATACCGTTTAACGATATTGATGAAGTAATTGAACGTGCGAATAAATCTCAATTTGGCTTAGCTGC
TGGTGTATGGACAGAAAATGTTAAAACTGCACACTATGTTGCAAGTAAAGTACGTGCAGGTACAGTATGGGTAAACTGTT
ATAACGTCTTTGATGCAGCATCTCCATTTGGAGGATTTAAACAATCTGGTCTCGGCCGTGAAATGGGATCTTACGCATTA
AATAACTATACAGAAGTGAAGAGCGTTTGGCTTAACTTAAATTAA

Protein sequence :
MSQLAVNLHEKVENFLQGTKKLYVNGSFIESASGKTFKTPNPATGETLAVVSEAGREDIHKAVVAARMAFDEGPWSRMST
AERSRLMYKLADLMEEHKEELAQLETLDNGKPIRETMAADIPLAIEHMRYYAGWATKIVGQTIPVSGDYFNYTRHEAVGV
VGQIIPWNFPLLMAMWKMGAALATGCTIVLKPAEQTPLSALYLAELIEEAGFPKGVINIVPGFGESAGQALVNHPLVDKI
AFTGSTPVGKQIMRQASESLKRVTLELGGKSPNIILPDADLSRAIPGALSGVMFNQGQVCSAGSRLFVPKKMYDNVMADL
VLYSKKLNQGAGLSPETTIGPLVSEEQQKRVMGFIEKGIEEGAEVLCGGNNPFDQGYFVSPTVFADVNDEMTIAKEEIFG
PVISAIPFNDIDEVIERANKSQFGLAAGVWTENVKTAHYVASKVRAGTVWVNCYNVFDAASPFGGFKQSGLGREMGSYAL
NNYTEVKSVWLNLN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 5e-96 47
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 4e-93 45
aldA AAC12273.1 aldehyde dehydrogenase Virulence VPI Protein 2e-94 45
aldA AAK20747.1 aldehyde dehydrogenase Virulence VPI Protein 2e-94 45
aldA AAK20776.1 aldehyde dehydrogenase Virulence VPI Protein 2e-94 45
aldA-1 YP_001216300.1 aldehyde dehydrogenase Not tested VPI-1 Protein 7e-94 45
hpaE AAO17179.1 HpaE Not tested tcd island Protein 4e-80 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
dhaS YP_002447056.1 aldehyde dehydrogenase VFG0082 Protein 2e-93 45