Gene Information

Name : thcA (BCAN_A0206)
Accession : YP_001592074.1
Strain :
Genome accession: NC_010103
Putative virulence/resistance : Virulence
Product : EPTC-inducible aldehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 213599 - 215116 bp
Length : 1518 bp
Strand : +
Note : -

DNA sequence :
ATGAACAAGGTTGAATTCAGCCGGACCGTGAAACCGGCTTTTGCAAAACGCTATGGCAATTTCATCGGCGGCAAATGGGT
GGAACCCAGGTCAGGCCGCTATTTTGAGAATACATCGCCCGTAAACGGCCAGGTTCTGTGTGAAGTGGCCCGTTCCGATG
CTGCCGATGTGGAAGCGGCGCTCGATGCAGCGCACGCCGCCAGGGAGCTATGGGGCAGGACAAGCGTGGCTGAACGCGCG
CTTATCCTCAATCGCATCGCAGACCGGATTGAGGAAAACCTGCCTGCATTGGCGGCTGCCGAAACATGGGACAACGGCAA
GCCGATCCGCGAAACCACCAATGCCGACCTGCCGCTGGCGGTTGATCATTTCCGCTATTTCGCGGGCGTCATCCGCGCTC
AGGAAGGCGGTATTTCTGAAATCGATCACGATACGGTCGCCTATCACTTCCACGAACCGCTGGGCGTGGTGGGGCAGATC
ATTCCGTGGAACTTCCCGCTTCTCATGGCGACATGGAAGCTTGCGCCAGCCCTTGCCGCAGGCAATTGCGTGGTGCTGAA
ACCCGCCGAACAAACTCCGGCTTCCATCCTCGTGCTGATGGAGCTCATTGCCGATATTCTGCCGCCGGGTGTGGTCAATA
TCGTCAATGGTTTCGGCCTTGAGGCCGGTAAGCCGCTGGCATCCAGCCCCCGCATCGCCAAGATCGCCTTCACGGGTGAG
ACGACGACCGGCCGCCTCATCATGCAATATGCCAGCCAGAACCTCATCCCGGTCACGCTGGAGCTGGGCGGCAAGTCACC
CAACATCTTCTTCAAGGATGTCGCGGCTGAAGATGATGATTTTCTGGACAAAGCCATTGAAGGCTTCGTGATGTTCGCGC
TCAATCAGGGCGAGGTCTGTACTTGCCCCAGCCGCGCGCTCATTCAGGAATCGATCTATGACAGGTTCATGAAAAAGGCG
CTGAAGCGTGTTGAAGCCATCGTGCAGGGCGATCCGCTTGACCCGGCCACGATGATTGGCGCGCAGGCATCGAGCGAACA
ACTCGAAAAAATCCTGAGCTATCTCGATATTGGCCGTCAGGAAGGTGCGGAAGTGCTGGCAGGTGGCGAACGCAACATGC
TGCCGGGCGATCTTGCTGGCGGCTATTATGTGAAGCCGACCGTCTTCAAGGGCCATAACAAGATGCGCATTTTCCAGGAG
GAAATCTTCGGGCCGGTCGTGTCCGTCGCAACCTTCAAGGATGATGCGGAGGCGCTATCGATTGCCAATGACACACTCTA
TGGCCTCGGTGCGGGCATATGGACGCGCGATGGCACACGCGCCTATCGCTTCGGTCGCGCCATTAAGGCAGGCCGCGTCT
GGACCAACTGCTACCACGCCTACCCGGCCCATGCGGCTTTCGGTGGTTACAAGCAGTCGGGGATCGGGCGCGAGAACCAT
CTGAAGATGCTCGATCACTACCAGAATACCAAGAACATGCTGGTGAGCTACAGCCCGAAGAAGCTTGGCTTCTTCTAA

Protein sequence :
MNKVEFSRTVKPAFAKRYGNFIGGKWVEPRSGRYFENTSPVNGQVLCEVARSDAADVEAALDAAHAARELWGRTSVAERA
LILNRIADRIEENLPALAAAETWDNGKPIRETTNADLPLAVDHFRYFAGVIRAQEGGISEIDHDTVAYHFHEPLGVVGQI
IPWNFPLLMATWKLAPALAAGNCVVLKPAEQTPASILVLMELIADILPPGVVNIVNGFGLEAGKPLASSPRIAKIAFTGE
TTTGRLIMQYASQNLIPVTLELGGKSPNIFFKDVAAEDDDFLDKAIEGFVMFALNQGEVCTCPSRALIQESIYDRFMKKA
LKRVEAIVQGDPLDPATMIGAQASSEQLEKILSYLDIGRQEGAEVLAGGERNMLPGDLAGGYYVKPTVFKGHNKMRIFQE
EIFGPVVSVATFKDDAEALSIANDTLYGLGAGIWTRDGTRAYRFGRAIKAGRVWTNCYHAYPAHAAFGGYKQSGIGRENH
LKMLDHYQNTKNMLVSYSPKKLGFF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 2e-147 65
aldA AAC12273.1 aldehyde dehydrogenase Virulence VPI Protein 2e-152 65
aldA AAK20747.1 aldehyde dehydrogenase Virulence VPI Protein 2e-152 65
aldA AAK20776.1 aldehyde dehydrogenase Virulence VPI Protein 2e-152 65
aldA-1 YP_001216300.1 aldehyde dehydrogenase Not tested VPI-1 Protein 8e-152 65
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-69 42
hpaE AAO17179.1 HpaE Not tested tcd island Protein 9e-70 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
thcA YP_001592074.1 EPTC-inducible aldehyde dehydrogenase VFG0082 Protein 9e-148 65