Gene Information

Name : thcA (BSUIS_A0203)
Accession : YP_001626873.1
Strain :
Genome accession: NC_010169
Putative virulence/resistance : Virulence
Product : EPTC-inducible aldehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 210305 - 211822 bp
Length : 1518 bp
Strand : +
Note : -

DNA sequence :
ATGAACAAGGTTGAATTCAGCCGGACCGTGAAACCGGCTTTTGCAAAACGCTATGGCAATTTCATCGGCGGCAAATGGGT
GGAACCCAGGTCAGGCCGCTATTTTGAGAATACATCGCCCGTAAACGGCCAGGTTCTGTGTGAAGTGGCCCGTTCCGATG
CTGCCGATGTGGAAGCGGCGCTCGATGCAGCGCACGCCGCCAGGGAGCTATGGGGCAGGACAAGCGTGGCTGAACGCGCG
CTTATCCTCAATCGCATCGCAGACCGGATTGAGGAAAACCTGCCTGCATTGGCGGCTGCCGAAACATGGGACAACGGCAA
GCCGATCCGCGAAACCACCAATGCCGACCTGCCGCTGGCGGTTGATCATTTCCGCTATTTCGCGGGCGTCACCCGCGCTC
AGGAAGGCGGTATTTCTGAAATCGATCACGATACGGTCGCCTATCACTTCCACGAACCGCTGGGCGTGGTGGGGCAGATC
ATTCCGTGGAACTTCCCGCTTCTCATGGCGACATGGAAGCTTGCGCCAGCCCTTGCCGCAGGCAATTGCGTGGTGCTGAA
ACCCGCCGAACAAACTCCGGCTTCCATCCTCGTGCTGATGGAGCTCATTGCCGATATTCTGCCGCCGGGTGTGGTCAATA
TCGTCAATGGTTTCGGCCTTGAGGCCGGTAAGCCGCTGGCATCCAGCCCCCGCATCGCCAAGATCGCCTTCACGGGTGAG
ACGACGACCGGCCGCCTCATCATGCAATATGCCAGCCAGAACCTCATCCCGGTCACGCTGGAGCTGGGCGGCAAGTCACC
CAACATCTTCTTCAAGGATGTCGCGGCTGAAGATGATGATTTTCTGGACAAGGCCATTGAAGGCTTCGTGATGTTCGCGC
TCAATCAGGGCGAGGTCTGTACTTGCCCCAGCCGCGCGCTCATTCAGGAATCGATCTATGACAGGTTCATGGAAAAGGCG
CTGAAGCGTGTTGAAGCCATCGTGCAGGGCGATCCGCTTGACCCGGCCACGATGATTGGCGCGCAGGCATCGAGCGAACA
ACTCGAAAAAATCCTGAGCTATCTCGATATTGGCCGTCAGGAAGGTGCGGAAGTGCTGGCAGGTGGCGAACGCAACATGC
TGCCGGGCGATCTTGCTGGCGGCTATTATGTGAAGCCGACCGTCTTCAAGGGCCATAACAAGATGCGCATTTTCCAGGAG
GAAATCTTCGGGCCGGTCGTGTCCGTCGCAACCTTCAAGGATGATGCGGAGGCGCTATCGATTGCCAATGACACACTCTA
TGGCCTCGGTGCGGGCATATGGACGCGCGATGGCACACGCGCCTATCGCTTCGGTCGCGCCATTAAGGCAGGCCGCGTCT
GGACCAACTGCTACCACGCCTACCCGGCCCATGCGGCTTTCGGTGGTTACAAGCAGTCGGGGATCGGGCGCGAGAACCAT
CTGAAGATGCTCGATCACTACCAGAATACCAAGAACATGCTGGTGAGCTACAGCCCGAAGAAGCTTGGCTTCTTCTAA

Protein sequence :
MNKVEFSRTVKPAFAKRYGNFIGGKWVEPRSGRYFENTSPVNGQVLCEVARSDAADVEAALDAAHAARELWGRTSVAERA
LILNRIADRIEENLPALAAAETWDNGKPIRETTNADLPLAVDHFRYFAGVTRAQEGGISEIDHDTVAYHFHEPLGVVGQI
IPWNFPLLMATWKLAPALAAGNCVVLKPAEQTPASILVLMELIADILPPGVVNIVNGFGLEAGKPLASSPRIAKIAFTGE
TTTGRLIMQYASQNLIPVTLELGGKSPNIFFKDVAAEDDDFLDKAIEGFVMFALNQGEVCTCPSRALIQESIYDRFMEKA
LKRVEAIVQGDPLDPATMIGAQASSEQLEKILSYLDIGRQEGAEVLAGGERNMLPGDLAGGYYVKPTVFKGHNKMRIFQE
EIFGPVVSVATFKDDAEALSIANDTLYGLGAGIWTRDGTRAYRFGRAIKAGRVWTNCYHAYPAHAAFGGYKQSGIGRENH
LKMLDHYQNTKNMLVSYSPKKLGFF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 3e-147 65
aldA AAC12273.1 aldehyde dehydrogenase Virulence VPI Protein 3e-152 65
aldA AAK20747.1 aldehyde dehydrogenase Virulence VPI Protein 3e-152 65
aldA AAK20776.1 aldehyde dehydrogenase Virulence VPI Protein 3e-152 65
aldA-1 YP_001216300.1 aldehyde dehydrogenase Not tested VPI-1 Protein 1e-151 65
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-69 42
hpaE AAO17179.1 HpaE Not tested tcd island Protein 3e-69 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
thcA YP_001626873.1 EPTC-inducible aldehyde dehydrogenase VFG0082 Protein 1e-147 65