Gene Information

Name : thcA (B005_2471)
Accession : YP_006641563.1
Strain : Nocardiopsis alba ATCC BAA-2165
Genome accession: NC_018524
Putative virulence/resistance : Virulence
Product : EPTC-inducible aldehyde dehydrogenase
Function : -
COG functional category : -
COG ID : -
EC number : 1.2.1.3
Position : 2581514 - 2583037 bp
Length : 1524 bp
Strand : +
Note : -

DNA sequence :
ATGACGATCTACGCTCCGCCCGGCACGCCCGGAAGCGTCGTCGAGTACGCGTCGCGCTACGAGAACTGGATCGGCGGCGC
GTGGGTCGCGCCGGTCGAGGGACGCTACTTCGAGAACCCCACCCCCGTCACCGGACGGGTCTTCACCGAGGCCGCCCGCA
GCGGGGCCGAGGACGTCGAACTCGCCCTCGACGCCGCCCACGCGGCCGCGCCGGCCTGGGGACGCACCTCGCCGGCGGAG
CGGGCGCTGATCCTCAACCGGATCGCCGATCGGATGGAGGAGAACCTGGAGCGGCTCGCGGTCGCCGAGTCCTGGGAGAA
CGGCAAGCCGGTCCGCGAGGCGCTGGCCGCCGACATCCCGCTGGCCATCGACCACTTCCGCTACTTCGCCGGCGCGATCC
GGGCCCAGGAGGGCCACACCTCCCAGATCGACGGGGACACCGTCGCGTACCACTTCCAGGAGCCCTTGGGCGTGGTCGCC
CAGATCATCCCGTGGAACTTCCCGATCCTCATGGCGGTCTGGAAGCTCGCCCCCGCGCTGGCGGCGGGCAACGCCGTGGT
CCTCAAGCCCGCCGAGCAGACGCCCACCTCGATCATGGTGCTGATCGACCTCATCGCCGACCTGCTGCCGCCCGGCGTGG
TCAACGTCGTCAACGGGTTCGGGGTCGAGGCCGGCAAGCCGCTGGCGGCCAACCCGCGCGTGCGCAAGGTCGCCTTCACC
GGTGAGACGACCACGGGGCGGCTGATCATGCAGTACGCCTCCGAGAACCTCATCCCGGTCACCCTGGAGTTGGGCGGCAA
GAGCCCCAACCTCTTCTTCGCCGACGTGGCGGCCGAACGCGACGCGTTCTACGACAAGGCGTTGGAGGGCTTCACCCTCT
TCGCCCTCAACCAGGGCGAGGTGTGCACCTGTCCCTCGCGCGCGTTGATCCAGGACTCCCTCTACGACGGCTTCATGGCC
GACGCCCTGGCCCGGGTCGGTCGGATCCGGCAGGGGGACCCGTTGGACACCGAGACCATGATCGGCGCGCAGGCGAGCAA
CGACCAGTTGGAGAAGGTGCTGTCCTACATCGACATCGGCCGCAAGGAGGGGGCGAAGGTGCTCTCCGGCGGGGAGCGGG
CCGAGATGGAGGGCGACCTGGCCGGCGGCTACTACGTCACGCCGACCGTCTTCGAGGGTCGGAACACTATGCGGATCTTC
CAGGAGGAGATCTTCGGCCCGGTGGTGTCGGTGTCCCGTTTCGCCGACTACGCGGACGCCATCGGTATCGCCAACGACAC
CCTGTACGGGCTCGGCGCGGGGGTGTGGTCCCGGGACGGCAACACCGCCTACCGGGCCGGACGCGACATCCAGGCCGGTC
GGGTGTGGGTGAACAACTACCACGCCTACCCCGCGCACGCGGCCTTCGGCGGTTATAAGCAGTCGGGGATCGGCCGGGAG
AACCACAAGATGATGCTCGACCATTACCAGCAGACCAAGAACCTTCTGGTCAGCTACTCGGACCAGGCGATGGGGCTGTT
CTGA

Protein sequence :
MTIYAPPGTPGSVVEYASRYENWIGGAWVAPVEGRYFENPTPVTGRVFTEAARSGAEDVELALDAAHAAAPAWGRTSPAE
RALILNRIADRMEENLERLAVAESWENGKPVREALAADIPLAIDHFRYFAGAIRAQEGHTSQIDGDTVAYHFQEPLGVVA
QIIPWNFPILMAVWKLAPALAAGNAVVLKPAEQTPTSIMVLIDLIADLLPPGVVNVVNGFGVEAGKPLAANPRVRKVAFT
GETTTGRLIMQYASENLIPVTLELGGKSPNLFFADVAAERDAFYDKALEGFTLFALNQGEVCTCPSRALIQDSLYDGFMA
DALARVGRIRQGDPLDTETMIGAQASNDQLEKVLSYIDIGRKEGAKVLSGGERAEMEGDLAGGYYVTPTVFEGRNTMRIF
QEEIFGPVVSVSRFADYADAIGIANDTLYGLGAGVWSRDGNTAYRAGRDIQAGRVWVNNYHAYPAHAAFGGYKQSGIGRE
NHKMMLDHYQQTKNLLVSYSDQAMGLF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 7e-139 61
aldA AAC12273.1 aldehyde dehydrogenase Virulence VPI Protein 2e-145 60
aldA AAK20747.1 aldehyde dehydrogenase Virulence VPI Protein 2e-145 60
aldA AAK20776.1 aldehyde dehydrogenase Virulence VPI Protein 2e-145 60
aldA-1 YP_001216300.1 aldehyde dehydrogenase Not tested VPI-1 Protein 5e-145 60
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 8e-63 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
thcA YP_006641563.1 EPTC-inducible aldehyde dehydrogenase VFG0082 Protein 3e-139 61