Gene Information

Name : thmS2 (azo3021)
Accession : YP_934524.1
Strain : Azoarcus sp. BH72
Genome accession: NC_008702
Putative virulence/resistance : Unknown
Product : succinate semialdehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : 1.2.1.16
Position : 3319525 - 3320985 bp
Length : 1461 bp
Strand : +
Note : Probable succinate semialdehyde dehydrogenase [NAD(P)+]. Homology to thmS of Pseudonocardia sp. K1 of 46% (trembl|Q9F3V7). Is capable of oxidizing substrates using NADP as cofactor. Pfam: Aldehyde dehydrogenase family no signal peptide no TMHs; Family mem

DNA sequence :
ATGTCCGCACCCACCTCCTACCCCAGCCGCAACTTCATCGCCGGCGAGTGGCGCGCCGCCATCTCCGGCGCCACCTTCGC
CAAGCTCGCCCCGACCTCCGGCGCGGTGCTGGCCGAGGTCGCCAACTCCAGCGCCGCCGATGTGGATGCGGCGGTGGCCG
CCGCCCGCGTCCAGTTCGACGGCGGCGAGTGGTCACGCCTGCCAGGCGCCGAACGCGGCCGCCTGCTCAACAAGCTGGCC
GACCTGCTGGCGCGCGACGCCGAGCGCTTCGCCCACATCCTGGCGATGGAACAGGGCCGGCCGCTGATGGAAATGCGCAT
GCTGGACCTGCCGATGTCCATCGACACCCTGCGCTACTTCGCCGGCTGGGCCGACAAGCTGGAAGGCCGCCAGATTCCCA
CCGCCGGCTTCATGGGCCGGCCGACGCTCAACTACACCATTCGCGAGGCCATCGGCGTGGCCGCGCTGATCGTGCCGTGG
AACGCGCCGCTGATGATCGGCATCTGGAAGCTGGCCCCGGCGCTGGCCGCCGGCTGCACCGTGGTGCTCAAGCCTTCTGA
AGACGCGCCGCTGGCGCTCACCGCGCTGGCCGGGCTGGCCGCAGAGGCCGGTTTCCCGGCCGGCGTGTTCAACCTGGTGA
ATGGCATGGGCCCGGAGGCCGGCGCCACGCTGGTGAAGCACCCCGGCGTGGACAAGATCAGCTTCACCGGCAGCACCGAG
GTGGGCCGCATCATCGCCCGCGAGGCGGCGCCGCTGTTCAAGCGCCTGACGCTGGAGCTGGGCGGCAAGGCGCCGCAGAT
CATCTGCGCCGACGCCAACCTGGACGCCGCCATCATGGGCGTCGCCATGGGCCTGTTCGTGAACCAGGGCCAGACCTGCG
CCGCCGGCACCCGCATCCTGGTGCATCGCAGCCGCTACGACGACGTGGTCGGCGCACTCGCCGGCGCGGCCAAGTCGGTC
ACGCTGGGCGACCCGCTGGACGCCAACACCCGCATGGGCGCCCTGATCAACGCCCGCCACCGCGACCGCGTCGCCGCCCT
GATCCAGAGCGGCATCGCCGAAGGCGCCGCGCTGGTGGCCGGCGGCGAGGCGCTGCCCGAGAACGGCTTCTTCGTCCGCC
CCACGGTGTTCGCCGGCGGCACGCCGCAGATGCGCATCATGCGCGAGGAGATCTTCGGCCCGGTCGGCGTGGTGGTGCCC
TTCGACAGCGACGAGGAAGCCGTGCAACTCGCCAACGACACCCCCTTCGGCCTCTCCGCCTCGCTGTGGACGCAGGACAT
CGCCCGTGCGCACACGCTGGCGCCCAAGCTGCGCGTCGGCGCGGTCGCCATCAACGGCTGGAGCCCGCTCGACGCGCGCC
TGCCGTGGGGCGGCTACAAGGATTCCGGCGTGGGGCGCGATCTGTCTCGTACCGCGCTGGATGCTTATACGGAAGAGAAG
GTGGTGTCGGTGGTGATGTGA

Protein sequence :
MSAPTSYPSRNFIAGEWRAAISGATFAKLAPTSGAVLAEVANSSAADVDAAVAAARVQFDGGEWSRLPGAERGRLLNKLA
DLLARDAERFAHILAMEQGRPLMEMRMLDLPMSIDTLRYFAGWADKLEGRQIPTAGFMGRPTLNYTIREAIGVAALIVPW
NAPLMIGIWKLAPALAAGCTVVLKPSEDAPLALTALAGLAAEAGFPAGVFNLVNGMGPEAGATLVKHPGVDKISFTGSTE
VGRIIAREAAPLFKRLTLELGGKAPQIICADANLDAAIMGVAMGLFVNQGQTCAAGTRILVHRSRYDDVVGALAGAAKSV
TLGDPLDANTRMGALINARHRDRVAALIQSGIAEGAALVAGGEALPENGFFVRPTVFAGGTPQMRIMREEIFGPVGVVVP
FDSDEEAVQLANDTPFGLSASLWTQDIARAHTLAPKLRVGAVAINGWSPLDARLPWGGYKDSGVGRDLSRTALDAYTEEK
VVSVVM

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-60 43