Gene Information

Name : SMa1415 (SMa1415)
Accession : NP_436020.1
Strain :
Genome accession: NC_003037
Putative virulence/resistance : Unknown
Product : aldehyde dehydrogenase
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 779216 - 780712 bp
Length : 1497 bp
Strand : -
Note : glimmer prediction, start codon changed based on codon usage and homology; most similar to L-sorbosone dehydrogenases; probable succinate semialdehyde dehydrogenase; glimmer prediction; shows significant global identity to L-sorbosone dehydrogenase NAD(P)

DNA sequence :
ATGGACCAGCTGAACAACTTCCTTTCCCCTCCGGCCGCCCCGCGGGATTTCGGTTTCTTCGTGGACGGCAAATGGCAAAG
CGGGCATGACTTCTTCGTGCGGCACTCGCCCGGACATGGCGTTGCGGTCACCCGCACGGCAAAATGCAGCGTCGACGACC
TGAATGCCGCAGTCGCCGCCGCACGCCGGGCTTTCGAGGACCGGCGTTGGTCCGGTCTCCCGGGGGGCTCTCGTGCCTCG
GTACTGCTGCGGGTGGCCGAGATCCTGCGCACCCGCCGCGATGAGCTCGCCTATTGGGAGACGCTGGAAAATGGCAAACC
GATCGCTCAGGCGCGCGGCGAGATCGACCACTGCATCGCCTGCTTCGAAGTAGGCGCGGGTGCAGCCCGCCTTCTGCACG
GCGACAGCTTCAATTCGCTCGGAGACGGATTGTTCGGCATGGTTCTGCGCGAGCCGATCGGGGTCGTCGGGCTCATCACG
CCATGGAACTTTCCTTTCCTTATCCTGTGCGAGCGCGTTCCCTTCATTCTCGCATCGGGTTGCACGATGGTCGTGAAACC
GTCGGAGGTGACTTCCGCGACGACTCTCATCCTTGCCGAGGTGCTCGCCGAAGCCGGACTGCCCGACGGCGTGTATAACG
TCATCACCGGCTCGGGCCGGACGATCGGCCAGGCCATGAGTGAGCATCCGGACATCGACATGCTGTCGTTCACGGGTTCG
ACCGCCGTTGGCCGTTCCTGCGTGCATGCGGCCGCCGACAGCAATTTCAAGAAGCTCGGCCTGGAGCTTGGAGGCAAGAA
CCCGATCATCGTCTTTGCCGACTCCGATCTAGAGGATGCCGCCGATGGCGCAGCTTTCGGCATCAGCTTCAATACGGGAC
AATGCTGCGTGTCCTCATCGCGCTTGATCGTCGAGCGATCGGTGGCGCGCGAATTCGAAGCGCTGCTTGCAGAGAAGATG
AAACGCATACGCGTCGGGGACCCGCTGGACGAGACGACCCAGGTCGGCGCGATCACGACCGAAGCGCAGAACACGACCAT
TCTCGACTATATCGCCAAGGGCAAGACGGAAGGAGCGGAGCTCGTGACCGGGGGGACCGCCATCGATCTCGGCCGCGGGC
AATACATCGCGCCCACGCTGTTTTCGGGCGTATCGCGTGAGATGGCGATCGCGCGTGACGAGATCTTCGGGCCGGTGCTT
TGCTCGATGACCTTCGACACGGTCGAGCAGGCGGTCGAACTGGCAAACGACACGGTTTACGGACTGGCTGCAAGCGTCTG
GACCAAGAATATCGACAAGGCCCTGACCGTCACGCGCCGCGTTCGTGCGGGCCGCTTCTGGGTCAATACCATGATGGCCG
GCGGGCCGGAGATGCCGCTCGGAGGCTTCAAGCAGTCCGGTTGGGGCCGCGAAGCCGGCATGTACGGGGTCGAGGAATAT
ACGCAGGTGAAATCGGTCCATGTCGAGATCGGCAAGCGCACGCATTGGATCTCCTGA

Protein sequence :
MDQLNNFLSPPAAPRDFGFFVDGKWQSGHDFFVRHSPGHGVAVTRTAKCSVDDLNAAVAAARRAFEDRRWSGLPGGSRAS
VLLRVAEILRTRRDELAYWETLENGKPIAQARGEIDHCIACFEVGAGAARLLHGDSFNSLGDGLFGMVLREPIGVVGLIT
PWNFPFLILCERVPFILASGCTMVVKPSEVTSATTLILAEVLAEAGLPDGVYNVITGSGRTIGQAMSEHPDIDMLSFTGS
TAVGRSCVHAAADSNFKKLGLELGGKNPIIVFADSDLEDAADGAAFGISFNTGQCCVSSSRLIVERSVAREFEALLAEKM
KRIRVGDPLDETTQVGAITTEAQNTTILDYIAKGKTEGAELVTGGTAIDLGRGQYIAPTLFSGVSREMAIARDEIFGPVL
CSMTFDTVEQAVELANDTVYGLAASVWTKNIDKALTVTRRVRAGRFWVNTMMAGGPEMPLGGFKQSGWGREAGMYGVEEY
TQVKSVHVEIGKRTHWIS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 2e-75 42