Gene Information

Name : BURPS668_A0734 (BURPS668_A0734)
Accession : YP_001061733.1
Strain :
Genome accession: NC_009075
Putative virulence/resistance : Virulence
Product : aldehyde dehydrogenase (NAD) family protein
Function : -
COG functional category : C : Energy production and conversion
COG ID : COG1012
EC number : -
Position : 698186 - 699772 bp
Length : 1587 bp
Strand : +
Note : identified by match to protein family HMM PF00171

DNA sequence :
GTGCGCGCGCCCGCCCGCCCGCGCGCGGCGCCGCGCGTCACATGCGCTCGCGCATGCGGCGCCGCGCGGCGAATCCCACT
CTACGAGGCTATTCCGATGGACAAGACCACTTTGGCTGACTGGCAGGACAAGGCCGCGACGCTCGCGATCGAGGGGCGCG
CATTCATCGACGGCGCGTATCGCGACGCGCACGGCGGCAAGACCTTCGATTGCGTGAGCCCGATCGACGGGCGCGTGCTC
GCGAAGGTCGCCGATTGCGGCGCGGCCGATGTCGACGCGGCGGTGGCCGCCGCGCGGCGCGCGTTCGACGCGCAGGCGTG
GGCGGGCCTGAACCCGCGCGAGCGCAAGGCGATCCTGCTGCGCTGGGCCGCGCTGATGCGCGCGCATCTCGACGAGCTGG
CGCTGCTCGAGACGCTCGACGCGGGCAAGCCGATCGGCGACACGACGAGCGTCGACGTGCCGGGCGCCGCGTACTGCGTC
GAATGGTTCGCCGAGGCGATCGACAAGGTGGGCGGCGAAGTGGTGCCCGCCGATCATCATCTCGTCGGCCTCGTCACGCG
CGAGCCGCTCGGCGTCGTCGCCGCCGTCGTGCCGTGGAATTTTCCGATCCTGATGGCGTCGTGGAAGTTCGGCCCGGCGC
TCGCCGCGGGCAACAGCGTCGTGCTCAAGCCGTCGGAGAAATCGCCGCTCACGGCGATCAGGGTCGCGCGGCTCGCGCAC
GAGGCGGGGATTCCGGCCGGCGTGTTCAACGTCGTGCCGGGCGGCGGCGAGCCGGGCAAGCTGCTCGCGCTGCATCGCGA
CGTCGACTGTCTCGCGTTCACCGGCTCCACGGGTGTCGGCAAGCTGATCATGCAGTACGCGGGGCAATCGAACCTGAAGC
GCGTGTGGCTCGAGCTGGGCGGCAAGTCGCCGAACATCGTGCTGCCCGACTGCCCGGATCTCGACCGCGCGGCGAAGGCG
GCGGCGGGCGCGATCTTCTACAACATGGGCGAGATGTGCACGGCGGGATCGCGCCTGCTCGTGCACCGCGAGATCAAGGA
CGCGTTCGTCGAAAAGCTCGTCGCCGCGGCGCGCGCGTACAAGCCGGGCAATCCGCTCGATCCGAACGTGTCGATGGGCG
CGATCGTCGACGCGATCCAGCTCGAGCGCGTGCTCGGCTACATCGAGGCGGGCCGCGCCGAAGCGCGGCTGCTGCTCGGC
GGCGCGCGCGTGAACGAGGCGAGCGGCGGCTTCTACATCGAGCCGACCGTGTTCGACACCGCGCCCGACACACGGATCGC
GCGCGAGGAAATCTTCGGCCCGGTGCTGTCGATGATCACGTTCGATTCGGTCGACGAAGCGGTGAGGATCGCGAACGACA
GCGAATACGGGCTCGGCGCGGCCGTGTGGACCGCGAACCTGACGACCGCGCACGAACTCGCGCGGCGGTTGCGCGCGGGC
ACCGTGTGGGTCAACTGCTACGACGAAGGGGGCGACATGAACTTCCCGTTCGGCGGCTACAAGCAGTCGGGCAACGGCCG
CGACAAGTCGTTGCATGCACTGGAGAAGTACACCGAGCTGAAGTCCACGCTCGTGCGGCTGCGCTAA

Protein sequence :
MRAPARPRAAPRVTCARACGAARRIPLYEAIPMDKTTLADWQDKAATLAIEGRAFIDGAYRDAHGGKTFDCVSPIDGRVL
AKVADCGAADVDAAVAAARRAFDAQAWAGLNPRERKAILLRWAALMRAHLDELALLETLDAGKPIGDTTSVDVPGAAYCV
EWFAEAIDKVGGEVVPADHHLVGLVTREPLGVVAAVVPWNFPILMASWKFGPALAAGNSVVLKPSEKSPLTAIRVARLAH
EAGIPAGVFNVVPGGGEPGKLLALHRDVDCLAFTGSTGVGKLIMQYAGQSNLKRVWLELGGKSPNIVLPDCPDLDRAAKA
AAGAIFYNMGEMCTAGSRLLVHREIKDAFVEKLVAAARAYKPGNPLDPNVSMGAIVDAIQLERVLGYIEAGRAEARLLLG
GARVNEASGGFYIEPTVFDTAPDTRIAREEIFGPVLSMITFDSVDEAVRIANDSEYGLGAAVWTANLTTAHELARRLRAG
TVWVNCYDEGGDMNFPFGGYKQSGNGRDKSLHALEKYTELKSTLVRLR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF SG19 AAN62241.1 putative aldehyde dehydrogenase Not tested PAGI-3(SG) Protein 3e-110 58
VC0819 NP_230467.2 aldehyde dehydrogenase Not tested VPI-1 Protein 1e-65 44
aldA AAC12273.1 aldehyde dehydrogenase Virulence VPI Protein 1e-65 43
aldA AAK20747.1 aldehyde dehydrogenase Virulence VPI Protein 1e-65 43
aldA AAK20776.1 aldehyde dehydrogenase Virulence VPI Protein 1e-65 43
aldA-1 YP_001216300.1 aldehyde dehydrogenase Not tested VPI-1 Protein 1e-65 43
hpaE AAO17179.1 HpaE Not tested tcd island Protein 3e-57 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
BURPS668_A0734 YP_001061733.1 aldehyde dehydrogenase (NAD) family protein VFG0082 Protein 4e-66 44