Gene Information

Name : ETA_30000 (ETA_30000)
Accession : YP_001908915.1
Strain : Erwinia tasmaniensis Et1/99
Genome accession: NC_010694
Putative virulence/resistance : Unknown
Product : membrane-bound sugar-binding protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 3344233 - 3348486 bp
Length : 4254 bp
Strand : -
Note : silverDB:etchr02987

DNA sequence :
ATGAGTGAAGCCGCACGCGTCGGCGACGCCATCGGCCATTCCGCCGCGCTGGCCGGGATGATTAGCGGTACGCTGGTCGG
CGGGCTGATTGCCGCCGCGGGGTCCGTGGCCGCCGGCGCGCTGTTCGTCGCCGGGCTGGCCTCGTCCTGTCTTGGTATCG
GCGTGCTGCTGATTGGCGCGAGCCTGGCGCTGGGGTATCTCGCCGGGGAGGCGGGCACGGCGGCGCGCGACGGCATTGCC
GCGGCCGGGGCGGGCAGCGTGTCGGCCTCGGGGCAGATACTGACCGGCTCGCCGGACGTGTTTATCAACGGCAAGCCGGC
GGCCATCGCCACGGTCAGCCAGGCGGGCTGCGATAAGGACGGGCCGTCGATGCAGATGGCCCAGGGCTCCGACCGGGTGT
TTATCAACGGCCGGCCCGCCGCGCGCGTCGGCGATAAAACCAACTGTGACGCCACGGTGATGGCCGGCTCGCCCAGCGTG
CGCATCGGCGGCGGCACCGTCACCACGCTGGCGATAAAGCCCGAGGTGCCGGAGTGGGCCTACAAGGCCTCCGACCTGAC
GCTGCTGCTGACCGGGCTGCTCGGCGGCGCGGGCGGCGCGCTCAGCAAGGTGGGTAAGGTGGGCAAACTGCTGGGCAGGC
TGCCCGGCATCAACAGGCTGGGGCAGATTGCCTGCCGCTTTGGCACGCTGATGACCGCCGGCGCGGCGGCGGGCATTATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAAGTTTCTCTCCGGCGACGACGAGCTGGACTTCGTGCTGCCGTCGCGCCT
GCCGGTCGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCAGAGGGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGCAGCCTGCAGCGTCACGGCGACGGGCTGGCGTGGCGCGCGCCGTCCGGCGACCTCGTCTCATTCCCGATGGTGCCG
TGCGGCCATAAAACCTACTGCGAGGCGGAAAAGTGCTGGCTGATGCACAACGCCGACGGCAGCTGGCGGCTGTACGACGT
CGGCGAACAGTCCTGGCACTATCCGCTGCTGGACGGGGAGCATCCCGCCCGGCTGAACATGCTGACGGACGCCGCCGGCA
ACGCCACCTCGCTGTTTTATGACGAGCAGGGACGGCTGGGCGAACTGGTGGACGGCGCGGGCCAGCGCCTGGGCTGCCGC
TATCTGACCACCGCGGCCGGGCGGTCACGCCTGAGCGCGGTGCTGCTGCACACCCCGGACGGGGAGCGCACGCTGGTCAG
CTACGCGTATGACGACGAAGGGCAGCTTGTCACCGTGCGCAACCGCGCGGGCGAGGTCACGCGCCGCTTCACCTGGCGCG
ACGGGCTGATGGCCAGCCACCAGGACGCCAACGGGCTGCTGAACGAATATCAGTGGCGGGAAATTGACGGCCTGCCGCGC
GTCACCGCCTGGCGACACAGCGCCGGGGAAGAGCTGGCGCTGCACTACGACTTTAACGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCCAGCTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTCGACGGCAGCCGGC
TGGCGTTCGTCTACGCGCGCGGCGAGCTGTGCGGCGTGCTGCTGCCGGGCGGCGGCCAGCGCCGGAGCGAGTGGGACCGC
TACGGGCGGCTGCTGAGCGAGACCGACCCGACCGGGCGCAAAACCCTTTACCAGTACGACCGTAACGGCGACCGCCTGGT
CTGCGTCACCCACCCGGACGGCGGCCGCGAGTATCAGCAGTGGGATGACCGGGGCCGCCTGGTTAAACAGACCGACGCGG
CGGAAAACAGCACGCTTTATCACTATCCGGATGAAGAAGAGAGCCTGCCGACGCGCATCACCGACGCCCTCGGCGGCGTG
GCCCGGCCTGAGTGGAACGGCCGGGGGCTGCTGACGCGCTATACCGACTGCTCCGGCAGCGTCACCGCGTACGGCTATGA
CGTTTTTGGCCAGCTCACCGACCGCACCGATGCGGAAGGCAACGTGACCCGGTATCTCTGGGACGCCGCCGGGCGTCTGC
AAACCCTGCGCCACGCGGACGGCAGCGAGGAGCACTTCGCCTGGAACGAACGCGGGCAGCTGGCGCGCCATCAGGACCCG
CTCGGCGGCGAGACGCGCTGGCGCTACAACCTGCTGGGCCAGCCGCTCAGCGTCACCGACCGCATCGACCGCACGCGCAG
CTGGCACTACAGCCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTACCGGTTCAGCTACGACGCCG
CCGGGCGCCTGACCGGCGAACGCCGCCCGGACAACACCGACCGCCTGTACCGCTACGGCCCGGACGGCCAGCTTGCCGAA
CGCCGGGAAACCAGCCCGCAGGACGGCCTTACGCCGCCGCCTCACCGCCTGCACCGCTTCCGCTATGACGAGGCGGGGCG
GCTGGAATGGCGCGGCAACGACAGCGCCGAATGGCGGTATCACTACGACGCGGCGGGCAGGCTGAACGCGCTGGCGCGTA
CGCCGACCGCCGCCGGGGCGGCGCTGGGGATTGAGGCGGACCGCGTTGAGCTGAAGTACGACGGGGCGGGCAACCTGCCG
TGCGAGCGCGGCGTGAACGGCGAGCTGGGCTACCGGTGGGACGCGCTGGCCAACCTGCAGGCATTGACGCTGCCGCAGGG
CGACGGCCTGCAGTGGCTGCACTACGGCTCCGGCCACGTCAGCGCGGTGAAATTTAACCGGCAGCTGGTCAGCGAATTTA
CCCGCGACCGCCTGCACCGCGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGCCGCTACGACGCGCTGGGCAGG
CGCAGCTGGCAGAGCAGCGCCTTCAGTGACGGCAGGATAACCCGGCCGGAAGAGGGCATCCTGTGGCGGGCGTTCCGCTA
TACCGGGCGTGGCGAGCTGGCGGGCGTCAGCGACGCGCTGCGCGGCGAGGTGCACTACGGCTACGACGCCGAAGGCCGCC
TGTTGCAGCACCGCGAGCTGAAGTCCGGCAGGGTGGGGGGCAGGCTGCTGTATGACGCCGCCGACAACCTGCTGGGCGAG
CAGGCCCCGCACGACGACCCGGAACAGCACCCGCCGCCGCGGCCGCTGGGCGACAACCGCCTGGCGCACTGGCGGCGGCT
GTTCTACCGCTACGACGCCTGGGGCAACCTGGTCAGCCGCCGCCACGGCGTCAGCGAGCAACATTACACCTATGACGCCG
ACAACCGCCTGATACGGGCCCGGGGCTTCGGCCCGCAGGGGGAGTTCAGCGCGCGGTATCACTACGACGCGCTGGGCAGG
CGCACCCGCAAGGAGGTGACGTACGCAGGAAAATCCGCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGCTTATTGCA
GGAGCAGCGGGCCAACGGCACGCGCCGCACCTGGAGCTACGACCCGGAAAGCCCGTGGACGCCGCTGGCGGCCATCGAAC
AGGCGGGAGAGGGGCCGGAGGCGGACATTTACTGGCTGAACAGCGACCTGAACGGCGCGCCGCTGGAGGTCACTGACGCC
GAAGGTAATCTGCGCTGGTCGGGTCAGTACGACACCTTCGGCAGGCTGCTGGGCCAGACGGTGGCGGGTTCAGCACAACG
CACGGGGCCGGTGTACGACCAGCCGCTGCGTTATGCGGGGCAGTACCAGGACAACGAGAGCGGACTGCACTATAATCTGT
TCCGTTACTACGAGCCGGAGGTGGGCAGGTTTACCACCCAGGACCCGGTGGGGCTGGCGGGGGGGATGAACCTGTATGCG
TATGCGCCGAATCCGCTTAGCTGGATCGATCCTCTTGGTTTAACGAAATGTTCGCCGAACAAGAAAACGACTTATGAAGG
TGTCAGCCGCAGAGATGCGCTCAGGCAGGCTAAACGTGATGCTGGCATACCAAATAATCAACATCCCAAATCAATAAATC
GTCCTGACTTGATGGATGGTTATGGTAAAAGATTTTTGGATGATAATGGAAAAGTAGTAAGAACAAGAGAATATGAGTTT
ACCAATATTGACGGCAAGGCTGTTTATATTCAGGAACATAGCTTAGGTCATGCTAAAGCAACACCTTTACATGGGGCTGA
ACCTCACTTTAATGTGAGGCCAATAGATGATTTAAGTGGCGAAGTATTAAATACTGGTAGTGTTCTTGGTACTCATGGTC
ACTATAATTTTTAA

Protein sequence :
MSEAARVGDAIGHSAALAGMISGTLVGGLIAAAGSVAAGALFVAGLASSCLGIGVLLIGASLALGYLAGEAGTAARDGIA
AAGAGSVSASGQILTGSPDVFINGKPAAIATVSQAGCDKDGPSMQMAQGSDRVFINGRPAARVGDKTNCDATVMAGSPSV
RIGGGTVTTLAIKPEVPEWAYKASDLTLLLTGLLGGAGGALSKVGKVGKLLGRLPGINRLGQIACRFGTLMTAGAAAGII
ARPVDIISGQKFLSGDDELDFVLPSRLPVEWQRYWRSGNPAEGVLGRGWSLFWESSLQRHGDGLAWRAPSGDLVSFPMVP
CGHKTYCEAEKCWLMHNADGSWRLYDVGEQSWHYPLLDGEHPARLNMLTDAAGNATSLFYDEQGRLGELVDGAGQRLGCR
YLTTAAGRSRLSAVLLHTPDGERTLVSYAYDDEGQLVTVRNRAGEVTRRFTWRDGLMASHQDANGLLNEYQWREIDGLPR
VTAWRHSAGEELALHYDFNGGTRRAVRDDGMQASWQLDDDDSVAQFTDFDGSRLAFVYARGELCGVLLPGGGQRRSEWDR
YGRLLSETDPTGRKTLYQYDRNGDRLVCVTHPDGGREYQQWDDRGRLVKQTDAAENSTLYHYPDEEESLPTRITDALGGV
ARPEWNGRGLLTRYTDCSGSVTAYGYDVFGQLTDRTDAEGNVTRYLWDAAGRLQTLRHADGSEEHFAWNERGQLARHQDP
LGGETRWRYNLLGQPLSVTDRIDRTRSWHYSPRGWLTRLENGNGGEYRFSYDAAGRLTGERRPDNTDRLYRYGPDGQLAE
RRETSPQDGLTPPPHRLHRFRYDEAGRLEWRGNDSAEWRYHYDAAGRLNALARTPTAAGAALGIEADRVELKYDGAGNLP
CERGVNGELGYRWDALANLQALTLPQGDGLQWLHYGSGHVSAVKFNRQLVSEFTRDRLHRETGRSQGALHQQRRYDALGR
RSWQSSAFSDGRITRPEEGILWRAFRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHRELKSGRVGGRLLYDAADNLLGE
QAPHDDPEQHPPPRPLGDNRLAHWRRLFYRYDAWGNLVSRRHGVSEQHYTYDADNRLIRARGFGPQGEFSARYHYDALGR
RTRKEVTYAGKSAQTTRFLWQGYRLLQEQRANGTRRTWSYDPESPWTPLAAIEQAGEGPEADIYWLNSDLNGAPLEVTDA
EGNLRWSGQYDTFGRLLGQTVAGSAQRTGPVYDQPLRYAGQYQDNESGLHYNLFRYYEPEVGRFTTQDPVGLAGGMNLYA
YAPNPLSWIDPLGLTKCSPNKKTTYEGVSRRDALRQAKRDAGIPNNQHPKSINRPDLMDGYGKRFLDDNGKVVRTREYEF
TNIDGKAVYIQEHSLGHAKATPLHGAEPHFNVRPIDDLSGEVLNTGSVLGTHGHYNF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 48
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 48