Gene Information

Name : EAM_0799 (EAM_0799)
Accession : YP_003537889.1
Strain : Erwinia amylovora ATCC 49946
Genome accession: NC_013971
Putative virulence/resistance : Unknown
Product : Rhs family protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 898987 - 903234 bp
Length : 4248 bp
Strand : +
Note : -

DNA sequence :
ATGAGTGAAGCCGCACGCGTCGGCGACGCCATCGGCCATTCCTCCGCGCTGGCCGGGATGACGGGCGGCACCATAGTCGG
CGGGCTGATTGCCGCCGCCGGCGCCGTGGCCGCCGGGGCGCTGTTTGTCGCCGGGCTGGCCGCCTCCTGTCTTGGCGTTG
GCGTCTTGCTGATGGGCGCCAGCCTGGCGGTGGGCTATCTCACCGGGGAGGCGGCCACCGCGGCGCGCGACGGCATGGCC
GCTGCCGGGGCGGACAGACGGTCCGCTTCCGGCCAGATACTGACCGGCTCACCGAACGTGTTTATCAACGGCAAACCGGC
GGCCATCGCCACCGTCAGCCAGGCGGGCTGTGACCGGGACGGGCCGACGATGCAGATGGCGCAGGGCTCCGCCCGGGTGT
TTATCAACGGCCAGCCCGCCGCGCGCGTCGGCGACAAAACCAACTGCGGTGCCACGGTGATGGCAGGCTCGCCCAGCGTG
CGCATCGGCGGCGGCACCGCCACCACGCTGACGATAAAACCCGAAGTGCCGGACCGGGCCTATAAGGCCTCGGACCTGAC
GCTGCTGTTTGCCGGGCTGCTTGGCGGCGCGGGCGGCGCGGCCGGCAAGGCGGGCAAACTGGCTGAACTGCTGAGCAGGC
TGCCCGGCATCAACAGGCTTGGCCAGGTGGCCTGCCGCTTCGGCGTGCTGATGACCGCCAGCGCGGCGGCGGGCATCATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAAGTTTCTCTCCGGCGACGACGAGCTGGACTTTGTGCTGCCGTCGCGCCT
GCCGGTTGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAAAGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGTACCCTCCAGCCTTACGCCGACGGGCTGGTGTGGCGCGCGCCGTCCGGCGACCTGGTTTCGTTCCCGATGGTGCCG
CGCGGCCATAAAACCTGGTGTGAAGCCGAAAAGTGCTGGCTGATGCACAACGCCGACGACAGCTGGCAGCTGTTCGACGT
CAGTGAACAGGCCTGGCACTATCCGCCGCTGGACGCGCAGTATCCCGCCCGCCTGAGCATGGTGACCGACGCCGGCGGCA
ACGCCACCTCGCTGTTTTACGACGAGCAGGGGCGGCCGGGCGAACTGGTGGACAGCGCCGGCCAGCGCCTGAGCTGCCGC
TACCTGACGACCGCCGGCGGGCATTGCCGCCTGAGCGCGGTGCTGCTGCATACCGCGGACGGGGAGCACACGCTGGTCAG
CTACGGGTATGACGACGACGGGCAGCTCGCCAGCGTGCGCAACCGCGCCGGCGAGGTCACGCGCCGCTTCACCTGGCATG
ACGGGCTGATGGCCAGCCACGAGGATGCCAACGGGCTGCGGAACGAATACCGCTGGCAGGAGATTGACGGCCTGCCGCGC
GTCACCGCCTGGCGGCACGGCGCCGGGGAAGCGCTGGCGCTGCACTACGACATTAACGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCGTGCTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTTGACGGCCGCAGGC
TGGCGTTTATCTACGCGCGCGGCGAGCTGTGCAGCGTGCTGCTGCCGGGCGGCGGCCAGCGGCACAGCGAGTGGGACCGC
TACGGGCGACTGCTGAACGAAACCGACCCGTCAGGGCGTAAAACCACCTGTCAGTATGCGCGTAACAGCGACCGTCTGGT
TTCGGTCACCCATCCCGACGGCAGCCGTGAGTGCCAGTCATGGGATGACAGGGGGCGGCTGATTACACAGAGCGACGCGC
TGGGAAACACCACGCTTTACCACTACCCGGACGGGGAAGAAAGCTTACCGGCGCGCATCACCGATGCCCTCGGCGGCGTG
GCGCGGCTTGAGTGGGACGGCCGGGGGCTGCTGACGCGCTATACCGACTGTTCCGGCAGCGTCACCGCGTACGACTATGA
CATTTTCGGCCAGCTCACCGGGCGCACCGATGCGGAAGGCAACGTGACCCGCTACCGCCGGGATACCGCCGGTCGCCTGC
AAACCCTGCAGCACGCGGACGGCAGCGAAGAGCACTTCGTCTGGAACGAACGCGGGCAGCTGGCGCGCCATCAGGACCCG
TCCGGCAGCGAAACGCAGTGGCGCTACAACCTGCTGGGCCAGCCGGTCAGCGTCACCGACCGCATCAACCGCACGCGCCA
CTACCACTACGGCCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTATCAGTTCAGCTACGATGCTG
CCGGGCGCATCACCGCCGAACGCCGCCCGGACAACACCGACCACCTCTATCGCTACGGCGCGGACGGCCAGCTTGCCGAA
CACCGGGAAACCGGCCCGCAGAACAGCCTTGCGCCGCCCGCGCACCGCCTGCACCGCTTCCGCTTTGACGAGGCGGGCCG
CCTGGCGTGGCGCGGCAACGACAGCGCCGAATGGCAGTATCACTACGATGCCGCAGGCAGGCTGACCCGGCTTGTGCGTA
CCCCCACGGCCGCCGGGGCGGAGCTGGGGATTGAGGCGGACAGCGTTGAGCTGCAGTACGACAAAGCGGGTCACCTGCTG
TGCGAGCGCGGCGTGAACGGCGCGCCGGTCTACAGCCGGGACGCGCTCGGCAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACCGCCTGCAGTGGCTGCACTACGGCTCCGGCCATGCCGGCGCGCTGAAATTCAACCGGCAGGCGGTGAGCGAATTCA
CCCGTGACCGCCTGCACCGTGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGCCGCTACGATGCGTCCGGCAGG
CGCAGCTGGCAGAGCAGCACTTTCGGTGACGGCCAGATAACCCGGCCGGAAGACGGCATGCTGTGGCGGGCGTTCCGCTA
CACCGGGCGCGGCGAGCTGGCGGGCGTCAGCGACGCGCTGCGCGGCGAAGTGCACTACGGCTACGACGCCGAAGGCCGCC
TGCTGCAGCACCGCGAGCTGCAGTCCGGCAGGACGGGCAGCCGGCTGGTGTATGACGCCGCCGACAACCTGCTGGGCGGG
CAAAGCCCGCACGACGACCCGGAACGGCCGCCGCCGCCGCCGCAGAGCAGCAACCGTCTGCCGCACTGGCAGCGGCTGTT
CTACCGCTACGACGTCTGGGGCAACCTGGTCAGCCGCCGCCACGGCCTCAACGAGCAGCATTACACTTACGACGCCGACA
ACCGCCTGATACGGGCGCGTGGCTCCGGTCCTCAGGGCGAGTTCAGCGCGCAGTACCATTATGACGCGCTGGGCCGGCGC
AGCCGCAAGGAGGTCACCTTCGCGGGCAAAGCCCCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGGCTGCTGCAGGA
GCAGCGCGCCAACGGCACACGGCGTACCTGGAGCTATGACCCGGAAAGCCCGTGGACGCCGCTGGCGGCCATCGAGCAGG
CCGGGGAAGGGCCACAGGCGGATATTTACTGGCTGAACACCGACCTCAACGGCGCGCCGCTGGAGGTGACCGACGCCGAT
GGCAGGCTGCGCTGGTCGGGACAGTACGACACCTTCGGCAGGCTGCAGGGCCAGACGACGGCCGGTGCGGCACAACGCAC
GGGGCCGGTTTACGACCAGCCGCTGCGCTACGCCGGGCAGTATGCTGACAGTGAAACGGGACTGCACTATAATCTGTTCC
GGTACTACGAGCCTGACGTTGGCAGGTTTACGACCCAGGACCCTGTGGGGCTGGCGGGGGGCCTGAACCTGTATGCGTAT
GCGCCGAATCCGTACGGGTGGGTGGATCCTCTTGGTTTAACGAAATGTTCGCCGAACAAGAAAACGACTTATGAAGGTGT
CAGCCGCAGAGATGCACTCAGGCAGGCTAAACGTGATGCGGGCATACCTAATAACCAGCAGCCTTCAAAGATTGTCAGAC
CAGAGCTAAGAGATGGTAACGGCAACATAATGATTGGCAAAAATAATCAACCAATCAGGACTAGAGAATACCATTTTGTT
AATAAAGACAATAAAACTGTGTTGATTCAAGAGCATAGTTTAGGCCATCAAAAAGCTGTTCCCGGACACGGTGCAGAGCC
GCATTTTAATACCAGAAGTATTGATAGGCCAGATGCAGGAAACTTTCCCGAAACACACGGGCACTACAATTTTCCGTGGA
GTTATTAG

Protein sequence :
MSEAARVGDAIGHSSALAGMTGGTIVGGLIAAAGAVAAGALFVAGLAASCLGVGVLLMGASLAVGYLTGEAATAARDGMA
AAGADRRSASGQILTGSPNVFINGKPAAIATVSQAGCDRDGPTMQMAQGSARVFINGQPAARVGDKTNCGATVMAGSPSV
RIGGGTATTLTIKPEVPDRAYKASDLTLLFAGLLGGAGGAAGKAGKLAELLSRLPGINRLGQVACRFGVLMTASAAAGII
ARPVDIISGQKFLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESTLQPYADGLVWRAPSGDLVSFPMVP
RGHKTWCEAEKCWLMHNADDSWQLFDVSEQAWHYPPLDAQYPARLSMVTDAGGNATSLFYDEQGRPGELVDSAGQRLSCR
YLTTAGGHCRLSAVLLHTADGEHTLVSYGYDDDGQLASVRNRAGEVTRRFTWHDGLMASHEDANGLRNEYRWQEIDGLPR
VTAWRHGAGEALALHYDINGGTRRAVRDDGMQACWQLDDDDSVAQFTDFDGRRLAFIYARGELCSVLLPGGGQRHSEWDR
YGRLLNETDPSGRKTTCQYARNSDRLVSVTHPDGSRECQSWDDRGRLITQSDALGNTTLYHYPDGEESLPARITDALGGV
ARLEWDGRGLLTRYTDCSGSVTAYDYDIFGQLTGRTDAEGNVTRYRRDTAGRLQTLQHADGSEEHFVWNERGQLARHQDP
SGSETQWRYNLLGQPVSVTDRINRTRHYHYGPRGWLTRLENGNGGEYQFSYDAAGRITAERRPDNTDHLYRYGADGQLAE
HRETGPQNSLAPPAHRLHRFRFDEAGRLAWRGNDSAEWQYHYDAAGRLTRLVRTPTAAGAELGIEADSVELQYDKAGHLL
CERGVNGAPVYSRDALGNLQALTLPQGDRLQWLHYGSGHAGALKFNRQAVSEFTRDRLHRETGRSQGALHQQRRYDASGR
RSWQSSTFGDGQITRPEDGMLWRAFRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHRELQSGRTGSRLVYDAADNLLGG
QSPHDDPERPPPPPQSSNRLPHWQRLFYRYDVWGNLVSRRHGLNEQHYTYDADNRLIRARGSGPQGEFSAQYHYDALGRR
SRKEVTFAGKAPQTTRFLWQGYRLLQEQRANGTRRTWSYDPESPWTPLAAIEQAGEGPQADIYWLNTDLNGAPLEVTDAD
GRLRWSGQYDTFGRLQGQTTAGAAQRTGPVYDQPLRYAGQYADSETGLHYNLFRYYEPDVGRFTTQDPVGLAGGLNLYAY
APNPYGWVDPLGLTKCSPNKKTTYEGVSRRDALRQAKRDAGIPNNQQPSKIVRPELRDGNGNIMIGKNNQPIRTREYHFV
NKDNKTVLIQEHSLGHQKAVPGHGAEPHFNTRSIDRPDAGNFPETHGHYNFPWSY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 47
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 47