Gene Information

Name : EAM_2423 (EAM_2423)
Accession : YP_003539494.1
Strain : Erwinia amylovora ATCC 49946
Genome accession: NC_013971
Putative virulence/resistance : Unknown
Product : Rhs family protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 2642366 - 2646646 bp
Length : 4281 bp
Strand : +
Note : -

DNA sequence :
ATGAGTGAAGCCGCACGCGTCGGCGACGCCATCGGCCATTCCTCCGCGCTGGCCGGGATGACGGGCGGCACCATAGTCGG
CGGGCTGATTGCCGCCGCCGGCGCCGTGGCCGCCGGGGCGCTGTTTGTCGCCGGGCTGGCCGCCTCCTGTCTTGGCGTTG
GCGTCTTGCTGATGGGCGCCAGCCTGGCGGTGGGCTATCTCACCGGGGAGGCGGCCACCGCGGCGCGCGACGGCATGGCC
GCTGCCGGGGCGGACAGACGGTCCGCTTCCGGCCAGATACTGACCGGCTCACCGAACGTGTTTATCAACGGCAAACCGGC
GGCCATCGCCACCGTCAGCCAGGCGGGCTGTGACCGGGACGGGCCGACGATGCAGATGGCGCAGGGCTCCGCCCGGGTGT
TTATCAACGGCCAGCCCGCCGCGCGCGTCGGCGACAAAACCAACTGCGGTGCCACGGTGATGGCAGGCTCGCCCAGCGTG
CGCATCGGCGGCGGCACCGCCACCACGCTGACGATAAAACCCGAAGTGCCGGACCGGGCCTATAAGGCCTCGGACCTGAC
GCTGCTGTTTGCCGGGCTGCTTGGCGGCGCGGGCGGCGCGGCCGGCAAGGCGGGCAAACTGGCTGAACTGCTGAGCAGGC
TGCCCGGCATCAACAGGCTTGGCCAGGTGGCCTGCCGCTTCGGCGTGCTGATGACCGCCAGCGCGGCGGCGGGCATCATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAAGTTTCTCTCCGGCGACGACGAGCTGGACTTTGTGCTGCCGTCGCGCCT
GCCGGTTGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAAAGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGTACCCTCCAGCCTTACGCCGACGGGCTGGTGTGGCGCGCGCCGTCCGGCGACCTGGTTTCGTTCCCGATGGTGCCG
CGCGGCCATAAAACCTGGTGTGAAGCCGAAAAGTGCTGGCTGATGCACAACGCCGACGACAGCTGGCAGCTGTTCGACGT
CAGTGAACAGGCCTGGCACTATCCGCCGCTGGACGCGCAGTATCCCGCCCGCCTGAGCATGGTGACCGACGCCGGCGGCA
ACGCCACCTCGCTGTTTTACGACGAGCAGGGGCGGCCGGGCGAACTGGTGGACAGCGCCGGCCAGCGCCTGAGCTGCCGC
TACCTGACGACCGCCGGCGGGCATTGCCGCCTGAGCGCGGTGCTGCTGCATACCGCGGACGGGGAGCACACGCTGGTCAG
CTACGGGTATGACGACGACGGGCAGCTCGCCAGCGTGCGCAACCGCGCCGGCGAGGTCACGCGCCGCTTCACCTGGCATG
ACGGGCTGATGGCCAGCCACGAGGATGCCAACGGGCTGCGGAACGAATACCGCTGGCAGGAGATTGACGGCCTGCCGCGC
GTCACCGCCTGGCGGCACGGCGCCGGGGAAGCGCTGGCGCTGCACTACGACATTAACGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCGTGCTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTTGACGGCCGCAGGC
TGGCGTTTATCTACGCGCGCGGCGAGCTGTGCAGCGTGCTGCTGCCGGGCGGCGGCCAGCGGCACAGCGAGTGGGACCGC
TACGGGCGACTGCTGAACGAAACCGACCCGTCAGGGCGTAAAACCACCTGTCAGTATGCGCGTAACAGCGACCGTCTGGT
TTCGGTCACCCATCCCGACGGCAGCCGTGAGTGCCAGTCATGGGATGACAGGGGGCGGCTGATTACACAGAGCGACGCGC
TGGGAAACACCACGCTTTACCACTACCCGGACGGGGAAGAAAGCTTACCGGCGCGCATCACCGATGCCCTCGGCGGCGTG
GCGCGGCTTGAGTGGGACGGCCGGGGGCTGCTGACGCGCTATACCGACTGTTCCGGCAGCGTCACCGCGTACGACTATGA
CATTTTCGGCCAGCTCACCGGGCGCACCGATGCGGAAGGCAACGTGACCCGCTACCGCCGGGATACCGCCGGTCGCCTGC
AAACCCTGCAGCACGCGGACGGCAGCGAAGAGCACTTCGTCTGGAACGAACGCGGGCAGCTGGCGCGCCATCAGGACCCG
TCCGGCAGCGAAACGCAGTGGCGCTACAACCTGCTGGGCCAGCCGGTCAGCGTCACCGACCGCATCAACCGCACGCGCCA
CTACCACTACGGCCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTATCAGTTCAGCTACGATGCTG
CCGGGCGCATCACCGCCGAACGCCGCCCGGACAACACCGACCACCTCTATCGCTACGGCGCGGACGGCCAGCTTGCCGAA
CACCGGGAAACCGGCCCGCAGAACAGCCTTGCGCCGCCCGCGCACCGCCTGCACCGCTTCCGCTTTGACGAGGCGGGCCG
CCTGGCGTGGCGCGGCAACGACAGCGCCGAATGGCAGTATCACTACGATGCCGCAGGCAGGCTGACCCGGCTTGTGCGTA
CCCCCACGGCCGCCGGGGCGGAGCTGGGGATTGAGGCGGACAGCGTTGAGCTGCAGTACGACAAAGCGGGTCACCTGCTG
TGCGAGCGCGGCGTGAACGGCGCGCCGGTCTACAGCCGGGACGCGCTCGGCAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACCGCCTGCAGTGGCTGCACTACGGCTCCGGCCATGCCGGCGCGCTGAAATTCAACCGGCAGGCGGTGAGCGAATTCA
CCCGTGACCGCCTGCACCGTGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGCCGCTACGATGCGTCCGGCAGG
CGCAGCTGGCAGAGCAGCACCTTCGGGGACGGCCAGATAACCCGGCCGGAAGACGGCATGCTGTGGCGGGCGTTCCGCTA
CACCGGGCGCGGCGAGCTGGCGGGCGTCAGCGACGCGCTGCGCGGCGAAGTGCACTACGGCTACGACGCCGAAGGCCGCC
TGCTGCAGCACCGCGAGCTGCAGTCCGGCAGGACGGGCAGCCGGCTGGTGTATGACGCCGCCGACAACCTGCTGGGCGGG
CAAAGCCCGCACGACGACCCGGAACGGCCGCCGCCGCCGCCGCAGAGCAGCAACCGTCTGCCGCACTGGCAGCGGCTGTT
CTACCGCTACGACGTCTGGGGCAACCTGGTCAGCCGCCGCCACGGCCTCAACGAGCAGCATTACACTTACGATGCCGACA
ACCGCCTGATACGGGCGCGTGGCTCCGGTCCTCAGGGCGAGTTCAGCGCGCAGTACCATTATGACGCGCTGGGCCGGCGC
AGCCGCAAGGAGGTCACCTTCGCGGGCAAAGCCCCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGGCTGCTGCAGGA
GCAGCGCGCCAACGGCACGCGGCGTACCTGGAGCTATGACCCGGAAAGCCCGTGGACGCCGCTGGCGGCCATCGAGCAGG
CCGGGGAAGGGCCACAGGCGGATATTTACTGGCTGAACACCGACCTCAACGGCGCGCCGCTGGAGGTGACCGACGCCGAT
GGCAGGCTGCGCTGGTCGGGACAGTACGACACCTTCGGCAGGCTGCAGGGCCAGACGACGGCCGGTGCGGCACAACGCAC
GGGGCCGGTGTACGACCAGCCGCTGCGCTACGCCGGGCAGTATGCTGACAGTGAAACGGGACTGCACTATAATCTGTTCC
GTTACTACGAGCCTGACGTTGGCAGGTTTACGACCCAGGACCCTGTGGGGCTGGCGGGGGGGCTGAACCTGTATGCGTAT
GCGCCGAATCCGTACGGGTGGGTGGATCCGCTGGGGCTGGCGAAGTGCGGTAATAATGAGAAATCATCTTATAAAGGTCC
TGAACTTCCCGGCAGTATTGCTGAAACCTTTGACAAAGGGATTTATAAAAACAGGCAACTTTACAAGTCTGAAACGTTCT
ATAAATATCATGGGTTAAATAATAGAACTGGCAGAAAATATTCATGGCTAACTAATGAGCGATATGGTTCTGAAGAAATG
TTAAGACAAAAGCTTGCGATTCGACATGATTGGGGCGTTGTCATTACAAAAGTATCTGAATTTAAAGTTCCTCAAGGTAC
ATGGATCAGTGAGGGGCCAGCCGCAGCTCAAGGGGCTGGTTATCCTGGGTTAGGATATCAAGCAGTAGTATCTAATTTAC
CTAAATCATGGATTATTAACACACTAAAGGTTCCTTGGTAA

Protein sequence :
MSEAARVGDAIGHSSALAGMTGGTIVGGLIAAAGAVAAGALFVAGLAASCLGVGVLLMGASLAVGYLTGEAATAARDGMA
AAGADRRSASGQILTGSPNVFINGKPAAIATVSQAGCDRDGPTMQMAQGSARVFINGQPAARVGDKTNCGATVMAGSPSV
RIGGGTATTLTIKPEVPDRAYKASDLTLLFAGLLGGAGGAAGKAGKLAELLSRLPGINRLGQVACRFGVLMTASAAAGII
ARPVDIISGQKFLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESTLQPYADGLVWRAPSGDLVSFPMVP
RGHKTWCEAEKCWLMHNADDSWQLFDVSEQAWHYPPLDAQYPARLSMVTDAGGNATSLFYDEQGRPGELVDSAGQRLSCR
YLTTAGGHCRLSAVLLHTADGEHTLVSYGYDDDGQLASVRNRAGEVTRRFTWHDGLMASHEDANGLRNEYRWQEIDGLPR
VTAWRHGAGEALALHYDINGGTRRAVRDDGMQACWQLDDDDSVAQFTDFDGRRLAFIYARGELCSVLLPGGGQRHSEWDR
YGRLLNETDPSGRKTTCQYARNSDRLVSVTHPDGSRECQSWDDRGRLITQSDALGNTTLYHYPDGEESLPARITDALGGV
ARLEWDGRGLLTRYTDCSGSVTAYDYDIFGQLTGRTDAEGNVTRYRRDTAGRLQTLQHADGSEEHFVWNERGQLARHQDP
SGSETQWRYNLLGQPVSVTDRINRTRHYHYGPRGWLTRLENGNGGEYQFSYDAAGRITAERRPDNTDHLYRYGADGQLAE
HRETGPQNSLAPPAHRLHRFRFDEAGRLAWRGNDSAEWQYHYDAAGRLTRLVRTPTAAGAELGIEADSVELQYDKAGHLL
CERGVNGAPVYSRDALGNLQALTLPQGDRLQWLHYGSGHAGALKFNRQAVSEFTRDRLHRETGRSQGALHQQRRYDASGR
RSWQSSTFGDGQITRPEDGMLWRAFRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHRELQSGRTGSRLVYDAADNLLGG
QSPHDDPERPPPPPQSSNRLPHWQRLFYRYDVWGNLVSRRHGLNEQHYTYDADNRLIRARGSGPQGEFSAQYHYDALGRR
SRKEVTFAGKAPQTTRFLWQGYRLLQEQRANGTRRTWSYDPESPWTPLAAIEQAGEGPQADIYWLNTDLNGAPLEVTDAD
GRLRWSGQYDTFGRLQGQTTAGAAQRTGPVYDQPLRYAGQYADSETGLHYNLFRYYEPDVGRFTTQDPVGLAGGLNLYAY
APNPYGWVDPLGLAKCGNNEKSSYKGPELPGSIAETFDKGIYKNRQLYKSETFYKYHGLNNRTGRKYSWLTNERYGSEEM
LRQKLAIRHDWGVVITKVSEFKVPQGTWISEGPAAAQGAGYPGLGYQAVVSNLPKSWIINTLKVPW

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 47
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 47