Gene Information

Name : EAMY_2782 (EAMY_2782)
Accession : YP_003532137.1
Strain : Erwinia amylovora CFBP1430
Genome accession: NC_013961
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 2867965 - 2872212 bp
Length : 4248 bp
Strand : -
Note : YD repeat, RHS protein

DNA sequence :
ATGAGTGAAGCCGCACGCGTCGGCGACGCCATCGGCCATTCCTCCGCGCTGGCCGGGATGACGGGCGGCACCATAGTCGG
CGGGCTGATTGCCGCCGCCGGCGCCGTGGCCGCCGGGGCGCTGTTTGTCGCCGGGCTGGCCGCCTCCTGTCTTGGCGTTG
GCGTCTTGCTGATGGGCGCCAGCCTGGCGGTGGGCTATCTCACCGGGGAGGCGGCCACCGCGGCGCGCGACGGCATGGCC
GCTGCCGGGGCGGACAGACGGTCCGCTTCCGGCCAGATACTGACCGGCTCACCGAACGTGTTTATCAACGGCAAACCGGC
GGCCATCGCCACCGTCAGCCAGGCGGGCTGTGACCGGGACGGGCCGACGATGCAGATGGCGCAGGGCTCCGCCCGGGTGT
TTATCAACGGCCAGCCCGCCGCGCGCGTCGGCGACAAAACCAACTGCGGTGCCACGGTGATGGCAGGCTCGCCCAGCGTG
CGCATCGGCGGCGGCACCGCCACCACGCTGACGATAAAACCCGAAGTGCCGGACCGGGCCTATAAGGCCTCGGACCTGAC
GCTGCTGTTTGCCGGGCTGCTTGGCGGCGCGGGCGGCGCGGCCGGCAAGGCGGGCAAACTGGCTGAACTGCTGAGCAGGC
TGCCCGGCATCAACAGGCTTGGCCAGGTGGCCTGCCGCTTCGGCGTGCTGATGACCGCCAGCGCGGCGGCGGGCATCATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAAGTTTCTCTCCGGCGACGACGAGCTGGACTTTGTGCTGCCGTCGCGCCT
GCCGGTTGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAAAGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGTACCCTCCAGCCTTACGCCGACGGGCTGGTGTGGCGCGCGCCGTCCGGCGACCTGGTTTCGTTCCCGATGGTGCCG
CGCGGCCATAAAACCTGGTGTGAAGCCGAAAAGTGCTGGCTGATGCACAACGCCGACGACAGCTGGCAGCTGTTCGACGT
CAGTGAACAGGCCTGGCACTATCCGCCGCTGGACGCGCAGTATCCCGCCCGCCTGAGCATGGTGACCGACGCCGGCGGCA
ACGCCACCTCGCTGTTTTACGACGAGCAGGGGCGGCCGGGCGAACTGGTGGACAGCGCCGGCCAGCGCCTGAGCTGCCGC
TACCTGACGACCGCCGGCGGGCATTGCCGCCTGAGCGCGGTGCTGCTGCATACCGCGGACGGGGAGCACACGCTGGTCAG
CTACGGGTATGACGACGACGGGCAGCTCGCCAGCGTGCGCAACCGCGCCGGCGAGGTCACGCGCCGCTTCACCTGGCATG
ACGGGCTGATGGCCAGCCACGAGGATGCCAACGGGCTGCGGAACGAATACCGCTGGCAGGAGATTGACGGCCTGCCGCGC
GTCACCGCCTGGCGGCACGGCGCCGGGGAAGCGCTGGCGCTGCACTACGACATTAACGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCGTGCTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTTGACGGCCGCAGGC
TGGCGTTTATCTACGCGCGCGGCGAGCTGTGCAGCGTGCTGCTGCCGGGCGGCGGCCAGCGGCACAGCGAGTGGGACCGC
TACGGGCGACTGCTGAGCGAAACCGACCCGTCAGGGCGTAAAACCACCTGTCAGTATGCGCGTAACAGCGACCGTCTGGT
TTCGGTCACCCATCCCGACGGCAGCCGTGAGTGCCAGTCATGGGATGACAGGGGGCGGCTGATTACACAGAGCGACGCGC
TGGGAAACACCACGCTTTACCACTACCCGGACGGGGAAGAAAGCTTACCGGCGCGCATCACCGATGCCCTCGGCGGCGTG
GCGCGGCTTGAGTGGGACGGCCGGGGGCTGCTGACGCGCTATACCGACTGTTCCGGCAGCGTCACCGCGTACGACTATGA
CATTTTCGGCCAGCTCACCGGGCGCACCGATGCGGAAGGCAACGTGACCCGCTACCGCCGGGATACCGCCGGTCGCCTGC
AAACCCTGCAGCACGCGGACGGCAGCGAAGAGCACTTCGTCTGGAACGAACGCGGGCAGCTGGCGCGCCATCAGGACCCG
TCCGGCAGCGAAACGCAGTGGCGCTACAACCTGCTGGGCCAGCCGGTCAGCGTCACCGACCGCATCAACCGCACGCGCCA
CTACCACTACGGCCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTATCAGTTCAGCTACGATGCTG
CCGGGCGCATCACCGCCGAACGCCGCCCGGACAACACCGACCACCTCTATCGCTACGGCGCGGACGGCCAGCTTGCCGAA
CACCGGGAAACCGGCCCGCAGAACAGCCTTGCGCCGCCCGCGCACCGCCTGCACCGCTTCCGCTTTGACGAGGCGGGCCG
CCTGGCGTGGCGCGGCAACGACAGCGCCGAATGGCAGTATCACTACGATGCCGCAGGCAGGCTGACCCGGCTTGTGCGTA
CCCCCACGGCCGCCGGGGCGGAGCTGGGGATTGAGGCGGACAGCGTTGAGCTGCAGTACGACAAAGCGGGTCACCTGCTG
TGCGAGCGCGGCGTGAACGGCGCGCCGGTCTACAGCCGGGACGCGCTCGGCAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACCGCCTGCAGTGGCTGCACTACGGCTCCGGCCATGCCGGCGCGCTGAAATTCAACCGGCAGGCGGTGAGCGAATTCA
CCCGTGACCGCCTGCACCGTGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGCCGCTACGATGCGTCCGGCAGG
CGCAGCTGGCAGAGCAGCACTTTCGGTGACGGCCAGATAACCCGGCCGGAAGACGGCATGCTGTGGCGGGCGTTCCGCTA
CACCGGGCGCGGCGAGCTGGCGGGCGTCAGCGACGCGCTGCGCGGCGAAGTGCACTACGGCTACGACGCCGAAGGCCGCC
TGCTGCAGCACCGCGAGCTGCAGTCCGGCAGGACGGGCAGCCGGCTGGTGTATGACGCCGCCGACAACCTGCTGGGCGGG
CAAAGCCCGCACGACGACCCGGAACGGCCGCCGCCGCCGCCGCAGAGCAGCAACCGTCTGCCGCACTGGCAGCGGCTGTT
CTACCGCTACGACGTCTGGGGCAACCTGGTCAGCCGCCGCCACGGCCTCAACGAGCAGCATTACACTTACGACGCCGACA
ACCGCCTGATACGGGCGCGTGGCTCCGGTCCTCAGGGCGAGTTCAGCGCGCAGTACCATTATGACGCGCTGGGCCGGCGC
AGCCGCAAGGAGGTCACCTTCGCGGGCAAAGCCCCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGGCTGCTGCAGGA
GCAGCGCGCCAACGGCACGCGGCGTACCTGGAGCTATGACCCGGAAAGCCCGTGGACGCCGCTGGCGGCCATCGAGCAGG
CCGGGGAAGGGCCACAGGCGGATATTTACTGGCTGAACACCGACCTCAACGGCGCGCCGCTGGAGGTGACCGACGCCGAT
GGCAGGCTGCGCTGGTCGGGACAGTACGACACCTTCGGCAGGCTGCAGGGCCAGACGACGGCCGGTGCGGCACAACGCAC
GGGGCCGGTTTACGACCAGCCGCTGCGCTACGCCGGGCAGTATGCTGACAGTGAAACGGGACTGCACTATAATCTGTTCC
GTTACTACGAGCCTGACGTTGGCAGGTTTACGACCCAGGACCCTGTGGGGCTGGCGGGGGGCCTGAACCTGTATGCGTAT
GCGCCGAATCCGTACGGGTGGGTGGATCCTCTTGGTTTAACGAAATGTTCGCCGAACAAGAAAACGACTTATGAAGGTGT
CAGCCGCAGAGATGCACTCAGGCAGGCTAAACGTGATGCGGGCATACCTAATAACCAGCAGCCTTCAAAGATTGTCAGAC
CAGAGCTAAGAGATGGTAACGGCAACATAATGATTGGCAAAAATAATCAACCAATCAGGACTAGAGAATACCATTTTGTT
AATAAAGACAATAAAACTGTGTTGATTCAAGAGCATAGTTTAGGCCATCAAAAAGCTGTTCCCGGACACGGTGCAGAGCC
GCATTTTAATACCAGAAGTATTGATAGGCCAGATGCAGGAAACTTTCCCGAAACACACGGGCACTACAATTTTCCGTGGA
GTTATTAG

Protein sequence :
MSEAARVGDAIGHSSALAGMTGGTIVGGLIAAAGAVAAGALFVAGLAASCLGVGVLLMGASLAVGYLTGEAATAARDGMA
AAGADRRSASGQILTGSPNVFINGKPAAIATVSQAGCDRDGPTMQMAQGSARVFINGQPAARVGDKTNCGATVMAGSPSV
RIGGGTATTLTIKPEVPDRAYKASDLTLLFAGLLGGAGGAAGKAGKLAELLSRLPGINRLGQVACRFGVLMTASAAAGII
ARPVDIISGQKFLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESTLQPYADGLVWRAPSGDLVSFPMVP
RGHKTWCEAEKCWLMHNADDSWQLFDVSEQAWHYPPLDAQYPARLSMVTDAGGNATSLFYDEQGRPGELVDSAGQRLSCR
YLTTAGGHCRLSAVLLHTADGEHTLVSYGYDDDGQLASVRNRAGEVTRRFTWHDGLMASHEDANGLRNEYRWQEIDGLPR
VTAWRHGAGEALALHYDINGGTRRAVRDDGMQACWQLDDDDSVAQFTDFDGRRLAFIYARGELCSVLLPGGGQRHSEWDR
YGRLLSETDPSGRKTTCQYARNSDRLVSVTHPDGSRECQSWDDRGRLITQSDALGNTTLYHYPDGEESLPARITDALGGV
ARLEWDGRGLLTRYTDCSGSVTAYDYDIFGQLTGRTDAEGNVTRYRRDTAGRLQTLQHADGSEEHFVWNERGQLARHQDP
SGSETQWRYNLLGQPVSVTDRINRTRHYHYGPRGWLTRLENGNGGEYQFSYDAAGRITAERRPDNTDHLYRYGADGQLAE
HRETGPQNSLAPPAHRLHRFRFDEAGRLAWRGNDSAEWQYHYDAAGRLTRLVRTPTAAGAELGIEADSVELQYDKAGHLL
CERGVNGAPVYSRDALGNLQALTLPQGDRLQWLHYGSGHAGALKFNRQAVSEFTRDRLHRETGRSQGALHQQRRYDASGR
RSWQSSTFGDGQITRPEDGMLWRAFRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHRELQSGRTGSRLVYDAADNLLGG
QSPHDDPERPPPPPQSSNRLPHWQRLFYRYDVWGNLVSRRHGLNEQHYTYDADNRLIRARGSGPQGEFSAQYHYDALGRR
SRKEVTFAGKAPQTTRFLWQGYRLLQEQRANGTRRTWSYDPESPWTPLAAIEQAGEGPQADIYWLNTDLNGAPLEVTDAD
GRLRWSGQYDTFGRLQGQTTAGAAQRTGPVYDQPLRYAGQYADSETGLHYNLFRYYEPDVGRFTTQDPVGLAGGLNLYAY
APNPYGWVDPLGLTKCSPNKKTTYEGVSRRDALRQAKRDAGIPNNQQPSKIVRPELRDGNGNIMIGKNNQPIRTREYHFV
NKDNKTVLIQEHSLGHQKAVPGHGAEPHFNTRSIDRPDAGNFPETHGHYNFPWSY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 47
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 47