Gene Information

Name : EAMY_2522 (EAMY_2522)
Accession : YP_003531877.1
Strain : Erwinia amylovora CFBP1430
Genome accession: NC_013961
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 2604838 - 2609118 bp
Length : 4281 bp
Strand : +
Note : HNH endonuclease

DNA sequence :
ATGAGTGAAGCCGCACGCGTCGGCGACGCCATCGGCCATTCCTCCGCGCTGGCCGGGATGACGGGCGGCACCATAGTCGG
CGGGCTGATTGCCGCCGCCGGCGCCGTGGCCGCCGGGGCGCTGTTTGTCGCCGGGCTGGCCGCCTCCTGTCTTGGCGTTG
GCGTCTTGCTGATGGGCGCCAGCCTGGCGGTGGGCTATCTCACCGGGGAGGCGGCCACCGCGGCGCGCGACGGCATGGCC
GCTGCCGGGGCGGACAGACGGTCCGCTTCCGGCCAGATACTGACCGGCTCACCGAACGTGTTTATCAACGGCAAACCGGC
GGCCATCGCCACCGTCAGCCAGGCGGGCTGTGACCGGGACGGGCCGACGATGCAGATGGCGCAGGGCTCCGCCCGGGTGT
TTATCAACGGCCAGCCCGCCGCGCGCGTCGGCGACAAAACCAACTGCGGTGCCACGGTGATGGCAGGCTCGCCCAGCGTG
CGCATCGGCGGCGGCACCGCCACCACGCTGACGATAAAACCCGAAGTGCCGGACCGGGCCTATAAGGCCTCGGACCTGAC
GCTGCTGTTTGCCGGGCTGCTTGGCGGCGCGGGCGGCGCGGCCGGCAAGGCGGGCAAACTGGCTGAACTGCTGAGCAGGC
TGCCCGGCATCAACAGGCTTGGCCAGGTGGCCTGCCGCTTCGGCGTGCTGATGACCGCCAGCGCGGCGGCGGGCATCATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAAGTTTCTCTCCGGCGACGACGAGCTGGACTTTGTGCTGCCGTCGCGCCT
GCCGGTTGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAAAGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGTACCCTCCAGCCTTACGCCGACGGGCTGGTGTGGCGCGCGCCGTCCGGCGACCTGGTTTCGTTCCCGATGGTGCCG
CGCGGCCATAAAACCTGGTGTGAAGCCGAAAAGTGCTGGCTGATGCACAACGCCGACGACAGCTGGCAGCTGTTCGACGT
CAGTGAACAGGCCTGGCACTATCCGCCGCTGGACGCGCAGTATCCCGCCCGCCTGAGCATGGTGACCGACGCCGGCGGCA
ACGCCACCTCGCTGTTTTACGACGAGCAGGGGCGGCCGGGCGAACTGGTGGACAGCGCCGGCCAGCGCCTGAGCTGCCGC
TACCTGACGACCGCCGGCGGGCATTGCCGCCTGAGCGCGGTGCTGCTGCATACCGCGGACGGGGAGCACACGCTGGTCAG
CTACGGGTATGACGACGACGGGCAGCTCGCCAGCGTGCGCAACCGCGCCGGCGAGGTCACGCGCCGCTTCACCTGGCATG
ACGGGCTGATGGCCAGCCACGAGGATGCCAACGGGCTGCGGAACGAATACCGCTGGCAGGAGATTGACGGCCTGCCGCGC
GTCACCGCCTGGCGGCACGGCGCCGGGGAAGCGCTGGCGCTGCACTACGACATTAACGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCGTGCTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTTGACGGCCGCAGGC
TGGCGTTTATCTACGCGCGCGGCGAGCTGTGCAGCGTGCTGCTGCCGGGCGGCGGCCAGCGGCACAGCGAGTGGGACCGC
TACGGGCGACTGCTGAGCGAAACCGACCCGTCAGGGCGTAAAACCACCTGTCAGTATGCGCGTAACAGCGACCGTCTGGT
TTCGGTCACCCATCCCGACGGCAGCCGTGAGTGCCAGTCATGGGATGACAGGGGGCGGCTGATTACACAGAGCGACGCGC
TGGGAAACACCACGCTTTACCACTACCCGGACGGGGAAGAAAGCTTACCGGCGCGCATCACCGATGCCCTCGGCGGCGTG
GCGCGGCTTGAGTGGGACGGCCGGGGGCTGCTGACGCGCTATACCGACTGTTCCGGCAGCGTCACCGCGTACGACTATGA
CATTTTCGGCCAGCTCACCGGGCGCACCGATGCGGAAGGCAACGTGACCCGCTACCGCCGGGATACCGCCGGTCGCCTGC
AAACCCTGCAGCACGCGGACGGCAGCGAAGAGCACTTCGTCTGGAACGAACGCGGGCAGCTGGCGCGCCATCAGGACCCG
TCCGGCAGCGAAACGCAGTGGCGCTACAACCTGCTGGGCCAGCCGGTCAGCGTCACCGACCGCATCAACCGCACGCGCCA
CTACCACTACGGCCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTATCAGTTCAGCTACGATGCTG
CCGGGCGCATCACCGCCGAACGCCGCCCGGACAACACCGACCACCTCTATCGCTACGGCGCGGACGGCCAGCTTGCCGAA
CACCGGGAAACCGGCCCGCAGAACAGCCTTGCGCCGCCCGCGCACCGCCTGCACCGCTTCCGCTTTGACGAGGCGGGCCG
CCTGGCGTGGCGCGGCAACGACAGCGCCGAATGGCAGTATCACTACGATGCCGCAGGCAGGCTGACCCGGCTTGTGCGTA
CCCCCACGGCCGCCGGGGCGGAGCTGGGGATTGAGGCGGACAGCGTTGAGCTGCAGTACGACAAAGCGGGTCACCTGCTG
TGCGAGCGCGGCGTGAACGGCGCGCCGGTCTACAGCCGGGACGCGCTCGGCAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACCGCCTGCAGTGGCTGCACTACGGCTCCGGCCATGCCGGCGCGCTGAAATTCAACCGGCAGGCGGTGAGCGAATTCA
CCCGTGACCGCCTGCACCGTGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGCCGCTACGATGCGTCCGGCAGG
CGCAGCTGGCAGAGCAGCACTTTCGGTGACGGCCAGATAACCCGGCCGGAAGACGGCATGCTGTGGCGGGCGTTCCGCTA
CACCGGGCGCGGCGAGCTGGCGGGCGTCAGCGACGCGCTGCGCGGCGAAGTGCACTACGGCTACGACGCCGAAGGCCGCC
TGCTGCAGCACCGCGAGCTGCAGTCCGGCAGGACGGGCAGCCGGCTGGTGTATGACGCCGCCGACAACCTGCTGGGCGGG
CAAAGCCCGCACGACGACCCGGAACGGCCGCCGCCGCCGCCGCAGAGCAGCAACCGTCTGCCGCACTGGCAGCGGCTGTT
CTACCGCTACGACGTCTGGGGCAACCTGGTCAGCCGCCGCCACGGCCTCAACGAGCAGCATTACACTTACGACGCCGACA
ACCGCCTGATACGGGCGCGTGGCTCCGGTCCTCAGGGCGAGTTCAGCGCGCATTACCATTATGACGCGCTGGGCCGGCGC
AGCCGCAAGGAGGTCACCTTCGCGGGCAAAGCCCCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGGCTGCTGCAGGA
GCAGCGCGCCAACGGCACGCGGCGTACCTGGAGCTATGACCCGGAAAGCCCGTGGACGCCGCTGGCGGCCATCGAGCAGG
CCGGAGAAGGGCCACAGGCGGATATTTACTGGCTGAACACCGACCTCAACAGCGCGCCGCTGGAGGTGACCGACGCCGAT
GGCAGGCTGCGCTGGTCGGGACAGTACGACACCTTCGGCAGGCTGCAGGGCCAGACGACGGCCGGTGCGGCACAACGCAC
GGGGCCGGTTTACGACCAGCCGCTGCGCTACGCCGGGCAGTATGCTGACAGTGAAACGGGACTGCACTATAATCTGTTCC
GTTACTACGAGCCTGACGTTGGCAGGTTCACGACCCAGGACCCTGTGGGGCTGGCGGGGGGCCTGAACCTGTATGCGTAT
GCGCCGAATCCGTACGGGTGGGTGGATCCGCTGGGGTTGAGCAGATGTAAACCTGGCACTGCTTCAGGTGAAGGTAGCAA
GATTGAAGGGAAATGGTTGAGAGGGACTCATGGTAATGCTGGCTTATTCCCATCCTCAGTAGCTGATAAGCTACGCGGTA
GGCAATTTAAATCGTTTGACGATTTCAGGGAAAATGTCTGGAAAGAGGTAGGAAATGATTCTCATCTTTCTCAACAATTT
AGACCTTCAAATATTACCAGGATGAAAAGTGGTAAAGCCCCAATTTCCCACAACTCTCAATGGAATGGGAAAAATAAATC
TTATGTTCTTCATCACCGTACACCGATTCAACATGGTGGTGGTATCTATGATGTTGATAATCTAATAGTTGTTACTCCGA
GATATCATCTTGATGTGTTGGATCGGGCCTATCATTTTTGA

Protein sequence :
MSEAARVGDAIGHSSALAGMTGGTIVGGLIAAAGAVAAGALFVAGLAASCLGVGVLLMGASLAVGYLTGEAATAARDGMA
AAGADRRSASGQILTGSPNVFINGKPAAIATVSQAGCDRDGPTMQMAQGSARVFINGQPAARVGDKTNCGATVMAGSPSV
RIGGGTATTLTIKPEVPDRAYKASDLTLLFAGLLGGAGGAAGKAGKLAELLSRLPGINRLGQVACRFGVLMTASAAAGII
ARPVDIISGQKFLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESTLQPYADGLVWRAPSGDLVSFPMVP
RGHKTWCEAEKCWLMHNADDSWQLFDVSEQAWHYPPLDAQYPARLSMVTDAGGNATSLFYDEQGRPGELVDSAGQRLSCR
YLTTAGGHCRLSAVLLHTADGEHTLVSYGYDDDGQLASVRNRAGEVTRRFTWHDGLMASHEDANGLRNEYRWQEIDGLPR
VTAWRHGAGEALALHYDINGGTRRAVRDDGMQACWQLDDDDSVAQFTDFDGRRLAFIYARGELCSVLLPGGGQRHSEWDR
YGRLLSETDPSGRKTTCQYARNSDRLVSVTHPDGSRECQSWDDRGRLITQSDALGNTTLYHYPDGEESLPARITDALGGV
ARLEWDGRGLLTRYTDCSGSVTAYDYDIFGQLTGRTDAEGNVTRYRRDTAGRLQTLQHADGSEEHFVWNERGQLARHQDP
SGSETQWRYNLLGQPVSVTDRINRTRHYHYGPRGWLTRLENGNGGEYQFSYDAAGRITAERRPDNTDHLYRYGADGQLAE
HRETGPQNSLAPPAHRLHRFRFDEAGRLAWRGNDSAEWQYHYDAAGRLTRLVRTPTAAGAELGIEADSVELQYDKAGHLL
CERGVNGAPVYSRDALGNLQALTLPQGDRLQWLHYGSGHAGALKFNRQAVSEFTRDRLHRETGRSQGALHQQRRYDASGR
RSWQSSTFGDGQITRPEDGMLWRAFRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHRELQSGRTGSRLVYDAADNLLGG
QSPHDDPERPPPPPQSSNRLPHWQRLFYRYDVWGNLVSRRHGLNEQHYTYDADNRLIRARGSGPQGEFSAHYHYDALGRR
SRKEVTFAGKAPQTTRFLWQGYRLLQEQRANGTRRTWSYDPESPWTPLAAIEQAGEGPQADIYWLNTDLNSAPLEVTDAD
GRLRWSGQYDTFGRLQGQTTAGAAQRTGPVYDQPLRYAGQYADSETGLHYNLFRYYEPDVGRFTTQDPVGLAGGLNLYAY
APNPYGWVDPLGLSRCKPGTASGEGSKIEGKWLRGTHGNAGLFPSSVADKLRGRQFKSFDDFRENVWKEVGNDSHLSQQF
RPSNITRMKSGKAPISHNSQWNGKNKSYVLHHRTPIQHGGGIYDVDNLIVVTPRYHLDVLDRAYHF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 47
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 47