Gene Information

Name : EpC_11140 (EpC_11140)
Accession : YP_002648139.1
Strain : Erwinia pyrifoliae Ep1/96
Genome accession: NC_012214
Putative virulence/resistance : Unknown
Product : Rhs family protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 1272737 - 1277023 bp
Length : 4287 bp
Strand : -
Note : silverDB:cEP01115

DNA sequence :
ATGAGTGAAGCCGCACGCGTTGGCGATGCCACCGGCCATTCCTCCGCGCTGGCCGGGATGACTGGCGGTACCATAGTCGG
CGGGCTGATTGCCGCCGCCGGTGCCGTGGCCGCCGGTGCGCTGTTTGTCGCCGGGCTGGCCTCGGCCTGTCTCGGCGTTG
GCGTGCTGCTGATGGGTGCCAGCCTGGCGGTGGGTTATCTCACCGGGGAGGCGGCCACGGCGGCGCGCGACGGCATGGCC
GCCGCCGGGGCAGCCAGCCTGTCCGCTTCGGGGCAGATACTGACCGGCTCGCCGGACGTGTTGATCAACGGTAAACCGGC
GGCCATCGCCACGGTCAGCCAGGCAGGTTGCGATAAGGACGGGCCGTCGATGCAGATGGCGCAAGGCTCCGACCGGGTGT
TTATCAACGGCCAGCCCGCTTCCCGCGTCGGCGACAAAACCAACTGCGGTGCCACGGTGATGGCCGGCTCGCCCAGCGTG
CGCATCGGCGGCGGCACCGCCACCACGCTGGCGATAAAACCCGAAGTGCCGGAGTGGGCCTGCAAGGCCTCTGACCTGAC
GCTGCTGTTTACCGGGCTGCTCGGCGGTGCCGGCGGCGCGGCCGGTAAGGCTGGCAGGCTGGGTAAACTGCTGAGCAGGC
TGCCCGGCATCAGTAAGCTTGCGCAGGTGGCCTGCCGCTTCGGCACCCTGATGACCGCCAGCGCCGCAGCGGGCATCATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAAATTTCTCTCCGGCGACGACGAGCTGGACTTCGTGCTGCCCTCACGTCT
GCCGGTCGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAAAGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGCCGCCTGCAGCATTATGATGACGGCCTGGTGTGGCGCGCGCCGTCCGGTGACTTTGTCCCGTTCCCGATGGTGCCA
CGCGGCCGCAAAAGCTGGTGCGAAGCGGAAAAATGCTGGCTGATGCACAATGCCGACGGCAGCTGGCAGGTGTCCGACGT
CAGTGAACAGGTCTGGCACTATCCGCCGCCCGAGGGTAAGCATCCCGCCCGGCTGAACATGCTGACGGACGCCGGCGGTA
ACGCCACCTCGCTGTTTTACGATGAGCAGGGACGGCTGAGCGAACTGGTGGACAGCGCCGGTCAGCGCCTGAGCTGCCGC
TATCTGACCCGCGCCGCCGGGCATGACCGCCTGAGCGCGGTGCTGCTGCACACCCCGGACGGGGAGCGCACGCTGGTCAG
CTACGATTATGACGACGAGGGGCAGCTTGTCACCGTGCGCAACCGCGCCGGCGAGGTGACGCGCCGCTTCAGCTGGCGCG
ACGGGCTGATGGCCAGCCACCAGGACGCCAACGGGCTGCTGAACGAATATCTGTGGCAGGAGATTGACGGCCTGCCGCGC
GTCACCGGCTGGCGGCACAGCGCCGGGGAAGAGCTGGCGCTGCACTACGACTTTAGCGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCGTGGTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTTGACGGCCGCCGGC
TGGCGTTTGTCTGCGCACGCGGCGAGCTGTGCAGCGTGCTGCTGCCGGACGGCGGCCAGCGTCAGAGCGAGTGGGACCGC
TACGGGCGACTGCTGAGCGAAACCGACCCGACCGGGCGCAAAACCCTTTACCAGTACCAGCGTAACAGCGACCGGCTGGT
CTGTGTCACCCACCCCGACGGCAGCCGCGAGAGCCGGTCATGGGACCGCCAGGGGCGCCTGATTAAACAGACTGACGCGG
CAGAAAACACCACGCTTTACCACTACCCGGACGAAGAAGAGAGCCTGCCGGCGCGCATCACCGACGCCTCCGGCGGCGTG
GTGCAGCTTGAGTGGAACGGCCGGGGGCTGCTGACGCGCCATACCGACTGTTCCGGCAGCGTCACCGCCTATGGCTATGA
CGTTTTCGGCCAGCTCACCGACCGTACCGATGCGGAAGGCAACGTGACCCGCTACCGCCGGGATGCCGCCGGTCGCCTGC
ACACCCTGCACCACGCGGACGGCAGCGAAGAGCATTTCACCTGGAACGAACGCGGGCAGCTGGTGCGGCATCAGGACCCG
CCCGGCAGCGAGACGCACTGGCGCTACAACCTGCTGGGCCAGCCGGTCAGCATCACCGACCGCATCAACCGCACGCGAAA
CTGGCACTACAACCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTATCAGTTCAGCCACGATGCCG
CCGGGCGCATCACCGCCGAACGGCGTCCGGACAACACCGACCACCTGTACCGCTACGGCCCGGACGGCCAGCTGGCCGAA
CACCGGGAAACCGGCCCGCAGAACAGCCTTGCGCCGCCCGCGCACCGCCTGCACCGCTTCCGCTTTGACGGGGCGGGTCG
CCCGGCATGGCGCGGCAACGACAGCGCCGAATGGCAGTATCACTACGATGCCGCCGGCAGGCTGAGCCTGCTCACGCGTA
CCCCCACCGCCGCCGGGGCGGAGGCGGGGATTGAAGCGGACCGCATTGAGCTGCAGTACGACCGGGCGGGCAACCTGCTG
TGCGAGCGCGGCGTGAACGGCGGGCTGCACTACCAGTGGGACGCGCTGGCTAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACAGCCTGCAGTGGCTGCACTACGGCTCCGGCCACGTCAGCGCGCTGAAGTTCAACCGGCAGCGGGTCAGTGAATTTA
CCCGTGACCGCCTGCACCGCGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGGCGCTACGATGCGCTGGGCAGG
CGCAGCTGGCAGAGCAGCGCCTTCAGTGACGGGAAGATAACCCGGCCGGAGGACGGTATTCTGTGGCGGGCATTCCGCTA
TACCGGGCGCGGCGAGCTGGCGGGCATCAGCGATGCGCTGCGCGGCGAGGTGCACTACGGCTACGACGCCGAAGGCCGCC
TGTTGCAGCACCGCGAGCTGAAGTCCGGCAGGGTTGGCAACCGGCTGCTGTATGACGCCGCCGATAACCTGCTGGGCGGG
CAAAGCCCGCACGACGACCCGGCACAGCCGCCGCCGCCGCCGCTGAGCAGCAACCGCCTGCCGCACTGGCAGCGGTTGTT
CTACCGCTACGACGTCTGGGGCAATCTGGTCAGCCGCCGCCACGGCGTCAACGAACAGCATTACACCTACGACGCCGACA
ACCGCCTGATACACGCGCGCGGCTTCGGTCCGCAGGGCGAATTCAGCGCGCGGTATCACTATGACGCGCTGGGCAGGCGC
AGCCGCAAGGAGGTCACCTTCGCGGCTAAAGCGCCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGGCTGCTGCAGGA
GCAGCGCGGCAACGGCACGCGCCGCACCTGGAGCTATGACCCGGCCAGCCCGTGGACGCCGCTGGCGGCCATCGAACAGG
CGGGTGACGCTGAGCAGGCCGATATTTACTGGCTGAACGCCGACCTCAACAGCGCGCCGCTGGAGGTCACCGACGCAGGG
GGTAATCTGCGCTGGTCGGGACAGTACGACACCTTCGGCAAACTGCTGGGCCAGACGGTCGCCGGGGCAGCACAGCGCAC
CGGGCCGGTCTACGACCAGCCGCTGCGCTACGCCGGGCAGTACCAGGACAACGAGAGCGGACTGCACTATAATCTGTTCC
GTTTTTACGAGCCTGATGTAGGAAGATTCACGACCCAGGACCCGGTGGGGCTGGCGGGAGGGATGAACCTGTATGCTTAT
GCGCCGAATCCGTATGGGTGGGTTGATCCGCTGGGGTTAAGTAAGTGTGCACTGGAAGGAAAATATAAAGAAGTCGATAA
GGCTAATTTACCTGATTGGATTAAAGATTCTTTCAAGAATGGCGAATATAAAACGGTAAGAACAACTGATGAAGTGAATT
TATATCGTGTGTTCGGTGGTAATGCGAAAATAGACGGATCATTTGTTAGTACATCACCAGCGTTGAATAAAATACAAGCC
AAAATTGATTCGGCACTTTTACCAGAATGGAAAAATACGCGACAGTTTGAAGCTACTATTACTGTACCTAAAGGAACAAT
CCTTCAGGTCGGCAAGGTTGAACAGCAAGTTATGCTCTCTGGTGCAAAACTCCAGGGAGGGGCTGACCAAATATTGTTAC
CACATGGCTATCCTACAAGTTGGATAAGTGATGTCAGATTTTTATAA

Protein sequence :
MSEAARVGDATGHSSALAGMTGGTIVGGLIAAAGAVAAGALFVAGLASACLGVGVLLMGASLAVGYLTGEAATAARDGMA
AAGAASLSASGQILTGSPDVLINGKPAAIATVSQAGCDKDGPSMQMAQGSDRVFINGQPASRVGDKTNCGATVMAGSPSV
RIGGGTATTLAIKPEVPEWACKASDLTLLFTGLLGGAGGAAGKAGRLGKLLSRLPGISKLAQVACRFGTLMTASAAAGII
ARPVDIISGQKFLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESRLQHYDDGLVWRAPSGDFVPFPMVP
RGRKSWCEAEKCWLMHNADGSWQVSDVSEQVWHYPPPEGKHPARLNMLTDAGGNATSLFYDEQGRLSELVDSAGQRLSCR
YLTRAAGHDRLSAVLLHTPDGERTLVSYDYDDEGQLVTVRNRAGEVTRRFSWRDGLMASHQDANGLLNEYLWQEIDGLPR
VTGWRHSAGEELALHYDFSGGTRRAVRDDGMQAWWQLDDDDSVAQFTDFDGRRLAFVCARGELCSVLLPDGGQRQSEWDR
YGRLLSETDPTGRKTLYQYQRNSDRLVCVTHPDGSRESRSWDRQGRLIKQTDAAENTTLYHYPDEEESLPARITDASGGV
VQLEWNGRGLLTRHTDCSGSVTAYGYDVFGQLTDRTDAEGNVTRYRRDAAGRLHTLHHADGSEEHFTWNERGQLVRHQDP
PGSETHWRYNLLGQPVSITDRINRTRNWHYNPRGWLTRLENGNGGEYQFSHDAAGRITAERRPDNTDHLYRYGPDGQLAE
HRETGPQNSLAPPAHRLHRFRFDGAGRPAWRGNDSAEWQYHYDAAGRLSLLTRTPTAAGAEAGIEADRIELQYDRAGNLL
CERGVNGGLHYQWDALANLQALTLPQGDSLQWLHYGSGHVSALKFNRQRVSEFTRDRLHRETGRSQGALHQQRRYDALGR
RSWQSSAFSDGKITRPEDGILWRAFRYTGRGELAGISDALRGEVHYGYDAEGRLLQHRELKSGRVGNRLLYDAADNLLGG
QSPHDDPAQPPPPPLSSNRLPHWQRLFYRYDVWGNLVSRRHGVNEQHYTYDADNRLIHARGFGPQGEFSARYHYDALGRR
SRKEVTFAAKAPQTTRFLWQGYRLLQEQRGNGTRRTWSYDPASPWTPLAAIEQAGDAEQADIYWLNADLNSAPLEVTDAG
GNLRWSGQYDTFGKLLGQTVAGAAQRTGPVYDQPLRYAGQYQDNESGLHYNLFRFYEPDVGRFTTQDPVGLAGGMNLYAY
APNPYGWVDPLGLSKCALEGKYKEVDKANLPDWIKDSFKNGEYKTVRTTDEVNLYRVFGGNAKIDGSFVSTSPALNKIQA
KIDSALLPEWKNTRQFEATITVPKGTILQVGKVEQQVMLSGAKLQGGADQILLPHGYPTSWISDVRFL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 47
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 46