Gene Information

Name : EPYR_03396 (EPYR_03396)
Accession : YP_005804147.1
Strain : Erwinia pyrifoliae DSM 12163
Genome accession: NC_017390
Putative virulence/resistance : Unknown
Product : protein rhsB
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3437353 - 3441639 bp
Length : 4287 bp
Strand : +
Note : HNH endonuclease

DNA sequence :
ATGAGTGAAGCCGCACGCGTTGGCGATGCCACCGGCCATTCCTCCGCGCTGGCCGGGATGATCGGCGGTACGATTGTCGG
CGGGCTGATTGCCGCCGCCGGTGCCGTGGCCGCCGGTGCGCTGTTTGTCGCCGGGCTGGCCTCGGCCTGTCTCGGCGTTG
GCGTGCTGCTGATGGGTGCCAGCCTGGCGGTGGGTTATCTCACCGGGGAGGCGGCCACGGCGGCGCGCGACGGCATGGCC
GCCGCCGGGGCAGCCAGCCTGTCCGCTTCGGGGCAGATACTGACCGGCTCGCCGGACGTGTTTATCAACGGCAAACCGGC
GGCCATCGCCACGGTCAGCCAGGCGGGCTGCGATAAGGACGGGCCGTCGATGCAGATGGCGCAAGGCTCCGACCGGGTGT
TTATCAACGGCCAGCCCGCTTCCCGCGTCGGCGACAAAACCAACTGCGGTGCCACGGTGATGGCCGGCTCGCCCAACGTG
CACATCGGCGGCGGCACCGCCACCACGCTGGCGATAAAACCCGAAGTGCCGGAGTGGGCCTNCAAGGCCTCTGACCTGAC
GCTGCTGTTTACCGGGCTGCTCGGCGGTGCCGGCGGCGCGGCCGGTAAGGCTGGCAGGCTGGGTAAACTGCTGAGCAGGC
TGCCCGGCATCAGTAAGCTTGCGCAGGTGGCCTGCCGCTTCGGCACCCTGATGACCGCCAGCGCCGCAGCGGGCATCATC
GCCCGCCCGGTGGATATCATCAGCGGGCAGAANTTTCTCTCCGGCGACGACGAGCTGGACTTCGTGCTGCCCTCACGTCT
GCCGGTCGAATGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAAAGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AAAGCCGCCTGCAGCATTATGATGACGGCCTGGTGTGGCGCGCGCCGTCCGGTGACTTTGTCCCGTTCCCGATGGTGCCA
CGCGGCCGCAAAAGCTGGTGCGAAGCGGAAAAATGCTGGCTGATGCACAATGCCGACGGCAGCTGGCAGGTGTCCGACGT
CAGTGAACAGGTCTGGCACTATCCGCCGCCCGAGGGTAAGCATCCCGCCCGGCTGCACATGCTGACGGACGCCGGCGGNA
ACGCCACCTCGCTGTTTTACGATGAGCAGGGACGGCTGAGCGAACTGGTGGACAGCGCCGGTCAGCGCCTGAGCTGCCGC
TATCTGACCCGCGCCGCCGGGCATGACCGCCTGAGCGCGGTGCTGCTGCACACCCCGGACGGGGAGTGCACGCTGGTCAG
CTACGATTATGACGACGAGGGGCAGCTTGTCACCGTGCGCAACCGCGCCGGCGAGGTGACGCGCCGCTTCAGCTGGCGCG
ACGGGCTGATGGCCAGCCACGAGGATGCCAACGGGCTGCTGAACGAATATCTGTGGCAGGAGATTGACGGCCTGCCGCGC
GTCACCGGCTGGCGGCACAGCGCCGGGGAAGAGCTGGCGCTGCACTACGACTTTAGCGGCGGCACGCGCCGGGCGGTGCG
CGACGACGGCATGCAGGCGTGGTGGCAGCTGGACGACGACGACAGCGTGGCGCAGTTCACCGACTTTGACGGCCGCCGGC
TGGCGTTTGTCTACGCCCGCGGCGAGCTGTGCAGCGTGCTGCTGCCGGACGGCGGCCAGCGTCAGAGCGAGTGGGACCGC
TACGGGCGACTGCTGAGCGAAACCGACCCGACCGGGCGCAAAACCCTTTACCAGTACCAGCGTAACAGCGACCGGCTGGT
CTGTGTCACCCACCCCGACGGCAGCCGCGAGAGCCGGTCATGGGACCGCCAGGGGCGCCTGATTAAACAGACTGACGCGG
CAGAAAACACCACGCTTTACCACTACCCGGACGAAGAAGAGAGCCTGCCGGCGCGCATCACCGANGCCTCCGGCGGCGTG
GTGCAGCTTGAGTGGAACGGCCGGGGGCTGCTGACGCGCCATACCGACTGTTCCGGCAGCGTCACCGCCTATGGCTATGA
CGTTTTCGGCCAGCTCACCGACCGTACCGATGCGGAAGGCAACGTGACCCGCTACCGCCGGGATGCCGCCGGTCGCCTGC
ACACCCTGCACCACGCGGACGGCAGCGAAGAGCATTTCACCTGGAACGAACGCGGGCAGCTGGTGCGGCATCAGGATCCG
CCCGGCAGCGAGACGCACTGGCGCTACAACCTGCTGGGCCAGCCGGTCAGCATCACCGACCGCATCAACCGCACGCGAAA
CTGGCACTACAACCCGCGCGGCTGGCTGACGCGGCTGGAGAACGGCAACGGCGGCGAGTATCACTTCAGCCACGATGCCG
CCGGGCGCATCACCGCCGAACGGCGTCCGGACAACACCGACCACCTGTACCGCTACGGCCCGGACGGCCAGCTGGCCGAA
CACCGGGAAACCGGCCCGCAGAACAGCCTTGCGCCGCCCGCGCACCGCCTGCACCGCTTCCGCTTTGACGGGGCGGGTCG
CCCGGCATGGCGCGGCAACGACAGCGCCGAATGGCAGTATCACTACGATGCCGCCGGCAGGCTGAGCCGGCTCACGCGTA
CCCCCACCGCCGCCGGGGCGGAGGCGGGGATTGAAGCGGACCGCATTGAGCTGCAGTACGACCGGGCGGGCAACCTGCTG
TGCGAGCGCGGCGTGAACGGCGGGCTGCACTACCAGTGGGACGCGCTGNCTAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACAGCCTGCAGTGGCTGCACTACGGCTCCGGCCACGTCAGCGCGCTGAAGTTCAACCGGCAGCGGGTCAGTGAATTTA
CCCGTGACCGCCTGCACCGCGAAACCGGGCGCAGCCAGGGCGCGCTGCACCAGCAGCGGCGCTACGATGCGCTGGGCAGG
CGCAGCTGGCAGAGCAGCGCCTTCAGTGACGGGAAGATAACCCGGCCGGAGGACGGTATTCTGTGGCGGGCGTTCCGCTA
TACCGGGCGCGGCGAGCTGGCGGGCGTCAGCGATGCGCTGCGCGGCGAGGTGCACTACGGCTACGACGCCGAAGGCCGGC
TGTTGCAGCACCGCGAGCTGAAGTCCGGCAGGGTTGGCAACCGGCTGCTGTATGACGCCGCCGATAACCTGCTGGGCGGG
CAAAGCCCGCACGACGACCCGGAACAGCCGCCGCCGCCGCCGCTGAGCAGCAACCGCCTGCCGCACTGGCAGCGGCTGTT
CTACCGCTACGACGTCTGGGGCAATCTGGTCAGCCGCCGCCACGGCGTCAACGAACAGCATTACACCTACGACGCCGACA
ACCGCCTGATACGCGCGCGCGGCTTCGGTCCGCAGGGCGAATTCAGCGCGCGGTATCACTATGACGCGCTGGGCAGGCGC
AGCCGCAAGGAGGTCACCTTCGCGGCTAAAGCGCCGCAGACCACGCGCTTCCTGTGGCAGGGCTACCGGCTGCTGCAGGA
GCAGCGCGGCAACGGCACGCGCCGCACCTGGAGCTACGACCCGGCCAGCCCGTGGACGCCGCTGGCGGCCATCGAACAGG
CGGGTGACGCTGAGCAGGCCGATATTTACTGGCTGAACGCCGACCTCAACAGCGCGCCGCTGGAGGTCACCGACGCAGAG
GGCAATCTGCGCTGGTCGGGACACTACGACACCTTCGGCAAACTGCTGGGCCAGACGGTCGCCGGGGCAGCACAGCGCAC
CGGGCCGGTCTATGACCAGCCGCTGCGCTACGCCGGGCAGTACCAGGACAACGAGAGCGGACTGCACTATAATCTGTTCC
GTTACTACGAGCCTGATGTAGGAAGATTCACGACCCAGGACCCGGTGGGGCTGGCGGGAGGGATGAACCTGTATGCTTAT
GCGCCGAATCCGTATGGGTGGGTTGATCCGCTGGGGTTAAGTAAGTGTGCACTGGAAGGAAAATATAAAGAAGTCGATAA
GGCTAATTTACCTGATTGGATTAAAGATTCTTTCAAGAATGGCGAATATAAAACGGTAAGAACAACTGATGAAGTGAATT
TATATCGTGTGTTCGGTGGTAATGCGAAAATAGACGGATCATTTGTTAGTACATCACCAGCGTTGAATAAAATACAAGCC
AAAATTGATTCGGCACTTTTACCAGAATGGAAAAATACGCGACAGTTTGAAGCTACTATTACTGTACCTAAAGGAACAAT
CCTTCAGGTCGGCAAGGTTGAACAGCAAGTTATGCTCTCTGGTGCAAAACTCCAGGGAGGGGCTGACCAAATATTGTTAC
CACATGGCTATCCTACAAGTTGGATAAGTGATGTCAGATTTTTATAA

Protein sequence :
MSEAARVGDATGHSSALAGMIGGTIVGGLIAAAGAVAAGALFVAGLASACLGVGVLLMGASLAVGYLTGEAATAARDGMA
AAGAASLSASGQILTGSPDVFINGKPAAIATVSQAGCDKDGPSMQMAQGSDRVFINGQPASRVGDKTNCGATVMAGSPNV
HIGGGTATTLAIKPEVPEWAXKASDLTLLFTGLLGGAGGAAGKAGRLGKLLSRLPGISKLAQVACRFGTLMTASAAAGII
ARPVDIISGQXFLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESRLQHYDDGLVWRAPSGDFVPFPMVP
RGRKSWCEAEKCWLMHNADGSWQVSDVSEQVWHYPPPEGKHPARLHMLTDAGGNATSLFYDEQGRLSELVDSAGQRLSCR
YLTRAAGHDRLSAVLLHTPDGECTLVSYDYDDEGQLVTVRNRAGEVTRRFSWRDGLMASHEDANGLLNEYLWQEIDGLPR
VTGWRHSAGEELALHYDFSGGTRRAVRDDGMQAWWQLDDDDSVAQFTDFDGRRLAFVYARGELCSVLLPDGGQRQSEWDR
YGRLLSETDPTGRKTLYQYQRNSDRLVCVTHPDGSRESRSWDRQGRLIKQTDAAENTTLYHYPDEEESLPARITXASGGV
VQLEWNGRGLLTRHTDCSGSVTAYGYDVFGQLTDRTDAEGNVTRYRRDAAGRLHTLHHADGSEEHFTWNERGQLVRHQDP
PGSETHWRYNLLGQPVSITDRINRTRNWHYNPRGWLTRLENGNGGEYHFSHDAAGRITAERRPDNTDHLYRYGPDGQLAE
HRETGPQNSLAPPAHRLHRFRFDGAGRPAWRGNDSAEWQYHYDAAGRLSRLTRTPTAAGAEAGIEADRIELQYDRAGNLL
CERGVNGGLHYQWDALXNLQALTLPQGDSLQWLHYGSGHVSALKFNRQRVSEFTRDRLHRETGRSQGALHQQRRYDALGR
RSWQSSAFSDGKITRPEDGILWRAFRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHRELKSGRVGNRLLYDAADNLLGG
QSPHDDPEQPPPPPLSSNRLPHWQRLFYRYDVWGNLVSRRHGVNEQHYTYDADNRLIRARGFGPQGEFSARYHYDALGRR
SRKEVTFAAKAPQTTRFLWQGYRLLQEQRGNGTRRTWSYDPASPWTPLAAIEQAGDAEQADIYWLNADLNSAPLEVTDAE
GNLRWSGHYDTFGKLLGQTVAGAAQRTGPVYDQPLRYAGQYQDNESGLHYNLFRYYEPDVGRFTTQDPVGLAGGMNLYAY
APNPYGWVDPLGLSKCALEGKYKEVDKANLPDWIKDSFKNGEYKTVRTTDEVNLYRVFGGNAKIDGSFVSTSPALNKIQA
KIDSALLPEWKNTRQFEATITVPKGTILQVGKVEQQVMLSGAKLQGGADQILLPHGYPTSWISDVRFL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 47
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 47