Gene Information

Name : rhsD (PAJ_0397)
Accession : YP_005933273.1
Strain : Pantoea ananatis AJ13355
Genome accession: NC_017531
Putative virulence/resistance : Unknown
Product : protein rhsD precursor RhsD
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 488026 - 492213 bp
Length : 4188 bp
Strand : -
Note : similar to Erwinia tasmaniensis Et1/99, putative membrane-bound sugar-binding protein (NCBI: YP_001908915.1) COG: cell wall/envelope/membrane biogenesis subcellular localization as predicted by Psort 2.0: inner membrane

DNA sequence :
ATGTTTGAAGCTGCACGCCTCGGCGATGATATCGGTCACTCTCATGCCCTGGCGGGCATGATTGCAGGCACTATCGTCGG
TGGACTGATCGCTGCGGCGGGCGGTATTGCTGCGGGTGCATTGATGATTGCCGGGATAGGCGCCTCCTGCCTTGGCGTGG
GCGTATTGCTGGTTGGACTCAGTATCGGCGTGGGTTACCTTACTGGCGAACTGGCCACAGCCGCGCGAGACGGAATAGCG
GATGCTGGGGCGGCCAGCATGACCCCAAAGGGCAAGATTACCACCGGTTCGCCCAACGTCTTTATTAACAGTAAACCCGC
CGCCATGGCCACCAACAGCATAGTGGCCTGCAAGGACGATGGTCCGCAGCAGATGGCGGAAGGCTCCTCCCGGGTGTATA
TCAACGGCCTGCCTGCATCACGCATCGACGATCGCACCACCTGCGATGCCAAAGTGATGACGGGCTCTGACAACGTCTAT
ATCGGCGGCGAGCCTGAACAAACGCTGCCTATCCAACCTGAAGTGCCCGAATGGGTTTATAAAGCCTCCGATCTTACCCT
GCTGTTTGCCGGGCTGGCCGGTGGCGTGGGCGGTGCGGCTGGCAAGGTCGGCGCGCTGGGTAAATTACTGAGCAAGATTC
CCGGTATCAATAAAATCGCCCGGGTGGCCTGCCGTGCTGGTGCCTTGATGACCGGTGTGGCGGCCGCAGGTATTCTTGCC
CGTCCCGTTGATATCGTAAGCGGCCAGAAGTTTCTTAGCGGCGATGATGAACTGGATTTTGTGCTGCCCTCCCGGCTGCC
GCTACGCTGGCAGCGCTACTGGCGCAGCGGTAATCCCGGCGACAGCGTATTAGGCCGTGGCTGGAACCTGTTCTGGGAAA
CACGGCTGGAACGTTACCAGGATGGGCTGGTCTGGCGTGCGTCCTCCGGCGATTACGTCTCTTTTCCTCTGGTGCCAAAA
GGCCAGCGTACCTACTGCGAAGCGGAGAAACGCTGGCTGGAACATCATCGGGACGACAGTTGGTCATTGTATGACATCAG
CGGTGAGCGCTGGCATTTTATGCCCCTGCGCGATGATGCCCCTTCTCTTCCCCTCTGCCTGACCGAGCCCTGCGGCAATG
AGATTCAGTTCGACTGGAACCCCGATCACACATTGGCTGCGCTGACAGACAGTGCCGGTCAGCGCGTAACCTGCCGCTAT
GCGAATAACCGGCTCGCAGGTGCCTGGCTGGATGACGATATTTGTCTGGTCAGTTACGCGTATGATGATATCGGCCAACT
CGTTACGGTAACGGGCCGGGGCGGTAGCGTGCGCCGACGCTTCCAGTGGTGCGATGGCCTGATGGTCGCCCATGAAGATA
TGAATGGCCTGCTAAGTGAGTATCGCTGGCAGGAGATCGATGGCCTCCCAAGAGTGGTGGCCTTCCGTCACAGCGGCGGC
GAGCAGCTGGATTTTGAGTACGATTTTGATAATGGCATCCGACGCGCGCGACGTGATGACGGCGTTGAAGCTCACTGGCT
CATTGACGATGACGATCATGTCGCCCGCTTTACCGACTTCGACGGCCGCCAGACCATGCTGGTTTACCGCGCAGGCGAGT
TGTGCGATGTGATTATGCCCGGCGGTGCCATGCGCTGCAGCAACTGGGACCGCTATGGTCGCATGACGCAGGAGATCGAT
CCGGTCGGACGTCGAACCACTTATCACTGGTTCCGCATGACTGACCGTGTGATCCGTACGGACTATCCCGATCTGAGCGC
GACTCAGGCGGCTTACGATCTTAATGTTCGCCTGCTGACTGAAACCGATGCGCTCAATAACGTCACCACCTATCACTACC
CTGATGACACAGAACTGCTGCCAGACAGCATCACCGATGCAACGGGCGGCGTGGTGAAGCTGGAATGGAACCGACAAGGG
CTGCTGACGCAGCGCACCGACTGCTCTGGCAGCGTAACGACGTTTAGCTACGACCGATTTGGCCAGCTTGTTCGCAGCGT
GGATGCTGAAGGTCATGTCACCCAGCGCGAGTGGAATGACAAGGGACAACTCTGCGCCATTATCCATCCAGACGGTAGCC
GGGAAACCCTACACTGGAACACCCAGGGGCAGCTCAGTGCGTGGCGCGATCCGCTGGAAACCGAAGTCCGCTGGACCTAT
AACGCACTCGGATTACCGGTGAGCCTGACGGACCGCATTGGTCGCACGCGTCGCTGGCATTACGATGCCCGTGGTAATCT
GCTGCGGCTGGAAAACGGCAACGGCGGTGACTATCGCTTTACCTGCGATCCGCTCGGCCGTCCATTGAGTGAGATCCGGC
CCGATAAGACCTCACGCAATATGGAATGGAATGCTCGTGGCTTTCTCATTGGTTTACAGGAAAATGGCCAGCCCGCTAAT
GACGGTGGCATAGCTCGACGCTGGCAGCGCTTTAGCTACGACGACAGCGGCTTGATTACCCAGCGCACAACGCAGCATGC
TGAATACCACTACCGGCGTCATCGTAGCGGTCAACTTGCCAGCCTGGTTCGCACGCCGACCAGCGACGGAATGGCGCTTG
GTATCGAAGACGATGAAATCGCCTTTACCTATGACGTCGCAGGTCAGCTCTTAACAGAAGCTGGTATAAACGGGAAACTG
GATTACGAATGGGATGCGCTGGGCAACCTCACCCATCTGACCCTGCCGGGTGAACAGCAACTTGCCTGGCTGCACTACGG
TTCGGGTCACGTCAGCGCGATTCGCTTTAATCAGCAACTGGTCAGTGAATTTACCCGCGACCGCCTGCACCGGGAAATTC
GCCGCAGTCAGGGCGCGCGTGAGCAGGCGCGTCAGTACGACAGCCTTGGCAGGCGCACGATGCAGCGCAGTGAATTGCAT
AGCGAGGTGGTGCTACCGGAAAAAGCCATTCTGGAACGCGCTTTCCGCTATTCGGCGCGCAGTGAACTGGAGTCTGTCAG
TGATACGCTGCGCGGCGACATTATCTATGGTTACGATGATGAAGGCCGCCTGCTGAAGCACTACGAAGCCAGACAGGGTC
ACAGCACCTCGCATTTTGCTTACGATAACGCGGACAACCTGGCGGCAAACGACGATGCACTGCACGCGTTGCCAGTTACC
GACAACCGTCTGCATCACTGGCAGAACCTGTTTATGAAGTACGATGACTGGGGCAACCTGGTCAGTCGACGCAGCGGCCT
GCATGAGCAGCATTTTACCTATGACGCCGAAAACCGGCTCATCAGCGCCAAAGGCAATGGCCCGGACGGCGGGTTTACCG
CACACTATCATTATGATGCGCTGGGCAGGCGCACCCGCAAAGTGGTCAGCACCCAGCACGATCGTAAAGAAGTCCGTTTT
CTGTGGCAGGGCTATCGCCTGTTACAGGAGCAGCATGAAAACGGCCAGTGCCAGACCTACGTGTACGATCCCAACGAGGC
CTGGAGCCCGCTGGCGCGCATCGACCATATGGCAGCAGGCGAACGGGGCGATGTGTTGTGGTTCAATACCGATCTCAACG
GGGCACCGCTAGAGGTAACCGATGAACGCGGCGACATCCGCTGGAGCGGTCAGTACGGCAGCTTTGGTGAAGTGCGTCGG
CAGACAGAGGGCTTTACCCGGCTGGCAAAACAGTCTGCCCTGCCCCATCAGCCGCTGCGCTATGCGGGCCAGTATGCGGA
CAGCGAAACCGGTCTGCATTACAATCTGTTCCGCTATTACGATCCGCAGGTGGGACGGTTCACGGTTCAGGACCCGATAG
GGCTGGAAGGCGGCTGGAACCTGTATCAGTATGCGCCGAACCCGCTGAGTTGGATCGACCCGTTGGGGTTGAATAAGTGT
GGGAGCTTTACAAAGAATCCTGACGATATCCATTTCATGCAAAGTTCAATTAAAAATCAAACGGGCGAGCATACCGTTCT
CAATAACGCAGCTGCTCTTAAAAACGGTACTCTTAAGCCCACTGATTTACCCGCAATTAAAATTTGGCAAGACTCAAGCG
GAAAATTATGGACTCTGGACCACAGGAGACTGGCTGCATTCAAACTGTCAGGGCTGAAAGAAATACCGGTGCAGTGGGCT
ACGGAGAAAGAAATTGCTGGCCAAATGTGGAAAATGACTACGAAAACAGATGGAAAAAGTATCATTCTTAAAATGGGTGA
TGGTATTAAGAGAGTTATAGGTGGCTAA

Protein sequence :
MFEAARLGDDIGHSHALAGMIAGTIVGGLIAAAGGIAAGALMIAGIGASCLGVGVLLVGLSIGVGYLTGELATAARDGIA
DAGAASMTPKGKITTGSPNVFINSKPAAMATNSIVACKDDGPQQMAEGSSRVYINGLPASRIDDRTTCDAKVMTGSDNVY
IGGEPEQTLPIQPEVPEWVYKASDLTLLFAGLAGGVGGAAGKVGALGKLLSKIPGINKIARVACRAGALMTGVAAAGILA
RPVDIVSGQKFLSGDDELDFVLPSRLPLRWQRYWRSGNPGDSVLGRGWNLFWETRLERYQDGLVWRASSGDYVSFPLVPK
GQRTYCEAEKRWLEHHRDDSWSLYDISGERWHFMPLRDDAPSLPLCLTEPCGNEIQFDWNPDHTLAALTDSAGQRVTCRY
ANNRLAGAWLDDDICLVSYAYDDIGQLVTVTGRGGSVRRRFQWCDGLMVAHEDMNGLLSEYRWQEIDGLPRVVAFRHSGG
EQLDFEYDFDNGIRRARRDDGVEAHWLIDDDDHVARFTDFDGRQTMLVYRAGELCDVIMPGGAMRCSNWDRYGRMTQEID
PVGRRTTYHWFRMTDRVIRTDYPDLSATQAAYDLNVRLLTETDALNNVTTYHYPDDTELLPDSITDATGGVVKLEWNRQG
LLTQRTDCSGSVTTFSYDRFGQLVRSVDAEGHVTQREWNDKGQLCAIIHPDGSRETLHWNTQGQLSAWRDPLETEVRWTY
NALGLPVSLTDRIGRTRRWHYDARGNLLRLENGNGGDYRFTCDPLGRPLSEIRPDKTSRNMEWNARGFLIGLQENGQPAN
DGGIARRWQRFSYDDSGLITQRTTQHAEYHYRRHRSGQLASLVRTPTSDGMALGIEDDEIAFTYDVAGQLLTEAGINGKL
DYEWDALGNLTHLTLPGEQQLAWLHYGSGHVSAIRFNQQLVSEFTRDRLHREIRRSQGAREQARQYDSLGRRTMQRSELH
SEVVLPEKAILERAFRYSARSELESVSDTLRGDIIYGYDDEGRLLKHYEARQGHSTSHFAYDNADNLAANDDALHALPVT
DNRLHHWQNLFMKYDDWGNLVSRRSGLHEQHFTYDAENRLISAKGNGPDGGFTAHYHYDALGRRTRKVVSTQHDRKEVRF
LWQGYRLLQEQHENGQCQTYVYDPNEAWSPLARIDHMAAGERGDVLWFNTDLNGAPLEVTDERGDIRWSGQYGSFGEVRR
QTEGFTRLAKQSALPHQPLRYAGQYADSETGLHYNLFRYYDPQVGRFTVQDPIGLEGGWNLYQYAPNPLSWIDPLGLNKC
GSFTKNPDDIHFMQSSIKNQTGEHTVLNNAAALKNGTLKPTDLPAIKIWQDSSGKLWTLDHRRLAAFKLSGLKEIPVQWA
TEKEIAGQMWKMTTKTDGKSIILKMGDGIKRVIGG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 45
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 45