Gene Information

Name : YpsIP31758_0330 (YpsIP31758_0330)
Accession : YP_001399324.1
Strain : Yersinia pseudotuberculosis IP 31758
Genome accession: NC_009708
Putative virulence/resistance : Unknown
Product : RHS/YD repeat-containing protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 385627 - 389835 bp
Length : 4209 bp
Strand : +
Note : identified by match to protein family HMM PF03527; match to protein family HMM PF05488; match to protein family HMM PF05593; match to protein family HMM TIGR01643

DNA sequence :
ATGTTTGAAGCGGCCCGTGTTGATGACAAGCTTTATCATTCCAGTGCCTTAGCGGGTTTTATTATTGGCTCCATTATTGG
TGCCGCCGTGATTTTTGCGGCCGCGGCTTACGCCGCCTCCATTGTTCTCACCGGCGGGGCGACGCTGGTCGCTACCGGCT
TTATTGTGGGTATGGGGGTGACCACGCTGGGCGTCGTTGCCGGTGGGTTAATACGCTCCGTGGGCGAAAAAATAGGGAGC
ATGTGCCATCACGATGTCGGACAAATTACGACAGGGTCCAAAAACGTTAAAGTGAACAGTAAACGGGCGGCGCATGTCGA
GCTCAGTACCGTGGCCTGTAAAGATGACTCCGCCATTCAGCGCATGGCCGAAGGTTCGTCAAATATCTTTATTAACAGTA
AAGCCGCCGTTCGTCTGGAAGATAAAACGACCTGTGATGCGGTTGTCGATTCCGCTTCCAGTAATGTGACGTTTGGTGGG
GGGCGCGTTCAGTATCTCGATATTAAACGCGAGATTTCTGATGAAATGCGTGATTTGTCAGAGAAGCTGTTTATTGTCGC
CGGGCTGGCGGGCGGCATATTTGGGGCGGCAAAACAGGCGGGGTGTTTCGGCCTTAAATGCCTGAGCAAGATTGCGTTGG
GTGAGATGGCCGGGGCGGCTGCCGGGTATGGGCTGGAAAAAGGGGTTGGGGCCATCGCCGGTTATTTCGGTTACCCGGTT
GATGTGATCAGTGGACAGAAATTGCTGACAGGTGAGGGCGATGATACCGATTTTATTCTGCCGGGTATCTTCCCGCTGCA
CTGGAGCCGGATTTATCGCAGTGAAAATCACCATGTCGGGGCGCTGGGACAAGGCTGGTCTCTGGTATGGGAGCGTTCAT
TACGCAAAGAAGATGACAGCATTGTTTATCAGAATGATGAAGGTCGGGAGATTGTCTTTCCCCTGATTAAACGTGGAGAG
CGCTATTTCTCCCCCACGGAGCATATCTGGCTGGCACGTACCGAGCGTGATACCTATGCCATCAGCAGCCCGTTTGAAAC
CTGTTTTATTTTTGAGGCCTTTTCTGAGGCTGGCGTCGCGAAATTAGCCAGCCTCGAAGATATCAATGGTCATGCCCTGT
ATTTCTTTTATGACGATATCGGGCAACTGAAAAAAATATCGACCACCAGCGGCTATGGGGTGTATTGCCAGTATGAAAAA
GGGCGTCTGGTGTCCGTTGCCTGTGTTAAGGGCGGTACGCCGGGCACACTGGTCCGCTACCAGTATAATGAACAGCACCA
GTTGGTTAGCGTCACTAACCGTGAGGGGCAAATCACCCGCCAGTTTGGTTACCATGGCCATCTGATCAATAAACTGGCGG
ATGTCAGGGGGCTGGAGTGCCGTTACACATGGGCTGATATCGGCGGAACCCCGCGAATTACGCACAGTGCCACCAATCTG
GGGGAGCAGTGGCAGTTTGATTATGATATCGACAATCAACAGACCACCCTGACGGACCTCAATACCGGGCAGACCGCCTG
CTGGGGATATAACGCCCAACATTTAATTACCGACTATCGGGATTTTGATGGCGGGAAATATGCATTTGACTACAACGACC
TCAATATGCCGGTACGCGTTGTGCTGGCAGGCGAGAGAACGCTCGTTCTGGTTTACGATGCACTGGCGCGCCCGATCCAG
ATCACCGATCCGCTAAAACGTGAAACCCACATTGATTATCACCGTAACAGTCTGCGGGTGGTGCGCCGTCAGTACCCTGA
CGGGCAGGTCTGGAAGGGGGAATATGACCGTACCGGCCGTTTGCTGAAAGAGAACGCGCCGGATGGCGGGGTGACGCTTT
ATCATTATCCAGGGGCCTCATCCCTTCCTGAACGCATAACCAATGCCGTAGGGGCGCAGACACACCTTGGTTGGGAAAGG
CACGGGCAACTGACGGAGCACACCGACTGCTCGGGTAAACTGACCCGCTACGAATATGATATCGATGGCCATCTGCTGAC
GGTCATCGATGCTGAAAACCATTCAACACATTACAGCTACAACCGTCTCGGGCAGCCCACCGGGGTCAGGTACGCCGATG
GCCGCAAAGAGCAGTTGCGGTATAACGCTCAGGGGCTGGTTGAACAGTTTACCGATCCTGTCGGGCGGCAGTTGCACTGG
CGTTATAACCTGCGGGGTCAGCCGGTCAGCTTTACTGATCGTCTGCAACGGGAATACCGTTACCGCTATGACTGCCATGG
GCAGATGATTGAGCTGGATAATGCCAATGGGGGCCAGTATCACTTCCGGTGGAGCAGCGGCGGGCAATTGGTGGAAGAGC
AGTATCCCGATAACCTTGTCCGGCGTTATCGCTATGGGGAGAGCGGGATGCTGATGGCGCTGGAGACCACCGCGCCCACG
GTTGACGATCTTACCGTCTCCCGGCAGGTCAGTTTTGACTATGATGCGGGCGGGCGAATGACGCAGCGCCTGACGGGCAT
GAGTGCGACCCGGTATGACTGGGACATCATGGACCGTTTATTGCTGGCCGAGCGTGTGCCAACGGCGGTGGGCGAACAGG
CGGGGATCGTCGGTAATGGTGTTCGTTTGGCGTATGACAAGGCCGGGCATTTACTGACGGAAAGCGGTGACCTGGGTGCG
GTGACGTATCAGTGGGATCCGCTGCATCATCTGGCCGCCCTGACGCTGCCCGATGGTCAGACGCTGTCATGGTTGCGTTA
CGGTGCGGGCCATGTCAGTGCCATTCGTCATGGTGATACGCTTATTTCCGAGTTCAGCCGGGATAATCTTCATCGGGAAG
TGAGCCGGACCCAGGGTATTTTGACGCAGTATCGTGATTATGACGCGATGGGGCGGCGGTTGTGGCAATCGGCGGGTTCT
GATGCGCCGACAGTGGCGGCCGATCTGCTGCCCCGTCAGGGGGATATCTGGCGTAAATTTAGCTTTGACACTGCCGGTGA
ACTGAGCATGGCCACCGATTTTATCCGGGGTGAGCAGCAGTACCGTTATGATGCGGAAGGGCGGCTGACTGACAGCCGGG
AGCGTCATCAGTTATCCGTTGCGGAGGATTTTGCTTACGACAATGCGGATAACCTGCTGAACCTGAGGAAACTGCCGTTT
GACACCGTCGATCCACTGTACGATACACCGGTCGCCAACAACCGTTTGACGCAATGGCAGCATTACCGTTTTGAGTATGA
TGCCTGGGGAAACATGACCACGCGGCATGCCGGTGGTCGGATGCAACATTTTGCCTATGACGATGATAACCGGCTGCTGC
GGGCCTGGGGAACCGGGCCGTTAGGGGAGCATGACAGCCACTATCGGTATGATGCGCTGGGGCGGCGTATCCACAAATCG
GTGACGATAAAGCGCGGCGCAGAAAAAACCACCCGTCAGACCGATTTTATCTGGCAGGGACTGCGGTTATTGCAGGAGCA
ACATGCGGACGGCAACGCGACCTATATTTACGACCCGAACGAAAGTTATACGCCGCTGGCGCGGGTCGATCAGCGTCATG
GCGAGACAGAAAGTCAGGTGTATTATTTTCATACGGATATCAACGGTACCCCGCTGGATGTCACGGACGGAGAGGGTAAG
CACCGCTGGTCAGGGAAATACCACGCCTGGGGCAAAGTTACCCGGCAGAATGTCAGCGATCCAAGGCAAAGCACGGTCAG
CCGGTTCGCGCAGCCGCTGCGTTATCCGGGGCAATACAGTGATGACGAGACGGGTTTGCACTACAATACGTTCAGGTACT
ATGACCCGGAGATAGGGCGATTTAGTACGCAGGACCCGATAGGGCTGGCGGGGGGGGTGAATCTTTATCAGTATGGGCCA
AATCCGTTAACGTGGATCGATCCTTGGGGTTATACAGGAACATATATTTTTACTGACGGTGTTGTATCTTATATAGGTAA
GGGCCCGTTAGGACGAATGGTAGCGTCTATGGGACAAAGAATTGGCGGTTCTTTGAATGCAATACAGTCGGCTCATTTGG
ACTTTGGTAGTGATAAGTTAGGATTCATGGTAGAACATCGGATGATGGAAAAGTATGGTGCTCGTTATTCTCCCGACTTT
GCTAACAGTGAACGCGTTGGTTCACCGGGAAAAAAATTATATGATGCCGCCGATTTGAAAACACAAAAAAAGGTTGACCG
TCTAGCTAATAAATTAGATAAGAATTTTAAGTCATCTAAAGGATGTTAA

Protein sequence :
MFEAARVDDKLYHSSALAGFIIGSIIGAAVIFAAAAYAASIVLTGGATLVATGFIVGMGVTTLGVVAGGLIRSVGEKIGS
MCHHDVGQITTGSKNVKVNSKRAAHVELSTVACKDDSAIQRMAEGSSNIFINSKAAVRLEDKTTCDAVVDSASSNVTFGG
GRVQYLDIKREISDEMRDLSEKLFIVAGLAGGIFGAAKQAGCFGLKCLSKIALGEMAGAAAGYGLEKGVGAIAGYFGYPV
DVISGQKLLTGEGDDTDFILPGIFPLHWSRIYRSENHHVGALGQGWSLVWERSLRKEDDSIVYQNDEGREIVFPLIKRGE
RYFSPTEHIWLARTERDTYAISSPFETCFIFEAFSEAGVAKLASLEDINGHALYFFYDDIGQLKKISTTSGYGVYCQYEK
GRLVSVACVKGGTPGTLVRYQYNEQHQLVSVTNREGQITRQFGYHGHLINKLADVRGLECRYTWADIGGTPRITHSATNL
GEQWQFDYDIDNQQTTLTDLNTGQTACWGYNAQHLITDYRDFDGGKYAFDYNDLNMPVRVVLAGERTLVLVYDALARPIQ
ITDPLKRETHIDYHRNSLRVVRRQYPDGQVWKGEYDRTGRLLKENAPDGGVTLYHYPGASSLPERITNAVGAQTHLGWER
HGQLTEHTDCSGKLTRYEYDIDGHLLTVIDAENHSTHYSYNRLGQPTGVRYADGRKEQLRYNAQGLVEQFTDPVGRQLHW
RYNLRGQPVSFTDRLQREYRYRYDCHGQMIELDNANGGQYHFRWSSGGQLVEEQYPDNLVRRYRYGESGMLMALETTAPT
VDDLTVSRQVSFDYDAGGRMTQRLTGMSATRYDWDIMDRLLLAERVPTAVGEQAGIVGNGVRLAYDKAGHLLTESGDLGA
VTYQWDPLHHLAALTLPDGQTLSWLRYGAGHVSAIRHGDTLISEFSRDNLHREVSRTQGILTQYRDYDAMGRRLWQSAGS
DAPTVAADLLPRQGDIWRKFSFDTAGELSMATDFIRGEQQYRYDAEGRLTDSRERHQLSVAEDFAYDNADNLLNLRKLPF
DTVDPLYDTPVANNRLTQWQHYRFEYDAWGNMTTRHAGGRMQHFAYDDDNRLLRAWGTGPLGEHDSHYRYDALGRRIHKS
VTIKRGAEKTTRQTDFIWQGLRLLQEQHADGNATYIYDPNESYTPLARVDQRHGETESQVYYFHTDINGTPLDVTDGEGK
HRWSGKYHAWGKVTRQNVSDPRQSTVSRFAQPLRYPGQYSDDETGLHYNTFRYYDPEIGRFSTQDPIGLAGGVNLYQYGP
NPLTWIDPWGYTGTYIFTDGVVSYIGKGPLGRMVASMGQRIGGSLNAIQSAHLDFGSDKLGFMVEHRMMEKYGARYSPDF
ANSERVGSPGKKLYDAADLKTQKKVDRLANKLDKNFKSSKGC

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 41
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 41