Gene Information

Name : YPTB3621 (YPTB3621)
Accession : YP_072103.1
Strain : Yersinia pseudotuberculosis IP 32953
Genome accession: NC_006155
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 4307136 - 4311404 bp
Length : 4269 bp
Strand : -
Note : similar to Yersinia pestis YPO3608 exported protein (99.6% evalue=0)

DNA sequence :
ATGTTTGAAGCGGCCCGTGTTGATGACAAGCTTTATCATTCCAGTGCCTTAGCGGGTTTTATTATTGGCTCCATTATTGG
TGCCGCCGTGATTTTTGCGGCCGCGGCTTACGCCGCCTCCATTGTTCTCACCGGCGGGGCGACGCTGGTCGCTACCGGCT
TTATTGTGGGTATGGGGGTGACCACGCTGGGCGTCGTTGCCGGTGGGTTAATACGCTCCGTGGGCGAAAAAATAGGGAGC
ATGTGCCATCACGATGTCGGACAAATTACGACAGGGTCCAAAAACGTTAAAGTGAACAGTAAACGGGCGGCGCATGTCGA
GCTCAGTACCGTGGCCTGTAAAGATGACTCCGCCATTCAGCGCATGGCCGAAGGTTCGTCAAATATCTTTATTAACAATA
AAGCCGCCGTTCGTCTGGAAGATAAAACGACCTGTGATGCGGTTGTCGATTCCGCTTCCAGTAATGTGACGTTTGGTGGG
GGGCGCGTTCAGTATCTCGATATTAAACGCGAGATTTCTGATGAAATGCGTGATTTGTCAGAGAAGCTGTTTATTGTCGC
CGGGCTGGCGGGCGGCATATTTGGGGCGGCAAAACAGGCGGGGTGTTTCGGCCTTAAATGCCTGAGCAAGATTGCGTTGG
GTGAGATGGCCGGGGCGGCTGCCGGGTATGGGTTGGAAAAAGGGGTTGGGGCCATCGCCGGTTATTTCGGTTACCCGGTT
GATGTGATCAGTGGACAGAAATTGCTGACAGGTGAGGGCGATGATACCGATTTTATTCTGCCGGGTATCTTCCCGCTGCA
CTGGAGCCGGATTTATCGCAGTGAAAATCACCATGTCGGGGCGCTGGGACAAGGCTGGTCTCTGGTATGGGAGCGTTCAT
TACGCAAAGAAGATGACAGCATTGTTTATCAGAATGATGAAGGTCGGGAGATTGTCTTTCCCCTGATTAAACGTGGAGAG
CGCTATTTCTCCCCCACGGAGCATATCTGGCTGGCACGTACCGAGCGTGATACCTATGCCATCAGCAGCCCGTTTGAAAC
CTGTTTTATTTTTGAGGCCTTTTCTGAGGCTGGCGTCGCGAAATTAGCCAGCCTCGAAGATCTCAATGGTCATGCCCTGT
ATTTCTTTTATGACGATATCGGGCAACTGAAAAAAATATCGACCACCAGCGGCTATGGGGTGTATTGCCAGTATGAAAAA
GGGCGTCTGGTGTCCGTTGCCTGCGTCAAGGGCGGTACGCCGGGCACACTGGTCCGCTACCAGTATAATGAACAGCACCA
GTTGGTCAGCGTCACTAACCGTGAGGGGCAAATCACCCGCCAGTTTGGTTACCATGGCCATCTGATCAATAAACTGGCGG
ATGTCAGGGGGCTGGAGTGCCGTTACACATGGGCTGATATCGGCGGAACCCCGCGAATTACGCACAGTGCCACCAATCTG
GGGGAGCAGTGGCAGTTTGATTATGATATCGACAATCAACAGACCACCCTGACGGACCTCAATACCGGGCAGACCGCCTG
CTGGGGATATAACGCCCAACATTTAATTACCGACTATCGGGATTTTGATGGCGGGAAATATGCATTTGACTACAACGACC
TCAATATGCCGGTACGCGTTGTGCTGGCAGGCGAGAGAACGCTCGTTCTGGTTTACGATGCACTGGCGCGCCCGATCCAG
ATCACCGATCCGCTAAAACGTGAAACCCACATTGATTATCACCGTAACAGTCTGCGGGTGGTGCGCCGTCAGTACCCTGA
CGGGCAGGTCTGGAAGGGGGAATATGACCGTACCGGCCGTTTGCTGAAAGAGAACGCGCCGGATGGCGGGGTGACGCTTT
ATCATTATCCAGGGGCCTCATCCCTTCCTGAACGCATAACCAATGCCGTAGGGGCGCAGACACACCTTGGTTGGGAAAGG
CACGGGCAACTGACGGAGCACACCGACTGCTCGGGTAAACTGACCCGCTACGAATATGATATCGATGGCCATCTGCTGAC
GGTCATCGATGCTGAAAACCATTCAACACATTACAGCTACAACCGTCTCGGGCAGCCCACCGGGGTCAGGTACGCCGATG
GCCGCAAAGAGCAGTTGCGGTATAACGCTCAGGGGCTGGTTGAACAGTTTACCGATCCTGTCGGGCGGCAGTTGCACTGG
CGTTATAACCTGCGGGGTCAGCCGGTCAGCTTTACTGATCGTCTGCAACGGGAATACCGTTACCGCTATGACTGCCATGG
GCAGATGATTGAGCTGGATAATGCCAATGGTGGCCAGTATCACTTCCGGTGGAGCAGTGGCGGGCAATTGGTGGAAGAGC
AGTATCCCGATAACCTTGTCCGGCGTTATCGCTATGGGGAGAGCGGGATGCTGATGGCGCTGGAGACCACCGCGCCCACG
GTTGACGATCTTACCGTCTCCCGGCAGGTCAGTTTTGACTATGATGCGGGCGGGCGAATGACGCAGCGCCTGACGGGCAT
GAGTGCGACCCGGTATGACTGGGACATTATGGACCGTTTATTGTTGGCCGAGCGTGTGCCAACGGCGGTGGGCGAACAGG
CGGGGATCGTCGGTCATGGTGTTCGTTTGGCGTATGACAAGGCCGGGCATTTACTGACGGAAAGCGGTGACCTGGGTGCG
GTGACGTATCAGTGGGATCCGCTGCATCACCTGGCCGCCCTGACGCTGCCCGATGGTCAGACGCTGTCATGGTTGCGTTA
CGGTGCGGGCCATGTCAGTGCCATTCGTCATGGTGATACGCTTATTTCCGAGTTCAGCCGGGATAACCTTCATCGGGAAG
TGAGCCGGACCCAGGGTATTTTGACGCAGTATCGTGATTATGACGCGATGGGGCGGCGGTTGTGGCAATCGGCGGGTTCT
GATGCGCCGACAGTGGCGGCCGATCTGCTGCCCCGTCAGGGGGATATCTGGCGTAAATTCAGCTTTGACACTGCCGGTGA
ACTGAGCATGGCCACCGATTTTATCCGGGGTGAGCAGCAGTACCGTTATGATGCGGAAGGGCGGCTGACTGACAGCCGGG
AGCGTCATCAGTTATCCGTTGCGGAGGATTTTGCTTACGACAATGCGGATAACCTGCTGAACCTGAGGAAACTGCCGTTT
GACACCGTCGATCCACTGTACGATACACCGGTCGCCAACAACCGTTTGACGCAATGGCAGCATTACCGTTTTGAGTATGA
TGCCTGGGGAAACATGACCACGCGGCATGCCGGTGGTCGGATGCAACATTTTGCCTATGACGATGATAACCGGCTGCTGC
GGGCCTGGGGAACCGGGCCGTTAGGGGAGCATGACAGCCACTATCGGTATGATGCGCTGGGGCGGCGTATCCACAAATCG
GTGACGATAAAGCGCGGCGCAGAAAAAACCACCCGTCAGACCGATTTTATCTGGCAGGGGTTGCGGTTATTGCAGGAGCA
ACATGCGGACGGCAACGCGACCTATATTTACGACCCGAACGAAAGTTATACGCCGCTGGCACGGGTCGATCAGCGTCATG
GCGAGACAGAAAGTCAGGTGTATTATTTTCATACGGATATCAATGGTACCCCGCTGGATGTCACGGACGGAGAGGGTAAG
CACCGCTGGTCAGGGAAATACCACGCCTGGGGCAAAGTCACCCGGCAGAATGTCAGCGATCCAAGGCAAAGCACGGTCAG
CCGGTTCGCGCAGCCGCTGCGTTATCCGGGGCAATACAGTGATGACGAGACGGGTTTGCACTACAATACGTTCAGGTACT
ATGACCCGGAGATAGGGCGATTTAGCACGCAGGACCCGATAGGGCTGGCGGGGGGGGTGAATCTTTATCAGTATGGGCCA
AATCCGCTAGGTTGGGTGGATCCTTTAGGACTAGCTAATCTTTTTGATTTAGGTACTTATGGCGGTTTAAATGGAGGGAT
TCATGTCGGCGATGGTTTACAAGCACATGAGCTTATTCGTCATGAATTTCTTAAGCAATTAGGATTAGCTAATGATACTC
GTTTATCTTCGAATCCTTCTATTGCGTTAGACCTTGATCACCATACTCGGGGACCATTAAAAGATTCTAGAGGCATTGGT
GGTGTTCATTACCATGAAGCTCAGGTTAGAGCTGAGAGAGGACTTGGAATTAACCAATTTGCCTCAAAAATAGCGGATGA
GTTAGATATAACATCTGAAGCCATGAAAAGAGCGGGTGTTCCTGAAACACAAATTAGTAAATTACGAGGCAACGCGGAAA
AATTTTATGGCAATCTTTCTGGGTGTTAA

Protein sequence :
MFEAARVDDKLYHSSALAGFIIGSIIGAAVIFAAAAYAASIVLTGGATLVATGFIVGMGVTTLGVVAGGLIRSVGEKIGS
MCHHDVGQITTGSKNVKVNSKRAAHVELSTVACKDDSAIQRMAEGSSNIFINNKAAVRLEDKTTCDAVVDSASSNVTFGG
GRVQYLDIKREISDEMRDLSEKLFIVAGLAGGIFGAAKQAGCFGLKCLSKIALGEMAGAAAGYGLEKGVGAIAGYFGYPV
DVISGQKLLTGEGDDTDFILPGIFPLHWSRIYRSENHHVGALGQGWSLVWERSLRKEDDSIVYQNDEGREIVFPLIKRGE
RYFSPTEHIWLARTERDTYAISSPFETCFIFEAFSEAGVAKLASLEDLNGHALYFFYDDIGQLKKISTTSGYGVYCQYEK
GRLVSVACVKGGTPGTLVRYQYNEQHQLVSVTNREGQITRQFGYHGHLINKLADVRGLECRYTWADIGGTPRITHSATNL
GEQWQFDYDIDNQQTTLTDLNTGQTACWGYNAQHLITDYRDFDGGKYAFDYNDLNMPVRVVLAGERTLVLVYDALARPIQ
ITDPLKRETHIDYHRNSLRVVRRQYPDGQVWKGEYDRTGRLLKENAPDGGVTLYHYPGASSLPERITNAVGAQTHLGWER
HGQLTEHTDCSGKLTRYEYDIDGHLLTVIDAENHSTHYSYNRLGQPTGVRYADGRKEQLRYNAQGLVEQFTDPVGRQLHW
RYNLRGQPVSFTDRLQREYRYRYDCHGQMIELDNANGGQYHFRWSSGGQLVEEQYPDNLVRRYRYGESGMLMALETTAPT
VDDLTVSRQVSFDYDAGGRMTQRLTGMSATRYDWDIMDRLLLAERVPTAVGEQAGIVGHGVRLAYDKAGHLLTESGDLGA
VTYQWDPLHHLAALTLPDGQTLSWLRYGAGHVSAIRHGDTLISEFSRDNLHREVSRTQGILTQYRDYDAMGRRLWQSAGS
DAPTVAADLLPRQGDIWRKFSFDTAGELSMATDFIRGEQQYRYDAEGRLTDSRERHQLSVAEDFAYDNADNLLNLRKLPF
DTVDPLYDTPVANNRLTQWQHYRFEYDAWGNMTTRHAGGRMQHFAYDDDNRLLRAWGTGPLGEHDSHYRYDALGRRIHKS
VTIKRGAEKTTRQTDFIWQGLRLLQEQHADGNATYIYDPNESYTPLARVDQRHGETESQVYYFHTDINGTPLDVTDGEGK
HRWSGKYHAWGKVTRQNVSDPRQSTVSRFAQPLRYPGQYSDDETGLHYNTFRYYDPEIGRFSTQDPIGLAGGVNLYQYGP
NPLGWVDPLGLANLFDLGTYGGLNGGIHVGDGLQAHELIRHEFLKQLGLANDTRLSSNPSIALDLDHHTRGPLKDSRGIG
GVHYHEAQVRAERGLGINQFASKIADELDITSEAMKRAGVPETQISKLRGNAEKFYGNLSGC

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 41
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 41