Gene Information

Name : BB2000_1115 (BB2000_1115)
Accession : YP_008397670.1
Strain : Proteus mirabilis BB2000
Genome accession: NC_022000
Putative virulence/resistance : Unknown
Product : Rhs-family protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1232812 - 1236645 bp
Length : 3834 bp
Strand : -
Note : matching_protein_id: YP_002150527.1; matching_locus_tag: PMI0761

DNA sequence :
ATGTTAGAGGCGGCAAGGGTTGGAGATATTATCGGTCACTCAAAATCCATGTGGGGGATGCTCGTTGGTACTATATTAGG
CGCTGCTATCGCCATCGGTGGCGCCATGATTTCTGGTGTATTAATGGGTGCGGGTATTGCTGCAAGCTGCATTGGTATTG
GCGTGCTTGCCATTGGTGCTTCCATCGCCGTCGGTTACGGAACTACCCTATTAGCCGACTGGGTTCGGGATAAATGTGTT
GAAACAGGTTCAAAATCCTTAAGCCCCTGTGGGGAAATCCTTGACGGCTCCAAAAATGTTCGTATTAACGGCAAGCCTGC
CGCAATGTCTACCCGTAGCAACGTCAAATGTGATAAAGAAAACAGTGAACGCCAAATCGCTCAAGGCTCGGGTAGTGTCT
ATATCAACGGTTTTCCAGCCTCTCGGGTCGGTGATAAAACCACCTGTGATGCGACTATCATGGAAGGTTCCCCCAATGTC
CGGATTGGCGGTGGAACACAAGCAACCGAAGATATTGAACCTGAAATTCCCTCTTGGGTTACCACCGCATCCGATCTCAC
GATGTTATTTGCCGGTTTCCTGAGTTTTGGTTCAGCGGCAGCCAAAGGCCCCGGGGCAGTAGCAAAGTTGTGGAGCAAAT
TACCGGGTAGTGCCAAAATAAGTCGCTTTGCTTGTCGTTATGGCAAGATTTTTACTGGGCTTAGTCTCGCTATTCCGGCC
ATTGGCATCTTAACCCGCCCTGTTGAAGTGATTGGCGGACAAAAAATCCTCAATGACGAAGATGAATTGGATTTTAGCTA
TGAGGCGGAATTACCACTTTATTGGCAACGCAATTACTTAAGTAGCTATTGCTATGAGGGTGCACTCGGTCGAGGTTGGA
GTTTCTTTTGGGAAAGCCAGCTAATTAAAAGCGAAGATGGCTTTGTTTGGCAAAATCTATCCGGTGATATCCTTCCATTT
CCTGACATTCCACTCGGACACCGTAACTTTAACGAAGCTGCTCAAGGCTGGATCATCCACCATGAGGATGACAGTTGGAC
GTTTCAGGATGCTGGTGAGCTACGCTATCATTATCCCCCTTTTGATGACAAAGGTTATAGTCGTTTAAGTCATATCGTTG
ATAATGTGGGAAACGAGCAGCGGTTTCATTACAACGAGCACCACCAACTGATCCATATTACGGGATGTGGTGATCTCAAT
ATCGAGTGTGAATATCAATCTTTTCAACTTGCTGAAAAGACCGTATCGCGTCTTACTGCAGTCTATCAAGTAAACCCCCA
TCAGATACGCCGTCGTCTATGTGCTTATTTTTACAACGAAAGTGCCCAATTAATACGTGTTGAGCAACAGACAGATCACC
CTTATCGTCAATTTGGTTGGACAGATACGGGTGTAATGGCATGGCATAGCGATAAATATGGCTTACGCAGTGAATACCGC
TGGGCACTTTCAGAAGATAATCTTTGGCGAGTGATTGAAAACAAAACCAGCGAAGGCGAAAGCTATCGTCTTGAATATGA
TGATATCAACCTCACACGAACCGCTTATTGGCACGATGGTTCAACCTCATTTTGGCAACTTAATCATGATCACCAAATTA
TTCATTATACCGATCGTACAGGCATTAAAACGGCGCTAATTTGGGATGAGTTTGGATTACCTTGCGGTTGTCGCAATGCG
TTGGGTCATACACATATTAGTGAATGGGACGCATTGGGCCGACTACTCAGCATAACGGATGGTAATGGTAACCAAACGCG
TTGGCAATATCAGAATGAACGCGAGCGCCTTATCACTGTATTTTGGCCGGATAACACAGAGTCGCGTTTAGTGTATGACA
GTTTAGGGCGACTGATTAAGGAAATTTCTCCCCTTCATCAGATCACCGAATACCGCTATGACTTTAAGACCACATTGCGC
CCTACTGCACGCATTGATGCCAAGCAAGGACGCAGTGAGTTTCTCTGGAACAAACGTGGGCAATTACTGCGCCATACTGA
TTGCTCAGGCAAACAACATATTTGGTGTTATGATGACGAAGGCCGTGTTGTTTCTCAAACCAACGCCTTGCAGGAAGCCA
CCGAGTACCAGTACAATGAGGCGGGACACCTCACTCGCATCGTATTGCCCGATAATTCTACCGTGCAACTGGCATGGAAT
GCCGCAGGATCACTTACTCATCATCAACGCAATGACAATACCCCGCGTCAATGGCAATACAACGCCTTTGGTCGTGTCAC
CACAGAAATTGATAAACTCGCTCGGCATATCTACTACCATTACAATGCACAAGGTGCATTAATTTCAATTGAAAATGCCA
ATGGCGGGCGTTATCTCCTCAATCGTGATGCCGAAGGTCGGTTAGTCGAAGAAATACGTCCTGATGAGACGCTACTCCAA
TACACCTATAACGTCGCTGGGCGACTGGTTGAAGAAGCTCACTTAGGCGACCGAGTCTTCACATCCGCACCACGCACAAT
ATTACTCGACTATGATGCGGCGGGGAACCTTGCCAAACGAGAGACTTTAACTGATCGCTATCAATATCAATGGGATAGTA
TGAACCGCCTATTGGTCGCCAGCAAACAACCTAATCAACGTGGTCTTGAAATGGGTTTGCAAGCGAATCAAGTCCACTTT
ACCTACGATGCACTTGGCCGCATTATTCGTGAACAAACAGGCGACGATATCGTCGAATTTAATTATGATGAACTCAATAA
CCTAAGCCGTCTGACGTTACCTCAAGGTGACAGCCTCAACTGGCTCTATTATGGCTCTGGGCATGCCACGGCTATCAACC
ACCTTGTTGATAGTCGCTCTCAGTTAATTACTGAATTTGAACGTGACGACCTACACCGTGAAATCAGTCGAACTCAAGGA
GAGCTCACCCAATATCGGCAATACGATAAACTGGGGCGAACCATTAGCACTTTCAGCTCGCGTGATAAGCAACATCCGTT
AAATGGTATTACTTTATGGCGTAAGTGGTTTTATGATCCCCAAGATAATCTTGGTGCCATGGAAGACACCTATCGAGGTT
GGGTAGAATACCTGTATGACTCAGAGCAACGTTTAAAAAAAGTCGCCAGTAGTGAAAACCTTGATGCTATGCTGTTTTAC
GATCGCGCGGATAATCTACTCGAACGCCCACAATCCGAAATTGATGCTGAACACTCCCCTACTTTAGAACTAAGTCCCCA
AGGGGATAAGCTACGTCAATTTCAAGGGTGGCACTATCAGTATGATGCCTATGGTAATGTTATTGCTCGCCGTTACCGTA
ACCAATCACCACAAACCTATGCTTATGATGGTGATAATCGTCTGGTTATCGCTCATAATCAAGGCATAAAAGCTCAATAC
CACTACGATGCTCTGGGCCGTCGTATTCACAAAACCGTTGAAAACCGAGAAAGTGGCCAAGTTAAACGACAAGAGACGCA
TTTTATTTGGCAAGGGCTACGGTTACTGCAAGAGCAGGATCTCAATACTGGTAAACACCAAACTTATTGCTACGAAGAAC
ACGGCAGTTATACCCCTCTTGCCGTTATCGTGAAACAATCCAGCGGTTTTCATTATTACTGGCACCACTGTGATATTAAC
AGTGCCCCACTTGAAGTCACCAATGCACAAGGCAATACGCTATGGTCAGGGAAATATGAACGCTTTGGTTTTGTTCGCAA
TAGCCCATTAAGTTTCTACTCTGACCCTGAACGTGTGATGGAGTCCTTTGAGCAAAATCTACGCTATGCCGGACAATATT
TTGACAATGACAAATTTGTCGGGAACAAATTTGAACAGCGCTTGCGCTGGCCCGTAGGGCGAGTATCAGGATGA

Protein sequence :
MLEAARVGDIIGHSKSMWGMLVGTILGAAIAIGGAMISGVLMGAGIAASCIGIGVLAIGASIAVGYGTTLLADWVRDKCV
ETGSKSLSPCGEILDGSKNVRINGKPAAMSTRSNVKCDKENSERQIAQGSGSVYINGFPASRVGDKTTCDATIMEGSPNV
RIGGGTQATEDIEPEIPSWVTTASDLTMLFAGFLSFGSAAAKGPGAVAKLWSKLPGSAKISRFACRYGKIFTGLSLAIPA
IGILTRPVEVIGGQKILNDEDELDFSYEAELPLYWQRNYLSSYCYEGALGRGWSFFWESQLIKSEDGFVWQNLSGDILPF
PDIPLGHRNFNEAAQGWIIHHEDDSWTFQDAGELRYHYPPFDDKGYSRLSHIVDNVGNEQRFHYNEHHQLIHITGCGDLN
IECEYQSFQLAEKTVSRLTAVYQVNPHQIRRRLCAYFYNESAQLIRVEQQTDHPYRQFGWTDTGVMAWHSDKYGLRSEYR
WALSEDNLWRVIENKTSEGESYRLEYDDINLTRTAYWHDGSTSFWQLNHDHQIIHYTDRTGIKTALIWDEFGLPCGCRNA
LGHTHISEWDALGRLLSITDGNGNQTRWQYQNERERLITVFWPDNTESRLVYDSLGRLIKEISPLHQITEYRYDFKTTLR
PTARIDAKQGRSEFLWNKRGQLLRHTDCSGKQHIWCYDDEGRVVSQTNALQEATEYQYNEAGHLTRIVLPDNSTVQLAWN
AAGSLTHHQRNDNTPRQWQYNAFGRVTTEIDKLARHIYYHYNAQGALISIENANGGRYLLNRDAEGRLVEEIRPDETLLQ
YTYNVAGRLVEEAHLGDRVFTSAPRTILLDYDAAGNLAKRETLTDRYQYQWDSMNRLLVASKQPNQRGLEMGLQANQVHF
TYDALGRIIREQTGDDIVEFNYDELNNLSRLTLPQGDSLNWLYYGSGHATAINHLVDSRSQLITEFERDDLHREISRTQG
ELTQYRQYDKLGRTISTFSSRDKQHPLNGITLWRKWFYDPQDNLGAMEDTYRGWVEYLYDSEQRLKKVASSENLDAMLFY
DRADNLLERPQSEIDAEHSPTLELSPQGDKLRQFQGWHYQYDAYGNVIARRYRNQSPQTYAYDGDNRLVIAHNQGIKAQY
HYDALGRRIHKTVENRESGQVKRQETHFIWQGLRLLQEQDLNTGKHQTYCYEEHGSYTPLAVIVKQSSGFHYYWHHCDIN
SAPLEVTNAQGNTLWSGKYERFGFVRNSPLSFYSDPERVMESFEQNLRYAGQYFDNDKFVGNKFEQRLRWPVGRVSG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 48
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 48