Gene Information

Name : SMWW4_v1c23490 (SMWW4_v1c23490)
Accession : YP_007406169.1
Strain : Serratia marcescens WW4
Genome accession: NC_020211
Putative virulence/resistance : Unknown
Product : RHS family protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2535037 - 2539323 bp
Length : 4287 bp
Strand : +
Note : -

DNA sequence :
ATGACCGAAGCGGCCCGCGTCGGCGATACCATCGGGCACTCCCATGCCCTGGCCGGCATGATTGCCGGCACCATTGTCGG
CGGCCTGATCGCCGCCGCAGGCGCATTGGCAGCGGGGGCGCTGTTCGTCGCCGGCCTGGCGGCTTCCTGTATCGGCGTCG
GCGTGCTGCTCATCGGCGCCAGCCTGGCGGTGGGGTATCTCACCGGGGAGGCGGCCACCGCGGCGCGTGACGGCATTGCC
GACGCCGGTGCCGGTAGCCTGACCCCCAAAGGCAATATCGTCACCGGCTCCCCCAACGTCTTCATCAACGGCAAACCCGC
CGCGCTCGCCACCAACAGCCAGGTGGCCTGCAGCGACGACGGCCCGAGCATGCAGATGGCGCAGGGCTCCGACAAGGTCA
GCATCAACGGCCAGCCCGCCTCGCGCGTGGGGGACAAAACCAACTGCGACGCGCAGGTGATGGAAGGTTCGCCCAACGTG
TTTATCGGCGGCGGCACCGTCACCACCTTGCCGATCAAGCCCGAAGTGCCGGATTGGCTGTACAAGGTCTCTGACCTGAC
GCTGCTGTTCGCCGGCCTGGTGGGCGGCGTGGGCGGCGCCGCCGGCAAGCTGGGGGCGCTCGGCAAACTGCTGGGCAAGC
TGCCCGGCATCAACAAACTGGCGCGCATCGCCTGCCGCGCCGGCACCCTGATGACCGCCACCGCCGCGGTCGGCATTATC
GCCCGGCCGGTGGATATCGTCAGCGGCCAGAAATTCCTCGACGGCGAAGACGATCTCGATTTCGTGCTGCCTTCGCGCCT
GCCGGTCGCCTGGCAGCGCTATTGGCGCAGCGGCAACCCCGGCGACAGCGTACTGGGCCGCGGCTGGAACCTGTTTTGGG
AGAGTAGCCTGCAGCCGTATCAGGACGGCCTGGTGTGGCGTGCGCCTTCCGGCGACTTCGTCGCTTTCCCTATGGTGCCG
CGCGGCCACAAAACCTACTGCGAAGCCGAAAAATGCTGGCTGATGCACAACGACGACGGCAGCTGGCAGCTGTTTGACGT
CGGCGAACAGATTTTCCACTACCCGCCGCTGGCGGGCGACCAGCCGAGCCGGCTCAGCATGATCACCGACGCCATCGGCA
ACGCCACCTCGCTGTTCTACGACGACGAGGGGCTGCTGAGCGAACTGGTGGACAGCGCCGGGCAGCGCCTGATGTGCCGC
TATGCACAGGGCCGTCTGCGCGAAGTGGCGCTGCAAACCGCCGACGGTGAACGAACGCTGGCGCGTTATGGTTACGATGA
GCAGGGCCAGCTGACGACGGTGAGCAACCGCGCCGGTGAGGTGACGCGGCGCTTTGGCTGGCGCGACGGCCTGATGATCA
GCCACCAGGACGCCGCCGGGCTGCTGAACGAATACCAATGGCAAGAGATCGACGGCGTGCCGCGCGTGACGGCCTACCGC
AACAGCGCCGGGGAATCCCTTGAGTTCGGTTACGACTTCGCCGGCGGGCGCCGCAGCGCGGTGCGCGGCGACGGCAAACG
GGCGGAGTGGCGGCTGGATGACGACGACAACGTGGCGCAGTACACCGATTTCGACCAGCGCCGCTACGGTTTTATCTACC
AACGCGGCGAGCTGTGCAGCGTGCTGCTGCCCGGCGGCGCGCAGCGCCAGAGCGAATGGGATCCCTACGGCCGCCTGCTG
TCGGAGACCGATCCGCTCGGCCGCGTGACGCGCTATCAATATTCGCGCAACAGCGGCCGGCTGTTCGCCGTCGCCTACCC
GGACGGCAGCAGTGAGGCGCAGCATTGGGACACGCTGGGGCGCCCGACGCGCTATGTCGATGCGCTGGGCAATGCCACGC
TGTACCGTTATCCGGATGACGAAGAGAGCCTGCCCGCTAGTATGATCGACGCGCTGGGCGGGGAAGTGAAGCTGGAGTGG
GACGCCCGCGGCCAGCTGACGCGCTACACCGACTGTTCCGGCAGCGTGACGGCTTATACCTACGATGCGCTGGGGCAACT
GACGGCGCAGACCGATGCCGAAGGCCATCAGACGCGCTACTTGTGGGATAACGGCGGCCGCCTGCATACCCTGATCCACC
CGGACGGCGGCGAAGAGCGCTTTAACTGGAATGCGCACGGCCAGCTGGCCGAGCATCAGGACGCGTTGGGCAGCCTGACC
CGCTGGCAGTACAACGCCCTGGGGCTGCCGGTCAGCATCACCGATCGCATCAACCGCACCCGGCGCTATCACTACAGCCC
GCAGGGCTGGCTGACACGGCTGGAGAACGGCAACGGCGGCGAATACCGCTTCAGCTACGATGCGGTGGGCCGCGTGCTGA
CCGAAGAGCGCCCGGACGACACCCGCCACGATTATCGCTACGGCGCGGCCGGGCTGCTGGAGGAGCACCGCGAGGTCGGC
CTGCCGGGCAGCGCAGGTGAGCTGACGCAGCGCGAACAGCGCTTCCGCTTCGACGAGGCGGGGCAGCTGGTCTGGCGCGG
CAACGCCAGCGCGGAATGGCATTACCGTTTCGACGCCATGGGGCGGCTGCGCGAGCTGAACCGCCTGCCGACGGCGAGCG
GCGCGGCGCTGGGCATCGAGCCGGACAGCGTGCAGATGCGCTATGACGCCGCCGGGCGGCTGCTCGGCGAACAGGGCGTG
AACGGCGAGCTGCAATACCAGTGGGATGCGCTGGCCAACCTGCAGGCGCTGACGCTGCCGCAGGGCGATCGCCTGCAGTG
GCTGTATTACGGCTCCGGCCACGCCAGCGCCATCAAATTCAACCAGCAGGTGGTGAGCGAGTTCACCCGCGATCGCCTGC
ACCGGGAAACCGGCCGTAGCCAGGGCGCGCTGCAGCAGCAGCGGCGTTACGACGCCATGGGCCGCCGCAGCTGGCAGAGC
AGCGCCTTCGGCCACGATAAACTGACCCGGCCGGAAGACGGCGTGCTGTGGCGCGCCTATCGCTACACCGGCCGCGGCGA
GCTGGCCGGGGTGAGCGACGCGCTGCGCGGCGAAGTGCACTATGGTTACGACGCCGAAGGCCGCCTGCTGCAACACCGCG
AACCCAATCAGGGCAAACCGGGCGCACGGCTGGTGTACGATCTGGCGGATAACCTGCTGGGCGAACGCAGCCCGCAGAGC
GACATCGACGCGCACCTGCCGCTGGCGCCGATCGCCGACAACCGCCTGACGCACTGGCAAAAACTGTTTTACCGTTACGA
CGCCTGGGGCAACCTGATCAGCCGGCGCAACGGCCTGTACGAACAGCACTACCGCTACGACGCCGACAACCGGCTGGTGC
AGGCGCACGGCCGCGGCCCGCAGGGTGAGTTCGAGGCGCAGTATCATTACGACGCGCTGGGCCGCCGCAGCCGCAAAACG
GTGCGCTATAAGGGCAAAACCGAACAGACGACCCGTTTCCTGTGGCAGGGCTACCGGTTGCTGCAGGAGCAGCGCGACGA
CGGCAGCCGCCGCAGCTGGAGCTACGATCCGGCCAGCCCGTGGAGCCCACTGGCGGCGCTGGAGCAGGCGGGCGACAGCC
GCTCGGCGGATATTTACTGGTATCACACCGATCTGAACAGCGCGCCGCTGGAAGTGACCGACGCAGCGGGCAACCTGTGC
TGGTCCGGGCAGTACGACACCTTCGGCAAGCTGCAGGGCCAGACGGTAGCCGGCGCGGCGAAGCGGCAGGGCGTGCAATA
CCAGCAGCCGCTGCGCTACGCCGGGCAATATCAGGACGACGAAAGCGGCCTGCACTACAACCTGTTCCGCTACTACGAAC
CCGAGGTGGGGCGTTTCACCACGCAGGATCCGATAGGGCTACGCGGCGGGTTGAACCTTTATAGATATGCTCCAAATCCT
TTGGGATGGATAGATCCTCTAGGACTTAGTGGCTTAAATAGCCTTCAGCAACAAATTGATGAGATTCTATTGGAACATCT
TCCTGCTATACAGAAGATAGATCCAAACGCAACCGTTGGATATAGAGGTAGTGCTGCAAGTGGAATAAGTAAGTCCCATG
ATCCTGCGATTGCTAGACCTATTAACATGAATGATTTCGATGTTGATGGATTTATTAAGTCGGATTACTTAGCTAGTTCA
CCTGAGTTTAGGAACAGACGCCGGGACGCATCTAAGCTTGGGGGGATGAAATCAATAGAGGAGTCTATCGATTCGAAGTT
AAGGCAGAAATTCCCAGGGTTAAGAAATGAACCATTTGGGTTTAGGGTGTTCTACACCCATGAGTTAGATGATCTGGCTA
GAAAAGGCGACGTCCAAAGAAGGTTAGGGCGTTCCGGATGTTCATGA

Protein sequence :
MTEAARVGDTIGHSHALAGMIAGTIVGGLIAAAGALAAGALFVAGLAASCIGVGVLLIGASLAVGYLTGEAATAARDGIA
DAGAGSLTPKGNIVTGSPNVFINGKPAALATNSQVACSDDGPSMQMAQGSDKVSINGQPASRVGDKTNCDAQVMEGSPNV
FIGGGTVTTLPIKPEVPDWLYKVSDLTLLFAGLVGGVGGAAGKLGALGKLLGKLPGINKLARIACRAGTLMTATAAVGII
ARPVDIVSGQKFLDGEDDLDFVLPSRLPVAWQRYWRSGNPGDSVLGRGWNLFWESSLQPYQDGLVWRAPSGDFVAFPMVP
RGHKTYCEAEKCWLMHNDDGSWQLFDVGEQIFHYPPLAGDQPSRLSMITDAIGNATSLFYDDEGLLSELVDSAGQRLMCR
YAQGRLREVALQTADGERTLARYGYDEQGQLTTVSNRAGEVTRRFGWRDGLMISHQDAAGLLNEYQWQEIDGVPRVTAYR
NSAGESLEFGYDFAGGRRSAVRGDGKRAEWRLDDDDNVAQYTDFDQRRYGFIYQRGELCSVLLPGGAQRQSEWDPYGRLL
SETDPLGRVTRYQYSRNSGRLFAVAYPDGSSEAQHWDTLGRPTRYVDALGNATLYRYPDDEESLPASMIDALGGEVKLEW
DARGQLTRYTDCSGSVTAYTYDALGQLTAQTDAEGHQTRYLWDNGGRLHTLIHPDGGEERFNWNAHGQLAEHQDALGSLT
RWQYNALGLPVSITDRINRTRRYHYSPQGWLTRLENGNGGEYRFSYDAVGRVLTEERPDDTRHDYRYGAAGLLEEHREVG
LPGSAGELTQREQRFRFDEAGQLVWRGNASAEWHYRFDAMGRLRELNRLPTASGAALGIEPDSVQMRYDAAGRLLGEQGV
NGELQYQWDALANLQALTLPQGDRLQWLYYGSGHASAIKFNQQVVSEFTRDRLHRETGRSQGALQQQRRYDAMGRRSWQS
SAFGHDKLTRPEDGVLWRAYRYTGRGELAGVSDALRGEVHYGYDAEGRLLQHREPNQGKPGARLVYDLADNLLGERSPQS
DIDAHLPLAPIADNRLTHWQKLFYRYDAWGNLISRRNGLYEQHYRYDADNRLVQAHGRGPQGEFEAQYHYDALGRRSRKT
VRYKGKTEQTTRFLWQGYRLLQEQRDDGSRRSWSYDPASPWSPLAALEQAGDSRSADIYWYHTDLNSAPLEVTDAAGNLC
WSGQYDTFGKLQGQTVAGAAKRQGVQYQQPLRYAGQYQDDESGLHYNLFRYYEPEVGRFTTQDPIGLRGGLNLYRYAPNP
LGWIDPLGLSGLNSLQQQIDEILLEHLPAIQKIDPNATVGYRGSAASGISKSHDPAIARPINMNDFDVDGFIKSDYLASS
PEFRNRRRDASKLGGMKSIEESIDSKLRQKFPGLRNEPFGFRVFYTHELDDLARKGDVQRRLGRSGCS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 50
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 49