Gene Information

Name : D781_0050 (D781_0050)
Accession : YP_007342637.1
Strain : Serratia marcescens FGI94
Genome accession: NC_020064
Putative virulence/resistance : Unknown
Product : RHS repeat-associated core domain protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 55417 - 59625 bp
Length : 4209 bp
Strand : +
Note : PFAM: RHS protein; PAAR motif; RHS Repeat; TIGRFAM: RHS repeat-associated core domain; YD repeat (two copies)

DNA sequence :
ATGAGCGAAGCGGCACGCGTCGGCGATGCGATTGGACATTCCCATGCGCTGGCCGGGATGATTGGCGGCACGATTGTCGG
CGGCCTGATTGCTGCCGCCGGCGCGGTGGCAGCGGGGGCGCTGTTCGTTGCCGGCCTGGCGGCGTCCTGCGTGGGCGTCG
GTGTATTACTGGTGGGCGCCAGCCTGGCGGTGGGTTATCTCACCGGAGAGCTGGCGGCCAAGGCGCGAGATGGCATTGCG
GAGGCCGGCGCAGGGAGCCTGACGCCGGCCGGCAAGATAGTGACCGGCTCGCCGGATGTGCGCATCAATGGCAAGCCGGC
GGCCATCGCCACCGTCAGCCGGGTTGTCTGCGAGCAGGATGGTCCCAGCATGCAGATGGCGCAGGGTTCGGACAAGGTGT
ATATCAACGGCCGGCCGGCGGCGCGCGTGGGGGATAAAACTAACTGTGACGCCAAGGTGATGGAAGGCTCACCCAACGTG
CGCATCGGCGGCGGCACTGTTACCACCCTGCCGATAAAACCGGAAGTACCGGACTGGGTGTATAAGATTTCCGACCTGAC
GCTGCTGTTTGCCGGCCTGGTAGGTGGCGTCGGCGGCGCGGCCAGCAAGCTGGGCGCGCTGGGCAGAATGCTGAGCAAGG
CGCCCGGCATCAACAAACTGGGGCGAACCGCCTGCCGGTTCGGCACGCTGATGACCGCCACGACCGCCGCCGGCATTATC
GCCCGGCCGGTTGATATCATCAGCGGCCAGAAACTCCTGTCCGGTGACGACGAGCTGGACTTTGTGCTGCCATCGCGCCT
GCCGGTTGAGTGGCAGCGCTACTGGCGCAGCGGCAACCCGGCAGAGAGCGTACTGGGCCGTGGCTGGAGCCTGTTCTGGG
AAAGCAGCCTGCAGACCTACCAGGATGGCCTGGTGTGGCGTGCGCCGTCCGGCGACTATGTCTCCTTCCCGATGGTGCCC
AAAGGGCATAAAACCTACTGCGAAGCGGAAAAGTGCTGGCTGATGCACAACAGCGACGACAGCTGGCAGGTGTTCGACGT
CAGCGAGCAGGCCTGGCACTATCCGGCCTTATCCGATGAACAACCGAGCCGTCTGCAGATGGTGACTGACCTCGCCGGCA
ACGCGGTTTCCCTGTTCCACGATGACCATGGCCGGCTGACCGAGCTGGTCGACAGCGCCGGTCAGCGGCTGGCGTGTCGC
TACCTGACCATCGCCAACGGGCTGTCCCGTCTGAGCACGGTGCTGCTGCATACCCCGGATGGCGAGCTGCCGCTGGTGCA
CTACGCCTACGATGAGGAAGGGCAACTGGTGACCGTCAGCAACCGCGCCGGCGAGGTCACCCGACGCTTTGGCTGGCAGG
ACGGACTGATGGTCAGTCATCAGGACCACAACGGCCTGCTGAACGAGTACCGCTGGCAGGAGCTTGACGGCCTGCCGCGG
GTAGTGGCCTACCGCAACAGCGCCGGCGAGCAGTTGGAACTCTATTATGATTTTGCCGGCGGCATGCGGCGGGTGGTGCG
CGATGATGGCCGGCAGGCATTGTGGCAGTTGGATGACGACGACAACGTGGCGCAGTTCACCGACTTTGACAGCAGAAAAT
CGGTGTTTATCTATGAACGGGGCGAGCTGTGCGGCGTGGTGCTGCCGGGCGGTGCGCAGCGTCAGAGCGAGTGGGACCGT
TACGGCCGCCTGCTGAGCGAGACCGACCCGCTGGGGCGCACCACCACCTATCAATACTCCCGCAACAGCGGCCGCCTGTT
CTCGGTCACTTATCCGGACGGCAGTCAGGCATTTCAGCACTGGGACACTCAGGGGCGCCCGACGCAGCAGATTGATGCGC
TGGGCAACGTCACCCGCTATCACTATCCCGACGAGGAAGAGAGCCTGCCGGAATGCGTCATCGATGCGCTGGGCGGCGAG
GTGAAACTGGTCTGGAACGCACAGGGACTGCTGACGCGCTATACCGACTGTTCCGGCAGCGTCACCGCCTACGCCTATGA
TGCGCTGGGCCAGCTGACGCACCGCACCGATGCGGAAGGTCACCTGACCCGTTACCGCTGGGACCGTGCCGGCCGACTGC
AGACGCTACTGCATCCGGACGGCAACGAAGAACTATTCGACTGGAATGCGCAGGGCCAGCTGGCCCGACACCAGGACCCG
CTCGGCAGCGAGACGCGCTGGCAGTACAACCTGCTGGGGCAGCCGGTCAGCGTCACCGACCGCATCCAGCGTACGCGCCG
TTACCACTATAACTGCCGCGGCTGGCTGACGAGACTGGAAAATGGCAACGGTGCCGATTACCAGTTCAGCCACGATGCGG
TTGGTCGGCTGATGATTGAACGGCGGCCGGACAGCATCGAGCGTTTATATCGTTATGGCCCCGATGGCCAGCTGAGCGAA
TACCGGGAAGTCGCGTCACCGGACGTAGAGCCGCAGCCGTCCCCGCGGCTGCACCAATTCCGTTATGACGAGGCCGGCCA
GCTGGTCTGGCGAGCCAACGACAGCGCCGAGTGGCATTACCACTATGATGCGGCCGGGCGCATGAACCGCCTGACGCGCA
CGCCGACCGCTGCCGGCAGCGAACTGGGTATCGAGCCGGACAGCGTGCAGCTGCGCTACGACCAGGCCGGCAACCTGCTG
AGCGAGCAGGGCGTCAACGGTGAGCTGCAGTACCAGTGGGATGCCTTGTCCAACCTGCAGACGCTGACGCTGCCGCAGGG
CGACCGGTTTCAGTGGCTCCATTACGGCTCCGGCCACGCCAGCGCCATCAGGTTTAACCAGCAACTGGTGAGTGAATTCA
GCCGCGACCGGCTGCACCGGGAAACCGGCCGTACGCAGGGTGCGCTGCAGCAGCGGCGCCAATACGATGCGCTGGGGCGG
CGCAGTTGGCAAAGTAGTGCTTTCAGCCACGGACAGATAACCAAACCGGAAGAGGGCGTGCTGTGGCGCACGTTCCGCTA
TACCGGGCGCGGCGAACTGGAAGGTGTCGGTGACGCGCTGCGCGGTGAGATTCATTATGGCTACGACGCGGAAGGACGCC
TGCTGCAGCACCGTGAGGCGCAGCAGGGCAGGCCGGGCCACCGGCTGCGCTACGACCTGGCGGATAATCTGACCGGTGAG
CAGCGCGTCAGCCGGGACCCGGATACCGACCTGCCGCCGGCGCCGGTGGTCAATAACCGGCTCGAATACTGGCAGCGGAT
GTTTTACCGCTATGACGGCTGGGGCAACCTGACCCGCCGGCGTAACGGGGTTTACCAACAGCATTACGTCTATGATGCCG
ACAACCGGCTGATAAAGGCGCACGGCCGCGGCCCGCAGGGTGATTTTGAGGCGCAGTATCACTATGACGCACTGGGACGC
CGCACGCGCAAGACGGTGACTCTCAAGGGGAAAGCGCCGGAAACCACGCGCTTCGTGTGGGAGGGCTACCGGCTGCTGCA
GGCGCAGCGGGACAACGGCACGCGCCGCACCTGGAGCTATGACCCGGCAAGCCGGTGGACGCCACTGGCAGCCCTCGAAC
AGGCGGGCGATGGCCAGCAGGCGGATATCTACTGGCTGCACACCGACCTTAACAGCGCGCCGCTGGAGGTTACGGATAGC
GAGGGTAACCTGCGCTGGTCCGGCAACTACGATACCTTCGGCAAACTGCAGGGGCAGACGGTCGCCGGCGCCGAACGGCG
CAAGGGCGCGCTCATTGAGCAGCCGCTGCGTTATGCCGGCCAGTACCAGGATAACGAAAGCGGGCTGCATTATAATCTGT
TCCGTTACTATGAGCCGGAGGTAGGGCGTTTTACCACGCAGGATCCGATAGGATTGCGCGGCGGGCTGAACCTGTATCAA
TATGCGCCAAACCCGTATGGATGGGTGGATCCGTTAGGGTTGAAATCATGTGGCCCAACGCGCACTAGACATGTACCTAA
CCGTCATATCCGGCGTCATAATAATATTGGTAATAGCAAGTTCTCCTTGAGAGAGAGAATAAAACTTCAAGGTAAGAGTA
AATATAAAAAACCTAATCAATATAGAAAGTTAGAAGATAGAACAATGAGTAATCCAAGCCGAGTAATACATCAAGGGGAT
GGAAGGATTAGGTATGAGAGAGATTATGATCGTGTCATTGGTACAAGAGGTGAGCAAGGGCATGTGACCGTTTATGATCC
AGTAAAGGACAAGATTATCACATCTTACCCAGCTCATTTAGAGGATTAA

Protein sequence :
MSEAARVGDAIGHSHALAGMIGGTIVGGLIAAAGAVAAGALFVAGLAASCVGVGVLLVGASLAVGYLTGELAAKARDGIA
EAGAGSLTPAGKIVTGSPDVRINGKPAAIATVSRVVCEQDGPSMQMAQGSDKVYINGRPAARVGDKTNCDAKVMEGSPNV
RIGGGTVTTLPIKPEVPDWVYKISDLTLLFAGLVGGVGGAASKLGALGRMLSKAPGINKLGRTACRFGTLMTATTAAGII
ARPVDIISGQKLLSGDDELDFVLPSRLPVEWQRYWRSGNPAESVLGRGWSLFWESSLQTYQDGLVWRAPSGDYVSFPMVP
KGHKTYCEAEKCWLMHNSDDSWQVFDVSEQAWHYPALSDEQPSRLQMVTDLAGNAVSLFHDDHGRLTELVDSAGQRLACR
YLTIANGLSRLSTVLLHTPDGELPLVHYAYDEEGQLVTVSNRAGEVTRRFGWQDGLMVSHQDHNGLLNEYRWQELDGLPR
VVAYRNSAGEQLELYYDFAGGMRRVVRDDGRQALWQLDDDDNVAQFTDFDSRKSVFIYERGELCGVVLPGGAQRQSEWDR
YGRLLSETDPLGRTTTYQYSRNSGRLFSVTYPDGSQAFQHWDTQGRPTQQIDALGNVTRYHYPDEEESLPECVIDALGGE
VKLVWNAQGLLTRYTDCSGSVTAYAYDALGQLTHRTDAEGHLTRYRWDRAGRLQTLLHPDGNEELFDWNAQGQLARHQDP
LGSETRWQYNLLGQPVSVTDRIQRTRRYHYNCRGWLTRLENGNGADYQFSHDAVGRLMIERRPDSIERLYRYGPDGQLSE
YREVASPDVEPQPSPRLHQFRYDEAGQLVWRANDSAEWHYHYDAAGRMNRLTRTPTAAGSELGIEPDSVQLRYDQAGNLL
SEQGVNGELQYQWDALSNLQTLTLPQGDRFQWLHYGSGHASAIRFNQQLVSEFSRDRLHRETGRTQGALQQRRQYDALGR
RSWQSSAFSHGQITKPEEGVLWRTFRYTGRGELEGVGDALRGEIHYGYDAEGRLLQHREAQQGRPGHRLRYDLADNLTGE
QRVSRDPDTDLPPAPVVNNRLEYWQRMFYRYDGWGNLTRRRNGVYQQHYVYDADNRLIKAHGRGPQGDFEAQYHYDALGR
RTRKTVTLKGKAPETTRFVWEGYRLLQAQRDNGTRRTWSYDPASRWTPLAALEQAGDGQQADIYWLHTDLNSAPLEVTDS
EGNLRWSGNYDTFGKLQGQTVAGAERRKGALIEQPLRYAGQYQDNESGLHYNLFRYYEPEVGRFTTQDPIGLRGGLNLYQ
YAPNPYGWVDPLGLKSCGPTRTRHVPNRHIRRHNNIGNSKFSLRERIKLQGKSKYKKPNQYRKLEDRTMSNPSRVIHQGD
GRIRYERDYDRVIGTRGEQGHVTVYDPVKDKIITSYPAHLED

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 46
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 45