Gene Information

Name : D781_2006 (D781_2006)
Accession : YP_007344474.1
Strain : Serratia marcescens FGI94
Genome accession: NC_020064
Putative virulence/resistance : Unknown
Product : RHS repeat-associated core domain protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2106820 - 2111082 bp
Length : 4263 bp
Strand : +
Note : PFAM: RHS protein; PAAR motif; RHS Repeat; TIGRFAM: RHS repeat-associated core domain; YD repeat (two copies)

DNA sequence :
ATGGGTGAAGCAGCGCGCGTCGGGGACAGCATCGGCCATTCCCATGCGCTGGCAGGGATGATTGGCGGCACGCTTATCGG
CGGCCTGATTGCCGCCGCCGGTGCGGTGGCGGCGGGCGCCCTGTTTGTCGCCGGGCTGGCGGCCTCCTGCGTGGGCGTCG
GCGTGCTGCTGATTGGCGCCAGCCTGGCGGTGGGCTACCTCTCCGGCGAGCTGGCCACCCAGGCGCGCGACGGCATTGCC
GCCGCCGGCGCCGGCAGTCTGTCGCCCAAGGGCACGATACTGACCGGCTCCGGCAACGTGTTTATCAACGGCAAACCGGC
GGCCATCGCCACCGTCAGCCGCGTGGTCTGCGAGGATGACGGCCCGAGCATGCAGATGGCGCAGGGCTCGGACAAGGTGT
TTATCAACGGCTACCCGGCGGTGCGCAGCGGGGACAAGACCAACTGCGACGCGCAGGTGATGGCCGGCTCGCCCGATGTG
CGCATCGGCGGCGGCACCGTCACCACGCTGCCCATCAAACCGGAAGTGCCCGACTGGCTGTATAAAATCTCTGACCTGAC
GCTGCTGTTCGCCGGCCTGATAGGCGGCGTCGGCGGCGCGGCCAGCAAGCTGGGGGCGCTCGGGCGCATGCTCAGCAAGG
CGCCGGGCATCAACAAGCTTGGCCGCGTGGCCTGCCGCGCCGGTGCATTGATGACCGCCACCGCGGCGACGGGCATTATC
GCCCGTCCGCTGGACGTGGTCAGCGGCCAGAAATTTCTCGACGGCGACGACGAGCTGGACTTTGTGCTGCCGTCGCGCCT
GCCGGTGGCGTGGCAGCGCTACTGGCGCAGCGGCAACCCGGCGGAGGGCGTGCTGGGGCGCGGCTGGAGCCTGTTCTGGG
AGAGCAGCCTGCAAATCTGGCAGGAGGGGCTGGTGTGGCGCGCGCCGTCCGGCGATTATGTCTCCTTCCCGATGGTGCCG
CGTGGCCACAAGACCTACTGCGAAGCGGAAAAATGCTGGCTGATGCATAACGCCGACGGCAGCTGGCAGGTGTTCGACGT
CAGCGAACAGGCGTGGCACTATCCGCGGCTGGAGGCGCAGCAGCCGAGCCGGCTGAGCATGATGACCGACGCCGTCGGCA
ACGCCACCTCGCTGTTCTATAACGACGCGGGGCAACTGAGTGAGCTGGTGGACAGCGCCGGGCAGCGTCTGGTCTGCCGC
TACCTGACGACGGCCAACGGTGCGTTGCGGCTGGCGGCGGTAGCGTGGCAAAACGGCCAGGATGAGCAGGTGCTGGCGAG
TTACGGCTACGACGACGCGGGGCAGCTGGTCACCGTCCGCAACCGCGCCGGCGAGGTGACGCGGCGTTTTGGCTGGCAGG
ACGGGCTGATGGTCAGCCATCAGGACCAGAACGGCCTGCTGAACGAATACCGCTGGCAGGAGATTGACGGCCTGCCGCGG
GTGGTGGCCTACCGCAACAGCGCCGGCGAGCAGCTGTCCGTCTATTATGATTTCGCCAACGGGACGCGCCGGGCGGTGCG
TGACGACGGCAAACAGGCGCTGTGGCAGCTGGATGATGACGACAACGTCGCGCAGTTCACCGACTATGACAGCCGTCGCT
ACGGCCTTATCTACGCGCGCGGCGAACTGTGCAGCGTGGTGCTGCCGGGCGGTGCGCAGCGGCAGAGCGAGTGGGACCCG
TACGGCCGCATGCTGAGCGAGACCGACCCGCTGGGCCGCACCACCACCTACCAATATTCCCGCAACAGCGGCCGCCTGTT
CTCGGTCACCGGGCCTGACGGTAGCCAGGCGTTCCAGCACTGGGACGAGCAGGGGCGTCTGGTCAGACAGACGGACGCGC
AGGGGCAGAGCACGCATTACCACTACCCGGACCCGGAAGAGAGCCTGCCGGAGCGCATCACCGACGCGCTGGGCGGTGAA
GTGCAGCTGGTGTGGAACGCACAGGGGCAGCTGACGCGCCATACCGACTGTTCCGGCAGCGTCACCGCCTACACCTATGA
CGCGCTGGGCCAGCTGACGCACCGCACCGATGCGGAAGGCCACCTGACCCGCTACCGCTGGGACGCCGCCGGCCGGCTGC
AGCAGCTGCGCCATCCGGACGGCAGCGACGAGCAGTTTGACTGGAACGCGCAGGGCCAGCTGGCCGCGCACCGGGACCCG
CTCGGCAGCGAGACGCGCTGGCAGTACACCCCGCTGGGCCTGCCGGACAGCATCACCGACCGCATCAACCGCACGCGTCG
TTATCACTACGGCCCGCGCGGCTGGCTGATGCGGCTGGAGAACGGCAACGGCGCCGACTACCAGTTCAGCTACGACGCGG
CAGGCCGCCTGCAGGTGGAACAGCGGCCGGACGGGCAGCGCCGTTACTACCATTACGGCGCCGACGGGCTGCCGACGACG
CTGCTGGAAACCGGCGCACCGGTCGCCGATGGCGCGGTGGCAGAGCGGCGGCAGCACTTCCGCTTTGATGAGGCGGGCCA
GCTGACGGCGCGCACCACCGACAGCGCCGAATGGCGCTACGACTACGACGCCGGCGGACGGCTGACGACGCTGACGCGCA
CGCCGACCGCCGCCGGTGCGGCGCTGGGCATCGAGCCGGACAGTATCCGGCTGCGCTACGACCGCGCCGGCAACCTGCTG
AGCGAGCAAGGCGTTAACGGCGAGCTGCAGTATCAGTGGGACGCGCTGGGCAACCTGCAGGCGCTGACGCTGCCGCAGGG
CGACCGGCTGCAGTGGCTCTATTACGGTTCCGGCCACGCCAGCGCCATCAGGTTCAATCAGCAGCTGGTGAGCGAGTTCA
GCCGCGACCGGCTGCACCGGGAGACCGGACGTACGCAGGGGGCGCTGCACCAGCGGCGGCAGTACGATGCGCTGGGCCGC
CGCAGCTGGCAGAGCAGCGGCTTCAGCCACGGGCAACTGACGAAGCCGGAAGACGGCGTGCTGTGTCGGGTGTACCACTA
CAGCGGCCGCGGCGAGATAGCCGGCGTCGACGACGCGCTGCGCGGAGAAGTCCGCTACGGCTACGATGCGGAAGGACGCC
TGCTGCAGCACCGCGAGGCGCAGCAGGGCAAGCCGGGCCACCGCCTGCAGTACGACATGGCGGACAACCTGCTGGGCGCG
CAGAGCGCCAGCCGCGCGCTGGAGGAGCAGCTGCCGCCGGCGCCGCTGGGGGATAACCGGCTGACGCACTGGCAGCAGCT
GTTCTACCGTTACGACGGCTGGGGCAACCTGATAAGCCGGCGCAACGGGCTGTACGAGCAGCACTATGTCTACGATGCGG
ACAACCGGCTGACGGCAGCGCACGGTCGCGGCCCGCAGGGCGAGTTCCGGGCGCAGTATCACTACGATGCGCTGGGCAGG
CGCACGCGCAAGCAGGTGGACTACAAGGGCAAGGCGGCGCAGAGCGCGCGTTTCCTGTGGCAGGGCTACCGGCTGTTGCA
GGAGCAGCGGGACGATGGCACGCGCCGCAGCTGGAGCTATGAGCCGGACAGCCCGTGGACGCCGCTGGCGGCCATTGAGC
AGGCGGGGGAGAGCCGGCAGGCGGATATCTTCTGGCTGCACAGCGAGCAGAACGGCGCGCCGCTGGAGGTGACGGACGGT
GAAGGCGGGCTGCGCTGGTCGGGGGATTACGACACCTTCGGCAGGCTGAAGGGGCAGACGGCGGCGGGCATCATGCAGCG
TCGGGGGGCGGCCTATGAGCAGCCGCTGCGCTACGCCGGGCAGTATCAGGATAGCGAGAGCGGACTACACTATAATCTGT
TCCGCTACTACGAGCCGGAGGTAGGTCGCTTTACTACCCAGGATCCGATAGGGCTGCAGGGTGGGCTGAACCTGTACCAG
TATGCGCCGAACCCGTATGGGTGGGTGGATCCGTTGGGGTTGACTGCTTGTTCTATTAAGGCCGGACGTAATCGCCGAAT
GGCGATGAATAAAGCGAAGGATACGGCTGGAATTCCTAGATCACAGCAACATGAAAGTCATTGGCAGATAGGTAATGATC
GTAGAAAGCAGGGGTATAGTAATTATATTTATTCTGAAAACCCGGCAGAGCATGGGAAATTTTACCAATATAGAAATGCT
GAGGGACATAAAGTTGTAGTGGTAGAACATACGAGCGATCCGAGAAACAAGATTGGTGATGCACATTTCCATGCCGGCAG
AGCAAAAACATCACCGCACACTTATGATTTTAAGACCGAAAGGTATGGTAAGGTTCCGGCGGATAGTTCAGGAGACCATC
ACATATATTATGATTATGACTAA

Protein sequence :
MGEAARVGDSIGHSHALAGMIGGTLIGGLIAAAGAVAAGALFVAGLAASCVGVGVLLIGASLAVGYLSGELATQARDGIA
AAGAGSLSPKGTILTGSGNVFINGKPAAIATVSRVVCEDDGPSMQMAQGSDKVFINGYPAVRSGDKTNCDAQVMAGSPDV
RIGGGTVTTLPIKPEVPDWLYKISDLTLLFAGLIGGVGGAASKLGALGRMLSKAPGINKLGRVACRAGALMTATAATGII
ARPLDVVSGQKFLDGDDELDFVLPSRLPVAWQRYWRSGNPAEGVLGRGWSLFWESSLQIWQEGLVWRAPSGDYVSFPMVP
RGHKTYCEAEKCWLMHNADGSWQVFDVSEQAWHYPRLEAQQPSRLSMMTDAVGNATSLFYNDAGQLSELVDSAGQRLVCR
YLTTANGALRLAAVAWQNGQDEQVLASYGYDDAGQLVTVRNRAGEVTRRFGWQDGLMVSHQDQNGLLNEYRWQEIDGLPR
VVAYRNSAGEQLSVYYDFANGTRRAVRDDGKQALWQLDDDDNVAQFTDYDSRRYGLIYARGELCSVVLPGGAQRQSEWDP
YGRMLSETDPLGRTTTYQYSRNSGRLFSVTGPDGSQAFQHWDEQGRLVRQTDAQGQSTHYHYPDPEESLPERITDALGGE
VQLVWNAQGQLTRHTDCSGSVTAYTYDALGQLTHRTDAEGHLTRYRWDAAGRLQQLRHPDGSDEQFDWNAQGQLAAHRDP
LGSETRWQYTPLGLPDSITDRINRTRRYHYGPRGWLMRLENGNGADYQFSYDAAGRLQVEQRPDGQRRYYHYGADGLPTT
LLETGAPVADGAVAERRQHFRFDEAGQLTARTTDSAEWRYDYDAGGRLTTLTRTPTAAGAALGIEPDSIRLRYDRAGNLL
SEQGVNGELQYQWDALGNLQALTLPQGDRLQWLYYGSGHASAIRFNQQLVSEFSRDRLHRETGRTQGALHQRRQYDALGR
RSWQSSGFSHGQLTKPEDGVLCRVYHYSGRGEIAGVDDALRGEVRYGYDAEGRLLQHREAQQGKPGHRLQYDMADNLLGA
QSASRALEEQLPPAPLGDNRLTHWQQLFYRYDGWGNLISRRNGLYEQHYVYDADNRLTAAHGRGPQGEFRAQYHYDALGR
RTRKQVDYKGKAAQSARFLWQGYRLLQEQRDDGTRRSWSYEPDSPWTPLAAIEQAGESRQADIFWLHSEQNGAPLEVTDG
EGGLRWSGDYDTFGRLKGQTAAGIMQRRGAAYEQPLRYAGQYQDSESGLHYNLFRYYEPEVGRFTTQDPIGLQGGLNLYQ
YAPNPYGWVDPLGLTACSIKAGRNRRMAMNKAKDTAGIPRSQQHESHWQIGNDRRKQGYSNYIYSENPAEHGKFYQYRNA
EGHKVVVVEHTSDPRNKIGDAHFHAGRAKTSPHTYDFKTERYGKVPADSSGDHHIYYDYD

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 48
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 48