Gene Information

Name : XNC1_3348 (XNC1_3348)
Accession : YP_003713514.1
Strain : Xenorhabdus nematophila ATCC 19061
Genome accession: NC_014228
Putative virulence/resistance : Unknown
Product : Rhs-family protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 3272276 - 3276583 bp
Length : 4308 bp
Strand : +
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology

DNA sequence :
ATGCCAGAAGCAGCAAGGCTTGGGGATACGATAGGCCACTCCAGTGCAATGGCCGGGTTGATTGGCGGAACTATCATCGG
TTCCCTGATTTCAGCTGCAGGCGGGATTGCCGCCGGTTTTCTTTTTGTGGCCGGGGTTGCGGCCTCCTGCATTGGGGTCG
GTGTGTTGTTAATTGGTGCGTCCATTGCCGTGGGAATGGCGGCGGGGGCGTTAGGGGACAAAGCCCGTGATGCCTGTGTT
GCGGCAGGTGCCTCGTCACGAAGTCCCAGCGGAACCATTACCAATGGTTCCGCCAATGTTTTTATTAATAATAAACCGGC
TGCAATAGCAACACGAAGTACAGTGGCGTGCAGTAAAGAAGCGGGCATCCGCCAGATGGCACAAGGCTCTGATTCCGTTT
TTATTAATAGCTTACCTGCCTCACGGGTGGGCGATAAAACGACCTGTGATGCCGCCGTCATGAGCGGTTCACCGAATGTC
ATTATCGGTGGGGGGACAGCCGCCACAGAAAGTATCACATCGGAAATTCCCAAGTGGGCGTATACCGTTTCGGATTTAAC
CATGTTTGCTGCGGGATTGATCAGCTTTGGCGGGGCGGCATCTAAGGGACCCGGTGCATTACAAAAATTGTTCAGCAGAG
TACCGGGTGTCAGCAGAATCAGCAGAGTTGCCTGCCGTCTGGCTTGGCTCGGAGTGGCAATACCGGTGGTGGGGATTCTG
ACGAATCCTGTCGAAGTGATTGCCGGGCAAAAATTCCTGAATGATGACGATGAGCTGGATTTTGTTTTTGAGGCGGAATT
CCCGCTGTACTGGCAGCGTAGTTACCTGAGTCGTTATCAATACGAAAGCGCGTTGGGGCAGGGTTGGAATCTGTTCTGGG
AAAGCCGGTTAACCCGTGTTGAAGAGAGCATTCTCTGGCGCAATTTATCCGGCGATATTATCCCGTTTCCGGATGTACCG
GAAGGCCATCGCTGCTTTTGTCCTGATGCGCAAAGCTGGCTGATACACACGGAAGACGGTGAATGGGAAATCCGGGATGC
GGGCGAGCTGGTTTATCACTATGTTGCTTTTGACAATGGAGGTATCAGCCGGTTAAGCCATATCCGTGATAATGTCGGTA
ATGAACAGCGATTCCATTACAACGACCAGCAGCAGATGGTCAATATCACCGGTTGCGGCGCAATGAATCTGCACTGTGAT
TATGACGTCATGGAAATCGGGAACAAAACGGTGTCCCGGCTCACCACTGTCTGGCGGGAACTCTACAACGGCCAGCGTGT
GCGCCTTTGTCACTATCACTACGATGAGAACGCCCGGCTTATCGGTGTCAGCCATCGCAATGACCATCTCCAGCGTCAAT
TCGGCTGGACTGAACAGGGAATGATGGCATGGCATCAGGACACGCGCGGACTACGTTGTGATTATGAATGGGAACAAACA
GAAGACGGATTGTGGCGGGTAATAGCTCAGAAAACCAGTGAAGGCGCAGGCTACCGGCTGGATTATGATGATGAAAACCT
GACCCGCACAGCACACTGGTATGATGACTCCCGCACTGTCTGGACATTAAACGAAGACCATCAGATTGTTCACTGTGCTG
ACCGTAATGGCACAGAACATCATCTTTTATGGGATGAATTCGGCCTGCCGAACGGCTATAAAAATGCAGAAGGGCATTCC
CGTTCAGGAGAGTGGGATAAATCCGGTCGGTTGTTGAGCATGACCGACGGCAACGGCAATCAGACCCAATGGCAATACCA
GAATGATACGGACAGACTGACCTTCATTTTCTGGCCGGATGGTACAGAAACCGTGCTGGAATATGATGAATTCGGGCGTC
TGGTCAGCGAAACCACCCCTTTGAAACACACGACCCGTTATGATTACGGCTTAAAGACCCTGCGGCCTTCTCAGCGTACT
GATGCGAAAGGCGGCAACAGCCAATTTTTGTGGAATGGTCAGGGACAGTTAATCAGCCATGCCGACTGCTCAGGGCAACG
CAACACATGGGGTTATGATGATGAAAATCGGCTGAGCCGTTTTATCAATGCCCTGATGGAAAGCGTAAGTTATCGCTATG
ATGATAACGGTCAACTGGTGCTCGTAACCTATCCCGATGGTTCAACAGAACAGATGGCTTGGGATCGTGCCGGTCAGCTT
ATCTATCATCAGCGCAATGAAAATGCATCCCGTGGCTGGGAATACAATGCACTGGGACAGATAGTTTGTGCCTCTGACCG
CCTGCAACGACAGCTCCGCTATCAATACAACGCCGAAGGACATCTGGTACGGATTGAGAATGCCAACGGGGATCGTTATC
TGCTCAATCGCGATGCAGAAGGGCGACTGATTGAAGAAATCCGTCCTGACGATACCCTGATCCGGTACGAATACAATGCC
GCTGGCCTGTTGTGTACAGAAAAGCGTATGGGTGATCGCGTATTCCGGCAACCTGTCCGCGCGGTTTATCTGCATTATGA
TGCCGCAGACAACCTGATTAAACGGGAAACCGATACCGACAATTACCAGTACCAGTGGGATAACATGGATCGGTTACTGG
CCGCCACCCGTGAACCGAATGCCGCCGGAAAACAACTAGGATTATTGCCGAATACGGTCAGCTTTGCCTACGATGCGCTG
GGGCGGGTGATCCGGGAGCAGAACGGCGAGGATGTGCTCCAGTTCAGTCATGATGAAATGGATAACCTGACCGAACTGAC
CCTGCCACAAGGTGATACCCTGCGCTGGCTTTACTACGGTTCCGGTCATCTCAGTGCTATTCGCCATAATCAACAGATGA
TTACGGAGTTTGAGCGTGACAATTTGCATCGCGAAACCAGCCGGACACAAGGGAAGTTGTTCCAGTTCCGTGAATATGAC
CCTCTTGGACGCCGCATCAGCCAATACAGTGTCAGGGATAAATCAGCCACTATCGGACAGGGGAAACCGTGGCGCGCATG
GCATTACGACAGGCAGGATGACCTGTCCCTGATGGAAGACCATTACCGGGGCTGGGTGGAGTATCTGTATGACTCGGAAA
GCCGGTTGAAAAAAGTCACCAGTGTGGACAGCTTTGAAGAAATGCTGTGGTACGACCGCGCGGATAACCTGCTTGAACGT
CCGCAATCCATGCTGGAAAAAGAGGCAGAAAGTAATAAACACCTGACCCCACAAGGCGACAGGCTGACACAATGGAACCA
GTGGCGCTATGAACACGATGCGCATGGCAATGTTATCAGTCGTGGCACCAACGCCAGCAACCGGCAGACCTATCGCTACG
ATGGGGATAACCGCCTGACCCTTGCCGAAGGCAACGGCATGAAGGCCAGCTATCATTATGATGCCCTTGGGCGGCGCATT
CGTAAGGTGGTGACAACCTGGCCGACAGGGACGCCTCAGCAGGAACAGACGGATTTTGTCTGGCACGGCCTGAAATTGTT
ACAGGAACGCCATACCAATACGGGTAAAACCCAGACCTACTGTTACGAATCCCATGAAAGTTATACGCCGCTGGCTTGTA
TTGTTGCTAAAGGCACTGTGCATGATTATTTCTGGTATCACACTGATATCAACAGTGCACCGCTCGAAGTGACGGATGAA
GACGGCAAAATTGCGTGGTCAGGCAAATATGACGCTTTCGGTGCCGTCAACAGCACCACGATGGCCTATTTTACCGATAC
GGAACGTTCAACCCGCAACTTTGATCAGAACCTGCGTTATGCCGGGCAATATTTTGACAAAGAGACGGGGCTGCACTTTA
ATACTTACAGATTTTATGCCCCTGAAATTGGTCGGTTTATTTCGCCTGATCCGATAGGGTTGAATGGTGGGGTGAATTTA
TACGTTTATGCACCAAATCCACTTACATGGATCGACCCACTTGGGCTTGCTAAATTATTCGAGCTAGGTACATATGGTGA
ATTAAATGGGCCGACACATGTTGGTGATAAATTGCAAGCACATGAGTTACTGAGGCACGAATATTTACGTCAACAAGGAC
TTGCAGAAACATCACGATTATCAGGCAATCCATCAATTGCACTGGATTTAGATCACCACACTCGTGGCCCGCAAAAAGAT
ACTCGTGGTATTGGTGGTGCACACTGGCATGAAACTCAAATTAGGGCAAATGAAGGACTCGGAAAAAATGATTTTGCGTC
AACGTTGAAAAGAGAACTTGATATCACCTCTGGTGGTTTAAGAAAAGCTGGTGTTCCTGCTAGTAGAGTCAAAAGAATGA
GAAAACAAGCTGAGAAATTCTACAGGGGATTATCAAATAAGGTTAAAAATGCTGGCACTTGTAAATAG

Protein sequence :
MPEAARLGDTIGHSSAMAGLIGGTIIGSLISAAGGIAAGFLFVAGVAASCIGVGVLLIGASIAVGMAAGALGDKARDACV
AAGASSRSPSGTITNGSANVFINNKPAAIATRSTVACSKEAGIRQMAQGSDSVFINSLPASRVGDKTTCDAAVMSGSPNV
IIGGGTAATESITSEIPKWAYTVSDLTMFAAGLISFGGAASKGPGALQKLFSRVPGVSRISRVACRLAWLGVAIPVVGIL
TNPVEVIAGQKFLNDDDELDFVFEAEFPLYWQRSYLSRYQYESALGQGWNLFWESRLTRVEESILWRNLSGDIIPFPDVP
EGHRCFCPDAQSWLIHTEDGEWEIRDAGELVYHYVAFDNGGISRLSHIRDNVGNEQRFHYNDQQQMVNITGCGAMNLHCD
YDVMEIGNKTVSRLTTVWRELYNGQRVRLCHYHYDENARLIGVSHRNDHLQRQFGWTEQGMMAWHQDTRGLRCDYEWEQT
EDGLWRVIAQKTSEGAGYRLDYDDENLTRTAHWYDDSRTVWTLNEDHQIVHCADRNGTEHHLLWDEFGLPNGYKNAEGHS
RSGEWDKSGRLLSMTDGNGNQTQWQYQNDTDRLTFIFWPDGTETVLEYDEFGRLVSETTPLKHTTRYDYGLKTLRPSQRT
DAKGGNSQFLWNGQGQLISHADCSGQRNTWGYDDENRLSRFINALMESVSYRYDDNGQLVLVTYPDGSTEQMAWDRAGQL
IYHQRNENASRGWEYNALGQIVCASDRLQRQLRYQYNAEGHLVRIENANGDRYLLNRDAEGRLIEEIRPDDTLIRYEYNA
AGLLCTEKRMGDRVFRQPVRAVYLHYDAADNLIKRETDTDNYQYQWDNMDRLLAATREPNAAGKQLGLLPNTVSFAYDAL
GRVIREQNGEDVLQFSHDEMDNLTELTLPQGDTLRWLYYGSGHLSAIRHNQQMITEFERDNLHRETSRTQGKLFQFREYD
PLGRRISQYSVRDKSATIGQGKPWRAWHYDRQDDLSLMEDHYRGWVEYLYDSESRLKKVTSVDSFEEMLWYDRADNLLER
PQSMLEKEAESNKHLTPQGDRLTQWNQWRYEHDAHGNVISRGTNASNRQTYRYDGDNRLTLAEGNGMKASYHYDALGRRI
RKVVTTWPTGTPQQEQTDFVWHGLKLLQERHTNTGKTQTYCYESHESYTPLACIVAKGTVHDYFWYHTDINSAPLEVTDE
DGKIAWSGKYDAFGAVNSTTMAYFTDTERSTRNFDQNLRYAGQYFDKETGLHFNTYRFYAPEIGRFISPDPIGLNGGVNL
YVYAPNPLTWIDPLGLAKLFELGTYGELNGPTHVGDKLQAHELLRHEYLRQQGLAETSRLSGNPSIALDLDHHTRGPQKD
TRGIGGAHWHETQIRANEGLGKNDFASTLKRELDITSGGLRKAGVPASRVKRMRKQAEKFYRGLSNKVKNAGTCK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 55
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 53