Gene Information

Name : api89 (PAU_03009)
Accession : YP_003041839.1
Strain : Photorhabdus asymbiotica ATCC 43949
Genome accession: NC_012962
Putative virulence/resistance : Unknown
Product : putative membrane-bound sugar-binding protein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG3209
EC number : -
Position : 3484950 - 3489221 bp
Length : 4272 bp
Strand : -
Note : Contains RHS domain and YD repeats.These repeats appear in general to be involved in binding carbohydrate

DNA sequence :
ATGCCAGAAGCCGCACGCTTGGGGGATACCATCGGGCATTCCAGTGCCATGGCCGGATTGATTGCCGGCACCGTCATCGG
GTCATTAATTTCAGCTGCCGGAGGAATTTTATCCGGCGCTCTCTTTATCGCGGGATTGGCTGCATCCTGTATCGGGGTGG
GCGTTTTGTTGATTGGCGCCGCGATTGCGGTAGGCATGGCGGCGGGATATTTAGGCGACATGGCGCGTGATGCCAGTGTC
GCGGCGGGCGCTTCATCGCGAAGTCCCTGCGGGGAAATCAAAACAGGCTCCCCCAATGTCGCTATCAACAACAAACCCGC
TGCTATAGCCACACGCAGTCAGGTAGCGTGCAGTAAAGAGAGCGGCATTCGCCAGATGGCGGAAGGCTCTGACTCTGTTT
TCATCAATGGCTGTCCGGCGGTACGGGTTGGCGACAAAACGGTCTGTGATGCCGCAGTCATGACCGGGTCGTCGAATGTC
TTTATCGGCGGGGGGACTGCGCGGAAAGTCACTATTGTGCCGGAAGTCCCGCCATGGGCGTATACCGTTTCCGATTTAAC
GATGCTGGCGGCAGGGTTCGCCAGCTTCGGTGGGGCGGCAGCCAAAGGGCCCGGGGCTTTACAAAAGCTGTTCGGCAAGA
TACCGGGGGCGGCTAAAATTCGCCAAATCACCTGTCGGCTGGGGGTGCTGGCTGTCGCCGTGCCTGTTGTGGGGATTCTC
ATGAACCCCGTGGAAGTTATCGCCGGCCAGAAGTTCCTGAACGATGAGGATGAGCGGGATTTTATGCTGGCGGGCGAACT
GCCGCTTGATTGGCAACGCAGCTACCTGAGCCGTTATCGCTATGACAGCGTGCTCGGACCGGGCTGGAGCCTGTTCTGGG
AAAGCCGCCTCACGCGCGTGGAAGACGGATTACTCTGGCGTTCCCCGTCAGGAGATATCGTGCCCTTTCCTGATGTGCCG
GCAGGACACCGCTGTTTCTGCCCGGATGCCCAAAGTGAGCTGATACACACGCCGGAAGAGACCTGGGAAATCCGTGATGC
CGGCGAACAGGTTTATCATTATGCGGCGTTCGATGATGACGGCATTAGCCGCCTGAGTCATATTCGCGATAACGTGGGCA
ACGAGCAACGCTGCCATTACAACGCGCAGCATCAGATGGTCACCATCACGGGCAGCGGCGGGCTTACCCTGCACTGCGAT
TACATGATGAGGGAAAACGATACCCAAACGCTCTCCCGTCTCACCGCTGTCTGGCGTGAACTCCCTGATGGCCAGCGCAT
CCTGCTTTGCCGCTACCACTATAATGACCAGGCGCAATTGACCGGGGTCAGCCATCGCAACGATTCTCTCCAGCGCCAGT
TCGGCTGGACTGAGCACGGCCTGATGGCCTGGCATCAGGATGCGCGGGGGCTACGCTGTGATTACCAGTGGGAGCAGGAT
AAAGAGGGTTGGTGGCGCGTCATGGCCCAGCAAACCAGCGAAGGGGCGGGTTACCAGCTGGCCTACGATGACGAAAACCT
CACCCGCACCGCCCACTGGCATGACGGCACCCGCACGGTGTGGCACCTCAACGACGCACACCACATTATCCATTGCCTCG
ACCGCACGGGCACCGAACATCATATCCTGTGGGATGAATTTGGCCTGCCGAACGGCTACAAAGATGCGGATGGCAACACC
CGCCTGAGTGAATGGGACCAACACGGTCGCCAGCTGAGTTTCACCGATGCTAACGGCAACCAGACCCGCTGGCAGTACGA
GAATGATAGCGACCGGCTCACCTTTATTTTCTGGCCGGATGGCACAGAAACAGCGCTGACTTACGATAACCTCGGGCGGC
TGATTAGCGAAACCTCGCCACTAAAACAGACCACCCATTACCACTATAGGCATCGTCACACGCAGCGGCCGGACCGGCGC
ACGGATGTTAAAGGGGGCGAGAGCCAGTTTTTGTGGAATGAACAGGGGCAGCTTATCCGCCATGCGGACTGTTCTGGTCA
GTCCACAATATGGAGTTATGACCGGGAGAATCGGCTGGAAAAAGTCACCAATGCCCTGATGGAAAGTACCCGCTATCACT
ATGGTGATGATGGTCAACTGATTCAGGTCACCTATCCGGATGACTCCACCGAACAGATGACCTGGGACCCGGCAGGCCAG
CTTATCAGTCACCAGCGCAATCAAAACCCGCCCCGTCATTGGGCATATAACGCGCGGGGTCAGGTCGTTTGTACCACTGA
CCGGCTGCAACGGCAGGTGTATTACCACTACAACCCCGAAGGCCATCTGATCCGGCTCGACAACGCCAACGAGGGCCGCT
ACCTGCTGAACCGGGATGCCGAAGGACGCTTAATCGAAGAAATCCGCCCCGACGACACCTTAATCCAGTACGAATACAAC
GCGGCGGGCTTGCTGAGCCTCGAACGGCGGCTGGGAGACCGCGTGTTCCGGCACCCGGAACGTCGGGTACATCAGCATTA
TGATGCGGCGGGATATCTTATTCAGCGGGAAACCCACACGGACACTTATCAGTACCAGTGGGACAACATGGGCAGATTAC
TGACAGCCCGCCGTGAACCCAATGACAACGGAAAACAGCTGGGGATAGAGCCCAACACGGTGCGTTTTAATTATGACGCT
GTGGGGCGGGTCATTCGGGAGCAGAATGGCGACGACAGCCTCCAATTCAGCTACGACGAGCTGGATAACCTGACCGCCCT
CACCTTGCCCCATGGCGGCACCTTAAGCTGGCTCTACTACGGCTCGGGTCACCTGAGTGCCATCCGGCACGGGCAGACAC
TCCTCACTGAATTCGAGCGTGACCGCCTGCATCGAGAAACCCGCCGCACGCAGGGCAAACTGTTCCAGCAGCGCGATTAC
GATCGACTGGGACGCTGTACCCATCAATATAGCCTGCCGTTGAAACAGGCTGATGCTGAACCGCTGCCTTACCTTAGCGA
AGGCAAGCCATGGCGGGCGTGGATCTACAACCCCCAAGACGAATTGCAGGTGATGGAAGACCATTACCGGGGCGTGATTG
AATATCTTTATGACTCGGAAAGTCGCCTGAAAAAAGTGACACACTGGGGCAGTGCTTATGATGATATGTTGTGGTATGAC
CGGGCGGATAACCTGCTGGAGCAACCTCAGGCAATACTGGAGCGTGAGGCGGCAGAGCGGGGGCTGTCCAAAACGATGGA
GCCGCAGGGTGACCGCCTGAACCACTGGCGTGACTGGTGCTATGAACATGACACGCATGGCAACGTTATCAGCCGGGGAC
GCGAGACGCGGGAAACCCAGCATTACCGTTATGATGGTGACAATCGGCTGACTGTGGCGGTTATCGGCACCACAACGGCA
CGTTATCACTACGATGCCCTGGGGCGGCGTATCCGTAAGGTGGTGAAAATCGGGATTGATGGACTGGCGCGTTATGAACA
GACCGATTTTGTCTGGCACGGCCTGCGACTGTTGCAGGAGCGGGATGGGAAAAGTGGCGAAACCCAGACCTACTGCTACG
AATCCCATGACAGTTACACCCCGCTGGCCAGCATCGTCACGAGAGGCGCCACGCACAATTACTTCTGGTATCACACCGAT
ATCAACGGTGCGCCGTTGGAAGTGACGGATGAGGAGGGCAAAATTGCATGGGCGGAGAAATACAGTACCTTCGGTGAACT
GGGGGGAACTCCGCTGGATTACTTCACCGATCCTGACCGCTCATCGTGGAGCTCGCGTTTCAGACAAAATCTGCGTTATG
CGGGACAATATTTTGACAAAGAGACGGGGCTGCACTTTAATACTTATCGGTATTATGCCCCGGAGATTGGCCGGTTTATC
TCGCCAGACCCGCTCGGGCTGGAAGGGGGTCCGAATCCCTATTCGTATGTTCATAACCCGGCGAACTGGATTGACCCATT
CGGGTTAGCAGCTTGTCCAACACAAAAATACGAAGTTAGTACATTCGATGATTTACAGAGACGTTCTAAGGTGGGTGATA
AATTAGATATCCACCATGCGGCTCAAAAACATCCTGCGGGGCAGGTAATTACTGGTTATGATCCTAAAGTAGCTCCATCA
ATAGCTCTCCCTAGAGGAGAACATAAACTGATTCCTACTATGAAAGGACCGTATACGGGATCGGCTAGAGATTTATTGGC
AAAAGATATTAGAGATTTGCGCAATTATACTAATGCCCCTCCTTCAGCAATAAAAGATTTGCTCAATTTAAATAAAGAAA
TGTATCCAGAAGCCTTCACTAAAATAAGGTAA

Protein sequence :
MPEAARLGDTIGHSSAMAGLIAGTVIGSLISAAGGILSGALFIAGLAASCIGVGVLLIGAAIAVGMAAGYLGDMARDASV
AAGASSRSPCGEIKTGSPNVAINNKPAAIATRSQVACSKESGIRQMAEGSDSVFINGCPAVRVGDKTVCDAAVMTGSSNV
FIGGGTARKVTIVPEVPPWAYTVSDLTMLAAGFASFGGAAAKGPGALQKLFGKIPGAAKIRQITCRLGVLAVAVPVVGIL
MNPVEVIAGQKFLNDEDERDFMLAGELPLDWQRSYLSRYRYDSVLGPGWSLFWESRLTRVEDGLLWRSPSGDIVPFPDVP
AGHRCFCPDAQSELIHTPEETWEIRDAGEQVYHYAAFDDDGISRLSHIRDNVGNEQRCHYNAQHQMVTITGSGGLTLHCD
YMMRENDTQTLSRLTAVWRELPDGQRILLCRYHYNDQAQLTGVSHRNDSLQRQFGWTEHGLMAWHQDARGLRCDYQWEQD
KEGWWRVMAQQTSEGAGYQLAYDDENLTRTAHWHDGTRTVWHLNDAHHIIHCLDRTGTEHHILWDEFGLPNGYKDADGNT
RLSEWDQHGRQLSFTDANGNQTRWQYENDSDRLTFIFWPDGTETALTYDNLGRLISETSPLKQTTHYHYRHRHTQRPDRR
TDVKGGESQFLWNEQGQLIRHADCSGQSTIWSYDRENRLEKVTNALMESTRYHYGDDGQLIQVTYPDDSTEQMTWDPAGQ
LISHQRNQNPPRHWAYNARGQVVCTTDRLQRQVYYHYNPEGHLIRLDNANEGRYLLNRDAEGRLIEEIRPDDTLIQYEYN
AAGLLSLERRLGDRVFRHPERRVHQHYDAAGYLIQRETHTDTYQYQWDNMGRLLTARREPNDNGKQLGIEPNTVRFNYDA
VGRVIREQNGDDSLQFSYDELDNLTALTLPHGGTLSWLYYGSGHLSAIRHGQTLLTEFERDRLHRETRRTQGKLFQQRDY
DRLGRCTHQYSLPLKQADAEPLPYLSEGKPWRAWIYNPQDELQVMEDHYRGVIEYLYDSESRLKKVTHWGSAYDDMLWYD
RADNLLEQPQAILEREAAERGLSKTMEPQGDRLNHWRDWCYEHDTHGNVISRGRETRETQHYRYDGDNRLTVAVIGTTTA
RYHYDALGRRIRKVVKIGIDGLARYEQTDFVWHGLRLLQERDGKSGETQTYCYESHDSYTPLASIVTRGATHNYFWYHTD
INGAPLEVTDEEGKIAWAEKYSTFGELGGTPLDYFTDPDRSSWSSRFRQNLRYAGQYFDKETGLHFNTYRYYAPEIGRFI
SPDPLGLEGGPNPYSYVHNPANWIDPFGLAACPTQKYEVSTFDDLQRRSKVGDKLDIHHAAQKHPAGQVITGYDPKVAPS
IALPRGEHKLIPTMKGPYTGSARDLLAKDIRDLRNYTNAPPSAIKDLLNLNKEMYPEAFTKIR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
YpsIP31758_3692 YP_001402646.1 RHS/YD repeat-containing protein Not tested YAPI Protein 0.0 53
api89 CAF28563.1 putative membrane-bound sugar-binding protein Not tested YAPI Protein 0.0 52