Gene Information

Name : PAU_03026 (PAU_03026)
Accession : YP_003041856.1
Strain : Photorhabdus asymbiotica ATCC 43949
Genome accession: NC_012962
Putative virulence/resistance : Unknown
Product : similar to hemagglutinin/hemolysin-related proteins. putativ transmembrane protein
Function : -
COG functional category : U : Intracellular trafficking, secretion and vesicular transport
COG ID : COG3210
EC number : -
Position : 3538625 - 3546715 bp
Length : 8091 bp
Strand : -
Note : This highly divergent repeat occurs in number of proteins implicated in cell aggregation

DNA sequence :
ATGAACAAACAGTTATATCGTCTTATCTTTAACCGATCCCGAAATAGGCTGATGGTGGTGGCGGAGATTGCCCGGGCCGG
GCAAGGCAGCACCGCGCGCCGTCGCGGTCGGCCATCGGCTCAACGGTTGTGTCGCCTTACCGCGTTCCAGTTCGGTTTAT
TGCTGGCCCTGGGCGGGATTTCACTCACCGCCCAGGCGGCGATTGTGGCCGATGGACAGGCACCGGGCCAGCAACAGCCG
ACCATTATCCCCAGTGCCAACGGCACGCCGCAGGTGAATATCCAGACCCCCAGCGCTGCCGGCGTGTCCCACAACACTTA
CCGCCAGTTTGATGTCGATAAACGCGGGGTTATCCTCAACAACAGCGCAAAGGCGACCGAGACCCAACTGGGCGGCATGG
TCGCCGGCAACCCCTGGCTGGCTAAAGGGGAAGCCAAAGTGATCCTCAATGAAGTCAACAGCCGTGACCCCAGCCACCTC
AATGGCTGGATTGAAGTCGCCGGGCGCAAGGCTGAGGTGGTGATTGCCAATCCGTCCGGGATTACCTGTAATGGCTGTGG
TTTTATCAATGCCCATCGCACCACCCTGACCACCGGTGAAGCCCTGATGGAGCGGGGGCATCTGACCGGTTTTGACGTGA
ATCAGGGCGAAGTGCGTATTGAGGGGCAAGGGATGGACAGCCGGCAGCAAAATTACACGGACATCATTGCCCGCGCCGTT
GCCCTCAACGCCAAATTACACGCCCAGAACCTGAAAGTGACCACCGGGCGCAATCGCGTGGATGCCGCACACCAGACCAT
CACGAAAAAATCCGCCGCTGAAGATGAGGCCCACCCACTGTTCGCGCTGGACAGTACGGCACTCGGCGGGATGTATGCTC
ATAAGATTCTGCTGATAGGCACCGAAGCCGGTGTCGGGGTGCGGAATGCCGGCGATATCGGCGTGCCCGCGGGGGAAGTC
TTCGTCACCGCCGATGGCCGGATTGAGAACCGGGGCACAATCAGCAGCCGGGACGCGCTGCAATTAACCAGTACCGCCGG
GATAGATAATCAGGGCAAACTGCTGTCGCAATCGACGGTCACTTTGCAGGCTGGGGGCCCACTGCACAACCGGGGCCGGA
TTGAGGCGCGGGGGGATATCACCGCCACCGCCCAGACGATACAGAGTGACCGCCACAGTGTCTGGGCCGCCGGACTGGAT
GATAACGGGAACACCACCCGCCCCGGCTCACTGACCCTGACCGCCCAGCAGGTTCAGGCTGGCGGGAAAAATCTGGCGAC
CCATACGCTGAATATTCACGGTCAACAGATTGACCTCAGCGGCAGTCAAACTGTAGCGGGCGACATTCAGCTGACGGCCA
GTCAACCGGGCATCAGCACGGCACACGCCAGCGTTAACGCAGACCGTTTTACTGCCCATACCCCGGGCCAGTTTAATAAT
ACCGGGGGGCAATTAACAGCGCGGGAAATCCATTTAACCACGCCCGATATCGCGAACCAGCAAGGAAAAATGACCCAGAC
CGGTCCCGGTGAACTGACCCTCCACACCCGGACCCTGAATAACCGGGGTGGCACCCTGTTTAATGCCGGATCACAGCTGA
CGATCACCACCGACCAGCTCGATAACCGTCAGGGCAACCTCGTCAATCAGGGGGATAACTTCCACCTGACCGCGCAAACC
GCAGACAACACTCAGGGGCAAGTGCAACTGGCCGGCAACGGTCAGCTGTCCCTGACAGCCCAACACTGGCTGGGCCATCA
GGGCAAACTGCTGACCAACGGCACGCTGGCTATTCAGGCCGGTGACCTGCAACTGAATCAGGCCGAGACCCGCGCTCATC
GCATTACTCTCAACGCCGATACGCTGAATCATCAGCAGGGTGTGATGCAACAATCGGGCACCGATACCCTGGCCCTGACG
GTCAACACGCTGAATAATCAGGGCGGCAAGATTGCCGGTAACGGAAACCTGAATGTTGAGGCAACCACAGTCGATAATCG
CCATGGCAACATCGTGGCGGCTGAAAACGGCTCGTTGACATTGACGGTTAACGACACGCTGGATAACCAGAATGGCCGAC
TGGAAGCCGGTAACGATATCCAACTCACCGCGACCCAACTGGATAATCGCCGGGGAACCCTGGCGGCCTCTGGGGGCAGT
GCCACGCTGACTATCGGGCAACAGATTCAGAACACCCACGGCCATATCGAAGCCAAAACCCGACTCACTACGACCAGTCA
GGCTTTGGATAACACCCAGGGGACATTACTGGCGCAACACATCACCAGCCAGACGACCGGGCACCGCTTCACCAATACCG
CCGGACAAGTGATCGCGCAAGATACCCTGACCTTACACAGTGGTGAGCTGGAAAACACCGCCGGACTATTACAGTCCGGG
GGCGACATGACGGTGAAAACTCACGGTCACTGGCTCGCCAACACCGCCACGACGGACCAAAAAGGCGGCCAGTTATTGAG
CGGCGGACACTTAACGCTGCATACCGGCGATATCGACAACACCGGCGGGATCATCGCCGCAGACGGCCACACTACCCTGA
CCAGCACCGCGCTTAACAATACCCGGGGACAGATAGCCGGCAATGGCGGACTGGACATTCACAGTCAGCGGCTTACCAAC
CGTCAGGGCACCTTACAATCCGCCGATGCGCTGAATCTCGATACCGATGGACAGGGACTGGATAACCAGCAGGGCCGTAT
TCTCGGCGATGGCATCACCACCGTCACCAGCGGCCCGCTGGACAATCGCCACGGTCATCTGCAAGGCGGACAGTTGGTTA
TTGGCACCCGACAGGCGCAGACAGATAACCGGGACGGCAAATTGTTGTCGGCAGGTACGTTCAACCTGAAAACCCAGCAG
CTCGACAATCGTCATGGGCAGGTGCAGGCGGTGGGCGACACCGTATTGAATGTTAAAACCCAAACCGATAACACCGGCGG
CCTGATTCGTGGCGGTGCGCAATTGACCTTGAGCACCGCTCACCTGATTAATCGCGACACGGCGCAGACGGATAAAGGAC
TGGAAGCCCAAAACCTAACGGTTAGCACCCAACAGGCGGACAACAGTCAGGGCACTTTACGGGCCGCCGACCACCTGCAA
GCGAATATCAGCCAGACCTTGAATAATACTCAGGGGCGGGTCTCCGCAGGCAAACAGCTCACTATCAACCGCGAGGCTCA
GCAACCGCCTCTGAGGATTAACAACCAACAAGGCACCTTGATTGCCGGGAAACAGGTCGATATTCAGGCTAATACGCTCA
GCGGTGACGGCCAACTGTTATCGCAGGGCGATATGGCGGTGACCTTAACCGAGGATTTTCACCATACCGGCAACACGGCG
GCCAATGGCAATCTGACCCTGAAAACCACCGGCAATCTCCTCAATGACCGCCCGATAAAAGCCGGTCAGGCGCTGCATCT
TGGGGCGCAAAATCTTACTAACAGCGCCACCGGTGAAATCAGCGCCGAACAAACGCAAATCAAGGTCCACGATACGCTGA
CCAATACCGGACTGATTGACGGCAACCTGACCCATCTTACGGCTCACACGTTGACCAACACCGGCACCGGACGGATTTAC
GGGGACCATCTCGCCCTCCAGACGGGGACCCTGAATAACACCGCGCACGCCGGCAACGCGGCGGTGATTGCCGCCCGTGA
CCGGCTGGCTATCGGCACTGACACCCTGAACAACCAGCATCATGCGCAAATTTACAGTGTGGGCGAGACGCATATCGGCG
GCCAGCTGGATAACACCTTATCCGCCACCGGTCAGGCGCGTGAACTCACTAACCACGCCGCCACCCTCGAAGCCGGCGGT
AATCTGACGATTGACGCCAACCAGCTCCACAACACCAACGCCGGGCTGGTGACTCAGGTCGCCGAAACCGAACACGCCCG
GCATCACGATGCGGCGCTCAGCGGCCAGACTGCCCGCTATGACTGGTCGCAGGTAGACACCGCCCGGCGGGACAAATACG
GCGTGCATACCGCCCGGATGCCGGACGGCAGCCGCAGCGATAACTTCTATGAGTATCAGTACACCCGTACCGTGACGGAA
ACGCAGGTCAAACACAGCGACCCGGGTAAAATTCTGGCCGGCGGCCATCTCACCCTTAACAGTGCAAACGTGACTAACCA
CGACAGCCAAATTATTGCCGGTGGCGCGCTGACGGGCACGATTGGCGAACTGCACAATCTCGCCACGCAGGGCGAACGTA
TCACCACCGAGACCGGCAGCCAGACCCACTGGTACGCCAAGAAAAAACGGAACAAACTCGGCATCGGCGGCACCAAAACT
TCGCAGGGAAAAAGTCGCAGTGGTTATAACCCTGCGCCGGTGGTAGAAACGATTGACCTGAAAACGCTGGCCTGGCAGGA
CCATGCCCGTCCACAGAACACCGATATCACGATTACGGACCGACAGACCGGCCAAATTCACGCGGCTCCCACGGCGGTGA
AACCGGTGAGCGGGATAAACGACCAGCCGCGGGTCTTGCCACCTGGTCAACCGTTTGAACTGAGTTTGCCCCCAGAAACG
GTGAAAGGGCAACCCATCGACCCGGTGATACGTGTCGTGACCCCCAATACCCGCCTGCCGGATAACAGCCTGCATACGGT
GCAACCGGCCAGTGACAGCCACTATCTGGTCGAGACCGACCCGAAATTTACCCAGTATAAACAGTGGCTGGGCTCGGATT
ATATGCGGCAACAGTTAACGCACGACCCGGCGCTGGTGCACAAACGCCTGGGCGATGGGTTCTATGAACAGCGACTGGTG
CGGGAGCAGATTACCCAGCTGACCGGCCAGCGCTACCTGCCCGGTTACCATAACGATGAAGCGCAGTTTAAGGCGCTGAT
GGATGCCGGAGTGGCTTTCGGTAAGGAACAACAACTCACCCCCGGCATTGCCCTGAGTCCGGCCCAGATGGCGCTGCTGA
CCGCCGACATTATCTGGCTGACCAACCAGACGGTGACTCTGCCGGACGGCAGCACGCAGGTGGTGACGGTGCCCCAGGTT
TACGCCCGGGTGAGACCCGGCGACCTGAGCGGAGACGGCGCTTTACTGGCCGGTAATACCGTGGCGCTCAACAGTCAGGG
CGATATCACCAACAGCGGCACGATTAGCGGGCGTGACGTTACCCAATTGACGGCCAACAACCTGACCAACAGCGGCTTTA
TTCGCGGCGGCAAAGTGGATGTGACGGCCCGGCAGGACATCACCAACCGGGGCGGCCAGATTCAGGGGGCGGATAAAGTT
GTGCTGCAAGCCGGTCGGGATATCACCAGCGCTGCGACCCTGCGCGGAGATGCCGCGAACCGCTGGCGGGACCGTCCGGC
GGGGATTTATGTGCAGAACGACCAGGGCACCCTGTCGCTGAGTGCCATCAACAATGTGCAATTAACCGCCAGTGAGGTGA
AGAACGCCGGTAAAGACGGTCGCACCGAGATAACCGCCGGCCATAACCTGACGCTGGATACCCTAAGCACGCACCGTACA
GAACAGGGCGACTGGGGGAAAGATAACTATCGCCATCTGAGCCAACAGCAGGATATCGGCAGCCAGATAACCGGTGCCGG
TGAGGTGACCCTGCAAGCCGGGCAGGATTTGACCGCTACCGCCGCCCACGTCAATGCCGGTCAACAATTGACGGCGCAGG
CCGGAAATAACCTGACATTGACAACCGGTACCGCCTCATCCGACTTAGTGGAGCACAGTAAGCAAACCAGCAAAGGCTGG
CTGTCCAAGTCGTCAGTGGAAACCCACGACGAAGTGCATGACCGACAAGCCCTGAGCACCACCTTCAGCGGCGATAAAGT
GACTTTACAGGCCGGTAAAGACCTGCATATCCGCGGCAGCAATGTGGCGGGCACACAGGATGTCCGCCTGAACGCCGGTC
ATCAGCTGACCGTCACCACCGCCGCAGAATCTCACGGTGAAACCCATTTGCGGCAAGAGAAAAAATCGGGCTTGATGGGC
ACGGGCGGTATTGGTTTCACATTAGGCCAGGCCAGCCAGAAAGTCACCACGGACAGCGACAGCCAGCGGAATAAAGGCAG
CACCGTGGGCAGCAGCCAGGGCCATGTCACCCTCAATGCCGGTACCCAGCTCAACCTTCACGGCAGTGACCTGGTGGCCA
GCAAGGACATCACCCTGACCGGCCAGAACGTGAATATCACCAGTGCGGCAAACCATCACACTACCCTGACCAAAACCGAA
CAGAAACAAAGTGGCCTGACGGTGGCGCTGAGTGGCACGGCAGGCGGCGCCCTCAACAGTGCGGTGCAGACGGCCCGGGC
CGCGCAGCAAACCGAAGACCCGCGCCTCAAAGCCCTGCAACACACTCAGGCCGCCCTCAGTGGGGTACAGGCGACACAGG
CTGCCCGGCTGGCCGAAGCCCAGGGCAGTGATAAAGGCAATAACACCCTGGCCGGGGTGAGCCTGTCCTACGGTCGTCAG
TCTTCCCGTTCAGAACAGCAGCACCAGCAGACTACCCAACAGGGCAGCCACCTGACCGCCGGGGATAACCTCACGATAAC
GGCGCACGGTGACGATAAAGGGGCATCCGGTCCGAACGGCGATATCCGCATTCAGGGCAGCCAGTTACAGGCGGGCAAAG
ACCTACAACTCAATGCCAGTCGGGATATTCAGCTCTCCGCCAGCCAGAACACCGAACAGACCACCGGCAAGAGCCGCAGT
CACGGCAGCGCAATGGGCATGGGCGTCACCGCCGGCCCGGGCGGCACTGGCTTTACGGTGTCGGCCAATGTCAGTCGGGG
CAACGGCCATGAAACCGGTCACGGTGTCAGCCACAATAACACCACGTTACAGGCGGGACAGACCGTTGAGCTGAACAGTG
GCCGGGATACGACGCTGAAAGGGGCACAAGTCAGCGGCGAACAGATGACCGCTGAGGTAAAACGCCATCTGACACTCAGC
AGCGAGCAGGACAGCCAGCGCTATGACAGCCAGCAACACAACGCCCGGGCCGGGGTCAGTACGACAGTGGGGCCACAGCC
GGACGGTACCCTGAGCCTCAATGCCAGCCGCAGCAAACTGCACAGCAATTACGATTCGGTGCAGGAGCAAACCGGGCTGT
TTGCCGGTAAAGGCGGCTATCAGGTTAATGTGGGGGACCACACCCAACTGGACGGGGCCGTGATAGCGAGTCAGGCCGAC
AACGCGAAAAACACCCTCAACACCGGGACACTGGGCTTTAAAGATATTCAAAATAAAGCGGACTTTAGCGTTGAGCAGCA
AAGTGCCGGCGTCAGCCTTGGTCAGCCAACCACCGGTCAGGTGCTGAATAATCTGGCGGTCAATGCGCTGACCGGCTCAA
ATAATCAGGGCCATGACCGCAGTACCACGCACGCAGCGGTCAGTGACGGCACCCTGATTATTCGGGATAAGGACCACCAG
ACGCAGGATATCGCGAACTTAAGCCGTGATACCGATAATGCGGCCAATGTCCTGAGGCCGATATTTGATAAGGAGAAAGC
GCAACAACGGCTGAAACAGGCGCAACTGATGGGCGAACTCAGCGCCCAGATGACCGAGATAGCCGGCACGGAAGGGAAAA
TTATCGCCACTAAGGCGGCGAAAGCGAAACTCAATCATATCAGTGAACAGGATAAAGCCGATGCCAGAGAAAAGTTGATC
AACGCAGGTAATAAAGCCCCCTCGCCAGCGGATATCAATAAGCAAGTTTATGACACCGCTTACACGCAGGCGCTGAATGA
CTCCGGTTTTGGCACGGGTGGCCCATACCAGAAAGCCTTACAGGCGGCCACGGCAGCCATTCAGGGGCTGGCCGGAAACC
ATTTGGGGCAAGCACTGGCCGGCGGTGCTTCGCCTTATTTGGCGGGCGTGATAAAAGAGCTGACCACCGACCCGCAAACC
CATCAGGTGGATATCGCCACCAACACGCTGGCCCATGCGCTTTTAGGCGCGGTGGCAGCGGAGGTGAGTGGTAACAACGC
TGTGGCGGGAGCAGCCGGGGCCGCGTCGGGCGAACTGGCGGCTCAGGTGCTGATAAAGCAGCTTTACGGCGACGCGGCTA
AAGTCAGCTAG

Protein sequence :
MNKQLYRLIFNRSRNRLMVVAEIARAGQGSTARRRGRPSAQRLCRLTAFQFGLLLALGGISLTAQAAIVADGQAPGQQQP
TIIPSANGTPQVNIQTPSAAGVSHNTYRQFDVDKRGVILNNSAKATETQLGGMVAGNPWLAKGEAKVILNEVNSRDPSHL
NGWIEVAGRKAEVVIANPSGITCNGCGFINAHRTTLTTGEALMERGHLTGFDVNQGEVRIEGQGMDSRQQNYTDIIARAV
ALNAKLHAQNLKVTTGRNRVDAAHQTITKKSAAEDEAHPLFALDSTALGGMYAHKILLIGTEAGVGVRNAGDIGVPAGEV
FVTADGRIENRGTISSRDALQLTSTAGIDNQGKLLSQSTVTLQAGGPLHNRGRIEARGDITATAQTIQSDRHSVWAAGLD
DNGNTTRPGSLTLTAQQVQAGGKNLATHTLNIHGQQIDLSGSQTVAGDIQLTASQPGISTAHASVNADRFTAHTPGQFNN
TGGQLTAREIHLTTPDIANQQGKMTQTGPGELTLHTRTLNNRGGTLFNAGSQLTITTDQLDNRQGNLVNQGDNFHLTAQT
ADNTQGQVQLAGNGQLSLTAQHWLGHQGKLLTNGTLAIQAGDLQLNQAETRAHRITLNADTLNHQQGVMQQSGTDTLALT
VNTLNNQGGKIAGNGNLNVEATTVDNRHGNIVAAENGSLTLTVNDTLDNQNGRLEAGNDIQLTATQLDNRRGTLAASGGS
ATLTIGQQIQNTHGHIEAKTRLTTTSQALDNTQGTLLAQHITSQTTGHRFTNTAGQVIAQDTLTLHSGELENTAGLLQSG
GDMTVKTHGHWLANTATTDQKGGQLLSGGHLTLHTGDIDNTGGIIAADGHTTLTSTALNNTRGQIAGNGGLDIHSQRLTN
RQGTLQSADALNLDTDGQGLDNQQGRILGDGITTVTSGPLDNRHGHLQGGQLVIGTRQAQTDNRDGKLLSAGTFNLKTQQ
LDNRHGQVQAVGDTVLNVKTQTDNTGGLIRGGAQLTLSTAHLINRDTAQTDKGLEAQNLTVSTQQADNSQGTLRAADHLQ
ANISQTLNNTQGRVSAGKQLTINREAQQPPLRINNQQGTLIAGKQVDIQANTLSGDGQLLSQGDMAVTLTEDFHHTGNTA
ANGNLTLKTTGNLLNDRPIKAGQALHLGAQNLTNSATGEISAEQTQIKVHDTLTNTGLIDGNLTHLTAHTLTNTGTGRIY
GDHLALQTGTLNNTAHAGNAAVIAARDRLAIGTDTLNNQHHAQIYSVGETHIGGQLDNTLSATGQARELTNHAATLEAGG
NLTIDANQLHNTNAGLVTQVAETEHARHHDAALSGQTARYDWSQVDTARRDKYGVHTARMPDGSRSDNFYEYQYTRTVTE
TQVKHSDPGKILAGGHLTLNSANVTNHDSQIIAGGALTGTIGELHNLATQGERITTETGSQTHWYAKKKRNKLGIGGTKT
SQGKSRSGYNPAPVVETIDLKTLAWQDHARPQNTDITITDRQTGQIHAAPTAVKPVSGINDQPRVLPPGQPFELSLPPET
VKGQPIDPVIRVVTPNTRLPDNSLHTVQPASDSHYLVETDPKFTQYKQWLGSDYMRQQLTHDPALVHKRLGDGFYEQRLV
REQITQLTGQRYLPGYHNDEAQFKALMDAGVAFGKEQQLTPGIALSPAQMALLTADIIWLTNQTVTLPDGSTQVVTVPQV
YARVRPGDLSGDGALLAGNTVALNSQGDITNSGTISGRDVTQLTANNLTNSGFIRGGKVDVTARQDITNRGGQIQGADKV
VLQAGRDITSAATLRGDAANRWRDRPAGIYVQNDQGTLSLSAINNVQLTASEVKNAGKDGRTEITAGHNLTLDTLSTHRT
EQGDWGKDNYRHLSQQQDIGSQITGAGEVTLQAGQDLTATAAHVNAGQQLTAQAGNNLTLTTGTASSDLVEHSKQTSKGW
LSKSSVETHDEVHDRQALSTTFSGDKVTLQAGKDLHIRGSNVAGTQDVRLNAGHQLTVTTAAESHGETHLRQEKKSGLMG
TGGIGFTLGQASQKVTTDSDSQRNKGSTVGSSQGHVTLNAGTQLNLHGSDLVASKDITLTGQNVNITSAANHHTTLTKTE
QKQSGLTVALSGTAGGALNSAVQTARAAQQTEDPRLKALQHTQAALSGVQATQAARLAEAQGSDKGNNTLAGVSLSYGRQ
SSRSEQQHQQTTQQGSHLTAGDNLTITAHGDDKGASGPNGDIRIQGSQLQAGKDLQLNASRDIQLSASQNTEQTTGKSRS
HGSAMGMGVTAGPGGTGFTVSANVSRGNGHETGHGVSHNNTTLQAGQTVELNSGRDTTLKGAQVSGEQMTAEVKRHLTLS
SEQDSQRYDSQQHNARAGVSTTVGPQPDGTLSLNASRSKLHSNYDSVQEQTGLFAGKGGYQVNVGDHTQLDGAVIASQAD
NAKNTLNTGTLGFKDIQNKADFSVEQQSAGVSLGQPTTGQVLNNLAVNALTGSNNQGHDRSTTHAAVSDGTLIIRDKDHQ
TQDIANLSRDTDNAANVLRPIFDKEKAQQRLKQAQLMGELSAQMTEIAGTEGKIIATKAAKAKLNHISEQDKADAREKLI
NAGNKAPSPADINKQVYDTAYTQALNDSGFGTGGPYQKALQAATAAIQGLAGNHLGQALAGGASPYLAGVIKELTTDPQT
HQVDIATNTLAHALLGAVAAEVSGNNAVAGAAGAASGELAAQVLIKQLYGDAAKVS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
S4 AAQ19127.1 putative adhesin/hemagglutinin/hemolysin Not tested PAI I CL3 Protein 0.0 45