Gene Information

Name : irp1 (PSPTO_2600)
Accession : NP_792409.1
Strain : Pseudomonas syringae DC3000
Genome accession: NC_004578
Putative virulence/resistance : Virulence
Product : yersiniabactin polyketide/non-ribosomal peptide synthetase
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG3321
EC number : -
Position : 2873730 - 2883251 bp
Length : 9522 bp
Strand : -
Note : This synthetase catalyzes the final steps in the biosynthesis of the siderophore yersiniabactin, adding one acetate unit with concomitant reduction to the alcohol and dimethylation followed by the addition, cyclization and methylation of one cysteine resi

DNA sequence :
ATGCCCACCTTTGAACACACCTACGGCGCCAGCGAGCCCATTGCGGTCATCGGCCTGGCCTGCCGCTTCCCCGGTGCCCG
CGACAGCACCGAGTATTGGCAAAACCTGCTGGCCGGTCGCGAGTGCAGCCGTCACTTCAGCCGTGAAGAACTGCTCGCCG
CAGGGCTGTCAGCAGAGCTGATCGACAACCCGGACTTCGTCAATGTGGCAGCAATCATCGACGACCCTGACCGCTTTGAC
GCGGCTCTGTTCGGTTACTCGCGTCAGGAAGCCGAATCCATCGACCCGCAGCAGCGGCTGTTCCTGCAGACCGTCTGGCA
CGCGCTGGAACACGCCGGTTTCGCGCCTCGCGACGTGGCGCACAAAACCGGTGTGTTTGCCTCGGGGCGCATGAGCACTT
ATCCGGGGCGAGACGCTATTCGCGTCACCGAAGTGGCGCAGGTCAAGGGCTTGCAGGCGCTGATGGGCAACGACAAGGAT
TACCTGGCCAGCCGCGCCGCCTACAAACTTAACCTGCGCGGCCCGGCCATGAGCGTGCAGACCGCCTGCTCAAGTTCGCT
GGTGGCGGTGCACATGGCCTGCGAAAGCCTGCGCAGTGGCGAGTGCGAAATGGCCGTGGCAGGCGGCGTGGCGGTGTCGT
TTCCGCAGCATGCCGGTTACCTGTACCAGCCGGGGATGATTTTCTCACCCGACGGTCGCTGCCGCCCGTTTGACGCCAGT
GCCCAGGGCACCTTTGCCGGTAACGGCGTCGGGGCCGTGACCCTGCGTCGCCTGGAGGACGCCTTGCGCGACGGCGACCC
GGTATTGGCCGTGTTGCGCGGCAGCGCGATCAATAACGACGGTCATCACAAGGTCGGCTACACAGCGCCCTCGGTGATCG
GTCAGCGTGAAGTCATCCAGGATGCGCTGCTGCTGGCCGACATCGATTGCGCCAGCATCGGCATGCTCGAAGCGCATGGT
ACCGGCACGCCACTGGGCGACCCCATCGAGGTTCAGGCACTGCGTGATGCCTTCGCACACCGCACGGACATCGGTGGCTG
CGCGCTGGGCTCGGTGAAGGGCAACCTCGGCCACCTGGACACCGCCGCAGGCATCGCCAGCCTGATCAAGACCGTTCTGG
CGGTCAGCCACGGGCGCATTCCGCCCAGCATCAATGTCGAGCGGATCAACCCGGCGCTGCAACTGGAACAGAGCCCGTTT
TACGTTCCGATCCAGGACCGTTCATGGCCGGCCGGACCACGCCGCGCCGGTGTGTCGTCATTCGGCATTGGTGGCACCAA
CTGCCACGTGATTGTCGAGGCGCTGCCCGACGCATTGCGCAGCTCAGGCCCGGTCGCGCAGGCCAGTGCCCTGCTACTCA
GTGCTGCCAGCCAGCACTCGTTGCGCCAACTGGCGGGCCGTTACGCTGAACGCCTGCACGCAGCAGACGCTGCCGCCGAT
CTTGCACACACCGCGTTACAGGCCCGGCAACTGGACCTGCCGTTTCGTCTGGCGGTGCCGTTGCACGAAGAAACCGCCCC
GGCGCTGCAGGCGTTTGCCCAAGGCGGCAGCGATGCCTTGCTGTATCAGGGACAGGCAAGCCAGGGCGCGCAGCTATGGC
TATGCAGCGGACAAGGCAGCCAGTGGGCCGGAATGGGCAAAAGCCTCTACGGTCAGTCGAAGGCCTTCAGCGAAAGTCTG
GACCGCAGCTTTGCAGCCTGTGCCGCGCACCTGCAACCCTCGTTGCAATCGGTCATGTTCGGCGAACATGAAGGGCTGAT
CGACCGCATGGATTACGCGCAGCCGGCCATCGTCGCCTTTGAAGTGGCGATGGCGGCTCACTGGCGCGAAGCGGGCCTGA
GCCCGGACCTGTTGATCGGCCATTCGGTGGGTGAATTCGCTGCCGTGGTCATCGCCGGGATCTATCGACTGGAAGACATT
CTGCCGCTGGTGATCATCCGCGGCCGGCTGATGAATCAGTGCGCAGCGCAAGGCTCGATGCTCGCGGTGTTCTGCGACGC
GGCAACGCTGCAGCCTCTGGCGCTTGAACACGGCGTGGAAATCGCCGTGCATAACGCAGCGCAGCATCTGGTGGCCTCCG
GTGAACGCCACGCCATCCGCGCACTGGCGCAAACGCTGCAACAGCGAAACCTGCGGCATAACCACCTGAGCGTGGCCGGC
GCGGCGCACTCGCGCTTGCTGGACCCGATTCTCGATGAATTCCAAAAAGCCAGTGCAGCTTTGCGCCCGGCCCAGGCGAA
AATACCGCTGATCTCGACCCTGACTGGCCAACCGCTGAGCCATGCCGAGCTTGAGCGCGGCGATTACTGGCGCAGGCACC
TGCGTCAACCGGTGCGCTATCACCAGGCACTGACTCACGCCTTGCAAACCGGCGTGACGATCGCGCTGGAACTGGGTGCC
GATGCACCATTGACCGGCATCGGCACGCGCCTGGAACACGCCGGCGTGCACTGGATCGCCAGTGCACGCCGCCATAAACC
GGCTACCACCGTCTTGCAGGACAGCCTGCTGCGCCTGTTCGCCGCCGGCGCCGCCTTGCCCTGGCGCACCCTGTTACCCT
CGGTCGGCAAGCGCATCCATGCCCCGCTGTATGTTTTCGACGAGCAGCGCTACTGGTGCGATGCGTCACAGCAGCCGAGC
ACGCAAGCACAGGACCTGTTGCTGGAGGCCGGTCGCAAGGTCGTGTTGCAGGAAGGCGCCAATCTGGACCTGCCACGTCT
GGAGCGCCTGTATCACTGCGTGACCCAACTGCATGCCATCTACGTTGACCAGATGGTTCGTCAGTGTGTGCCCGAGAGCA
TCGATCAGGGTGCCGAACCGCTGAGTATCCTGCGTGGGGGCCGGTTGCTGCCGCGCCATCGGCAACTGCTTGTGCGCCTG
CTCAATGCCTGTGTCGAGGATGGCTATTACACCCTAGAACACGGCCGTTACCGCAGCGCCCGGCCCATCCCCTATGAACA
ACGCCCGGCATTGCTGACTGAGCTGCGCAGTTGCTGTGAAGGGCTGGACGTGATCGCTGACACCGTCGAGCGGGCGGGAG
AGCAACTCTTCGCCATGATGAGCGGTGCGGTAGAGCCGGTGTCGGTGATCTTCCCGCAAAGCACCTCCAGCGGTGTTGAA
GTGCTGTATCAGCAATTCAGCTTCGGCCGCTATTTCAATCAGATCGCAGCAGGTGTGATCACCGGTCTGATCCAGGCACA
TCAGCACTCCGGACACGGGCCGCTGCGCATTCTGGAAGTCGGGGGCGGCACCGGTGGCACCACCGCCTGGCTGCTCCCGG
AACTGCGCAACGTCGCCGATGTGCGTTATTGCTTCACGGACATTTCCGCGCTGTTCAGCCGCCGCGCCGAAGACAAGTTC
AGCGAGTACGATTTTGTCGAGTATGCTCAGTTCGACCTGCAAAAACCCGCCAGCGAGCAGGGTTTCCAGGCCGGGCACTA
CGACCTGATCGTAGCCGCCAACGTGATCCACGCCACCCAGCACGTGGGCCAGACCCTGCACAACCTGCGCCCGTTGCTCA
AGCCGGGCGGCGCGCTATTGATGCGCGAGATTACCCGTCCGATGCGCCTGTTCGATTTTGTCTTCGGTCCGTTGGTATTG
CCGCTGCATGACGAGCAAGCGCGCGGCGGCGAACTGTTCCTGTCCACCGCACACTGGCAGCAACAATGCCTGGAAAGCGG
CTTTGAACGCCTCGACTGGCTGCCCGACGATGGCAGTGCCACCGCCGGTATCAGCGAACACATCCTGCTGGCACGCACCC
AGGCAGCCAACGTTGCGCCAGCCCTGCTCACTGCCGGGCACGACAGCGGCAGCGCGGTGCTGGGCAGGCAACTGGGCGAA
CACCTTTACCAGCCCGACTGGACAGATTGCGCAGGCCAGCCGCAACGCTGGAAGACACGCTTGCAGGAAGCCTGCGAACA
ACTGGCCGCACGCCACGGCGATGCTCGCCGCCCTCCCGTATTTAGCCGCGTTGTGCCGTTACCCGAAACCCTGACCGGCC
TGGCGCTGCACTGGAGTGCCGAGCCATTTGGAGTGGCCAGCGTCGAGCTCAAGCAGTGCAATGATCAGGGACAATGGCAG
CGGCTCGGCAGTGCCGACAACCGTTCCGAGCGCCTCACCACGCTGCCCGCCGCCAACACTGCATCCGGCACCCATTACGT
CTGGCAATGGGCGCCAGTGAATGCGCAGGATACCCCGTTGGCAAGGCTGCGCGTCGAGCCGGCCAGCGTCCGGGCGGCAT
TGGCCGCCGTTGGCGTGGCCCATGACCCAGAGGCTTCGGCGTGCCTGCTGATTGTACAAGGCGGCTCGCTGGCGGAGGTC
GCCAGCCAGGTGCTCGATGCAGTGAAGACGGACACCGGGCAGCCTCTGCTGGTGGTCACCCGCAACGCGTGGTCACTGGC
TGCCGATCACACGGTAAATCCCGAGCAACGCGCACTGTGGGGCTTGCTGCGGGTTGCCTGCGCCGAACAACCAGAGCGCG
CACTGGCTGTCATTGACCTGGATAGCAGCAAGCTCAACGGTAGCGATGACTGGCAGGCGCTGCTGCCGGGCTTGAAGGCG
GCGCAAAGCGGTGAACGCTGGATTGCCGTGCGTGACGGCGTCGCGCAGGTGCAGACGCTGAGCGTGCAGCGTCATCAGAG
CGCCAGCCTGCCGGCACAAAGCTTCAAGGACACCGGCTGGCACATCGTGACCGGCGCATTCGGCGGCCTGGGTCGCCTGA
GCAGCCATTGGCTGGCCGATCAGGGGGCGTCGCGCATCGCCCTGTTCGCGCCGCGCTGCCCGCATGACGGCGAGCAATGG
ATAGACACCCTGCAACAGCACTACGGCTGTGAAATACGCTGGATGGCCTGTGATATCAGTGATCAGGCTGCGCTGGCGGC
CTGCATCGACGCGTTGCGGGCCGAGGGCGGCCTGAGCGGCGCGATCCACAGTGCCGGCTTGCTGGACGACACGCCGCTGA
GCAACCTCGACGCCGCGCGCATGCAGCCACTGCTGCAGGTCAAATGCTCGGCAGCCCGCCAACTGCACACGGCGCTGGCG
GATCAAGGTCGCTACCTGTTGCTGTACTCCTCGGCTGCCGCCAGCCTCGGCGCAGCGGGCCAGGGAGCTCATGCGCTGGC
CAGTGCCTACCTGGACGGGCTGGCCGAATCGCAAACCGACACCCGCCTGCACACGGTCAGTATCGCGTGGGGCGCCTGGG
GCGAAACCGGACGTGCCGCCGAGGATCAACTGCATGCGCGACTGGCACTGGGCGGCATGGGTACGCTGGCAACCGGCGAA
GGCTTGTGGCACCTGGAACAGGCAGTCATGCGCAATTCGCCGTGGCGTCTGGCGATGCGTATAGACCCTGAGCGCATCGA
TCCGCGCCGGCGTCTGCTGACCCGGCACATCGAGCAACCCGCGACGCGCGCGCCGGTAAAAAGCACCCGCCACAGCGATG
CGCTCCCGGCACCGACCCTGACCGGTGATCCGCAGGCAGATCAACACGCGCTCAGCCAGTGGTTGAGCGCATCGATCTGC
CGTCAGTTGCGCCTGAGCCCTGACGCTGCACCTGCGCACAACCAGGACCTGATGCAACTGGGCCTCGACTCGTTACTGTT
CCTTGAGCTGAGCAGCGACATCCAGCGCCAATTGGGCGTTCGTCTGGACGCCGAACAGGCCTATCGCGACCTGAGCATTC
GTGGCCTGAGCGCACTGTTGCTGTCCAGCACCGAAAAAGCGCCGCTTGCAGCCCGTGACAATTTCATCGTGCCGCAACCC
GATAGCCGCTTCGAGCCTTTCCCGCTAACGCCGATCCAGCACGCTTACTGGCTGGGCCGTACCGACCTGATTGATTACGG
CGGCGTGGCCTGCCATGTGCTGTTTGAATGGGACAAAGCTTACGCCGACTTTGACCTGACCCGCTTCGAGCAGGCCTGGA
ACGCGCTGATCGCCCGTCATGACATGCTGCGCATGGTCATCGACAGTGACGGCCGCCAGCGCATTCTGCAAAACACGCCC
TGGTATCGCCTGCCGCGTAACGACCTGCGCGAGTTGCCTGCCGAACAACAGCAGCAACGCCTGCTGGACATTCGTGAAGA
CATGTCCTATCGCGTATTGCCCACCGACTGCTGGCCGCTGTTCGAAGTCACCGTCAGCGAACTCGACGCCGGTCATTGCC
GCCTGCACATGAACCTCGACCTGCTGCTGTTCGACGTGCAGAGCTTCAAGGTGATGATGGACGATCTGGCCAGCGCCTAC
GCCGGCCAGACACTCAAGCCGCTGGAACTGACCTTCCGCGACTACGTGATGGCTGACCTGGCGCAGCGCGACAGCCTGCA
ATGGCGACAAGCCTGGCGCTACTGGCAGGACACGCTGGAACAATTGCCTGGCGCCCCGCAGTTGCCGCTCGCGGACAACC
CGCCCAAGGGCCAGCCACGTTTCCGCACCGTTCAGGGCAAGCTCGACGCGGCGCAATGGAACCGTTTCAAGGCGCACTGT
CAGCACGTGGGTGTGACGGCCTCTGCCGCGCTGCTGGCGTTGTTCGCGCAGACACTGGAAAGCGTCAGCCGTACGCCGGA
ATTCACCCTGAATCTGACTTACTTCAACCGTCGCCCCCTGCACCCGCAGGTTCAGCAACTGATCGGTGACTTCACGTCAG
TGTTGTTGATCGATTTCCAGCTGGGGCGTGGTGACAGCCCAGGCCAGGTCATGGCCAACACGCAGGCCAGGCTCTGGCAG
CGTCTGGCACACACTGCCGTGAACGGCGTGGAATTGATGCGTGAACTCGGCCGCCGTCAGGGGCAGACGCGTCAGCCGGC
CATGCCGGTGGTGTTCACAAGCATGCTCGGCATGTCGCTGGACGGCAAAGCCATCGACCAGGCCATGACCTCAACCCTCG
GCGACCCGGTGCACGTCTTCACTCAGACGCCGCAAGTCTGGCTGGACCATCAAGTGATGGAAATCGACGGCGAGCTGGTG
TTCAGTTGGTATTGCATGGAAGACGTACTCGCCGACGGCCTGATCGACAGCCTGTTTCAGAGCTACTGCGACCTGTTACA
AACGCTGGCCGACCAACCACAGGGCTTCGACAGTCTGCCCGAGTTGCCGCGCCATGACTGGTCGGTGAACCTGGACGGCG
AGCGTTTCGACCCGCAGCGGCTCGAGGCGCAGTTGCGTCGTGAGCCGGGCGTGCAGAGCGCGCGCGTCAGCGTGGATCGC
GAAGGTCGCACGCTGCTTGGCGAGTTGGTCAGCCAGTCACCGAGTGCGGTCGATGAAGCGTTGCGCGCCCCGCTGCCGCT
GCTGCTGCCGCTGAGCGAGTTGCCGCAACTGAGCGACATGCAACGCCAGGAAGTTGACCTGACCTGGCAGGCACTGGAAA
GCCGCGCTCGTGAAGGCATTCTGAGCACCCTGCAAAAACACGGGCTGTTCAGCGGCGCCGGACAGCGTCACGACCTGGCA
CAGGTGATGACCCGACTGGGCGCGCAGCCGCAATTTGCGGGCTTGTTGCGCCAGTGGCTGGCAATGCTCTGCCAGCAAGG
CTGCTTGCAGCAGGACGGCCAGCACTATCAGGCCCTGCCCGCGCAGGCGCTTCAAGCCGCTGGCCAGTTGCTCCCCGACG
CCGAGTGGAGCCAGACGCTCGGCACGTACCTGGACGCCTGTATCGAACAGCACGCAGAACTGCTGCGCGGTGATTGTTCG
CCGTTGAGCCTGTTGTTCGGTCACAGCGATGCCGTCGTTCAGGCCCTCTACAGCAATAACCCGGTGCTGCACTGCCTGAA
CAGCGCACTGGCGCAAACTGCCAAAGCGCTGGCGGGCACACGTCGCGATCTGCGGGTGCTGGAAGTCGGTGCTGGCACCG
GTGCGACAACCCGGCACCTGTTGCCGATGCTTGAAGAACACTTGAGCGAGTACCGCTTCACTGATGTGTCCAATCTGTTT
CTCACCCAGGCCCAGGAGCATTTCGCTGCCTGGCCACAACTGACCTGCTCGATGCTCGACGTCAACCAGCCGGTGGATTT
CAGTCAGCACCCGGCACAAGGCTACGACCTGGTCGTGGCGGTCAACGTGATGCACGACGCCGCGCATGTCACACGCTCGC
TCAAGCGTCTGCACCGCCTGTTACGCAGCGGCGGGCATTTGCTGTTGCTGGAAGCCACCGAACGCGACAGCGCGCTGCAA
CTGGCGAGCATCGGCTTTATCGAAGGGCTGAGCAACTTTGAAGACGAGCGCAGTGAAGACGACAAGGCCATGCTCGATCT
GCCGCGCTGGCGCACTGCCGTGCAGGCATCGGGGTTCAGTTGGGTCATGAACTGGCCACAGCAGGCCGACGACAGCATGC
GTCAGCACTTCATGCTCGCCCGGGCCGAAGGCGTCAGCCATCTGGACCTGGCTGCCGTGGCCGGTCACCTGGAGCCACAC
CCTGCGCGATGGCCACTGGCATTGCGTCAGGTGGAACAAGTGGTTACAGCGCCTGCGGCCAGACAAAATCCGCAACAGAC
CTGCGACTCCGGGCCGGCCCGGGAGGTTGATCCGGCATTGCTGGAGGCCGTCTCGCAACTCTGGCAGGAACTGCTCGACC
AACCTATCGAGGCCGACAGCGACTTCTTCCTCAGCGGCGGCGACAGTCTGATCGCCACTCGCATGATCGCCCGCCTGAAC
CGCATGGGGCATTCCGGCAGCAGCCTGCGCAACCTGTTCGACAACCCGCGGCTGAGTGATTTCTGCGCCACCCTGCTCGA
CCACAGCGTGCAGACCGATGACAACCCGCTGGCGCTGGCCAGAGGGCGTAACACGCTGTCGCTGTTTGTGTTCCATGCCT
CGGATGGCGAGGTCAGCGCTTATCTGCCGCTGGCCCAGGCCCTGGACATGCAGGTGTTCGGGCTACAAGCGACGAATGCG
CTGGGCAGCGCTTCGCTCAAGGCGCTGGCCGCCCAATACGAACAGGCCATTCGTCGCCAGCAGGCCAGCGGGCCTTACGT
GCTGCTGGGCTGGTCGTACGGCACCTTCGTCGCCGAAGAAACCGCGCGTCTGCTGCTGCGCCAGGGTGAGCAGGTACGAC
TGATTCTTCTAGATCCGGTGTGCCGCGCGGACTTGCGCTTCGATGATCGCCCAGGCTTGTTGCGCCTGATGGCCGAGGGC
GCAAAAAGCATCGCCCTGCCTGACGACCTCGAACAATTGCCGGCCGCCGAGCAGTTGAACGTGTTTATGAACAACGCCAC
GCAGGCCGGCGTGCTGAAAAACCCGCCGCAAGCGCAGCAGGCCGAACAGTGGCTGCAGCGCATCGAGCACCTGATGACAC
TGCTGGCGCAACACTCGCAACCACGACAACTGGACCTGCCCTGCCTGTGGCTGAGCGCCGAAGGCCGCCCGCAGCACTGG
CTGCCCGCCGAGCAGGACTGGCAGGAATGGGCCGCTACGGCGCGCCGCGAATCGATGCCGTGCGATCACTGGCAACTGCT
GCTCGACAGCGATCAGGTTCAGCGTACTGCGGCGTCGATCAGCGCCTGGCTGGCCGCCACCCACAAGGAGAGTCATCCAT
GA

Protein sequence :
MPTFEHTYGASEPIAVIGLACRFPGARDSTEYWQNLLAGRECSRHFSREELLAAGLSAELIDNPDFVNVAAIIDDPDRFD
AALFGYSRQEAESIDPQQRLFLQTVWHALEHAGFAPRDVAHKTGVFASGRMSTYPGRDAIRVTEVAQVKGLQALMGNDKD
YLASRAAYKLNLRGPAMSVQTACSSSLVAVHMACESLRSGECEMAVAGGVAVSFPQHAGYLYQPGMIFSPDGRCRPFDAS
AQGTFAGNGVGAVTLRRLEDALRDGDPVLAVLRGSAINNDGHHKVGYTAPSVIGQREVIQDALLLADIDCASIGMLEAHG
TGTPLGDPIEVQALRDAFAHRTDIGGCALGSVKGNLGHLDTAAGIASLIKTVLAVSHGRIPPSINVERINPALQLEQSPF
YVPIQDRSWPAGPRRAGVSSFGIGGTNCHVIVEALPDALRSSGPVAQASALLLSAASQHSLRQLAGRYAERLHAADAAAD
LAHTALQARQLDLPFRLAVPLHEETAPALQAFAQGGSDALLYQGQASQGAQLWLCSGQGSQWAGMGKSLYGQSKAFSESL
DRSFAACAAHLQPSLQSVMFGEHEGLIDRMDYAQPAIVAFEVAMAAHWREAGLSPDLLIGHSVGEFAAVVIAGIYRLEDI
LPLVIIRGRLMNQCAAQGSMLAVFCDAATLQPLALEHGVEIAVHNAAQHLVASGERHAIRALAQTLQQRNLRHNHLSVAG
AAHSRLLDPILDEFQKASAALRPAQAKIPLISTLTGQPLSHAELERGDYWRRHLRQPVRYHQALTHALQTGVTIALELGA
DAPLTGIGTRLEHAGVHWIASARRHKPATTVLQDSLLRLFAAGAALPWRTLLPSVGKRIHAPLYVFDEQRYWCDASQQPS
TQAQDLLLEAGRKVVLQEGANLDLPRLERLYHCVTQLHAIYVDQMVRQCVPESIDQGAEPLSILRGGRLLPRHRQLLVRL
LNACVEDGYYTLEHGRYRSARPIPYEQRPALLTELRSCCEGLDVIADTVERAGEQLFAMMSGAVEPVSVIFPQSTSSGVE
VLYQQFSFGRYFNQIAAGVITGLIQAHQHSGHGPLRILEVGGGTGGTTAWLLPELRNVADVRYCFTDISALFSRRAEDKF
SEYDFVEYAQFDLQKPASEQGFQAGHYDLIVAANVIHATQHVGQTLHNLRPLLKPGGALLMREITRPMRLFDFVFGPLVL
PLHDEQARGGELFLSTAHWQQQCLESGFERLDWLPDDGSATAGISEHILLARTQAANVAPALLTAGHDSGSAVLGRQLGE
HLYQPDWTDCAGQPQRWKTRLQEACEQLAARHGDARRPPVFSRVVPLPETLTGLALHWSAEPFGVASVELKQCNDQGQWQ
RLGSADNRSERLTTLPAANTASGTHYVWQWAPVNAQDTPLARLRVEPASVRAALAAVGVAHDPEASACLLIVQGGSLAEV
ASQVLDAVKTDTGQPLLVVTRNAWSLAADHTVNPEQRALWGLLRVACAEQPERALAVIDLDSSKLNGSDDWQALLPGLKA
AQSGERWIAVRDGVAQVQTLSVQRHQSASLPAQSFKDTGWHIVTGAFGGLGRLSSHWLADQGASRIALFAPRCPHDGEQW
IDTLQQHYGCEIRWMACDISDQAALAACIDALRAEGGLSGAIHSAGLLDDTPLSNLDAARMQPLLQVKCSAARQLHTALA
DQGRYLLLYSSAAASLGAAGQGAHALASAYLDGLAESQTDTRLHTVSIAWGAWGETGRAAEDQLHARLALGGMGTLATGE
GLWHLEQAVMRNSPWRLAMRIDPERIDPRRRLLTRHIEQPATRAPVKSTRHSDALPAPTLTGDPQADQHALSQWLSASIC
RQLRLSPDAAPAHNQDLMQLGLDSLLFLELSSDIQRQLGVRLDAEQAYRDLSIRGLSALLLSSTEKAPLAARDNFIVPQP
DSRFEPFPLTPIQHAYWLGRTDLIDYGGVACHVLFEWDKAYADFDLTRFEQAWNALIARHDMLRMVIDSDGRQRILQNTP
WYRLPRNDLRELPAEQQQQRLLDIREDMSYRVLPTDCWPLFEVTVSELDAGHCRLHMNLDLLLFDVQSFKVMMDDLASAY
AGQTLKPLELTFRDYVMADLAQRDSLQWRQAWRYWQDTLEQLPGAPQLPLADNPPKGQPRFRTVQGKLDAAQWNRFKAHC
QHVGVTASAALLALFAQTLESVSRTPEFTLNLTYFNRRPLHPQVQQLIGDFTSVLLIDFQLGRGDSPGQVMANTQARLWQ
RLAHTAVNGVELMRELGRRQGQTRQPAMPVVFTSMLGMSLDGKAIDQAMTSTLGDPVHVFTQTPQVWLDHQVMEIDGELV
FSWYCMEDVLADGLIDSLFQSYCDLLQTLADQPQGFDSLPELPRHDWSVNLDGERFDPQRLEAQLRREPGVQSARVSVDR
EGRTLLGELVSQSPSAVDEALRAPLPLLLPLSELPQLSDMQRQEVDLTWQALESRAREGILSTLQKHGLFSGAGQRHDLA
QVMTRLGAQPQFAGLLRQWLAMLCQQGCLQQDGQHYQALPAQALQAAGQLLPDAEWSQTLGTYLDACIEQHAELLRGDCS
PLSLLFGHSDAVVQALYSNNPVLHCLNSALAQTAKALAGTRRDLRVLEVGAGTGATTRHLLPMLEEHLSEYRFTDVSNLF
LTQAQEHFAAWPQLTCSMLDVNQPVDFSQHPAQGYDLVVAVNVMHDAAHVTRSLKRLHRLLRSGGHLLLLEATERDSALQ
LASIGFIEGLSNFEDERSEDDKAMLDLPRWRTAVQASGFSWVMNWPQQADDSMRQHFMLARAEGVSHLDLAAVAGHLEPH
PARWPLALRQVEQVVTAPAARQNPQQTCDSGPAREVDPALLEAVSQLWQELLDQPIEADSDFFLSGGDSLIATRMIARLN
RMGHSGSSLRNLFDNPRLSDFCATLLDHSVQTDDNPLALARGRNTLSLFVFHASDGEVSAYLPLAQALDMQVFGLQATNA
LGSASLKALAAQYEQAIRRQQASGPYVLLGWSYGTFVAEETARLLLRQGEQVRLILLDPVCRADLRFDDRPGLLRLMAEG
AKSIALPDDLEQLPAAEQLNVFMNNATQAGVLKNPPQAQQAEQWLQRIEHLMTLLAQHSQPRQLDLPCLWLSAEGRPQHW
LPAEQDWQEWAATARRESMPCDHWQLLLDSDQVQRTAASISAWLAATHKESHP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 53
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 52
irp1 CAA21391.1 - Virulence HPI Protein 0.0 52
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 52
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52