Gene Information

Name : irp (ECED1_2250)
Accession : YP_002398189.1
Strain : Escherichia coli ED1a
Genome accession: NC_011745
Putative virulence/resistance : Virulence
Product : High-molecular-weight nonribosomal peptide/polyketide synthetase 1
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG3321
EC number : -
Position : 2179482 - 2188973 bp
Length : 9492 bp
Strand : +
Note : Evidence 2a : Function of homologous gene experimentally demonstrated in an other organism; PubMedId : 15719346, 15582399, 11927258; Product type e : enzyme

DNA sequence :
ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGA
ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAG
GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAGCTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT
TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA
GTCGATGGACCCGCAGCAGCGCCTATTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCG
TCCCCCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTC
GCGCAGGTAAAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT
GCACGGCCCGGCGTTATCGGTACAGACCGCCTGCTCCAGCTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCG
CAGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCAGGCTACCGCTACCAGCCCGGA
ATGATTTTCTCTCCTGATGGTCACTGTCGTCCCTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG
CGTGGTGCTGCGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA
ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTG
GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA
AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAACATGGGCCATC
TGGATACCGCGGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGTCGCGGGCAAATTCCTCCCTTACTGAAT
TTTCACACCCCCAACCCGGCGCTGAAACTTGAAGAGAGCCCCTTTACCATACCGGTGTCGGCACAGGCATGGCAGGACGA
AATGCGCTATGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC
TCAACGCGCGCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCAGCCAGCGACAGCGCGTTG
CGGCGGCTGGCGACGGATTATGCCGGGGCGCTGAGAGAGAATGCGGATGCCAGCTCTCTGGCCTTCACAGCCCTGCACGC
GCGCCGTCTCGATCTCCCCTTCCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGAGGCGCTCAGCGCCTGGGCCGGTG
AGAAATCGGGGGCGCTGGTTTACAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC
TGGCGCACTATGGGTCAAACGATGTACCAGCACTCAACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAG
CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAACTGGACAATATGGCCTGGGCGCAGC
CGGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCGTGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCAT
TCCGTCGGTGAATTTGCCGCTGCTGTTGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGGCGCGG
CGCGCTAATGCAGCAGTGCGCAAGCGGCGCGATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCC
AGTTTGAGCTGGATCTCGCCGCCAATAACGGTACGCAACATACGGTATTTTCCGGACCGGAAGCCCGTCTCGCAGTATTT
TGCGCCACGCTCTCGCAGCATGACATTAACTATCGTCGCCTGAGCGTAACCGGTGCGGCGCACTCCGCTTTACTGGAGCC
GATACTCGATCGGTTCCAGGACGCCTGCGCGGGACTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG
CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG
AGTATTCAGGTGGCGCATCAGCTCGGCGCCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTGCGGGCA
GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGATGTCCTCAATCAGGCCC
TGCTCCAGCTTTACGCTGCCGGCGTCGCCCTACCGTGGGCCGACCTGCTGGCGGGCGATGGACAACGTATCGCTGCGCCA
TGTTATCCGTTTGATACTGAGCGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAGCCTGCCGACGCAGCGCTGTCTGC
CGGGCTGGAGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCTCGCCTGGAAGCGCTTAAACAGTGCGCCACGCGAC
TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAGAACGGCGTGGACGCCATGACCATC
ATGCGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA
CCGCTGCACCGACGGGCGATACGTCCGCGCCCGCCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG
GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCG
GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCG
CTATTTCAACCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGCCGTTGCGTATTCTTG
AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCACTGGAGTACCATTTC
ACCGATATCTCGGCGCTGTTCACCCGTCGCGCCCAGCAGAAATTCGCCGACTATGATTTTGTGAAGTATAGCGAGCTGGA
TCTCGAAAAAGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCAGCGAACGTGATTCACGCCA
CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGCCTGCTGATGCGCGAAATCACC
CAGCCAATGCGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAGTT
ATTCCTCACCACCGCTCAGTGGCAACAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCGTGGCTACCGCAGGATGGCA
GCCCGACCGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGCGCCGTAACATTCACCGCG
CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGCC
CGAACGGTTTAACGCCCGCTGGCAGGAGGCCTGGCGTCTGCTTTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC
CCCCCGTCGCCGCCCCGGAGTGGCTGGGGAAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCGC
GTTGAAGCCCGTCATCCTGCTGGCGAGTGGCTGCCGCTATCGCCCGCCGCGCCTCTTCCTGCGCCGCAGACGCATTATCA
ATGGCGCTGGACGCCCCTCAACGTCGCCAGCATTGACCAACCGCTTACCTTTAGCTTCAGCGCCGGTACGCTTGCGCGCA
GCGACGAGCTGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCGCGACTGATGATTGTTGAGGAGAGCGAGGAT
ACGCTGGCCTTAGCGGAGAAAGTGATAGCAGCGCTCACCGCCAGCGCAGCCGGATTGATTGTGGTTACTCGCCGCGCGTG
GCGAGTCGAGGAAAATGAAGCACTCTCTGCATCCCATCACGCGCTATGGGCCTTGCTTCGAGTCGCGGCCAACGAACAGC
CGGAACGGTTGCTTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACGCTGCATCAAGGGTTGAGCGCAGTCTCA
CTATCACAGCGCTGGCTCGCCGCACGGGGTGACACCCTTTGGCTCCCTTCACTGGCGCCCAATACGGGATGCGCCGCTGA
ATTACCGGCAAACGTGTTTACCGGCGATAGCCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCG
TGAACTGGCTCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTGGCGCCGCGCGTGGATGAGTCATGGCTACGCGACGTG
GAGGGCGGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGC
CAACGGCGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGC
TGGCTGCCGTTTTCGCGGTAAAAGCGCAGGCGGCAAGCCAGCTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTT
ATTCTCTACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGG
GCTGGCCCAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGG
CGGCCACGCCGGAAATGCTGGCGACGCTCGCCAGCCGAGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTG
GAACAGGCGGTGATGCGCGGCGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGC
TCTGTTTAACATCAGCGCCACAGAAAAAGCCGCAACGCCGGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCC
TGAGCGATGAAACAGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGATTCAGCTAAGGCTGAGCGATCCGGCGTCACTG
CATCCAAACCAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCT
GGGCGTACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAG
AGGCGACGCCTGCCGCTTCGCAGCCGGAAGTGTTGCGGCACGACGCCGACGAGCGTTATGCGCCCTTCCCTTTGACGCCC
ATTCAGCACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGA
TAAACGCCACGATGAGTTCAATCTCGCCATACTGGAGAAAGCATGGAACCAGCTCATCGCACGCCACGATATGTTGCGTA
TGGTGGTTGATGCCGACGGGCAGCAGCGAATCCTGGCGACAACGCCGGAGTATCACATCCCGCGTGACGATCTGCGCGCG
CTTTCCCCGGAAGAACAGCGCATCGCGCTGGAAAAACGGCGGCATGAACTGAGCTATCGCGTTTTGCCTGCCGACCAGTG
GCCTCTTTTTGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGTCTGCATATGAACCTCGACCTTTTGCAGTTTG
ATGTGCAGAGTTTTAAAGTCATGATGGACGACCTGGCGCAGGTCTGGCGCGGTGAAACGCTGGCGCCGCTCGCTATTACC
TTCCGTGATTATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGAAAA
ACTGCCGCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCCCCCGGAAACGCCACACTTCACCACCTTCAAAT
CGACGATCGGCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTC
ACGCTGTTTGCCGCCACCCTTGAGCGCTGGAGCCGTACCACAACATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCC
GATCCATCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAGCGCCGGTGA
CGTTGCAAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTG
ATCCGTGAGCTGGGCCGCCTGCGCGGATCACAACGTCAACCGCTGATGCCGGTAGTGTTTACCAGTATGCTGGGGATGAC
GCTGGAAGGCATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACGCCGCAGG
TCTGGCTGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCC
GGCGCTGCCGAGGCGATGTTTAATGACTATTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCT
CGCCAGCGGCATCGCCGGGCACATTCCCCGCCGACGCTGGCCGCTGAACGCGCAGGCGGACTACGACCTGCGGGATATTG
AGCAGGCGACGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATC
GTGATGGCCGACGATCCGTCGCCATCAGCGGCGATGCCTGATGAGCACGAACTTACCCAACTGGCGCTGCCGTTGCCTGA
GCAGGCGCAGCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTACGCTAA
ATCGTCACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGGCGCTGTCCGCGCAAGCGTCT
CACCAGCGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAGTGGTTAATCCGCGAAGGTGAAAGCTGGCGCTG
CCGCATTCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCAAAGCCAATGGAGCCAGGCGCTGGCGCAGTATC
TGGAAACCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCGGTGTTCTCCGCTGGAATTGCTGTTCAACGAGCAGCAT
CGCGTTACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTG
CAGCGCAGAACGGATTCTGGAGATTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGC
GGCAGTCGTACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCAGGTG
TCTTATGCCTTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGT
CAATGTGCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGC
TGATCGTTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGC
GATTTCCGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGGTTTGCAAA
CGAGCTGGCGTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTGGCGCGTTCGCCTGGCGTAAATCGCC
CGGATAAAAAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTGCCCATTTTACAGATCCGGCAAAGAGAA
GCGTTATTTACGCCGCTGCATGCCCCGTCTGATACGCCGACTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCC
GGCGCTGGAAAGACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAAC
TGGGTGGCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCTAGGGCTAACCTTCAGGAT
CTGTTCAGCCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATACCCCT
TTGCCAGGGCGACGGTGAGGAAACCCTGTTTGTCTTCCACGCTTCAGACGGCGATATCAGCGCCTGGCTGCCGCTCGCTA
GCGCGTTGAACAGGCGCGTTTTCGGCCTGAAAGCAAAATCGCCGCAGCGCTTTGCCACGCTCGACCAGATGATCGATGAG
TATGTCGGGTGCATCCGTCGTCAGCAGCCTCACGGCCCTTATGTGCTGGCGGGTTGGTCGTATGGCGCGTTTCTTGCGGC
GGGCGCCGCACAGCGCCTGTACGCCAAAGGCGAGCAGGTTCGGATGGTGTTAATCGATCCCGTGTGCCGACAGGATTTCT
GTTGCGAAAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAG
CAGACGCCCGACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGACGCTGCA
AGCGGCAGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAGCGTTCCGG
TCCCCTGTCTCATGGTGTATGCCGCCGGGAGACCCGCGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAAC
AACGCCGACGACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCCCACGTTCAGGCTTGTGCGCAACA
CATTACGCGCTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA

Protein sequence :
MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH
YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV
AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG
MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML
AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN
FHTPNPALKLEESPFTIPVSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSAL
RRLATDYAGALRENADASSLAFTALHARRLDLPFRLAAPLNRETAEALSAWAGEKSGALVYSGHGASGKQVWLFTGQGSH
WRTMGQTMYQHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAIGH
SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVF
CATLSQHDINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ
SIQVAHQLGARVFLEMGPDAQLVACGQREYRDNAYWIASARRNKEASDVLNQALLQLYAAGVALPWADLLAGDGQRIAAP
CYPFDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAMTI
MRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRARPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYEMMSGA
EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQPLRILEVGGGTGGTTAWLLPELNGVPALEYHF
TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPTAGMSEHIILATLPGQAVSAVTFTA
PSEPVLGQALTDNGDYLADWSDCAGQPERFNARWQEAWRLLSQRHGDALPVEPPPVAAPEWLGKVRLSWQNEAFSRGQMR
VEARHPAGEWLPLSPAAPLPAPQTHYQWRWTPLNVASIDQPLTFSFSAGTLARSDELAQYGIIHDPHASSRLMIVEESED
TLALAEKVIAALTASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLLAAIDLAENTPWETLHQGLSAVS
LSQRWLAARGDTLWLPSLAPNTGCAAELPANVFTGDSRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDV
EGGQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLRNHDGRYL
ILYSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHL
EQAVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVMAWLKKRIAIQLRLSDPASL
HPNQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPEATPAASQPEVLRHDADERYAPFPLTP
IQHAYWLGRTHLIGYGGVACHVLFEWDKRHDEFNLAILEKAWNQLIARHDMLRMVVDADGQQRILATTPEYHIPRDDLRA
LSPEEQRIALEKRRHELSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLAIT
FRDYVMAEQARRQTSAWHDAWDYWQEKLPQLPLAPELPVVETPPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALL
TLFAATLERWSRTTTFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSAPVTLQEQMQQTQQRLWQNMAHSEMNGVEV
IRELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEP
GAAEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDI
VMADDPSPSAAMPDEHELTQLALPLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQALSAQAS
HQRLLRQWLQCLTEREWLIREGESWRCRIPLSEIPEPQEACPQSQWSQALAQYLETCIARHDALFSGRCSPLELLFNEQH
RVTDALYRDNPASACLNRYTAQIAALCSAERILEIGAGTAATTAPVLKATRNTRQSYHFTDVSAQFLNDARARFHDESQV
SYALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYR
DFRRRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVARSPGVNRPDKKAVSRYLQQRFGTGLPILQIRQRE
ALFTPLHAPSDTPTEPAKPTPVAGGNPALERQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQD
LFSHSTLSDFCAHLQAATSGEDNPIPLCQGDGEETLFVFHASDGDISAWLPLASALNRRVFGLKAKSPQRFATLDQMIDE
YVGCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRMVLIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQ
QTPDSQLADFISLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGESVPVPCLMVYAAGRPARWTPAETEWQGWIN
NADDAVIEASHWQIMMEAPHVQACAQHITRWLCATSTQPENTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 99
irp1 CAA21391.1 - Virulence HPI Protein 0.0 99
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 99
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 98