Gene Information

Name : irp1 (ECABU_c22420)
Accession : YP_006106298.1
Strain : Escherichia coli ABU 83972
Genome accession: NC_017631
Putative virulence/resistance : Virulence
Product : yersiniabactin biosynthetic protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2208726 - 2218217 bp
Length : 9492 bp
Strand : +
Note : HMWP1nonribosomal peptide/polyketide synthase

DNA sequence :
ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGA
ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAG
GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAGCTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT
TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA
GTCGATGGACCCGCAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCG
TCCCCCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTC
GCGCAGGTAAAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT
GCACGGCCCGGCGTTATCGGTACAGACCGCCTGCTCCAGCTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCG
CTGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCAGGCTACCGCTACCAGCCCGGA
ATGATTTTCTCTCCTGATGGTCACTGTCGTCCCTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG
CGTGGTGCTGCGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA
ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTG
GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA
AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAACATGGGCCATC
TGGATACCGCGGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGTCGCGGGCAAATTCCTCCCTTACTGAAT
TTTCACACCCCCAACCCGGCGCTGAAACTTGAAGAGAGCCCCTTTACCATACCGGTGTCGGCACAGGCATGGCAGGACGA
AATGCGCTATGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC
TCAACGCGCGCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTG
CGGCGGCTGGCGACGGATTATGCCGGGGCGCTGAGAGAGAATGCGGATGCCAGCTCTCTGGCCTTCACAGCCCTGCACGC
GCGCCGTCTCGATCTCCCCTTCCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGAGGCGCTCAGCGCCTGGGCTAGTG
AGAAATCGGGGGCGCTGGTTTACAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC
TGGCGCACTATGGGTCAAACGATGTACCAGCACTCAACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAG
CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGC
CGGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCGTGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCAT
TCCGTCGGTGAATTTGCCGCTGCCGTTGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGGCGCGG
CGCGCTAATGCAGCAGTGCGCAAGCGGCGCGATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCC
AGTTTGAGCTGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGTATTT
TGCGCCACGCTCTCGCAGCATGACATTAACTATCGTCGCCTGAGCGTAACCGGTGCGGCGCACTCCGCTTTACTGGAGCC
GATACTCGATCGGTTCCAGGACGCCTGCGCGGGACTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG
CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG
AGTATTCAGGTGGCGCATCAGCTCGGCACCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTGCGGGCA
GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGATGTCCTCAATCAGGCCC
TGCTCCAGCTTTACGCTGCCGGCGTCGCCCTACCGTGGGCCGACCTGCTGGCGGGCGATGGACAACGTATCGCTGCGCCA
TGTTATCCGTTTGATACTGAGCGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAGCCTGCCGACGCAGCGCTGTCTGC
CGGGCTGGAGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCTCGCCTGGAAGCGCTTAAACAGTGCGCCACGCGAC
TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAGAACGGCGTGGACGCCATGACCATC
ATGCGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA
CCGCTGCACCGACGGGCGATACGTCCGCGCCCGCCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG
GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCG
GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCG
CTATTTCAACCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGCCGTTGCGTATTCTTG
AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCACTGGAGTACCATTTC
ACCGATATCTCGGCGCTGTTCACCCGTCGCGCCCAGCAGAAATTCGCCGACTATGATTTTGTGAAGTATAGCGAGCTGGA
TCTCGAAAAAGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCAGCGAACGTGATTCACGCCA
CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGCCTGCTGATGCGCGAAATCACC
CAGCCAATGCGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAGTT
ATTCCTCACCACCGCTCAGTGGCAACAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCGTGGCTACCGCAGGATGGCA
GCCCGACCGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGCGCCGTAACATTCACCGCG
CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGCC
CGAACGGTTTAACGCCCGCTGGCAGGAGGCCTGGCGTCTGCTTTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC
CCCCCGTCGCCGCCCCGGAGTGGCTAGGGAAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCGC
GTTGAAGCCCGTCATCCTGCTGGCCAGTGGCTGCCGCTATCGCCCGCCGAGCCTCTTCCTGCGCCGCAAACGCATTATCA
ATGGCGCTGGACGCCCCTCAACGTCGCCAGCATTGACCATCCGCTTACCTTTAGCTTCAGCGCCGGTACGCTTGCGCGCA
GCGACGAGCTGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCACGACTGATGATTGTTGAGGAGAGCGAGGAT
ACGCTGGCCTTAGCGGAGAAAGTGATAGCAGCGCTCACCGCCAGCGCAGCCGGATTGATTGTGGTTACTCGCCGCGCGTG
GCGAGTCGAGGAAAATGAAGCACTCTCTGCATCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGC
CGGAACGGTTGCTTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACGCTGCATCAAGGGTTGAGCGCAGTCTCA
CTATCACAGCGCTGGCTCGCCGCACGGGGTGACACCCTTTGGCTTCCTTCACTGTCGCCCAATACGGGATGCGCCGCTGA
ATTACCGGCAAACGTGTTTACCGGCGATAGCCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCG
TGAACTGGCTCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTGGCACCGCGCGTGGATGAGTCATGGCTACGCGACGTG
GAGGGCGGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGC
CAACGGCGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGC
TGGCTGCCGTTTTCGCGGTAAAAGCGCAGGCGGCAAGCCAACTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTT
ATTCTCTACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGG
GCTGGCCCAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGG
CGGCCACGCCGGAAATGCTGGCGACGCTCGCCAGCCGAGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTG
GAACAGGCGGTGATGCGCGGCGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGC
TCTGTTTAACATCAGCGCCACAGAAAAAGCCGCAACGCCGGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCC
TGAGCGATGAAACAGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGCTGAGCGATCCGGCGTCACTG
CATCCAAACCAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCT
GGGTGTACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAG
AGGCGACGCCTGCCGCTTCGCAGCCGGAAGTGTTGCGGCACGACGCCGACGAGCGTTATGCGCCCTTCCCTTTGACGCCC
ATTCAGCACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGA
TAAACGCCACGATGAGTTCGATCTCGCCATACTGGAGAAAGCATGGAACCAGCTCATCGCACGCCACGATATGTTGCGTA
TGGTGGTTGATGCCGACGGGCAGCAGCGAATCCTGGCGACAACGCCGGAGTATCACATCCCGCGTGACGATCTGCGCGCG
CTTTCCCCGGAAGAACAGCGCATCGCGCTGGAAAAACGGCGGCATGAACTGAGCTATCGCGTTTTGCCTGCCGACCAGTG
GCCTCTTTTTGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGTCTGCATATGAACCTCGACCTTTTGCAGTTTG
ATGTGCAGAGTTTTAAAGTCATGATGGACGACCTGGCGCAGGTCTGGCGCGGTGAAACGCTGGCACCGCTCGCTATTACC
TTCCGTGATTATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGAAAA
ACTGCCGCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCCCCCGGAAACGCCACACTTCACCACCTTCAAAT
CGACGATCGGCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTC
ACGCTGTTTGCCGCCACCCTTGAGCGCTGGAGCCGTACCACAACATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCC
GATCCATCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAGCGCCGGTGA
CGTTGCAAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTG
ATCCGTGAGCTGGGGCGCCTGCGCGGATCACAACGTCAACCGCTGATGCCGGTAGTGTTTACCAGTATGCTGGGGATGAC
GCTGGAAGGCATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACGCCGCAGG
TCTGGCTGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCC
GGCGCTGCCGAGGCGATGTTTAATGACTATTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCT
CGCCAGCGGCATCGCCGGGCACATTCCCCGCCGACGCTGGCCGCTGAACGCGCAGGCGGACTACGACCTGCGGGATATTG
AGCAGGCGACGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATC
GTGATGGCCGACGATCCGTCGCCATCAGCGGCGATGCCTGATGAGCACGAACTTACCCAACTGGCGCTGCCGTTGCCTGA
GCAGGCGCAGCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGAGCGCTACAGGGGATCGCGGCTACGCTAA
ATCGTCACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGACGCTGTCCGCGCAAGCGTCT
CACCAGCGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAGTGGTTAATCCGCGAAGGTGAAAGCTGGCGCTG
CCGCATTCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCAAAGCCAATGGAGCCAGGCGCTGGCGCAGTATC
TGGAAACCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCAGTGTTCTCCGCTGGAGTTGCTGTTCAACGAGCAGCAT
CGCGTTACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTG
CAGCGCAGAACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGC
GGCAGTCGTACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCAGGTG
TCTTATGCCTTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGT
CAATGTGCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGC
TGATCGTTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGC
GATTTCCGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGGTTTGCAAA
CGAGCTGGCGTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTAGCGCGTTCGCCTGGCGTAAATCGCC
CGGATAAAAAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTGCCCATTTTACAGATCCGGCAAAGAGAA
GCGTTATTTACGCCGCTGCATGCCCCGTCTGATGCGCCGACTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCC
GGCGCTGGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAAC
TGGGCGGCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTTCAGGAT
CTGTTCAGCCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATACCCCT
TTGCCAGGGCGACGGTGAGGAAACCCTGTTTGTCTTCCACGCTTCAGACGGCGATATCAGCGCCTGGCTGCCGCTCGCTA
GCGCGTTGAACAGGCGCGTTTTCGGCCTGCAAGCAAAATCGCCGCAGCGCTTTGCCACGCTCGACCAGATGATCGATGAG
TATGTCGGGTGCATCCGTCGTCAGCAGCCTCACGGCCCTTATGTGCTGGCGGGTTGGTCGTATGGCGCGTTTCTTGCGGC
GGGCGCCGCACAACGCCTGTACGCCAAAGGCGAGCAGGTTCGGATGGTGTTAATCGATCCCGTGTGCCGACAGGATTTCT
GTTGCGAAAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAG
CAGATGCCCGACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGACGCTGCA
AGCGGCAGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAGCGTTCCGG
TCCCCTGTCTCATGGTGTATGCCGCCGGGAGACCCGCGCACTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAAC
AACGCCGACGACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCCCACGTTCAGGCTTGTGCGCAACA
CATTACGCGCTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA

Protein sequence :
MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH
YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV
AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG
MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML
AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN
FHTPNPALKLEESPFTIPVSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSAL
RRLATDYAGALRENADASSLAFTALHARRLDLPFRLAAPLNRETAEALSAWASEKSGALVYSGHGASGKQVWLFTGQGSH
WRTMGQTMYQHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAIGH
SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVF
CATLSQHDINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ
SIQVAHQLGTRVFLEMGPDAQLVACGQREYRDNAYWIASARRNKEASDVLNQALLQLYAAGVALPWADLLAGDGQRIAAP
CYPFDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAMTI
MRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRARPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYEMMSGA
EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQPLRILEVGGGTGGTTAWLLPELNGVPALEYHF
TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPTAGMSEHIILATLPGQAVSAVTFTA
PSEPVLGQALTDNGDYLADWSDCAGQPERFNARWQEAWRLLSQRHGDALPVEPPPVAAPEWLGKVRLSWQNEAFSRGQMR
VEARHPAGQWLPLSPAEPLPAPQTHYQWRWTPLNVASIDHPLTFSFSAGTLARSDELAQYGIIHDPHASSRLMIVEESED
TLALAEKVIAALTASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLLAAIDLAENTPWETLHQGLSAVS
LSQRWLAARGDTLWLPSLSPNTGCAAELPANVFTGDSRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDV
EGGQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLRNHDGRYL
ILYSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHL
EQAVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVMAWLKKRIAVQLRLSDPASL
HPNQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPEATPAASQPEVLRHDADERYAPFPLTP
IQHAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRILATTPEYHIPRDDLRA
LSPEEQRIALEKRRHELSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLAIT
FRDYVMAEQARRQTSAWHDAWDYWQEKLPQLPLAPELPVVETPPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALL
TLFAATLERWSRTTTFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSAPVTLQEQMQQTQQRLWQNMAHSEMNGVEV
IRELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEP
GAAEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDI
VMADDPSPSAAMPDEHELTQLALPLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQTLSAQAS
HQRLLRQWLQCLTEREWLIREGESWRCRIPLSEIPEPQEACPQSQWSQALAQYLETCIARHDALFSGQCSPLELLFNEQH
RVTDALYRDNPASACLNRYTAQIAALCSAERILEVGAGTAATTAPVLKATRNTRQSYHFTDVSAQFLNDARARFHDESQV
SYALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYR
DFRRRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVARSPGVNRPDKKAVSRYLQQRFGTGLPILQIRQRE
ALFTPLHAPSDAPTEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQD
LFSHSTLSDFCAHLQAATSGEDNPIPLCQGDGEETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPQRFATLDQMIDE
YVGCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRMVLIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQ
QMPDSQLADFISLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGESVPVPCLMVYAAGRPAHWTPAETEWQGWIN
NADDAVIEASHWQIMMEAPHVQACAQHITRWLCATSTQPENTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 99
irp1 CAA21391.1 - Virulence HPI Protein 0.0 99
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 99
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 98
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98