Gene Information

Name : irp1 (YE2618)
Accession : YP_001006816.1
Strain : Yersinia enterocolitica 8081
Genome accession: NC_008800
Putative virulence/resistance : Virulence
Product : yersiniabactin biosynthetic protein
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG3321
EC number : -
Position : 2826454 - 2835939 bp
Length : 9486 bp
Strand : +
Note : -

DNA sequence :
ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGA
ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAG
GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAACTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT
TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA
ATCGATGGACCCACAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCG
TCCCCCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTC
GCGCAGGTAAAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT
GCACGGCCCAGCGTTATCGGTACAGACCGCCTGCTCCAGTTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCG
CAGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCGGGCTACCGCTACCAGCCCGGA
ATGATTTTCTCTCCCGATGGTCACTGTCGTCCTTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG
CGTGGTACTGCGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA
ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTG
GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA
AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAATATGGGCCATC
TGGATACCGCAGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGCCGCGGGCAAATTCCACCCTTACTGAAT
TTTCATACCCCCAACCCGGCGCTGAAACTTGAAGAGAGTCCCTTTACCATACCGATGTCGGCGCAGGCGTGGCAGGATGA
AATGCGCTATGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC
TCAACGCGCGCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTG
CGGCGGCTGGCGACGGATTATGCCGGGGCGCTGAGAGAGAATACGGATGCCAGCGATCTGGCCTTCACGGCCCTGCACGC
GCGCCGTCTCGATCTTCCCTTTCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGCGGCGCTCAGCGACTGGGCCGGTG
AGAAATCGGGGGCGCTGGTTTATAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC
TGGCGCACTATGGGTCAAACGATGTACCAGCACTCAACGGCGTTTGCCGACATGCTGGATCGCTGTTTTTCCGCCTGTAG
CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGC
CGGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCACGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCAT
TCCGTCGGTGAATTTGCCGCTGCCGTCGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGACGCGG
CGCACTGATGCAGCAGTGCGCAAGCGGCGCAATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCC
AGTTTGAGCTGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGTATTT
TGCACCACGCTCTCGCAGCATAACATTAACTATCGTCGCCTGAGCGTAACCGGCGCGGCGCACTCCGCTTTACTGGAACC
GATACTCGATCGGTTCCAGGACGCCTGCGCGGGGCTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG
CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG
AGTATTCAGATGGCGCATCAGCTCGGCGCCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTCCGGGCA
GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGATGTCCTCAATCAGGCCC
TGCTCCAGCTTTACGCTGCCGGTGTCGCCTTACCGTGGACCGACCTACTGGCGGGTGATGGACAACGTATCGCTGCGCCA
TGTTATCCGTTTGATACTGAGCGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAACCTGCCGACGCAGCGCTGTCTGC
CGGGCTGGAGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCCCGTCTGGAAGCGCTTAAACAGTGCGCCACGCGAC
TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAAAACGGCGTGGACGCCATAACCATC
ATACGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA
CCGCTGCACCGACGGGCGATACGTCCGCGCCCACCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG
GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGACATGATGAGCGGCGCG
GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCG
CTATTTCAACCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGTCGTTGCGTATTCTTG
AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCGCTGGAGTACCACTTC
ACCGATATCTCAGCGCTGTTCACCCGCCGCGCCCAGCAGAAATTCGCCGACTATGATTTTGTGAAGTATAGCGAGCTGGA
TCTCGAAAAAGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCGGCGAACGTGATTCACGCCA
CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGCCTGCTGATGCGCGAAATCACC
CAGCCAATGCGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAATT
ATTCCTCACCACCGCTCAGTGGCAGCAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCATGGCTACCGCAGGATGGCA
GCCCAACCGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGTGCCGTAACATTCACCGCG
CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGCC
CGAACGGTTTAACGCCCGCTGGCAGGAGGCCTGGCGTCTGCTCTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC
CCCTCGTCGCCGCCCCGGAGTGGCTGGGGGAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCAC
GTTGAAGCCCGTCATCCTGATGGCGAGTGGCTGCCGCTATCGCCCGCCGCGCCTCTTCCTGCGCCGCAGACGCATTATCA
ATGGCGCTGGACGCCCCTCAACGTCGCCAGCGTTGACCATCCGCTTACCTTCAGCGCCGGTACGCTTGCGCGCAGCGACG
AGCTGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCGCGACTGATGATTGTTGAGGAGAGCGAGGATACGCTG
GCCTTAGCGGAGAAAGTGATAGCAGCGCTCATCGCCAGCGCAGCCGGATTGATTGTGGTCACTCGCCGCGCGTGGCGAGT
CGAGGAAAATGAGGCACTCTCTGCGTCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGCCGGAAC
GGTTGATTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACACTGCATCAAGGGTTGAGCGCAGTCTCACTATCA
CAGCGCTGGCTCGCCGCGCGGGGTAACACCCTCTGGCTCCCTTCACTGGCGCTCAATACGGGATGCGCCGCTGAATTACC
AGCAAACGTGTTTACCGGCGATAACCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCGTGAACT
GGCTCAGAGAAAAAGGCGCGCGACGCATCGCCCTGCTGGCGCAGCGCGTGGATGAGTCATGGCTACGCGACGTGGAGGGC
GGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGCCAACGG
CGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGCTGGCCA
CCGTTTTTGCGGTAAAAGCGCAGGCGGCAAACCAGCTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTTATTCTC
TACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGGGCTGGC
ACAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGGCGGCCA
CGCCGGAAATGCTGGCGACGCTCGCCAGCCGTGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTGGAACAG
GCGGTGATGCGCGGCGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGCTCTGTT
TAACATCAGCGCCACAGAAAAAGCCGCAACGCCTGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCCTGAGCG
ATGAAACGGCGGTGATAGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGCTGAGCGATCCGGCGTCACTGCGCCCA
AACCAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCTGGGTGT
ACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAGAGACGA
CGCCTGCCGCTTCGCAGCCGGAAGTGTTGCAGCACGACGCCGACAAGCGTTATGCGCCCTTCCCTTTGACGCCCATTCAG
CACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGATAAACG
CCACGATGAGTTCGATCTCGCCATACTGGAGAAAGCATGGAACCAGCTTATCGCACGCCACGATATGTTGCGCATGGTGG
TTGATGCCGACGGGCAGCAGCGAGTCCTGGGGACAACGCCGGAGTATCACATCCAGCGTGACGATCTGCGCGCGCTTTCC
CCGGAAGAACAACGCATCGCGCTGGAAAAACGGCGGCATGAAATGAGCTATCGCGTTTTGCCTGCCGACCAGTGGCCTCT
TTTTGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGCCTGCACATGAACCTCGACCTTTTGCAGTTCGATGTGC
AGAGTTTTAAAGTCATGATGGACGATCTGGCGCAGGTCTGGCGCGGTGAAACGCTGGCGCCGCTCGCTATTACCTTCCGT
GATTATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGGAAAACTGCC
GCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCGCCCGGAAACGCCACACTTCACCACCTTCAAATCGACGA
TCGGCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTCACGCTG
TTTGCCGCCACCCTTGAGCGCTGGAGCCGCACCACGGCATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCCGATCCA
TCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAACGCTGGTGACGTTGC
AAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTGATCCGT
GAGCTGGGCCGCCTGCGCGGGTCACAACGTCAACCGCTGATGCCGGTGGTGTTTACCAGTATGCTGGGGATGACGCTGGA
AGGCATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACACCGCAGGTCTGGC
TGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCCGGCGCT
GCCGAGGCGATGTTTAATGACTACTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCTCGCCAG
CGGTATCGCCGGGCACATTCCCCGTCGACGCTGGCCGCTGAACGCACAGGCGGACTACGACCTGCGGGATATTGAGCAGG
CGACGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATCGTAATG
GCCGACGATCCGTCGCCATCAGCGGCGACGCCTGATGAGCACGAACTTACCCAACTGGCGCTGTCGTTGCCTGAGCAGGC
GCAGCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTACGCTAAATCGTC
ACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGGCGCTGTCCGCGCAAGCGTCTCACCAG
CGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGCGTGGTTAATCCGCGAGGGTGAAAGCTGGCGCTGCCGCGT
TCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCCAAGCCAATGGAGCCAGGCGCTGGCGCAGTATCTGGAAA
CCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCAGTGTTCACCGCTGGAATTGCTGTTCAACGAGCAGCATCGTGTG
ACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTGCGGCGC
AGAACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGCGGAAGT
CGTACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCGGGTGTCTTAT
GCCTTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGTCAATGT
GCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGCTGATCG
TTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCAAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGCGATTTC
CGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGTTTTGCAAACGAGCT
GGCGTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTGGCGCATTCGCCTGGCGTAAATCGCCCGGATA
AAGAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTTCCCGTTTTACAGATCCGGCAAAGAGAAGCGTTA
TTTACGCCGCTGCATGCCCCGTCTGATGCGCTGATTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCCGGCGCT
GGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAACTGGGCG
GCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTTCAGGATCTGTTC
AGCCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATTCCCCTTTGCCA
GGGCGACGGCGACGAAACCCTGTTCGTCTTCCACGCTTCGGACGGCGATATCAGCGCCTGGCTGCCGCTCGCCAGCGCGC
TGAACAGGCGTGTTTTCGGCCTGCAAGCAAAATCGCCGCAGCGCTTTGCCACGCTTGACCAGATGATCGATGAGTATGTC
GGGTGCATCCGTCGCCAGCAGCCTCACGGCCCTTATGTACTGGCGGGTTGGTCGTATGGCGCGTTTCTCGCGGCGGGCGC
CGCACAGCGCCTGTACGCCAAAGGCGAGCAGGTTAGGATCGCGTTAATCGATCCCGTGTGCCGACAGGATTTCTGTTGCG
AAAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAGCAGACG
CCCGACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGACGCTGCAAGCGGC
AGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAACGTTCCTGTCCCCT
GTCTCATGGTGTATGCCGCCGGGAGACCCGCGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAACAACGCC
GACGACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCTCACGTTCAGGTTTGTGCGCAACACATTAC
GCGCTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA

Protein sequence :
MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH
YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV
AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG
MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML
AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN
FHTPNPALKLEESPFTIPMSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSAL
RRLATDYAGALRENTDASDLAFTALHARRLDLPFRLAAPLNRETAAALSDWAGEKSGALVYSGHGASGKQVWLFTGQGSH
WRTMGQTMYQHSTAFADMLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWHAEGLKPDFAIGH
SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVF
CTTLSQHNINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ
SIQMAHQLGARVFLEMGPDAQLVASGQREYRDNAYWIASARRNKEASDVLNQALLQLYAAGVALPWTDLLAGDGQRIAAP
CYPFDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAITI
IRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRAHPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYDMMSGA
EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQSLRILEVGGGTGGTTAWLLPELNGVPALEYHF
TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPTAGMSEHIILATLPGQAVSAVTFTA
PSEPVLGQALTDNGDYLADWSDCAGQPERFNARWQEAWRLLSQRHGDALPVEPPLVAAPEWLGEVRLSWQNEAFSRGQMH
VEARHPDGEWLPLSPAAPLPAPQTHYQWRWTPLNVASVDHPLTFSAGTLARSDELAQYGIIHDPHASSRLMIVEESEDTL
ALAEKVIAALIASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLIAAIDLAENTPWETLHQGLSAVSLS
QRWLAARGNTLWLPSLALNTGCAAELPANVFTGDNRWHLVTGAFGGLGRLAVNWLREKGARRIALLAQRVDESWLRDVEG
GQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLATVFAVKAQAANQLLQTLRNHDGRYLIL
YSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHLEQ
AVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVIAWLKKRIAVQLRLSDPASLRP
NQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPETTPAASQPEVLQHDADKRYAPFPLTPIQ
HAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRVLGTTPEYHIQRDDLRALS
PEEQRIALEKRRHEMSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLAITFR
DYVMAEQARRQTSAWHDAWDYWQGKLPQLPLAPELPVVETRPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALLTL
FAATLERWSRTTAFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSTLVTLQEQMQQTQQRLWQNMAHSEMNGVEVIR
ELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEPGA
AEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDIVM
ADDPSPSAATPDEHELTQLALSLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQALSAQASHQ
RLLRQWLQCLTERAWLIREGESWRCRVPLSEIPEPQEACPPSQWSQALAQYLETCIARHDALFSGQCSPLELLFNEQHRV
TDALYRDNPASACLNRYTAQIAALCGAERILEVGAGTAATTAPVLKATRNTRKSYHFTDVSAQFLNDARARFHDESRVSY
ALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYRDF
RRRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVAHSPGVNRPDKEAVSRYLQQRFGTGLPVLQIRQREAL
FTPLHAPSDALIEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQDLF
SHSTLSDFCAHLQAATSGEDNPIPLCQGDGDETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPQRFATLDQMIDEYV
GCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRIALIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQQT
PDSQLADFISLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGENVPVPCLMVYAAGRPARWTPAETEWQGWINNA
DDAVIEASHWQIMMEAPHVQVCAQHITRWLCATSTQPENTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 100
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 99
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 98
irp1 CAA21391.1 - Virulence HPI Protein 0.0 98
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 98