Gene Information

Name : irp (EC55989_2207)
Accession : YP_002403253.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : High-molecular-weight nonribosomal peptide/polyketide synthetase 2 (HMWP2)
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG1020
EC number : -
Position : 2241989 - 2248114 bp
Length : 6126 bp
Strand : +
Note : Evidence 2a : Function of homologous gene experimentally demonstrated in an other organism; PubMedId : 15719346, 15582399, 8366034, 11927258, 9709002; Product type e : enzyme

DNA sequence :
GTGCCATCAGGAGGAAGAATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTA
CCAACAATTACGCGAGCGGCTCATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAACCTGATCCAGG
CCGGCCTGGATTCCATAAGATTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTG
TATGCCGCCCCCACGCTGGCGGCATGGAACCAGTTAATGCTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCC
CGACGAATCATCCTGGCCGAACATGACCGAAAGTACCCCCTTCCCATTGACGCCAGTACAGCACGCCTACCTGACGGGCC
GCATGCCGGGGCAGACGCTTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACGGCGTCGCAG
CTGGAGCAGGCCATCACGACCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGGTCTG
GCTACCGCAACCTTACTGGAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGGCCTATCTGG
ACGCACTGCGCCAGCGCCTGAGCCACCGTCTTTTACGCGTGGAAATCGGCGAAACGTTTGATTTTCAGCTGACGCTCTTG
CCGGACAATCGCCACCGCCTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGA
GCTTAACGCCCTGCTGGCCGGAGAATCGCTGCCGGCTATCGACACCCGCTATGATTTCCGCTCGTATTTGCTGCACCAGC
AGAAGATCAATCAACCACTGAGAGACGTCGCGCGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCGCCCGTC
TTGCCGCTGGCCTGCGAACCCGCCACGCTACGTGAAGTCCGTAATACCCGACGCCGCATGATTGTCCCGGCAACACGCTG
GCACGCCTTTAGCAACCGGGCCGGCGAGTATGGCGTGACGCCGACAATGGCACTGGCGACCTGTTTTTCTGCCGTGCTGG
CTCGCTGGGGCGGCCTGACGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCAACCGGCGGTTGGCGCG
ATGCTTGCCGACTTCACCAATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAA
CCAGCTCACGTTTACGGAGGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGC
GCTACCCCCACGGCGCCCCGGTGGTATTTACCAGCAATCTGGGGCGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTG
GGCGAGCCGGAATGGGGCATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTG
GCTACAATGGGACAGCAACGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACATTGTTCGACGCCTACTGCCAGTTGATTA
ACCAACTCTGCGATGACGAAAGCGCCTGGCAAAAGCCGTTCGCAGATATGATGCCCGCCAGCCAGCGCGCGATACGCGAA
CGGGTCAACGCCACCGGCGCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCA
GGCGCTGGCGGTAACGGACATGCGTTATCAGTGGAATTATCATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGT
TAATCGAGTGCGGGGTTCAGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTG
GCCGTCCTGCTGGCCGGGGCGGTTTACGTTCCGGTTTCGCTGGATCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGA
CGCCAGCGTCCGGCTGGTGCTCATTTGTCAGCACGACGCCAGCGCCGGGTCAGACGATATTCCCGCCCTTGCCTGGCAGC
AGGCCATTGAGGCGGAGCCGATCGCCAACCCGGTAGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGC
TCTACCGGTACGCCGAAAGGGGTAGTCATTTCTCACCGGGGAGCGCTTAACACCTGTTGCGATATCAATACCCGCTATCA
GGTTGGCCCGCATGACAGGGTGCTGGCCCTCTCCGCCCTACATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGC
GCGCGGGCGGCGCGCTGGTGATGGTGATGGAAAATCAACGGCGCGATCCTCACGCATGGTGTGAGCTGATCCAGCGCCAT
CAGGTCACGCTCTGGAACAGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCC
GGAAAACCTGCGCGCAGTGATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCAC
AAGGACAATTTATCGCGATGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGCGAAATTCACGACGTCCCCGCC
CACTGGCGCTCCATCCCTTACGGTTTTCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAACAGGGCCGGGACTGCCC
TGACTGGGTGCCGGGTGAATTATGGATTGGCGGCATTGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGTAGCGAGC
AGCAATTTTTGACGCTCCCGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGATGGCACAATCGAG
TTCCTCGGTCGTCGCGACAAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCGCTCAGCCA
GCTGGTGGGGGTGAAACAAGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTTGTTCCTCAGG
GCGAGGCTTTTTGCGTTACCGATCATCGGAACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGT
TGCGCCATCTCGCCAGAGATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGCTAAAACTGAAGCCGGGTCA
CACCGCTGGCGCCGATCCTCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGT
TAGCATTTCTGGTGACACAACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCGAGGAT
GAGCACCCGCACTTCAGCGGACATGATTTAACGTTATCGCAAATTCTTCGCGGTGCCCGTAACGAACTGTCGTTACTGAA
CGACGCGCAGTGGTCGCCGGAAAGCCTGGCCTTTAACCATCCGGCCAGTGCCCCGTATATTCAGGAACTGGCGACAATTT
GCCAACAGCTTGCACAGCGCTTACAGCGCCCGGTACGCCTGCTTGAGGTGGGAACCCGCACTGGCCGCACCGCAGAATCG
CTGTTAGCACAGCTCAACGCCGGACAGATTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTGCTGAGCGCCCGGCA
GAGGCTCGCCCCCTGGCCTGGCGCCCGTCTGTCCCTCTGGAATGCAGACACGCTGGCGGCGCACGCTCACTCGGCGGACA
TTATCTGGCTTAATAACGCCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTT
CCCGGCGCGCTGCTCTACGTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCCTACTCAGCACGCTCCTGTTAACCAATGG
GCAGCCGGAGGCCTTGCTGCATAACAGCGCCGACTGGGCGGCATTATTTAGCGCGGCCGGCTTCAACTGTCAGCATGGCG
ATGAGGTCGCGGGGTTACAACGCTTCCTCGTACAATGTCCTGACAGGCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCC
GCCCTCGCCGGGCGTCTGCCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACAGCTAACGG
GAAAATTGACTACCAGGCGCTGAAGCGTCGTCATACCCCTGAAGCGGAAAACCCGGCCGAAGCGGATTTACCCCAGGGCG
ACATTGAAAAACAGGTTGCCGCCCTCTGGCAGCAACTCTTATCAACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAG
CAAGGCGGCGATAGCCTGCTGGCGACCCGTCTGACCGGGCAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCT
GTTTAATCATCCCCGGCTGGCGGATTTTGCCGCCACGCTGCGGAAAACCGACGTCCCGGTCGAACAACCATTCGTCCACT
CCCTTGAAGATCGCTACCAGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGCGTCAGCCGGGCTTTGCC
CTGGGCGGCGTCGGCTCACATTTCTTTGTTGAATTTGAAATTGCCGATCTGGACCTCACCCGGCTGGAGACGGTCTGGAA
CCGATTAATCGCCCGCCACGATATGCTGCGCGCCATCGTGCGTGATGGACAGCAACAGGTGCTCGAACAGACGCCCCCCT
GGGTGATACCCGCACACACCCTCCATACGCCTGAAGAGGCGTTGCGGGTGCGCGAAAAACTGGCGCATCAGGTACTCAAC
CCCGAAGTGTGGCCGGTATTCGATCTCCAGGTCGGATACGTGGACGGGATGCCTGCCCGCCTGTGGCTGTGTCTGGATAA
CCTGTTGCTTGACGGTCTGAGCATGCAGATCCTGCTGGCGGAGCTGGAGCACGGCTACCGCTACCCGCAACAGCTGCTTC
CGCCGCTGCCCGTCACCTTCAGGGATTATCTGCAACAACCCTCGCTACAGTCGCCCAATCCAGATTCTCTGGCATGGTGG
CAGGCGCAGCTTGATGATATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTT
CGCCCGCCTGAACGGCGCACTGGACAGCACGCGCTGGCATCGGCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGT
CGGCCGTACTGTTGTCGGTGTGGTCAACGGTTCTCTCTGCATGGAGTGCGCAGCCTGACTTCACGCTTAACCTTACGCTT
TTCGACAGGCGACCGCTGCACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCC
CGGCGAAAGCTGGCTGCACAGCGCGCAGTCACTACAGCAGCAGCTGAGCCAGAACCTCAACCACCGCGATGTGTCAGCCA
TCCGCGTGATGCGTCAACTGGCGCAACGGCAAAACGTGCCAGCCGTTCCGATGCCCGTCGTCTTTACCAGCGCGCTGGGC
TTTGAGCAGGATAACTTCCTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCT
CGATCACCAGATTTATGAATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGG
TGGAGCGCCAGTTTGAACAGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCC
GCGCTGGTGCCTCCCGTTAAACACGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACA
CATTGCGGCGGACGAGAGCACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAG
AAAACTTCTTTGAGGCGGGCGCAACGTCGCTGAATCTGGTGCAACTGCACGTTTTGTTACAACGTCACGAATTTTCCACC
CTGACGTTGCTTGACCTCTTCACCCACCCTTCTCCTGCTGCCCTGGCCGATTATCTGGCCGGCGTCGCCACGGTGGAGAA
AACACAACGACCTCGCCCTGTTCGCCGTCGTCAGCGGCGGATATAG

Protein sequence :
MPSGGRMISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLREL
YAAPTLAAWNQLMLSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQ
LEQAITTLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDAESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLL
PDNRHRLHVNIDLLIMDASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDVARAYWLAKASTLPPAPV
LPLACEPATLREVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLQPAVGA
MLADFTNILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPL
GEPEWGISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQRAIRE
RVNATGAPIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVL
AVLLAGAVYVPVSLDQPAARREKIYADASVRLVLICQHDASAGSDDIPALAWQQAIEAEPIANPVVRAPTQPAYIIYTSG
STGTPKGVVISHRGALNTCCDINTRYQVGPHDRVLALSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRH
QVTLWNSVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPA
HWRSIPYGFPLTNQRYRVVDEQGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPDGTIE
FLGRRDKQVKVGGYRIELGEIESALSQLVGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPC
CAISPEISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEERED
EHPHFSGHDLTLSQILRGARNELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRTAES
LLAQLNAGQIEYVGLEQSQEMLLSARQRLAPWPGARLSLWNADTLAAHAHSADIIWLNNALHRLLPEDPGLLATLQQLAV
PGALLYVMEFRQLTPSALLSTLLLTNGQPEALLHNSADWAALFSAAGFNCQHGDEVAGLQRFLVQCPDRQVRRDPRQLQA
ALAGRLPGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPAEADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQ
QGGDSLLATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSLEDRYQPFALTDVQQAYLVGRQPGFA
LGGVGSHFFVEFEIADLDLTRLETVWNRLIARHDMLRAIVRDGQQQVLEQTPPWVIPAHTLHTPEEALRVREKLAHQVLN
PEVWPVFDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLLPPLPVTFRDYLQQPSLQSPNPDSLAWW
QAQLDDIPPAPALPLRCLPQEVETPRFARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPDFTLNLTL
FDRRPLHPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQQLSQNLNHRDVSAIRVMRQLAQRQNVPAVPMPVVFTSALG
FEQDNFLARRNLLKPVWGISQTPQVWLDHQIYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLA
ALVPPVKHAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHVLLQRHEFST
LTLLDLFTHPSPAALADYLAGVATVEKTQRPRPVRRRQRRI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2 YP_001006815.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 YP_002346902.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 NP_669706.1 HMWP2 nonribosomal peptide synthetase Virulence HPI Protein 0.0 99
irp2 YP_070124.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 NP_993007.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 CAA21390.1 - Virulence HPI Protein 0.0 99
irp2 YP_853075.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 99
PMI2599 YP_002152317.1 non-ribosomal peptide synthase Not tested Not named Protein 0.0 41