Name : irp2 (EC042_2213) Accession : YP_006096519.1 Strain : Escherichia coli 042 Genome accession: NC_017626 Putative virulence/resistance : Virulence Product : non-ribosomal peptide synthase (yersiniabactin siderophore biosynthetic protein) Function : - COG functional category : - COG ID : - EC number : - Position : 2282655 - 2288780 bp Length : 6126 bp Strand : + Note : - DNA sequence : GTGCCATCAGGAGGAAGAATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTA CCAACAATTACGCGAGCGGCTCATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAACCTGATCCAGG CCGGCCTGGATTCCATAAGATTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTG TATGCCGCCCCCACGCTGGCGGCATGGAACCAGTTAATGCTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCC CGACGAATCATCCTGGCCGAACATGACCGAAAGTACCCCCTTCCCATTGACGCCAGTACAGCACGCCTACCTGACGGGCC GCATGCCGGGGCAGACGCTTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACGGCGTCGCAG CTGGAGCAGGCCATCACGACCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGGTCTG GCTACCGCAACCTTACTGGAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGGCCTATCTGG ACGCACTGCGCCAGCGCCTGAGCCACCGTCTTTTACGCGTGGAAATCGGCGAAACGTTTGATTTTCAGCTGACGCTCTTG CCGGACAATCGCCACCGCCTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGA GCTTAACGCCCTGCTGGCCGGAGAATCGCTGCCGGCTATCGACACCCGCTATGATTTCCGCTCGTATTTGCTGCACCAGC AGAAGATCAATCAACCACTGAGAGACGACGCGCGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCGCCCGTC TTGCCGCTGGCCTGCGAACCCGCCACGCTATGTGAAGTCCGTAATACCCGACGCCGCATGATTGTCCCGGCAACACGCTG GCACGCCTTTAGCAACCGGGCCGGCGAGTATGGCGTGACGCCGACAATGGCACTGGCGACCTGTTTTTCTGCCGTGCTGG CTCGCTGGGGCGGCCTGACGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGGCGGTTGGCGCG ATGCTTGCCGACTTCACCAATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAA CCAGCTCACGTTTACGGAGGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGC GCTACCCCCACGGCGCCCCGGTGGTATTTACCAGCAATCTGGGGCGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTG GGCGAGCCGGAATGGGGCATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTG GCTACAATGGGACAGCAACGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACATTGTTCGACGCCTACTGCCAGTTGATTA ACCAACTCTGCGATGACGAAAGCGCCTGGCAAAAGTCGTTCGCAGATATGATGCCCGCCAGCCAGCGCGCGATACGCGAA CGGGTCAACGCCACCGGCGCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCA GGCGCTGGCGGTAACGGACATGCGTTATCAGTGGAATTATCATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGT TAATCGAGTGCGGGGTTCAGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTG GCCGTCCTGCTGGCCGGGGCGGTTTACGTTCCGGTTTCGCTGGATCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGA CGCCAGCGTCCGGCTGGTGCTCATTTGTCAGCACGACGCCAGCGCCGGGTCAGACGATATTCCCGTCCTTGCCTGGCAGC AGGCCATTGAGGCGGAGCCGATCGCCAACCCGGTAGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGC TCTACCGGTACGCCGAAAGGGGTAGTCATTTCTCACCGGGGAGCGCTTAACACCTGTTGCGATATCAATACCCGCTATCA GGTTGGCCCGCATGACAGGGTGCTGGCCCTCTCCGCCCTACATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGC GCGCGGGCGGCGCGCTGGTGATGGTGATGGAAAATCAACGGCGCGATCCTCACGCATGGTGTGAGCTGATCCAGCGCCAT CAGGTCACGCTCTGGAACAGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCC GGAAAACCTGCGCGCAGTGATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCAC AAGGACAATTTATCGCGATGGGCGGTGCCACCGAGGCGTCTATCTGGTCTAACGCCTGCGAAATTCACGACGTCCCCGCC CACTGGCGCTCCATCCCTTACGGTTTTCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAACAGGGCCGGGACTGCCC TGACTGGGTGCCGGGTGAATTATGGATTGGCGGCATTGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGTAGCGAGC AGCAATTTTTGACGCTCCCGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGATGGCACAATCGAG TTCCTCGGTCGTCGCGACAAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCGCTCAGCCA GCTGGCGGGGGTGAAACAAGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTTGTTCCTCAGG GCGAGGCTTTTTGCGTTACCGATCATCGGAACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGT TGCGCCATCTCGCCAGAGATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGCTAAAACTGAAGCCGGGTCA CACCGCTGGCGCCGATCCTCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGT TAGCATTTCTGGTGACACAACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCGAGGAT GAGCACCCGCACTTCAGCGGACATGATTTAACGTTATCGCAAATTCTTCGCGGTGCCCGTAACGAACTGTCGTTACTGAA CGACGCGCAGTGGTCGCCAGAAAGCCTGGCCTTTAACCATCCGGCCAGCGCCCCGTATATTCAGGAACTGGCGACAATTT GCCAACAGCTTGCACAGCGCTTACAGCGCCCGGTGCGCCTGCTTGAGGTGGGAACCCGCACTGGCCGCGCCGCAGAATCG CTGTTAGCACAGCTCAACGCCGGACAGATTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTGCTGAGCGCCCGGCA GAGGCTCGCCCCCTGGCCTGGCGCCCGTCTGTCCCTCTGGAATGCAGACACGCTGGCGGCGCACGCTCACTCGGCGGACA TTATCTGGCTTAATAACGCCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTT CCCGGCGCGCTGCTCTACGTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCCTACTCAGCACGCTCCTGTTAACCAATGG GCAGCCGGAGGCCTTGCTGCATAACAGCGCCGACTGGGCGGCATTATTTAGCGCGGCCGCCTTCAACTGTCAGCATGGCG ATGAGGTCGCGGGGTTACAACGCTTCCTCGTACAATGTCCTGACAGGCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCC GCCCTCGCCGGGCGTCTGCCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGG GAAAATTGACTACCAGGCGCTGAAGCGTCGTCATACCCCTGAAGCGGAAAACCCGGCCGAAGCGGATTTACCCCAGGGCG ACATTGAAAAACAGGTTGCCGCCCTCTGGCAGCAACTCTTATCAACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAG CAAGGCGGCGATAGCCTGCTGGCGACCCGTCTGACCGGGCAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCT GTTTAATCATCCCCGGCTGGCGGATTTTGCCGCCACGCTGCGGAAAACCGACGTCCCGGTCGAACAACCATTCGTCCACT CCCCTGAAGATCGCTACCAGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGCGTCAGCCGGGCTTTGCC CTGGGCGGCGTCGGCTCACATTTCTTTGTTGAATTTGAAATTGCCGATCTGGATCTCACCCGGTTGGAGACGGTCTGGAA CCGATTAATCGCCCGCCACGATATGCTACGCGCCGTCGTGCGTGATGGACAGCAACAGGTGCTCGAACAGACGCCTCGCT GGGTAATACCCGCACACATCCTCCATACGCCTGAAGAGGCGTTGCAGGTGCGCGAAAAACTGGCCCATCAGGTACTCAAC CCCGAAGTCTGGCCGGTATTCGATCTCCAGGTCGGTTACGTTGACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAA CCTGTTACTTGACGGGCTGAGCATGCAGATTCTGTTGGCGGAGCTGGAGCACGGCTACCGCTACCCGCAACAGCTGCTTC CGCCGCTGCCCGTCACCTTCAGGGATTATCTGCAACAACCCTCGCTACAGTCGCCCAATCCAGATTCTCTGGCATGGTGG CAGGCGCAGCTTGATGATATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTT CGCCCGCCTGAACGGCGCACTGGACAGCACGCGCTGGCATCGGCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGT CGGCCGTACTGTTGTCGGTGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTT TTCGACAGGCGACCGCTGCACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCC CGGCGAAAGCTGGCTGCACAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCAGCCA TCCGCGTGATGCGTCAACTGGCGCAACGGCAAAACGTGCCTGCCGTTCCGATGCCCGTCGTCTTTACCAGCGCACTGGGC TTTGAGCAGGATAACTTCCTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCT CGATCACCAGATTTATGAATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGG TGGAGCGCCAGTTTGAACAGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCC GCGCTGGTGCCTCCCGTTAAACACGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACA CATTGCGGCGGACGAGAGCACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAG AAAACTTCTTTGAGGCGGGCGCAACGTCGCTGAATCTGGTGCAACTGCACGTTTTGTTACAACGTCACGAATTTTCCACC CTGACGTTGCTTGACCTCTTCACCCACCCTTCTCCTGCTGCCCTGGCCGATTATCTGGCCGGCGTCGCCACGGTGGAGAA AACAAAACGACCTCGCCCTGTTCGCCGTCGTCAGCGGCGGATATAG Protein sequence : MPSGGRMISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLREL YAAPTLAAWNQLMLSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQ LEQAITTLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDAESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLL PDNRHRLHVNIDLLIMDASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDDARAYWLAKASTLPPAPV LPLACEPATLCEVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGA MLADFTNILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPL GEPEWGISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKSFADMMPASQRAIRE RVNATGAPIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVL AVLLAGAVYVPVSLDQPAARREKIYADASVRLVLICQHDASAGSDDIPVLAWQQAIEAEPIANPVVRAPTQPAYIIYTSG STGTPKGVVISHRGALNTCCDINTRYQVGPHDRVLALSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRH QVTLWNSVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPA HWRSIPYGFPLTNQRYRVVDEQGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPDGTIE FLGRRDKQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPC CAISPEISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEERED EHPHFSGHDLTLSQILRGARNELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAES LLAQLNAGQIEYVGLEQSQEMLLSARQRLAPWPGARLSLWNADTLAAHAHSADIIWLNNALHRLLPEDPGLLATLQQLAV PGALLYVMEFRQLTPSALLSTLLLTNGQPEALLHNSADWAALFSAAAFNCQHGDEVAGLQRFLVQCPDRQVRRDPRQLQA ALAGRLPGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPAEADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQ QGGDSLLATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGRQPGFA LGGVGSHFFVEFEIADLDLTRLETVWNRLIARHDMLRAVVRDGQQQVLEQTPRWVIPAHILHTPEEALQVREKLAHQVLN PEVWPVFDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLLPPLPVTFRDYLQQPSLQSPNPDSLAWW QAQLDDIPPAPALPLRCLPQEVETPRFARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTL FDRRPLHPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVPAVPMPVVFTSALG FEQDNFLARRNLLKPVWGISQTPQVWLDHQIYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLA ALVPPVKHAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHVLLQRHEFST LTLLDLFTHPSPAALADYLAGVATVEKTKRPRPVRRRQRRI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp2 | YP_001006815.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_002346902.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | NP_669706.1 | HMWP2 nonribosomal peptide synthetase | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_070124.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_853075.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 99 |
irp2 | NP_993007.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | CAA21390.1 | - | Virulence | HPI | Protein | 0.0 | 99 |
PMI2599 | YP_002152317.1 | non-ribosomal peptide synthase | Not tested | Not named | Protein | 0.0 | 41 |