Name : irp2 (ECABU_c22410) Accession : YP_006106297.1 Strain : Escherichia coli ABU 83972 Genome accession: NC_017631 Putative virulence/resistance : Virulence Product : yersiniabactin biosynthetic protein Function : - COG functional category : - COG ID : - EC number : - Position : 2202531 - 2208638 bp Length : 6108 bp Strand : + Note : HMWP2 nonribosomal peptidesynthetase DNA sequence : ATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTACCAACAATTACGCGAGCG GCTCATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAACCTGATCCAGGCCGGCCTGGATTCCATAA GATTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTGTATGCCGCCCCCACGCTG GCGGCATGGAACCAGTTAATGCTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCCCGACGAATCATCCTGGCC GAACATGACCGAAAGTACCCCCTTCCCATTGACGCCAGTACAGCACGCCTACCTGACGGGCCGCATGCCGGGGCAGACGC TTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACGGCGTCGCAGCTGGAGCAGGCCATCACG ACCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGGTCTGGCTACCGCAACCTTACTG GAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGGCCTATCTGGACGCACTGCGCCAGCGCC TGAGCCACCGTCTTTTACGCGTGGAAATCGGCGAAACGTTTGATTTTCAGCTGACGCTCTTGCCGGACAATCGCCACCGC CTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGAGCTTAACGCCCTGCTGGC CGGAGAATCGCTGCCGGCTATCGACACCCGCTATGATTTCCGCTCGTATTTGCTGCACCAACAGAAGATCAATCAACCAC TGAGAGACGACGCGCGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCGCCCGTCTTGCCGCTGGCCTGCGAA CCCGCCACGCTACGTGAAGTCCGTAATACCCGACGCCGCATGATTGTTCCGGCAACACGCTGGCACGCCTTTAGCAACCG GGCCGGCGAGTATGGCGTGACGCCGACAATGGCGCTGGCGACCTGTTTTTCTGCCGTGCTGGCTCGCTGGGGCGGCCTGA CGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGGCGGTTGGCGCGATGCTTGCCGACTTCACC AATATTCTTCTGCTGGACACCACCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACGGA GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTGCGTGAACTCAAACGCCAGCAGCGCTACCCCCACGGCGCCC CGGTGGTATTTACCAGCAATCTGGGGCGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTGGGCGAGCCGGAATGGGGC ATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTGGCTACAATGGGACAGCAA CGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACATTGTTCGACGCCTACTGCCAGTTGATTAACCAACTCTGCGATGACG AAAGCGCCTGGCAAAAGCCGTTCGCAGATATGATGCCCGCCAGCCAGCGCGCGATACGCGAACGGGTCAACGCCACCGGC GCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCACAGGCGCTGGCGGTAACGGA CATGCGTTATCAGTGGAATTATCATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGTTAATCGAGTGCGGGGTTC AGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTGGCCGTCCTGCTGGCCGGG GCGGTTTACGTTCCGGTTTCGCTGGATCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGACGCCAGCGTCCGGCTGGT GCTCATTTGTCAGCACGACGCCAGCGCCGGGTCAGACGATATTCCCGTCCTTGCCTGGCAGCAGGCCATTGAGGCGGAGC CGATCGCCAACCCGGTAGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGCTCTACCGGTACGCCGAAA GGGGTAGTCATTTCTCACCGGGGAGCGCTTAACACCTGTTGCGATATCAATACCCGCTATCAGGTTGGCCCGCATGACAG GGTGCTGGCCCTCTCCGCCCTACATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGCGCGCGGGCGGCGCGCTGG TGATGGTGATGGAAAATCAACGGCGCGATCCTCACGCATGGTGTGAGCTGATCCAGCGCCATCAGGTCACGCTCTGGAAC AGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCCGGAAAACCTGCGCGCAGT GATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCACAAGGACAATTTATCGCGA TGGGCGGTGCCACCGAGGCGTCTATCTGGTCTAACGCCTGCGAAATTCACGACGTCCCCGCCCACTGGCGCTCCATCCCT TACGGTTTTCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAACAGGGCCGGGACTGCCCTGACTGGGTGCCGGGTGA ATTATGGATTGGCGGCATTGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGTAGCGAGCAGCAATTTTTGACGCTCC CGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGATGGCACAATCGAGTTCCTCGGTCGTCGCGAC AAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCGCTCAGCCAGCTGGCGGGGGTGAAACA AGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTTGTTCCTCAGGGCGAGGCTTTTTGCGTTA CCGATCATCGGAACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGTTGCGCCATCTCGCCAGAG ATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGCTAAAACTGAAGCCGGGTCACACCGCTGGCGCCGATCC TCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGTTAGCATTTCTGGTGACAC AACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCGAGGATGAGCACCCGCACTTCAGC GGACATGATTTAACGTTATCGCAAATTCTTCGCGGTGCCCGTAACGAACTGTCGTTACTGAACGACGCGCAGTGGTCGCC GGAAAGCCTGGCCTTTAACCATCCGGCCAGCGCCCCGTATATTCAGGAACTGGCGACAATTTGCCAACAGCTTGCACAGC GCTTACAGCGCCCGGTGCGCCTGCTTGAGGTGGGAACCCGCACTGGCCGCGCCGCAGAATCGCTGTTAGCACAGCTCAAC GCCGGACAGATTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTGCTGAGCGCCCGGCAGAGGCTCGCCCCCTGGCC TGGCGCCCGTCTGTCCCTCTGGAATGCAGACACGCTGGCGGCGCACGCTCACTCGGCGGACATTATCTGGCTTAATAACG CCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTTCCCGGCGCGCTGCTCTAC GTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCCTACTCAGCACGCTCCTGTTAACCAATGGGCAGCCGGAGGCCTTGCT GCATAACAGCGCCGACTGGGCGGCATTATTTAGCGCGGCCGCCTTCAACTGTCAGCATGGCGATGAGGTCGCGGGGTTAC AACGCTTCCTCGTACAATGTCCTGACAGGCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCCGCCCTCGCCGGGCGTCTG CCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGGGAAAATTGACTACCAGGC GCTGAAGCGTCGTCATACCCCTGAAGCGGAAAACCCGGCCGAAGCGGATTTACCCCAGGGCGACATTGAAAAACAGGTTG CCGCCCTCTGGCAGCAACTCTTATCAACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAGCAAGGCGGCGATAGCCTG CTGGCGACCCGTCTGACCGGGCAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCTGTTTAATCATCCCCGGCT GGCGGATTTTGCCGCCACGCTGCGGAAAACCGACGTCCCGGTCGAACAACCATTCGTCCACTCCCCTGAAGATCGCTACC AGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGCGTCAGCCGGGCTTTGCCCTGGGCGGCGTCGGCTCA CATTTCTTTGTTGAATTTGAAATTGCCGATCTGGATCTCACCCGGCTGGAGACGGTCTGGAACCGATTAATCGCCCGCCA CGATATGCTACGCGCCGTCGTGCGTGATGGACAGCAACAGGTGCTCGAACAGACGCCTCGCTGGGTAATACCCGCACACA TCCTCCATACGCCTGAAGAGGCGTTGCAGGTGCGCGAAAAACTGGCCCATCAGGTACTCAACCCCGAAGTCTGGCCGGTA TTCGATCTCCAGGTCGGTTACGTTGACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTACTTGACGGGCT GAGCATGCAGATTCTGTTGGCGGAGCTGGAACATGGCTACCAATATCCGCAACAGTTGCCTCCGCCGCTACCCGTCACCT ACAGGGATTACCTGCAACAACCCGCGATCCAGTCGCTTAACGCAGATTCTCTGGCATGGTGGCAGGCGCAACTTGATGAT ATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTTCGCCCGCCTGAACGGCGC GCTGGACAGCACGCGCTGGCATCGGCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGTCGGCCGTACTGTTGTCGG TGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTTTTCGACAGGCGACCGCTG CACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCCCGGCGAAAGCTGGCTGCA CAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCAGCCATCCGCGTGATGCGTCAAC TGGCGCAACGGCAAAACGTGCCTGCCGTTCCGATGCCCGTCGTCTTTACCAGCGCACTGGGCTTTGAGCAGGATAACTTC CTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCTCGATCACCAGATTTATGA ATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGGTGGAGCGCCAGTTTGAAC AGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCCGCGCTGGTGCCTCCCGTT AAACACGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACACATTGCGGCGGACGAGAG CACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAGAAAACTTCTTTGAGGCGG GCGCAACGTCGCTGAATCTGGTGCAACTGCACGTTTTGTTACAACGTCACGAATTTTCCACCCTGACGTTGCTTGACCTC TTCACCCACCCTTCTCCTGCTGCCCTGGCCGATTATCTGGCCGGCGTCGCCACGGTGGAGAAAACAAAACGACCTCGCCC TGTTCGCCGTCGTCAGCGGCGGATATAG Protein sequence : MISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTL AAWNQLMLSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQLEQAIT TLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDAESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLLPDNRHR LHVNIDLLIMDASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDDARAYWLAKASTLPPAPVLPLACE PATLREVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGAMLADFT NILLLDTTCDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPLGEPEWG ISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQRAIRERVNATG APIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVLAVLLAG AVYVPVSLDQPAARREKIYADASVRLVLICQHDASAGSDDIPVLAWQQAIEAEPIANPVVRAPTQPAYIIYTSGSTGTPK GVVISHRGALNTCCDINTRYQVGPHDRVLALSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRHQVTLWN SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPAHWRSIP YGFPLTNQRYRVVDEQGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPDGTIEFLGRRD KQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPCCAISPE ISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEEREDEHPHFS GHDLTLSQILRGARNELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAESLLAQLN AGQIEYVGLEQSQEMLLSARQRLAPWPGARLSLWNADTLAAHAHSADIIWLNNALHRLLPEDPGLLATLQQLAVPGALLY VMEFRQLTPSALLSTLLLTNGQPEALLHNSADWAALFSAAAFNCQHGDEVAGLQRFLVQCPDRQVRRDPRQLQAALAGRL PGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPAEADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQQGGDSL LATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGRQPGFALGGVGS HFFVEFEIADLDLTRLETVWNRLIARHDMLRAVVRDGQQQVLEQTPRWVIPAHILHTPEEALQVREKLAHQVLNPEVWPV FDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYQYPQQLPPPLPVTYRDYLQQPAIQSLNADSLAWWQAQLDD IPPAPALPLRCLPQEVETPRFARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTLFDRRPL HPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVPAVPMPVVFTSALGFEQDNF LARRNLLKPVWGISQTPQVWLDHQIYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLAALVPPV KHAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHVLLQRHEFSTLTLLDL FTHPSPAALADYLAGVATVEKTKRPRPVRRRQRRI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp2 | YP_001006815.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_002346902.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | NP_669706.1 | HMWP2 nonribosomal peptide synthetase | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_070124.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_853075.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 99 |
irp2 | NP_993007.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | CAA21390.1 | - | Virulence | HPI | Protein | 0.0 | 99 |
PMI2599 | YP_002152317.1 | non-ribosomal peptide synthase | Not tested | Not named | Protein | 0.0 | 41 |