Name : ECP_1942 (ECP_1942) Accession : YP_669843.1 Strain : Escherichia coli 536 Genome accession: NC_008253 Putative virulence/resistance : Virulence Product : yersiniabactin biosynthetic protein Function : - COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism COG ID : COG1020 EC number : - Position : 1980287 - 1986394 bp Length : 6108 bp Strand : + Note : HMWP2 nonribosomal peptidesynthetase DNA sequence : ATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTACCAACAATTACGCGAGCG GCTCATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAACCTGATCCAGGCCGGCCTGGATTCCATAA GATTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTGTATGCCGCCCCCACGCTG GCGGCATGGAACCAGTTAATGCTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCCCGACGAATCATCCTGGCC GAACATGACCGAAAGTACCCCCTTCCCATTGACGCCAGTACAGCACGCCTACCTGACGGGCCGCATGCCGGGGCAGACGC TTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACGGCGTCGCAGCTGGAGCAGGCCATCACG ACCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGGTCTGGCTACCGCAACCTTACTG GAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGGCCTATCTGGACGCACTGCGCCAGCGCC TGAGCCACCGTCTTTTACGCGTGGAAATCGGCGAAACGTTTGATTTTCAGCTGACGCTCTTGCCGGACAATCGCCACCGC CTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGAGCTTAACGCCCTGCTGGC CGGAGAATCGCTGCCGGCTATCGACACCCGCTATGATTTCCGCTCGTATTTGCTGCACCAACAGAAGATCAATCAACCAC TGAGAGACGACGCGCGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCGCCCGTCTTGCCGCTGGCCTGCGAA CCCGCCACGCTATGTGAAGTCCGTAATACCCGACGCCGCATGATTGTTCCGGCAACACGCTGGCACGCCTTTAGCAACCG GGCCGGCGAGTATGGCGTGACGCCGACAATGGCGCTGGCGACCTGTTTTTCTGCCGTGCTGGCTCGCTGGGGCGGCCTGA CGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGGCGGTTGGCGCGATGCTTGCCGACTTCACC AATATTCTTCTGCTGGACACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACGGA GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGCGCTACCCCCACGGCGCCC CGGTGGTATTTACCAGCAATCTGGGGCGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTGGGCGAGCCGGAATGGGGC ATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTGGCTACAATGGGACAGCAA CGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACATTGTTCGACGCCTACTGCCAGTTGATTAACCAACTCTGCGATGACG AAAGCGCCTGGCAAAAGCCGTTCGCAGATATGATGCCCGCCAGCCAGCGCGCGATACGCGAACGGGTCAACGCCACCGGT GCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCAGGCGCTGGCGGTAACGGA CATGCGTTATCAGTGGAATTATCATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGTTAATCGAGTGCGGGGTTC AGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTGGCCGTCCTGCTGGCCGGG GCGGTTTACGTTCCGGTTTCGCTGGATCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGACGCCAGCGTCCGGCTGGT GCTCATTTGTCAGCACGACGCCAGCGCCGGGTCAGACGATATTCCCGCCCTTGCCTGGCAGCAGGCCATTGAGGCGGAGC CGATCGCCAACCCGGTAGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGCTCTACCGGTACGCCGAAA GGGGTAGTCATTTCTCACCGGGGAGCGCTTAACACCTGTTGCGATATCAATACCCGCTATCAGGTTGGCCCGCATGACAG GGTGCTGGCCCTCTCCGCCCTACATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGCGCGCGGGCGGCGCGCTGG TGATGGTGATGGAAAATCAACGGCGCGATCCTCACGCATGGTGTGAGCTGATCCAGCGCCATCAGGTCACGCTCTGGAAC AGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCCGGAAAACCTGCGCGCAGT GATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCACAAGGACAATTTATCGCGA TGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGCGAAATTCACGACGTCCCCGCCCACTGGCGCTCCATCCCT TACGGTTTTCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAACAGGGCCGGGACTGCCCTGACTGGGTGCCGGGTGA ATTATGGATTGGCGGCATTGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGTAGCGAGCAGCAATTTTTGACGCTCC CGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGATGGCACAATCGAGTTCCTCGGTCGTCGCGAC AAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCGCTCAGCCAGCTGGCGGGGGTGAAACA AGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTTGTTCCTCAGGGCGAGGCTTTTTGCGTTA CCGATCATCGGAACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGTTGCGCCATCTCGCCAGAG ATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGCTAAAACTGAAGCCGGGTCACACCGCTGGCGCCGATCC TCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGTTAGCATTTCTGGTGACAC AACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCGAGGATGAGCACCCGCACTTCAGC GGACATGATTTAACGTTATCGCAAATTCTTCGCGGTGCCCGTAACGAACTGTCGTTACTGAACGACGCGCAGTGGTCCCC GGAAAGCCTGGCCTTTAACCATCCGGCCAGCGCCCCGTATATTCAGGAACTGGCGACAATTTGCCAACAGCTTGCACAGC GCTTACAGCGCCCGGTACGCCTGCTTGAGGTGGGAACCCGCACTGGCCGCGCCGCAGAATCGCTGTTAGCACAGCTCAAC GCCGGACAGATTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTGCTGAGCGCCCGGCAGAGGCTCGCCCCCTGGCC TGGCGCCCGTCTGTCCCTCTGGAATGCAGACACGCTGGCGGCGCACGCTCACTCGGCGGACATTATCTGGCTTAATAACG CCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTTCCCGGCGCGCTGCTCTAC GTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCCTACTCAGCACGCTCCTGTTAACCAATGGGCAGCCGGAGGCCTTGCT GCATAACAGCGCCGACTGGGCGGCATTATTTAGCGCGGCCGGCTTCAACTGTCAGCATGGCGATGAGGTCGCGGGGTTAC AACGCTTCCTCGTACAATGTCCTGACAGGCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCCGCCCTCGCCGGGCGTCTG CCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGGGAAAATTGACTACCAGGC GCTGAAGCGTCGTCATACCCCTGAAGCGGAAAACCCGGCCGAAGCGGATTTACCCCAGGGCGACATTGAAAAACAGGTTG CCGCCCTCTGGCAGCAACTCTTATCAACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAGCAAGGCGGCGATAGCCTG CTGGCGACCCGTCTGACCGGGCAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCTGTTTAATCATCCCCGGCT GGCGGATTTTGCCGCCACGCTGCGGAAAACCGACGTCCCGGTCGAACAACCATTCGTCCACTCCCCTGAAGATCGCTACC AGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGCGTCAGCCGGGCTTTGCCCTGGGCGGCGTCGGCTCA CATTTCTTTGTTGAATTTGAAATTGCCGATCTGGATCTCACCCGGTTGGAGACGGTCTGGAACCGATTAATCGCCCGCCA CGATATGCTACGCGCCGTCGTGCGTGATGGACAGCAACAGGTGCTCGAACAGACGCCTCGCTGGGTAATACCCGCACACA TCCTCCATACGCCTGAAGAGGCGTTGCAGGTGCGCGAAAAACTGGCCCATCAGGTACTCAACCCCGAAGTCTGGCCGGTA TTCGATCTCCAGGTCGGTTACGTTGACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTACTTGACGGGCT GAGCATGCAGATTCTGTTGGCGGAGCTGGAACATGGCTACCAATATCCGCAACAGTTGCCTCCGCCGCTACCCGTCACCT ACAGGGATTACCTGCAACAACCCGCGATCCAGTCGCTTAACGCAGATTCTCTGGCATGGTGGCAGGCGCAACTTGATGAT ATTCCTCCGGCTCCTGCATTGCCGCTACGTTGCATGCCTCAGGACGTCGAAACACCGCGCTTCGCCCGCCTGAACGGCGC GCTGGACAGCACGCGCTGGCATCGGCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGTCGGCCGTACTGTTGTCGG TGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTTTTCGACAGGCGACCGCTG CACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCCCGGCGAAAGCTGGCTGCA CAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCAGCCATCCGCGTGATGCGTCAAC TGGCGCAACGGCAAAACGTGTCTGCCGTTCCGATGCCCGTCGTCTTTACCAGCGCGCTGGGCTTTGAGCAGGATAACTTC CTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCTCGATCACCAGGTTTATGA ATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGGTGGAACGTCAGTTTGAGC AGTATTGCACCTTGCTAAACCGAATGACCGAGGATGAAAACAGCTGGCATTTGCCACTCGCCGCGCTGGTGCCTCCCGTT AAGCAGGCGGAGCAAGGTACAGAGCGCACATCGCGCGTATGCCCTGAGCACTCTCAGTCACACATTGCGGCGGACGAGAG CACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCACAAAACTTCTTTGAGGCGG GCGCAACGTCGCTGAATCTGGTGCAACTGCACGTTTTGTTACAACGTCACGAATTTTCGACTCTGACGTTACTTGACCTC TTTACCCACCCTTCTCCTGCAGCCCTGGCCGCTTATCTGACCAGTGTCGCCACGGTGGAGAAAACCAAACGCTCTCGCCC TGTTCGCCGTCGTCAGCGGCGGATATAG Protein sequence : MISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTL AAWNQLMLSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQLEQAIT TLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDAESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLLPDNRHR LHVNIDLLIMDASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDDARAYWLAKASTLPPAPVLPLACE PATLCEVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGAMLADFT NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPLGEPEWG ISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQRAIRERVNATG APIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVLAVLLAG AVYVPVSLDQPAARREKIYADASVRLVLICQHDASAGSDDIPALAWQQAIEAEPIANPVVRAPTQPAYIIYTSGSTGTPK GVVISHRGALNTCCDINTRYQVGPHDRVLALSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRHQVTLWN SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPAHWRSIP YGFPLTNQRYRVVDEQGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPDGTIEFLGRRD KQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPCCAISPE ISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEEREDEHPHFS GHDLTLSQILRGARNELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAESLLAQLN AGQIEYVGLEQSQEMLLSARQRLAPWPGARLSLWNADTLAAHAHSADIIWLNNALHRLLPEDPGLLATLQQLAVPGALLY VMEFRQLTPSALLSTLLLTNGQPEALLHNSADWAALFSAAGFNCQHGDEVAGLQRFLVQCPDRQVRRDPRQLQAALAGRL PGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPAEADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQQGGDSL LATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGRQPGFALGGVGS HFFVEFEIADLDLTRLETVWNRLIARHDMLRAVVRDGQQQVLEQTPRWVIPAHILHTPEEALQVREKLAHQVLNPEVWPV FDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYQYPQQLPPPLPVTYRDYLQQPAIQSLNADSLAWWQAQLDD IPPAPALPLRCMPQDVETPRFARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTLFDRRPL HPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVSAVPMPVVFTSALGFEQDNF LARRNLLKPVWGISQTPQVWLDHQVYESEGELRFNWDFVAALFPAGQVERQFEQYCTLLNRMTEDENSWHLPLAALVPPV KQAEQGTERTSRVCPEHSQSHIAADESTVSLICDAFREVVGESVTPAQNFFEAGATSLNLVQLHVLLQRHEFSTLTLLDL FTHPSPAALAAYLTSVATVEKTKRSRPVRRRQRRI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp2 | YP_002346902.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | NP_669706.1 | HMWP2 nonribosomal peptide synthetase | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_070124.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | CAA21390.1 | - | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_853075.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 99 |
irp2 | NP_993007.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_001006815.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 98 |
PMI2599 | YP_002152317.1 | non-ribosomal peptide synthase | Not tested | Not named | Protein | 0.0 | 41 |