Name : ECP_1943 (ECP_1943) Accession : YP_669844.1 Strain : Escherichia coli 536 Genome accession: NC_008253 Putative virulence/resistance : Virulence Product : yersiniabactin biosynthetic protein Function : - COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism COG ID : COG3321 EC number : - Position : 1986482 - 1995973 bp Length : 9492 bp Strand : + Note : HMWP1nonribosomal peptide/polyketide synthase DNA sequence : ATGGATAACTTGCGCTTCTCTTCTGCGCCAACAGCAGATTCCATTGATGCATCGATCGTTCAACACTACCCGGACTGCGA ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAG GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAGCTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA GTCGATGGACCCGCAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCG TCCCCCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTC GCGCAGGTAAAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT GCACGGCCCGGCGTTATCGGTACAGACCGCCTGCTCCAGCTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCG CTGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCAGGCTACCGCTACCAGCCCGGA ATGATTTTCTCTCCTGATGGTCACTGTCGTCCCTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG CGTGGTGCTGCGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTG GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAACATGGGCCATC TGGATACCGCGGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGTCGCGGGCAAATTCCTCCCTTACTGAAT TTTCACACCCCCAACCCGGCGCTGAAACTTGAAGAGAGCCCCTTTACCATACCGGTGTCGGCACAGGCATGGCAGGACGA AATGCGCTATGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC TCAACGCGCGCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTG CGGCGGCTGGCGACGGATTATGCCGGGGCGCTGAGAGAGAATGCGGATGCCAGCTCTCTGGCCTTCACAGCCCTGCACGC GCGCCGTCTCGATCTCCCCTTCCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGAGGCGCTCAGCGCCTGGGCCGGTG AGAAATCGGGGGCGCAGGTTTACAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC TGGCGCACTATGGGTCAAACGATGTACCAGCACTCAACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAG CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGC CGGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCGTGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCAT TCCGTCGGTGAATTTGCCGCTGCCGTTGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGGCGCGG CGCGCTAATGCAGCAGTGCGCAAGCGGCGCGATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCC AGTTTGAGCTGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGTATTT TGCGCCACGCTCTCGCAGCATGACATTAACTATCGTCGCCTGAGCGTAACCGGTGCGGCGCACTCCGCTTTACTGGAGCC GATACTCGATCGGTTCCAGGACGCCTGCGCGGGACTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG AGTATTCAGGTGGCGCATCAACTCGGCACCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTGCGGGCA GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGTTGTCCTCAATCAGGCCC TGCTCCAGCTTTACGCTGCCGGCGTCGCTCTACCGTGGGCCGACCTACTGGCGGGCGATGGACAACGTATCGCTGCGCCA TGTTATCCGTTTGATACTGAGCGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAGCCTGCCGACGCAGCGCTGTCTGC CGGGCTGGAGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCTCGCCTGGAAGCGCTTAAACAGTGCGCCACGCGAC TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAGAACGGCGTGGACGCCATGACCATC ATGCGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA CCGCTGCACCGACGGGCGATACGTCCGCGCCCGCCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCG GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCG CTATTTCAACCAAATCGCCGCCGGGGTATTACGTGGCATTGTCCAGACGCGTCAGCCCCGCCAGCCGTTGCGTATTCTTG AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCACTGGAGTACCATTTC ACCGATATCTCGGCGCTGTTCACCCGTCGCGCCCAGCAGAAATTCGCCGACTATGATTTTGTGAAGTATAGCGAGCTGGA TCTCGAAAAAGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCAGCGAACGTGATTCACGCCA CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGCCTGCTGATGCGCGAAATCACC CAGCCAATGCGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAGTT ATTCCTCACCACCGCTCAGTGGCAACAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCGTGGCTACCGCAGGATGGCA GCCCGACCGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGCGCCGTAACATTCACCGCG CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGCC CGAACGGTTTAACGCCCGCTGGCAGGAGGCCTGGCGTCTGCTTTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC CCCCCGTCGCCGCCCCGGAGTGGCTAGGGAAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCGC GTTGAAGCCCGTCATCCTGCTGGCGAGTGGCTGCCGCTATCGCCCGCCGAGCCTCTTCCTGCGCCGCAAACGCATTATCA ATGGCGCTGGACGCCCCTCAACGTCGCCAGCATTGACCATCCGCTTACCTTTAGCTTCAGCGCCGGTACGCTTGCGCGCA GCGACGAGCTGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCACGACTGATGATTGTTGAGGAGAGCGAGGAT ACGCTGGCCTTAGCGGAGAAAGTGATAGCAGCGCTCACCGCCAGCGCAGCCGGATTGATTGTGGTTACTCGCCGCGCGTG GCGAGTCGAGGAAAATGAAGCACTCTCTGCATCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGC CGGAACGGTTGCTTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACGCTGCATCAAGGGTTGAGCGCAGTCTCA CTATCACAGCGCTGGCTCGCCGCACGGGGTGACACCCTTTGGCTTCCTTCACTGTCGCCCAATACGGGATGCGCCGCTGA ATTACCGGCAAACGTGTTTACCGGCGATAGCCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCG TGAACTGGCTCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTAGCGCCGCGCGTGGATGAGTCATGGCTACGCGACGTG GAGGGCGGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGC CAACGGCGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGC TGGCTGCCGTTTTCGCGGTAAAAGCGCAGGCGGCAAGCCAGCTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTT ATTCTCTACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGG GCTGGCCCAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGG CGGCCACGCCGGAAATGCTGGCGACGCTCGCCAGCCGAGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTG GAACAGGCGGTGATGCGCGGTGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGC TCTGTTTAACATCAGCGCCACAGAAAAAGCCGCAACGCCGGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCC TGAGCGATGAAACAGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGCTGAGCGATCCGGCGTCACTG CATCCAAACCAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCT GGGCGTACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAG AGGCGACGCCTGCCGCTTCGCAGCCGGAAGTGTTGCGGCACGACGCCGACGAGCGTTATGCGCCCTTCCCTTTGACGCCC ATTCAGCACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGA TAAACGCCACGATGAGTTCGATCTCGCCATACTGGAGAAAGCATGGAACCAGCTCATCGCACGCCACGATATGTTGCGTA TGGTGGTTGATGCCGACGGGCAGCAGCGAATCCTGGCGACAACGCCGGAGTATCACATCCCGCGTGACGATCTGCGCGCG CTTTCCCCGGAAGAACAGCGCATCGCGCTGGAAAAACGGCGGCATGAACTGAGCTATCGCGTTTTGCCTGCCGACCAGTG GCCTCTTTTTGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGTCTGCATATGAACCTCGACCTTTTGCAGTTTG ATGTGCAGAGTTTTAAAGTCATGATGGACGACCTGGCGCAGGTCTGGCGCGGTGAAACGCTGGCGCCGCTCGCTATTACC TTCCGTGATTATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGAAAA ACTGCCGCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCCCCCGGAAACGCCACACTTCACCACCTTCAAAT CGACGATCGGCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTC ACGCTGTTTGCCGCCACCCTTGAGCGCTGGAGCCGTACCACAACATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCC GATCCATCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAGCGCCGGTGA CGTTGCAAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTG ATCCGTGAGCTGGGCCGCCTGCGCGGATCACAACGTCAACCGCTGATGCCGGTAGTGTTTACCAGTATGCTGGGGATGAC GCTGGAAGGCATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACGCCGCAGG TCTGGCTGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCC GGCGCTGCCGAGGCGATGTTTAATGACTATTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCT CGCCAGCGGCATCGCCGGGCACATTCCCCGCCGACGCTGGCCGCTGAACGCGCAGGCGGACTACGACCTGCGGGATATTG AGCAGGCGACGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATC GTGATGGCCGACGATCCGTCGCCATCAGCGGCGATGCCTGATGAGCACGAACTTACCCAACTGGCGCTGCCGTTGCCTGA GCAGGCGCAGCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTACGCTAA ATCGTCACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGGCGCTGTCCGCGCAAGCGTCT CACCAGCGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAGTGGTTAATCCGCGAAGGTGAAAGCTGGCGCTG CCGCATTCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCAAAGCCAATGGAGCCAGGCGCTGGCGCAGTATC TGGAAACCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCAGTGTTCTCCGCTGGAATTGCTGTTCAACGAGCAGCAT CGCGTTACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTG CAGCGCAGAACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGC GGCAGTCGTACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCAGGTG TCTTATGCCTTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGT CAATGTGCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGC TGATCGTTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGC GATTTCCGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGGTTTGCAAA CGAGCTGGCGTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTAGCGCGTTCGCCTGGCGTAAATCGCC CGGATAAAAAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTGCCCATTTTACAGATCCGGCAAAGAGAA GCGTTATTTACGCCGCTGCATGCCCCGTCTGATGCGCCGACTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCC GGCGCTGGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAAC TGGGCGGCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTTCAGGAT CTGTTCAGCCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATACCCCT TTGCCAGGGCGACGGTGAGGAAACCCTGTTTGTCTTCCACGCTTCAGACGGCGATATCAGCGCCTGGCTGCCGCTCGCTA GCGCGTTGAACAGGCGCGTTTTCGGCCTGCAAGCAAAATCGCCGCAGCGCTTTGCCACGCTCGACCAGATGATCGATGAG TATGTCGGGTGCATCCGTCGTCAGCAGCCTCACGGCCCTTATGTGCTGGCGGGTTGGTCGTATGGCGCGTTTCTTGCGGC GGGCGCCGCACAGCGCCTGTACGCCAAAGGCGAGCAGGTTCGGATGGTGTTAATCGATCCCGTGTGCCGACAGGATTTCT GTTGCGAAAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAG CAGACGCCCGACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGACGCTGCA AGCGGCAGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAGCGTTCCGG TCCCCTGTCTCATGGTGTATGCCGCCGGGAGACCCGCGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAAC AACGCCGACGACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCCCACGTTCAGGCTTGTGCGCAACA CATTACGCGCTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA Protein sequence : MDNLRFSSAPTADSIDASIVQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN FHTPNPALKLEESPFTIPVSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSAL RRLATDYAGALRENADASSLAFTALHARRLDLPFRLAAPLNRETAEALSAWAGEKSGAQVYSGHGASGKQVWLFTGQGSH WRTMGQTMYQHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAIGH SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVF CATLSQHDINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ SIQVAHQLGTRVFLEMGPDAQLVACGQREYRDNAYWIASARRNKEASVVLNQALLQLYAAGVALPWADLLAGDGQRIAAP CYPFDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAMTI MRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRARPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYEMMSGA EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQPLRILEVGGGTGGTTAWLLPELNGVPALEYHF TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPTAGMSEHIILATLPGQAVSAVTFTA PSEPVLGQALTDNGDYLADWSDCAGQPERFNARWQEAWRLLSQRHGDALPVEPPPVAAPEWLGKVRLSWQNEAFSRGQMR VEARHPAGEWLPLSPAEPLPAPQTHYQWRWTPLNVASIDHPLTFSFSAGTLARSDELAQYGIIHDPHASSRLMIVEESED TLALAEKVIAALTASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLLAAIDLAENTPWETLHQGLSAVS LSQRWLAARGDTLWLPSLSPNTGCAAELPANVFTGDSRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDV EGGQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLRNHDGRYL ILYSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHL EQAVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVMAWLKKRIAVQLRLSDPASL HPNQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPEATPAASQPEVLRHDADERYAPFPLTP IQHAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRILATTPEYHIPRDDLRA LSPEEQRIALEKRRHELSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLAIT FRDYVMAEQARRQTSAWHDAWDYWQEKLPQLPLAPELPVVETPPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALL TLFAATLERWSRTTTFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSAPVTLQEQMQQTQQRLWQNMAHSEMNGVEV IRELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEP GAAEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDI VMADDPSPSAAMPDEHELTQLALPLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQALSAQAS HQRLLRQWLQCLTEREWLIREGESWRCRIPLSEIPEPQEACPQSQWSQALAQYLETCIARHDALFSGQCSPLELLFNEQH RVTDALYRDNPASACLNRYTAQIAALCSAERILEVGAGTAATTAPVLKATRNTRQSYHFTDVSAQFLNDARARFHDESQV SYALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYR DFRRRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVARSPGVNRPDKKAVSRYLQQRFGTGLPILQIRQRE ALFTPLHAPSDAPTEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQD LFSHSTLSDFCAHLQAATSGEDNPIPLCQGDGEETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPQRFATLDQMIDE YVGCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRMVLIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQ QTPDSQLADFISLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGESVPVPCLMVYAAGRPARWTPAETEWQGWIN NADDAVIEASHWQIMMEAPHVQACAQHITRWLCATSTQPENTL |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp1 | YP_002346901.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | NP_669707.1 | HMWP1 nonribosomal peptide/polyketide synthase | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | CAA21391.1 | - | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | YP_070123.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | NP_993006.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | YP_853076.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 99 |
irp1 | CAA73127.1 | HMWP1 protein | Virulence | HPI | Protein | 0.0 | 98 |
irp1 | YP_001006816.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 98 |