Name : ECOK1_2150 (ECOK1_2150) Accession : YP_006101319.1 Strain : Escherichia coli IHE3034 Genome accession: NC_017628 Putative virulence/resistance : Virulence Product : putative polyketide synthetase Function : - COG functional category : - COG ID : - EC number : - Position : 2163313 - 2172804 bp Length : 9492 bp Strand : + Note : identified by match to protein family HMM PF00106; match to protein family HMM PF00109; match to protein family HMM PF00550; match to protein family HMM PF00668; match to protein family HMM PF00698; match to protein family HMM PF00975; match to protein fa DNA sequence : ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGA ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAG GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAGCTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA GTCGATGGACCCGCAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCG TCCCCCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTC GCGCAGGTAAAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT GCACGGCCCGGCGTTATCGGTACAGACCGCCTGCTCCAGCTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCG CAGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCAGGCTACCGCTACCAGCCCGGA ATGATTTTCTCTCCTGATGGTCACTGTCGTCCCTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG CGTGGTGCTGCGTCGCCTGAAAGACGCGCTACTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTG GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAACATGGGCCATC TGGATACCGCGGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGTCGCGGGCAAATTCCTCCCTTACTGAAT TTTCACACCCCCAACCCGGCGCTGAAACTTGAAGAGAGCCCCTTTACCATACCGGTGTCGGCACAGGCATGGCAGGACGA AATGCGCTATGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC TCAACGCGCGCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTG CGGCGGCTGGCGACGGATTATGCCGGGGCGCTGAGAGAGAATGCGGATGCCAGCTCTCTGGCCTTCACAGCCCTGCACGC GCGCCGTCTCGATCTCCCCTTCCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGAGGCGCTCAGCGCCTGGGCCGGTG AGAAATCGGGGGCGCTGGTTTACAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC TGGCGCACTATGGGTCAAACGATGTACCATCACTCAACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAG CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGC CGGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCGTGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCAT TCCGTCGGTGAATTTGCCGCTGCCGTTGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGGCGCGG CGCGCTAATGCAGCAGTGCGCAAGCGGCGCGATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCC AGTTTGAGCTGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGTATTT TGCGCCACGCTCTCGCAGCATGACATTAACTATCGTCGCCTGAGCGTAACCGGTGCGGCGCACTCCGCTTTACTGGAGCC GATACTCGATCGGTTCCAGGACGCCTGCGCGGGACTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG AGTATTCAGGTGGCGCATCAGCTCGGCGCCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTGCGGGCA GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGATGTCCTCAATCAGGCCC TGCTCCAGCTTTACGCTGCCGGCGTCGCCCTACCGTGGGCCGACCTGCTGGCGGGCGATGGACAACGTATCGCTGCGCCA TGTTATCCGTTTGATACTGAACGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAGCCTGCCGACGCAGCGCTGTCTGC CGGGCTGGAGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCTCGCCTGGAAGCGCTTAAACAGTGCGCCACGCGAC TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCTATGCCATTGAGAACGGCGTGGACGCCATGACCATC ATGCGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA CCGCTGCACCGACGGGCGATACGTCCGCGCCCGCCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCG GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCG CTATTTCAACCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGCCGTTGCGTATTCTTG AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCACTGGAGTACCATTTC ACCGATATCTCGGCGCTGTTCACCCGTCGCGCCCAGCAGAAATTCGCCGACTACGATTTTGTGAAGTATAGCGAGCTGGA TCTCGAAAAAGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCAGCGAACGTGATTCACGCCA CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGCCTGCTGATGCGCGAAATCACC CAGCCAATGCGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAGTT ATTCCTCACCACCGCTCAGTGGCAACAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCGTGGCTACCGCAGGATGGCA GCCCGAACGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGCGCCGTAACATTCACCGCG CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGCC CGAACAGTTTAACGCTCGCTGGCAGGAGGCATGGCGTCTGCTTTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC CCCCCGTCGCCGCCCCGGAGTGGCTGGGGAAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCGC GTTGAAGCCCGTCATCCTGCTGGCGAGTGGCTGCCGCTATCGCCCGCCGCGCCTCTTCCTGCGCCGCAAACGCATTATCA ATGGCGCTGGACGCCCCTCAACGTCGCCAGCATTGACCATCCGCTTACCTTTAGCTTCAGCGCCGGTACGCTTGCGCGCA GCGACGAGCTGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCACGACTGATGATTGTTGAGGAGAGCGAGGAT ACGCTGGCCTTAGCGGAGAAAGTGATAGCAGCGCTCACCGCCAGCGCAGCCGGATTGATTGTGGTTACTCGCCGCGCGTG GCGAGTCGAGGAAAATGAAGCACTCTCTGCATCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGC CGGAACGGTTGCTTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACGCTGCATCAAGGGTTGAGCGCAGTCTCA CTATCACAGCGCTGGCTCGCCGCACGGGGTGACACCCTTTGGCTTCCTTCACTGTCGCCCAATACGGGATGCGCCGCTGA ATTACCGGCAAACGTGTTTACCGGCGATAGCCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCG TGAACTGGCTCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTGGCGCCGCGCGTGGATGAGTCATGGCTACGCGACGTG GAGGGCGGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGC CAACGGCGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTACAGGAGCTTGATGACCACCAGC TGGCTGCCGTTTTCGCGGTAAAAGCGCAGGCGGCAAGCCAGCTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTT ATTCTCTACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGG GCTGGCCCAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGG CGGCCACGCCGGAAATGCTGGCGACGCTCGCCAGCCGAGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTG GAACAGGCGGTGATGCGCGGCGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGC TCTGTTTAACATCAGCGCCACAGAAAAAGCCGCAACGCCGGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCC TGAGCGATGAAACAGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGCTGAGCGATCCGGCGTCACTG CATCCAAACCAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCT GGGTGTACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAG AGGCGACGCCTGCCGCTTCGCAGCCGGAAGTGTTGCGGCACGACGCCGACGAGCGTTATGCGCCCTTCCCTTTGACGCCC ATTCAGCACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGA TAAACGCCACGATGAGTTCGATCTCGCCATACTGGAGAAAGCATGGAACCAGCTCATCGCACGCCACGATATGTTGCGTA TGGTGGTTGATGCCGACGGGCAGCAGCGAATCCTGGCGACAACGCCGGAGTATCACATCCCGCGTGACGATCTGCGCGCG CTTTCCCCGGAAGAACAGCGCATCGCGCTGGAAAAACGGCGGCATGAACTGAGCTATCGCGTTTTGCCTGCCGACCAGTG GCCTCTTTTTGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGTCTGCATATGAACCTCGACCTTTTGCAGTTTG ATGTGCAGAGTTTTAAAGTCATGATGGACGACCTGGCGCAGGTCTGGCGCGGTGAAACGCTGGCACCGCTCGCTATTACC TTCCGTGATTATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGAAAA ACTGCCGCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCCCCCGGAAACGCCACACTTCACCACCTTCAAAT CGACGATCGGCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTC ACGCTGTTTGCCGCCACCCTTGAGCGCTGGAGCCGTACCACAACATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCC GATCCATCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAGCGCCGGTGA CGTTGCAAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTG ATCCGTGAGCTGGGCCGCCTGCGCGGATCACAACGTCAACCGCTGATGCCGGTAGTGTTTACCAGTATGCTGGGGATGAC GCTGGAAGGCATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACGCCGCAGG TCTGGCTGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCC GGCGCTGCCGAGGCGATGTTTAATGACTATTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCT CGCCAGCGGCATCGCCGGGCACATTCCCCGCCGACGCTGGCCGCTGAACGCGCAGGCGGACTACGACCTGCGGGATATTG AGCAGGCGACGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATC GTGATGGCCGACGATCCGTCGCCATCAGCGGCGATGCCTGATGAGCACGAACTTACCCAACTGGCGCTGCCGTTGCCTGA GCAGGCGCAGCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTACGCTAA ATCGTCACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAGTAGTACAGGCGCTGTCCGCGCAAGCGTCT CACCAGCGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAGTGGTTAATCCGCGAAGGTGAAAGCTGGCGCTG CCGCATTCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCAAAGCCAATGGAGCCAGGCGCTGGCGCAGTATC TGGAAACCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCAGTGTTCTCCGCTGGAATTGCTGTTCAACGAGCAGCAT CGCGTTACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTG CAGCGCAGAACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGC GGCAGTCGTACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCAGGTG TCTTATGCCTTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGT CAATGTGCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGC TGATCGTTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGC GATTTCCGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGGTTTGCAAA CGAGCTGGCGTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTAGCGCGTTCGCCTGGCGTAAATCGCC CGGATAAAAAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTGCCCATTTTACAGATCCGGCAAAGAGAA GCGTTATTTACGCCGCTGCATGCCCCGTCTGATGCGCCGACTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCC GGCGCTGGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAAC TGGGCGGCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTTCAGGAT CTGTTCAACCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATACCCCT TTGCCAGGGCGACGGTGAGGAAACCCTGTTTGTCTTCCACGCTTCAGACGGCGATATCAGCGCCTGGCTGCCGCTCGCTA ACGCGTTGAACAGGCGCGTTTTCGGCCTGCAAGCAAAATCGCCGCAGCGCTTTGCCACGCTCGACCAGATGATCGATGAG TATGTCGGGTGCATCCGTCGTCAGCAGCCTCACGGCCCTTATGTGCTGGCGGGTTGGTCGTATGGCGCGTTTCTTGCGGC GGGCGCCGCACAGCGCCTGTACGCCAAAGGCAAGCAGGTTCGGATGGTGTTAATCGATCCCGTGTGCCGACAGGATTTCT GTTGCGAAAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAG CAGACGCCCGACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGTCGGTATGGTGTCGCAAAACCTGACGCTGCA AGCGGCAGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAGCGTTCCGG TCCCCTGTCTCATGGTGTATGCCGCCGGGAGACCCGCGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAAC AACGCCGACGACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCCCACGTTCAGGCTTGTGCGCAACA CATTACGCGCTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA Protein sequence : MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLKDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN FHTPNPALKLEESPFTIPVSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSAL RRLATDYAGALRENADASSLAFTALHARRLDLPFRLAAPLNRETAEALSAWAGEKSGALVYSGHGASGKQVWLFTGQGSH WRTMGQTMYHHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAIGH SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVF CATLSQHDINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ SIQVAHQLGARVFLEMGPDAQLVACGQREYRDNAYWIASARRNKEASDVLNQALLQLYAAGVALPWADLLAGDGQRIAAP CYPFDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGYAIENGVDAMTI MRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRARPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYEMMSGA EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQPLRILEVGGGTGGTTAWLLPELNGVPALEYHF TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPNAGMSEHIILATLPGQAVSAVTFTA PSEPVLGQALTDNGDYLADWSDCAGQPEQFNARWQEAWRLLSQRHGDALPVEPPPVAAPEWLGKVRLSWQNEAFSRGQMR VEARHPAGEWLPLSPAAPLPAPQTHYQWRWTPLNVASIDHPLTFSFSAGTLARSDELAQYGIIHDPHASSRLMIVEESED TLALAEKVIAALTASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLLAAIDLAENTPWETLHQGLSAVS LSQRWLAARGDTLWLPSLSPNTGCAAELPANVFTGDSRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDV EGGQTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLRNHDGRYL ILYSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHL EQAVMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVMAWLKKRIAVQLRLSDPASL HPNQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPEATPAASQPEVLRHDADERYAPFPLTP IQHAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRILATTPEYHIPRDDLRA LSPEEQRIALEKRRHELSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLAIT FRDYVMAEQARRQTSAWHDAWDYWQEKLPQLPLAPELPVVETPPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALL TLFAATLERWSRTTTFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSAPVTLQEQMQQTQQRLWQNMAHSEMNGVEV IRELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEP GAAEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDI VMADDPSPSAAMPDEHELTQLALPLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAVVQALSAQAS HQRLLRQWLQCLTEREWLIREGESWRCRIPLSEIPEPQEACPQSQWSQALAQYLETCIARHDALFSGQCSPLELLFNEQH RVTDALYRDNPASACLNRYTAQIAALCSAERILEVGAGTAATTAPVLKATRNTRQSYHFTDVSAQFLNDARARFHDESQV SYALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYR DFRRRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVARSPGVNRPDKKAVSRYLQQRFGTGLPILQIRQRE ALFTPLHAPSDAPTEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQD LFNHSTLSDFCAHLQAATSGEDNPIPLCQGDGEETLFVFHASDGDISAWLPLANALNRRVFGLQAKSPQRFATLDQMIDE YVGCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGKQVRMVLIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQ QTPDSQLADFISLAKTVGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGESVPVPCLMVYAAGRPARWTPAETEWQGWIN NADDAVIEASHWQIMMEAPHVQACAQHITRWLCATSTQPENTL |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp1 | YP_070123.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | NP_993006.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | YP_002346901.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | NP_669707.1 | HMWP1 nonribosomal peptide/polyketide synthase | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | CAA21391.1 | - | Virulence | HPI | Protein | 0.0 | 99 |
irp1 | YP_853076.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 99 |
irp1 | CAA73127.1 | HMWP1 protein | Virulence | HPI | Protein | 0.0 | 98 |
irp1 | YP_001006816.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 98 |