Name : LF82_300 (LF82_300) Accession : YP_002556852.1 Strain : Escherichia coli LF82 Genome accession: NC_011993 Putative virulence/resistance : Virulence Product : peptide synthetase-like protein Function : - COG functional category : - COG ID : - EC number : - Position : 2016861 - 2022986 bp Length : 6126 bp Strand : + Note : corresponding to LF82_p300 in publication : Miquel et al., PLoS One; 536; 55989; APEC_01; CFT073; ED1a; IAI39; S88; UMN026; UTI89 DNA sequence : GTGCCATCAGGAGGAAGAATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTA CCAACAATTACGCGAGCGGCTCATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAACCTGATCCAGG CCGGCCTGGATTCCATAAGATTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTG TATGCCGCCCCCACGCTGGCGGCATGGAACCAGTTAATGCTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCC CGACGAATCATCCTGGCCGAACATGACCGAAAGTACCCCCTTCCCATTGACGCCAGTACAGCACGCCTACCTGACGGGCC GCATGCCGGGGCAGACGCTTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACGGCGTCGCAG CTGGAGCAGGCCATCACGACCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGGTCTG GCTACCGCAACCTTACTGGAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGGCCTATCTGG ACGCACTGCGCCAGCGCCTGAGCCACCGTCTTTTACGCGTGGAAATCGGCGAAACGTTTGATTTTCAGCTGACGCTCTTG CCGGACAATCGCCACCGCCTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGA GCTTAACGCCCTGCTGGCCGGAGAATCGCTGCCGGCTATCGACACCCGCTATGATTTCCGCTCGTATTTGCTGCACCAGC AGAAGATCAATCAACCACTGAGAGACGACGCGCGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCGCCCGTC TTGCCGCTGGCCTGCGAACCCGCCACGCTACGTGAAGTCCGTAATACCCGACGCCGCATGATTGTCCCGGCAACACGCTG GCACGCCTTTAGCAACCGGGCCGGCGAGTATGGCGTGACGCCGACAATGGCACTGGCGACCTGTTTTTCTGCCGTGCTGG CTCGCTGGGGCGGCCTGACGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGGCGGTTGGCGCG ATGCTTGCCGACTTCACCAATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAA CCAGCTCACGTTTACGGAGGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGC GCTACCCCCACGGCGCCCCGGTGGTATTTACCAGCAATCTGGGGCGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTG GGCGAGCCGGAATGGGGCATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTG GCTACAATGGGACAGCAACGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACATTGTTCGACGCCTACTGCCAGTTGATTA ACCAACTCTGCGATGACGAAAGCGCCTGGCAAAAGCCGTTCGCAGATATGATGCCCGCCAGCCAGCGCGCGATACGCGAA CGGGTCAACGCCACCGGCGCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCA GGCGCTGGCGGTAACGGACATGCGTTATCAGTGGAATTATCATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGT TAATCGAGTGCGGGGTTCAGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTG GCCGTCCTGCTGGCCGGGGCGGTTTACGTTCCGGTTTCGCTGGATCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGA CGCCAGCGTCCGGCTGGTGCTCATTTGTCAGCACGACGCCAGCGCCGGGTCAGACGATATTCCCGTCCTTGCCTGGCAGC AGGCCATTGAGGCGGAGCCGATCGCCAACCCGGTAGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGC TCTACCGGTACGCCGAAAGGGGTAGTCATTTCTCACCGGGGAGCGCTTAACACCTGTTGCGATATCAATACCCGCTATCA GGTTGGCCCGCATGACAAGGTGCTGGCCCTCTCCGCCCTACATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGC GCGCGGGCGGCGCGCTGGTGATGGTGATGGAAAATCAACGGCGCGATCCTCACGCATGGTGTGAGCTGATCCAGCGCCAT CAGGTCACGCTCTGGAACAGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCC GGAAAACCTGCGCGCAGTGATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCAC AAGGACAATTTATCGCGATGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGCGAAATTCACGACGTCCCTGCC CACTGGCGCTCCATCCCTTACGGTTTTCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAACAGGGCCGGGACTGCCC TGACTGGGTGCCGGGTGAATTATGGATTGGCGGCATTGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGTAGCGAGC AGCAATTTTTGACGCTCCCGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGATGGCACAATCGAG TTCCTCGGTCGTCGCGACAAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCGCTCAGCCA GCTGGCGGGGGTGAAACAAGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTTGTTCCTCAGG GCGAGGCTTTTTGCGTTACCGATCATCGGAACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGT TGCGCCATCTCGCCAGAGATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGCTAAAACTGAAGCCGGGTCA CACCGCTGGCGCCGATCCTCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGT TAGCATTTCTGGTGACACAACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCGAGGAT GAGCACCCGCACTTCAGCGGACATGATTTAACGTTATCGCAAATTCTTCGTGGTGCCCGTAACGAACTGTCGTTACTGAA CGACGCGCAGTGGTCGCCGGAAAGCCTGGCCTTTAACCATCCGGCCAGCGCCCCGTATATTCAGGAACTGGCGACAATTT GCCAACAGCTTGCACAGCGCTTACAGCGCCCGGTGCGCCTGCTTGAGGTGGGAACCCGCACTGGCCGCGCCGCAGAATCG CTGTTAGCACAGCTCAACGCCGGACAGATTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTGCTGAGCGCCCGGCA GAGGCTCGTCCCCTGGCCTGGCGCCCGTCTGTCCCTCTGGAATGCAGACACGCTGGCGGCGCACGCTCACTCGGCGGACA TTATCTGGCTTAATAACGCCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTT CCCGGCGCGCTGCTCTACGTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCCTACTCAGCACGCTCCTGTTAACCAATGG GCAGCCGGAGGCCTTGCTGCATAACAGCGCCGACTGGGCGGCATTATTTAGCGCGGCCGCCTTCAACTGTCAGCATGGCG ATGAGGTCGCGGGGTTACAACGCTTCCTCGTACAATGTCCTGACAGGCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCC GCCCTCGCCGGGCGTCTGCCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGG GAAAATTGACTACCAGGCGCTGAAGCGTCGTCATACCCCTGAAGCGGAAAACCCGGCCGAAGCGGATTTACCCCAGGGCG ACATTGAAAAACAGGTTGCCGCCCTCTGGCAGCAACTCTTATCAACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAG CAAGGCGGCGATAGCCTGCTGGCGACCCGTCTGACCGGGCAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCT GTTTAATCATCCCCGGCTGGCGGATTTTGCCGCCACGCTGCGGAAAACCGACGTCCCGGTCGAACAACCATTCGTCCACT CCCCTGAAGATCGCTACCAGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGCGTCAGCCGGGCTTTGCC CTGGGCGGCGTCGGCTCACATTTCTTTGTTGAATTTGAAATTGCCGATCTGGATCTCACCCGGTTGGAGACGGTCTGGAA CCGATTAATCGCCCGCCACGATATGCTACGCGCCGTCGTGCGTGATGGACAGCAACAGGTGCTCGCACAGACGCCTCGCT GGGTAATACCCGCACACATCCTCCATACGCCTGAAGAGGCGTTGCAGGTGCGCGAAAAACTGGCGCATCAGGTACTCAAC CCCGAAGTGTGGCCGGTATTCGATCTCCAGGTCGGATACGTGGACGGGATGCCTGCCCGCCTGTGGCTGTGTCTGGATAA CCTGTTGCTTGACGGTCTGAGCATGCAGATCCTGCTGGCGGAGCTGGAGCACGGCTACCGCTACCCGCAACAGCTGCTTC CGCCGCTGCCCGTCACCTTCAGGGATTATCTGCAACAACCCTCGCTACAGTCGCCCAATCCAGATTCTCTGGCATGGTGG CAGGCGCAGCTTGATGATATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTT CGCCCGCCTGAACGGCGCACTGGACAGCACGCGCTGGCATCGGCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGT CGGCCGTACTGTTGTCGGTGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTT TTCGACAGGCGACCGCTGCACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCC CGGCGAAAGCTGGCTGCACAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCAGCCA TCCGCGTGATGCGTCAACTGGCGCAACGGCAAAACGTGCCTGCCGTTCCGATGCCCGTCGTCTTTACCAGCGCACTGGGC TTTGAGCAGGATAACTTCCTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCT CGATCACCAGATTTATGAATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGG TGGAGCGCCAGTTTGAACAGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCC GCGCTGGTGCCTCCCGTTAAACACGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACA CATTGCGGCGGACGAGAGCACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAG AAAACTTCTTTGAGGCGGGCGCAACGTCGCTGAATCTGGTGCAACTGCACGTTTTGTTACAACGTCACGAATTTTCCACC CTGACGTTGCTTGACCTCTTCACCCACCCTTCTCCTGCTGCCCTGGCCGATTATCTGGCCGGCGTCGCCACGGTGGAGAA AACAAAACGACCTCGCCCTGTTCGCCGTCGTCAGCGGCGGATATAG Protein sequence : MPSGGRMISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLREL YAAPTLAAWNQLMLSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQ LEQAITTLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDAESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLL PDNRHRLHVNIDLLIMDASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDDARAYWLAKASTLPPAPV LPLACEPATLREVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGA MLADFTNILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPL GEPEWGISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQRAIRE RVNATGAPIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVL AVLLAGAVYVPVSLDQPAARREKIYADASVRLVLICQHDASAGSDDIPVLAWQQAIEAEPIANPVVRAPTQPAYIIYTSG STGTPKGVVISHRGALNTCCDINTRYQVGPHDKVLALSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRH QVTLWNSVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPA HWRSIPYGFPLTNQRYRVVDEQGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPDGTIE FLGRRDKQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPC CAISPEISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEERED EHPHFSGHDLTLSQILRGARNELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAES LLAQLNAGQIEYVGLEQSQEMLLSARQRLVPWPGARLSLWNADTLAAHAHSADIIWLNNALHRLLPEDPGLLATLQQLAV PGALLYVMEFRQLTPSALLSTLLLTNGQPEALLHNSADWAALFSAAAFNCQHGDEVAGLQRFLVQCPDRQVRRDPRQLQA ALAGRLPGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPAEADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQ QGGDSLLATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGRQPGFA LGGVGSHFFVEFEIADLDLTRLETVWNRLIARHDMLRAVVRDGQQQVLAQTPRWVIPAHILHTPEEALQVREKLAHQVLN PEVWPVFDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLLPPLPVTFRDYLQQPSLQSPNPDSLAWW QAQLDDIPPAPALPLRCLPQEVETPRFARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTL FDRRPLHPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVPAVPMPVVFTSALG FEQDNFLARRNLLKPVWGISQTPQVWLDHQIYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLA ALVPPVKHAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHVLLQRHEFST LTLLDLFTHPSPAALADYLAGVATVEKTKRPRPVRRRQRRI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp2 | YP_002346902.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | NP_669706.1 | HMWP2 nonribosomal peptide synthetase | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_070124.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_853075.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 99 |
irp2 | NP_993007.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | CAA21390.1 | - | Virulence | HPI | Protein | 0.0 | 99 |
irp2 | YP_001006815.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 99 |
PMI2599 | YP_002152317.1 | non-ribosomal peptide synthase | Not tested | Not named | Protein | 0.0 | 41 |