Name : CFSAN001921_24640 (CFSAN001921_24640) Accession : YP_008258185.1 Strain : Genome accession: NC_021815 Putative virulence/resistance : Virulence Product : peptide synthetase Function : - COG functional category : - COG ID : - EC number : - Position : 181370 - 187477 bp Length : 6108 bp Strand : - Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+. DNA sequence : ATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTACCAACAATTACGCGAGCG GCTTATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAATCTGATCCAGGCCGGCCTGGATTCCATAA GGTTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTGTATGCCGCCCCCACGCTG GCGGCATGGAACCAGTTAATACTCAGCCGGTCGCCGGAGAACGCGGAAGAAGAAACGCCGCCCGACGAATCATCCTGGCC GAACATGACCGAAAGTACCCCCTTCCCATTGACGCCGGTACAGCACGCCTACCTGACGGGCCGCATGCCGGGGCAGACGC TTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACAGCGTCGCAACTGGAGCAGGCCATTACG GCCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGATCTGGCTACCGCAACCTTACTG GAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGCCCTATCTGGAAGCACTGCGCCAGCGCC TGAGCCACCGTCTTTTACGCGTGGAGATCGGTGAAACGTTTGATTTTCAGCTGACGCTCTTGCCGGACAATCGCCACCGC CTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGAGCTTAACGCCCTGCTGGC CGGAGAATCGCTGTCGGCTATAGACACCCGCTATGATTTCCGTTCGTATTTGCTGCACCAGCAGAAGATCAATCAACCAC TGAGAGACAACGCACGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCACCCGTCTTGCCGCTGACCTGCGAA CCCGCCACACTGCGCGAAGTCCGTAATACCCGGCGCCGCATGATTGTCCCGGCAACACGCTGGCACGCCTTTAGCAACCG GGCCGGCGAGTATGGCGTGACGCCGACAATGGCGCTGGCGACCTGTTTTTCTGCCGTGCTGGCTCGCTGGGGCGGCCTGA CGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGTCGGTTGGCGCGATGCTTGCCGATTTCACC AATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACGGA GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGCGCTACCCCCACGGTGCCC CGGTGGTATTTACCAGCAATCTGGGACGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTGGGCGAGCCGGAATGGGGC ATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTGGCTGCAATGGGACAGCAA CGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACGTTGTTCGACGCCTACTGCCAGTTGATTAACCAACTCTGCGATGACG AAAGCGCCTGGCAAAAGCCGTTTGCAGATATGATGCCCGCCCGCCAGCGCGCAATACGCGAACGGGTCAACGCCACCGGT GCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCAGGCGCTGGCGGTAACGGA CATGCGTTATCAGTGGAATTATTATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGTTAATCGAGTGCGGGGTTC AGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTGGCCGTCCTGCTGGCCGGG GCGGTTTACGTTCCGGTTTCGCTGGACCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGACGCCAGCGTCCGGCTGGT ACTCATTTGCCAGCACGACGCCAGCGCCTGGTCAGACGATATTCCCGTCCTTGCCTGGCAGCAGGCCATTGAGGCGGAGC CGATCGCCAACCCGGTGGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGCTCTACCGGCACGCCGAAA GGGGTGGTCATTTCTCACCGGGGAGCGCTCAACACCTGTTGCGATATCAATACCCGCTATCAGGTTGGCCCCGGTGACAG GGTGCTGGCCCTCTCCGCCCTGCATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGCGCGCGGGCGGCGTGCTGG TGATGGTGATGGAAAATCAACGGCGCGATCCTCATGCATGGTGTGAGCTGATCCAGCGCCATCAGGTCACGCTCTGGAAC AGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCCGGAAAACCTGCGTGCAGT GATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCACAAGGACAATTTATCGCGA TGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGTGAAATTCACGACGTCCCCGCCCACTGGCGTTCCATCCCT TACGGTTTCCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAATGGGGCCGGGACTGCCCTGACTGGGTACCGGGCGA ATTATGGATTGGCGGCATCGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGCAGCGAGCAGCAATTTTTGACGCACC CGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGACGGCACAATAGAGTTCCTCGGTCGTCGCGAC AAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCACTCAGCCAGTTGGCGGGGGTGAAACA AGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTGGTTCCTCAGGGCGAGGCTTTTTGCGTTA CCGATCATCGGGACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGTTGTGCTATCTCGCCAGAG ATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGTTAAAACTGAAGCCGGGTCACACCGCTGGCGCCGATCC TCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGTTAGCATTTCTGGTGACGC AACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCAAGGATGAGCACCCGAATTTCAGC GGACATGATTTAACGTTATCGCAAATTCTTCGCGGCACCCGCGACGAACTGTCGTTACTGAACGACGCGCAGTGGTCGCC GGAAAGCCTGGCCTTCAACCATCCGGCCAGCGCCCCATATATTCAGGAACTGGCGACAATTTGCCAACAGCTTGCACAGA GCTTACAGCGCCCGGTACGCCTGCTTGAGGTGGGAACCCGCACCGGCCGCGCCGCAGAATCGCTGTTAGTACAGCTCAAC GCCGGACAGGTTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTCCTGAGCGCCCGGCAGAGGCTCGTCCCCCGGCT TGGTGCCCGTCTGTCCCCCTGGAATGCAGACACGCTGGCGGTGCACGCTCACTCGGCGGACATTATCTGGCTCAATAACG CCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTTCCCGGCGCGCTGCTCTAC GTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCTTGCTCAGCACGCTCCTGTTAACCGATGGACAGCCGGAGGCCCTGCT GCATAACAGCGCCGACTGGGCGGCGTTATTTAGCGCAGCCGCCTTCAACTGTCAGCATGGCGATGAGGTCGCGGGGTTAC AACGCTTCCTCGTACAATGTCCCGATAGCCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCCGCCCTCGCCGGACGCCTG CCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGGGAAGATTGACTATCAGGC GCTGAAGCGTCGTCATACTCCTGAAGCGGAAAACCAGGCCGAAGCGGATTTACCCCAGGGCGACACTGAAAAACAGGTTG CCGCCCTCTGGCAGCAACTCTTATCGACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAGCAAGGCGGCGATAGCCTG CTGGCGACCCGTCTGACCGGACAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCTGTTTAATCATCCCCGGCT GGCGGATTTTGCCGCCACGCTGCGTAAAATCGACGTCCCGGTCGAACAACCATTCGTCCACTCTCCTGAAGATCGCTACC AGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTGGTGGGGTGTCAGCCGGGCTTTGCCCTGGGCGGCGTCGGCTCA CATTTCTTTGTTGAATTTGAAATTGCCGATCTGGACCTCACCCGGCTGGAGACGGTCTGGAACCGATTAATCGCCCGCCA CGATATGCTCCGCGCCGTCGTGCGTGATGGACAGCAACAGGTGCTAGAACAGACGCCCCACTGGGTGATACCCGCACACA CCCTCCATACGCCTGAAGAGGCGTTGCGGGCGCGCGAAAAACTGGCACATCAGGTACTCAACCCCGAAGTGTGGCCGGTA TTCGATCTCCAGGTCGGATACGTGAACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTGCTTGACGGTCT GAGCATGCAGATCCTGCTGGCGGAGCTGGAGCACGGCTACCGCTACCCGCAGCAGTTGCCTCCGCCGCTGCCCGTCACCT TCAGGGATTATCTGCAACAACCCTCGCTACAGTCGCCCAATCCAGATTCTCTGGCATGGTGGCAGGCACAGCTTGATGAT ATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTTCACCCGCCTGAACGGCGC GCTGGACAGCACGCGCTGGCATCGTCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGTCGGCCGTACTGTTGTCGG TGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTTTTCGACAGGCGGCCGCTG CACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCCCGGCGAAAGCTGGCTGCA CAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCGGCCATCCGCGTGATGCGTCAAC TGGCGCAACGGCAAAACGTGCCCGCCATTCCGATGCCCGTCGTCTTTACCAGCGCGCTGGGCTTTGAGCAGGATAACTTC CTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCTCGATCACCAGGTTTATGA ATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGGTGGAGCGCCAGTTTGAAC AGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCCGCGCTGGTGCCTCCCGTT AAATTCGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACACATTGCGGCGGACGAGAG CACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAGAAAACTTCTTTGAGGCGG GCGCAACGTCACTGAATCTGGTGCAACTGCACATTTTGTTACAACGTCACGAATTTTCCACCCTGACGTTGCTTGACCTC TTCACCCATCCTTCTCCTGCCGCCCTGGCCGATTATCTGGCCGGCGTCGTCACGGTGGAGAAAACAAAACATCCTCGCCC TGTTCGCCGTCGTCAGCGGCGGATATAG Protein sequence : MISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTL AAWNQLILSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQLEQAIT ALLQRHPMLHIAFRPDGQQIWLPQPYWNGVTVHDLRHNDAESRQPYLEALRQRLSHRLLRVEIGETFDFQLTLLPDNRHR LHVNIDLLIMDASSFTLFFDELNALLAGESLSAIDTRYDFRSYLLHQQKINQPLRDNARAYWLAKASTLPPAPVLPLTCE PATLREVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPSVGAMLADFT NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPLGEPEWG ISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPARQRAIRERVNATG APIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYYELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVLAVLLAG AVYVPVSLDQPAARREKIYADASVRLVLICQHDASAWSDDIPVLAWQQAIEAEPIANPVVRAPTQPAYIIYTSGSTGTPK GVVISHRGALNTCCDINTRYQVGPGDRVLALSALHFDLSVYDIFGVLRAGGVLVMVMENQRRDPHAWCELIQRHQVTLWN SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPAHWRSIP YGFPLTNQRYRVVDEWGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTHPDERWYRTGDLGCYWPDGTIEFLGRRD KQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRDPALPQAWHTLAGTLPCCAISPE ISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEERKDEHPNFS GHDLTLSQILRGTRDELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQSLQRPVRLLEVGTRTGRAAESLLVQLN AGQVEYVGLEQSQEMLLSARQRLVPRLGARLSPWNADTLAVHAHSADIIWLNNALHRLLPEDPGLLATLQQLAVPGALLY VMEFRQLTPSALLSTLLLTDGQPEALLHNSADWAALFSAAAFNCQHGDEVAGLQRFLVQCPDSQVRRDPRQLQAALAGRL PGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENQAEADLPQGDTEKQVAALWQQLLSTGNVTRETDFFQQGGDSL LATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKIDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGCQPGFALGGVGS HFFVEFEIADLDLTRLETVWNRLIARHDMLRAVVRDGQQQVLEQTPHWVIPAHTLHTPEEALRAREKLAHQVLNPEVWPV FDLQVGYVNGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLPPPLPVTFRDYLQQPSLQSPNPDSLAWWQAQLDD IPPAPALPLRCLPQEVETPRFTRLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTLFDRRPL HPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVPAIPMPVVFTSALGFEQDNF LARRNLLKPVWGISQTPQVWLDHQVYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLAALVPPV KFAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHILLQRHEFSTLTLLDL FTHPSPAALADYLAGVVTVEKTKHPRPVRRRQRRI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
irp2 | YP_070124.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 98 |
irp2 | YP_002346902.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 98 |
irp2 | NP_669706.1 | HMWP2 nonribosomal peptide synthetase | Virulence | HPI | Protein | 0.0 | 98 |
irp2 | NP_993007.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 98 |
irp2 | CAA21390.1 | - | Virulence | HPI | Protein | 0.0 | 98 |
irp2 | YP_853075.1 | yersiniabactin biosynthetic protein | Virulence | PAI IV APEC-O1 | Protein | 0.0 | 98 |
irp2 | YP_001006815.1 | yersiniabactin biosynthetic protein | Virulence | HPI | Protein | 0.0 | 97 |
PMI2599 | YP_002152317.1 | non-ribosomal peptide synthase | Not tested | Not named | Protein | 0.0 | 41 |