Gene Information

Name : CFSAN001921_24635 (CFSAN001921_24635)
Accession : YP_008258184.1
Strain :
Genome accession: NC_021815
Putative virulence/resistance : Virulence
Product : polyketide synthase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 171785 - 181282 bp
Length : 9498 bp
Strand : -
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGA
ACCTGTCGCGGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAACCTGCTGGAAG
GTCGTGAATGCAGCCGACGCTTTACGCGCGAAGAGCTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCAT
TATGTCAATATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGA
GTCGATGGACCCGCAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGTGCCG
TCCCCCATAAGACCGGTGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTAACAGAAGTC
GCGCAGGTAAAGGGGCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCT
GCACGGCCCGGCGTTATCGGTACAGACCGCCTGCTCCAGCTCGCTGGTTGCCGTACATCTGGCCTGTGAAAGCCTGCGCG
CAGGCGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCAGGCTATCGCTACCAGCCCGGA
ATGATTTTCTCTCCTGATGGTCACTGTCGTCCCTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTG
CGTGGTGTTACGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA
ACGACGGCAACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCGGTCATCGAAGAGGCGTTAATGCTG
GCGGCCATCGACGACAGGCAGGTAGGTTACATTGAAACCCACGGTACCGGCACACCGTTGGGCGACGCGATTGAAATTGA
AGCGTTACGCAACGTCTATGCGCCTCGCCCGCAGGATGAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAATATGGGGCATC
TGGATACCGCGGCGGGCATTGCCGGACTGCTAAAAACCGTTCTGGCAGTCAGCCGCGGACAAATTCCACCCTTACTGAAT
TTTCACACCCCCAACCCGGCGCTGAAACTTGAAGAGAGCCCCTTTACCATACCGGTGTCAGCGCAGGCGTGGCAGGACGA
AATGCGTTATGCTGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC
TCAACGCGCGCCTCCCCAACACGGATGGCGGCAGAAAAAGTACCGCGTTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTG
CGGCGGCTGGCGACGGATTATGCCGGAGCGCTAAGGGATAATGCGGATGCCAGCTCTCTGGCCTTCACGGCCCTGCACGC
GCGCCGTCTCGATCTTCCCTTCCGCCTGGCAGCGCCATTAAACCGTGAAACCGCTGCGGCGCTCAGTGCCTGGGCCGGTG
AGAAATCGGGGGAGCTGGTTTACAGCGGCCACGGTGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCAC
TGGCGCACTATGGGTCAGACAATGTACCGGCACTCAACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAG
CGAAATGCTCACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGC
CAGCGATTGTCGCGTTTGAAATCGCGATGGCGGCGCACTGGCGCGCTGAAGGACTGAAGCCAGATTTCGCCATTGGGCAT
TCCGTCGGTGAATTTGCCGCTGCCGTCGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGACGCGG
AGCACTGATGCAGCAGTGCGCAAGCGGCGCAATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGTC
AGTTTGAGCTGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGATATTT
TGCGCCACGCTCTCGCAGCATGACATTGACTATCGTCGCCTGAGCGTAACCGGCGCGGCGCACTCCGCTTTACTGGAACC
GATACTCGATCGGTTCCAGGCCGCCTGCGCGGGGCTGCACGCAGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCG
CCGACGTCATTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAG
AGTATCCAGATGGCGCATCAGCTCGGCGCCCGCGTTTTTCTGGAGATGGGACCTGATGCCCAGTTGGTTGCTTCCGGGCA
GCGCGAATACCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAACGATGTCCTCAATCAGGCCC
TGCTCCAGCTTTACGCTGCCGGCGTCGCCCTACCGTGGGCCAACCTACTGGCGGGCGATGGACAACGTATCGCTGCGCCA
TGTTATCCGTTTGATACTGAGCGTTACTGGAAAGAGCGCGTTTCCCCGGCCTGCGAGCCTGCCGATGCAGCACTGTCTGC
CGGGCTGGAGGTGGCGAGTCGCGCCGCGGCAGCGCTCGACCACCCCCGTCTGGAAGCGCTTAAACAGTGCGCCACGCGAC
TGCACGCCATCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAAAACGGTGTGGACGCCATGACCATC
ATGCGCCGTGGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTA
CCGCTGCACCGACGGGCGATACGCTCGCGCCCGCCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCG
GTTATTGTGAAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCG
GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAGGTGTTGTATCAGGAATTCAGCTTTGGTCG
CTATTTCAACCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGCCATTGCGTATTCTTG
AAGTTGGCGGCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAGCGGCGTTCCGGCGCTGGAGTACCATTTC
ACCGATATCTCGGCGCTGTTCACCCGCCGCGCCCAGCAGAAATTCGCCGACTATGATTTTGTGAAGTATAGCGAGCTGGA
TCTCGAAAAAGAGGCGCAATCGCAGGGTTTCCTGGCACAGTCTTACGATCTCATTGTGGCGGCGAACGTGATTCACGCCA
CCCGCCATATTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCGGGCGGGCGTCTGCTGATGCGCGAAATTACC
CAGCCAATGCGTCTGTTTGATTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAGTT
ATTCCTCACCACCGCTCAGTGGCAGCAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCGTGGCTACCGCAGGACGACA
GCCCGACCGCCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGTGCCGTAACATTCACCGCA
CCATCAGAACCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGACTATCTTGCCGACTGGTCTGATTGCGCAGGTCAGCC
TGAACAGTTTAACGCCCGCTGTCAGGAGGCCTGGCGTCTGCTCTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGC
CCCTCGCCGCGGCCCCGGAGTGGCTGGGGGAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCGC
GTTGAAGCCCGTCATCCTGATGGTAAGTGGCTGCCGCTATCGCCCGCCGCGCCTCTTCCTGCACCGCAAACGCATTATCA
ATGGCGCTGGACGCCCCTCAACATCGCCAGCGCTGACCATCCGCTTACCTTCAACTTCAGCGCCGGTATGCTTGCGCGCC
GCGACGAGCTGGCGCAATACGGCATTATTCACGATCCGCACGCCTCTTCGCGACTGATGATTGTTGAGGAGAGCGAGGAT
ACGCTGGCCTTAGCGGAGAAAGTGATAGCAGCGCTCACCGCCAGCGCAGCCGGATTGATTGTGGTCACTCGCCGCGCGTG
GCGAGTCGAGGAAAATGAAGCGCTCTCTGCGTCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGC
CGGAACGGTTGATTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACGCTACATCAAGGGTTGAGCGCAGTCTCA
CTATCACAGCGCTGGCTTGCCGCGCGGGGTAACACCCTCTGGCTCCCTTCACTGGCGCCCAATACGGGATGCGCCGCCGA
ATTACCGGCAAACGTGTTTACCGGCGATAACCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGTCTTGCCG
TGAACTGGCTCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTGGCACCGCGCGTGGATGAGTCATGGCTACGCGATGTG
GAGGGCGGGCAGACGCGCGTCTGCCGTTGTGATGTGGGCGATACCGGGCAAATGACCACGGTTCTTGACGATCTGGCGGC
CAACGGCGGCATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGC
GGGCCGCCGTTTTTGCGGTAAAAGCGCAGGCGGCAAACCAACTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTT
ATTCTCTACTCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGG
GCTGGCCCAGCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGG
CGGCCACGCCGGAAATGCTGGCGACGCTCGCCAACCGTGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTG
GAACAGGCGGTGATGCGCGGCACCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGC
TCTGTTTAACATCAGCGCCACAGAAAAAGCCGCCACGCCTGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCC
TGAGCGATGAAACGGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGCTGAGCAATCCGGCATCACTG
CGCCCAAACCAGGATCTGTTACAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGTGATATTCAGCACTACCT
GGGCGTACGCATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAG
AGACGACGCCTGCCGCTTCGCAGCCGGAAGTGTTGCGGCACGACGCCGACGAGCGTTATGCGCCCTTCCCTTTGACGCCA
ATTCAGCACGCCTACTGGCTGGGGCGAACCCACCTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGA
TAAACGCCACGATGAGTTTGATCTCGCTATACTGGAGAAAGCATGGAACCAGCTCATCGCTCGCCACGATATGTTGCGTA
TGGTGGTTGATGCCGACGGGCAGCAGCGAGTCCTGGCGACAACGCCTGAGTATCACATCCCTCGCGACGATCTGCGCGCG
CTTTCCCCGGAAGAACAGCGTATAGCGCTGGAAAAACGGCGGCATGAACTGAGCTATCGCGTTTTGCCTGCCGACCAGTG
GCCTCTTTTTGAGCTGGTGGTCAGCAAAATCGATGATTGCCATTACCGCCTGCACATGAACCTCGACCTTTTGCAGTTTG
ATGTGCAGAGTTTTAAAGTCATGATGGACGATCTGGCGCAAGTCTGGCGCGGCGAAACACTGGCGCCGCTCGATATCACC
TTCCGTGATTATGTGATGGCTGAACAGGCACGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGAAAA
ACTGCCGCAACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAAACACCCCCGGAAACGCCACACTTCACCACTTTCAAAT
CGACGATCGGCAAGAAAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTC
ACGTTGTTTGCCGCCACCCTTGAGCGCTGGAGCCGCACCACGGCATTTACGCTGAACCTGACGTTCTTCAATCGTCAGCC
GATCCATCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAACGCCGGTGA
CGTTGCAAGAGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTG
ATCCGTGAGCTGGGCCGCCTGCGCGGGTCACAACGTCAACCGCTGATGCCGGTGGTGTTTACCAGTATGCTGGGGATGAC
GCTGGAAGGCATGACTATCGATCAGGCAATGAGCCATCTGTTCGGCGAACCCTGCTATGTGTTCACGCAAACGCCGCAGG
TCTGGCTGGATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCC
GGCGCTGCCGAGGCAATGTTTAATGACTATTGCGCTATCCTGCAAGCCGTCATTGCCGCTCCTGAAAGCCTGAAGACTCT
CGCCAGCGGCATCGCCGGGCACATTCCCCGTCGACGCTGGCCGCTGAACGCACAGACAGACTACGACCTGCGGGATATTG
AGCAGGCGACGCTCGAATACCCCGGCATCCGGCAGACCAGAGCGGAAATAGCCGAACAAGGTGCGTTGACGCTGGATATC
GTGATGGTCGATGATCCGTCGCCATCAGCGGCGACGCCCGATGAGCACGATCTCGCCCAGCTGGCGCTGGCGCTGCCGTT
GCCTGCGCAGGTGCAGCTTGATGAACTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTA
CGCTAAATCGTCACGGCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGGCGCTGTCCGCGCAA
GCATCTCACCAGCGTCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAGTGGTTAATCCGCGAGGGTGAAAGCTG
GCGCTGCCGCGTTCCGCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCAAAGCCAATGGAGCCAGGCGCTGGCGC
AGTATCTGGAAACCTGCATCGCCCGGCACGACGCCCTCTTCTCCGGGCAGTGTTCACCGCTGGAATTGCTGTTCAACGAG
CAGCATCGCGTGACCGACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATATCGCGCAGATTGCCGC
CTTGTGCGGCGCAGAACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTACTGCAGGCCACGCGGA
ACACGCGGCAGTCGTACCACTTCACGGACGTCTCCGCACAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCG
CGGGTGTCTTATGCCTTATTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGT
TGCCGTCAATGTGCTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGAC
GTTTGCTGATCGTTGAAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGA
TACCGCGATTTCCGCCGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGGTT
TACAAACGAGCTGGCGTGGCCCACGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGCTGGCGCGTTCGCCTGGCGTAA
ATCGCCCGAATAAAGAAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTGCCCGTTTTACAGATCCGGCAA
AGAGAAACGTTATTTACGCCGCAGCATGCCCCGTCTGATGCGCTGATTGAGCCACCCAAACCCACGCCAGTTGCCGGGGG
GAATCCGGCGCTGGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTT
TCGAACTGGGCGGCGACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTT
CAGGATCTGTTCAGCCATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGGT
TCCCCTTTGTCAGGGCGACGGCGAGGAAACCCTGTTTGTCTTCCACGCTTCGGACGGCGATATCAGCGCCTGGCTACCGC
TCGCCAGCGCGCTGAACAGGCGCGTTTTCGGCCTGCAAGCAAAATCGCCGCTGCGCTTTGCCACGCTTGACCAGATGATC
GATGAGTATGTCGGGTGCATCCGTCGCCAGCAGCCTCACGGCCCTTATGTGCTGGCGGGTTGGTCGTATGGCGCGTTTCT
CGCGGCGGGTGCCGCACAGCGCCTGTACGCCAAAGGCGAGCAGGTTCGCATGGTGTTAATCGATCCCGTGTGCCGACAGG
ATTTCTGTTGCGACAACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGAACAAACGCCTCTGGCGCTACCCGAACATTTC
GACCAGCAGACGCCCGACAGCCAGCTTGCTGACTTTATCGGTCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGAC
GCTGCAAGCGGCAGAAACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAGCG
TTCCGGTCCCCTGTCTCATGGTGTATGCCGCCGGGAGACCCGCGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGG
ATAAACAACGCCGACGGCTATGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCTCCCCACGTTCAGGCTTGTGC
GCAACACATTACGCGCTGGCTTTGCGCAACCTCAACGCAATTGGAGAACACGTTATGA

Protein sequence :
MDNLRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPH
YVNIGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV
AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG
MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML
AAIDDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDERCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLN
FHTPNPALKLEESPFTIPVSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDGGRKSTALLLSAASDSAL
RRLATDYAGALRDNADASSLAFTALHARRLDLPFRLAAPLNRETAAALSAWAGEKSGELVYSGHGASGKQVWLFTGQGSH
WRTMGQTMYRHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAIGH
SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAIF
CATLSQHDIDYRRLSVTGAAHSALLEPILDRFQAACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQ
SIQMAHQLGARVFLEMGPDAQLVASGQREYRDNAYWIASARRNKEANDVLNQALLQLYAAGVALPWANLLAGDGQRIAAP
CYPFDTERYWKERVSPACEPADAALSAGLEVASRAAAALDHPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAMTI
MRRGRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYARARPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYEMMSGA
EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQPLRILEVGGGTGGTTAWLLPELSGVPALEYHF
TDISALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFLAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDDSPTAGMSEHIILATLPGQAVSAVTFTA
PSEPVLGQALTDNGDYLADWSDCAGQPEQFNARCQEAWRLLSQRHGDALPVEPPLAAAPEWLGEVRLSWQNEAFSRGQMR
VEARHPDGKWLPLSPAAPLPAPQTHYQWRWTPLNIASADHPLTFNFSAGMLARRDELAQYGIIHDPHASSRLMIVEESED
TLALAEKVIAALTASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLIAAIDLAENTPWETLHQGLSAVS
LSQRWLAARGNTLWLPSLAPNTGCAAELPANVFTGDNRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDV
EGGQTRVCRCDVGDTGQMTTVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQRAAVFAVKAQAANQLLQTLRNHDGRYL
ILYSSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLANRGMGALSDAEGCWHL
EQAVMRGTPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVMAWLKKRIAVQLRLSNPASL
RPNQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPETTPAASQPEVLRHDADERYAPFPLTP
IQHAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRVLATTPEYHIPRDDLRA
LSPEEQRIALEKRRHELSYRVLPADQWPLFELVVSKIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQVWRGETLAPLDIT
FRDYVMAEQARRQTSAWHDAWDYWQEKLPQLPLAPELPVVETPPETPHFTTFKSTIGKKEWQAVKQRWQQQGVTPSAALL
TLFAATLERWSRTTAFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSTPVTLQEQMQQTQQRLWQNMAHSEMNGVEV
IRELGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEP
GAAEAMFNDYCAILQAVIAAPESLKTLASGIAGHIPRRRWPLNAQTDYDLRDIEQATLEYPGIRQTRAEIAEQGALTLDI
VMVDDPSPSAATPDEHDLAQLALALPLPAQVQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQALSAQ
ASHQRLLRQWLQCLTEREWLIREGESWRCRVPLSEIPEPQEACPQSQWSQALAQYLETCIARHDALFSGQCSPLELLFNE
QHRVTDALYRDNPASACLNRYIAQIAALCGAERILEVGAGTAATTAPVLQATRNTRQSYHFTDVSAQFLNDARARFHDES
RVSYALFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSG
YRDFRRRDEKPMLTRSAWQEVLVQAGFTNELAWPTQESSPLRQHLLLARSPGVNRPNKEAVSRYLQQRFGTGLPVLQIRQ
RETLFTPQHAPSDALIEPPKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANL
QDLFSHSTLSDFCAHLQAATSGEDNPVPLCQGDGEETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPLRFATLDQMI
DEYVGCIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGEQVRMVLIDPVCRQDFCCDNRAALLRLLAEEQTPLALPEHF
DQQTPDSQLADFIGLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGESVPVPCLMVYAAGRPARWTPAETEWQGW
INNADGYVIEASHWQIMMEAPHVQACAQHITRWLCATSTQLENTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 98
irp1 CAA21391.1 - Virulence HPI Protein 0.0 98
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 97
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 97
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 97