Gene Information

Name : ETEC_2082 (ETEC_2082)
Accession : YP_006115648.1
Strain : Escherichia coli ETEC H10407
Genome accession: NC_017633
Putative virulence/resistance : Virulence
Product : non-ribosomal peptide synthase (yersiniabactin siderophore biosynthetic protein)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2235771 - 2245253 bp
Length : 9483 bp
Strand : +
Note : -

DNA sequence :
TTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTACCCGGACTGCGAACCTGTCGC
GGTTATCGGCTACGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGCAAAATCTGCTGGAAGGTCGTGAAT
GCAGCCGACGCTTTACGCGCGAAGAGCTTCTGGCCGTCGGTCTGGATGCCGCCATCATTGACGATCCTCATTATGTCAAT
ATCGGTACGGTGTTAGACAACGCCGACTGCTTCGACGCCACCCTGTTTGGCTATTCGCGACAGGAAGCGGAGTCGATGGA
CCCGCAGCAGCGCCTGTTTTTGCAGGCGGTCTGGCATGCGCTGGAACATGCCGGTTATGCCCCCGGCGCCGTCCCCCATA
AGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTACCCCGGTCGCGAAGCATTGAACGTGACAGAAGTCGCGCAGGTA
AAAGGTCTGCAATCTCTGATGGGCAATGATAAAGACTATATTGCCACCCGCGCCGCGTACAAACTCAACCTGCACGGCCC
GGCGTTATCGGTACAGACCGCCTGCTCCAGCTCGCTGGTTGCCGTGCATCTGGCCTGTGAAAGCCTGCGCGCAGGCGAAT
CCGATATGGCGGTGGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCAGGCTACCGCTACCAGCCCGGAATGATTTTC
TCTCCTGATGGTCACTGTCGTCCCTTTGACGCCTCGGCTGAGGGCACCTGGGCCGGTAACGGTCTCGGCTGCGTGGTGCT
GCGTCGCCTGAGAGACGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACAACGACGGCA
ACAGAAAGGTCGGCTATACCGCCCCTTCCGTCGCAGGGCAACAGGCAGTCATCGAAGAGGCGTTAATGCTGGCGGCCATC
GACGACAGGCAGGTAGGTTACATTGAAACCCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGAAGCGTTACG
CAACGTCTATGCGCCTCGCCCGCAGGATCAGCGCTGTGCGCTCGGTTCCGTGAAAAGTAACATGGGCCATCTGGATACCG
CGGCGGGCATTGCCGGACTGCTGAAAACCGTTCTGGCAGTCAGTCGCGGGCAAATTCCTCCCTTACTGAATTTTCACACC
CCCAACCCGGCGCTGAAACTTGAAGAGAGCCCCTTTACCATACCGGTGTCGGCACAGGCATGGCAGGACGAAATGCGCTA
TGCGGGCGTCTCCTCCTTTGGTATTGGCGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGCTCAACGCGC
GCCTCCCCAATACGGATAGCGGCAGAAAAAGTACCGCGCTGCTGCTCAGCGCCGCCAGCGACAGCGCGTTGCGGCGGCTG
GCGACGGATTATGCCGGGGCGCTGAGAGAGAATGCGGATGCCAGCTCTCTGGCCTTCACAGCCCTGCACGCGCGCCGTCT
CGATCTCCCCTTCCGCCTGGCGGCGCCATTAAACCGTGAAACCGCCGAGGCGCTCAGCGCCTGGGCCGGTGAGAAATCGG
GGGCGCTGGTTTACAGCGGCCACGGCGCCAGCGGCAAGCAGGTGTGGCTGTTTACCGGCCAGGGCTCGCACTGGCGCACT
ATGGGTCAAACGATGTACCATCACTCAACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAGCGAAATGCT
CACGCCGTCACTGCGCGAAGCGATGTTTAACCCCGATTCGGCGCAGCTGGACAATATGGCCTGGGCGCAGCCGGCGATTG
TCGCGTTTGAAATCGCGATGGCGGCGCACTGGCGTGCTGAAGGACTGAAGCCAGACTTCGCCATTGGGCATTCCGTCGGT
GAATTTGCCGCTGCCGTTGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGGCGCGGCGCGCTAAT
GCAGCAGTGCGTAAGCGGCGCGATGGTGGCGGTATTTGCAGACGAAGACACGCTGATGCCGCTGGCTCGCCAGTTTGAGC
TGGATCTCGCCGCCAACAACGGTACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGTATTTTGCGCCACG
CTCTCGCAGCATGACATTAACTATCGTCGCCTGAGCGTAACCGGTGCGGCGCACTCCGCTTTACTGGAGCCGATACTCGA
TCGGTTCCAGGACGCCTGCGCGGGACTGCACGCGGAGCCGGGGCAAATACCGATTATTTCCACGCTCACCGCCGACGTCA
TTGATGAGTCAACGCTCAACCAGGCGGATTACTGGCGCCGACACATGCGCCAGCCGGTGCGTTTTATCCAGAGTATTCAG
GTGGCGCATCAGCTCGGCGCCCGCGTTTTTCTGGAGATGGGGCCCGATGCCCAGTTGGTTGCTTGCGGGCAGCGCGAATA
CCGCGATAACGCATACTGGATAGCCAGCGCCCGGCGTAACAAAGAGGCGAGCGATGTCCTCAATCAGGCCCTGCTCCAGC
TTTACGCTGCCGGCGTCGCCCTACCGTGGGCCGACCTGCTGGCGGGCGATGGACAACGTATCGCTGCGCCATGTTATCCG
TTTGATACTGAACGTTACTGGAAAGAGCGCGTCTCCCCGGCCTGCGAGCCTGCCGACGCAGCGCTGTCTGCCGGGCTGGA
GGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCTCGCCTGGAAGCGCTTAAACAGTGCGCCACGCGACTGCACGCCA
TCTACGTCGATCAACTGGTACAACGCTGTACCGGCGATGCCATTGAGAACGGCGTGGACGCCATGACCATCATGCGCCGT
GGACGTCTGCTGCCCCGCTACCAGCAGCTACTCCAGCGCCTGCTGAATAACTGCGTGGTCGACGGCGATTACCGCTGCAC
CGACGGGCGATACGTCCGCGCCCGCCCCATTGAACATCAACAGCGGGAATCACTGCTGACGGAACTTGCCGGTTATTGTG
AAGGTTTTCAGGCTATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCGGAAGAACCG
GTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAAGTGCTGTATCAGGAATTCAGCTTTGGCCGCTATTTCAA
CCAAATCGCCGCCGGGGTATTACGCGGCATTGTCCAGACGCGTCAGCCCCGCCAGCCGTTGCGTATTCTTGAAGTTGGCG
GCGGAACCGGCGGCACCACCGCGTGGCTGCTGCCGGAACTCAACGGCGTTCCGGCACTGGAGTACCATTTCACCGATATC
TCGGCGCTGTTCACCCGTCGCGCCCAGCAGAAATTCGCCGACTACGATTTTGTGAAGTATAGCGAGCTGGATCTCGAAAA
AGAGGCGCAGTCTCAGGGTTTCCAGGCACAGTCTTACGATCTTATCGTGGCAGCGAACGTGATTCACGCCACCCGCCATA
TTGGCCGCACGCTCGATAATCTGCGCCCCCTGCTCAAGCCAGGCGGGCGCCTGCTGATGCGCGAAATCACCCAGCCAATG
CGTCTGTTTGACTTCGTTTTCGGCCCGCTGGTTCTTCCGCTACAGGATCTCGACGCCCGCGAAGGTGAGTTATTCCTCAC
CACCGCTCAGTGGCAACAACAGTGCCGCCACGCCGGATTCAGCAAAGTGGCGTGGCTACCGCAGGATGGCAGCCCGAACG
CCGGGATGAGCGAACATATCATTCTCGCCACGCTGCCCGGTCAGGCGGTTAGCGCCGTAACATTCACCGCGCCATCAGAA
CCCGTGTTGGGGCAGGCGCTGACGGATAACGGTGATTATCTCGCCGACTGGTCTGATTGCGCAGGTCAGTCCGAACAGTT
TAACGCCCGCTGGCAGGAGGCATGGCGTCTGCTTTCACAGCGTCATGGCGACGCTCTGCCTGTGGAACCGCCCCCCGTCG
CCGCCCCGGAGTGGCTGGGGAAGGTTCGCTTAAGCTGGCAAAACGAAGCCTTTTCCCGCGGTCAGATGCGCGTTGAAGCC
CGTCATCCTGCTGGCGAGTGGCTGCCGCTATCGCCCGCCGCGCCTCTTCCTGCGCCGCAAACGCATTATCAATGGCGCTG
GACGCCCCTCAACGTCGCCAGCATTGACCATCCGCTTACCTTTAGCTTCAGCGCCGGTACGCTTGCGCGCAGCGACGAGC
TGGCGCAATACGGCATCATTCACGATCCGCACGCCTCTTCACGACTGATGATTGTTGAGGAGAGCGAGGATACGCTGGCC
TTAGCGGAGAAAGTGATAGCAGCGCTCACCGCCAGCGCAGCCGGATTGATTGTGGTTACTCGCCGCGCGTGGCGAGTCGA
GGAAAATGAAGCACTCTCTGCATCCCATCACGCGCTATGGGCCTTGCTTCGCGTCGCGGCCAACGAACAGCCGGAACGGT
TGCTTGCCGCCATCGATCTCGCCGAAAACACCCCGTGGGAAACGCTGCATCAAGGGTTGAGCGCAGTCTCACTACCACAG
CGCTGGCTCGCCGCACGGGGTGACACCCTTTGGCTTCCTTCACTGTCGCCCAATACGGGATGCGCCGCTGAATTACCGGC
AAACGTGTTTACCGGCGATAGCCGCTGGCATCTGGTGACCGGAGCGTTTGGCGGATTAGGCCGCCTTGCCGTGAACTGGC
TCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTGGCGCCGCGCGTGGATGAGTCATGGCTACGCGACGTGGAGGGCGGG
CAGACGCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGGCAACTGGCCACGGTTCTTGACGATCTGGCGGCCAACGGCGG
CATTGCCGGAGCGATTCATGCCGCTGGCGTATTGGCTGACGCGCCCTTGCAGGAGCTTGATGACCACCAGCTGGCTGCCG
TTTTCGCGGTAAAAGCGCAGGCGGCAAGCCAGCTGTTGCAAACCCTGCGCAACCACGACGGACGCTATCTTATTCTCTAC
TCTTCCGCTGCCGCCACCCTCGGCGCGCCGGGTCAGAGCGCCCATGCGCTGGCCTGCGGCTACCTGGACGGGCTGGCCCA
GCAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGAGAAAGCGGTCGGGCGGCCACGC
CGGAAATGCTGGCGACGCTCGCCAGCCGAGGTATGGGCGCGTTAAGCGATGCCGAAGGCTGCTGGCACCTGGAACAGGCG
GTGATGCGCGGCGCCCCGTGGCGACTGGCGATGCGCGTTTTTACCGACAAAATGCCCCCGTTACAACAGGCTCTGTTTAA
CATCAGCGCCACAGAAAAAGCCGCAACGCCGGTCATTCCTCCTGCTGATGACAACGCCTTTAACGGCAGCCTGAGCGATG
AAACAGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTAAGGTTGAGCGATCCGGCGTCACTGCATCCAAAC
CAGGATCTGTTGCAACTCGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCTGGGTGTACG
CATCAATGCGGAACGGGCGTGGCAGGATCTGTCTCCTCATGGACTCACGCAGCTTATCTGTTCTAAGCCAGAGGCGACGC
CTGCCGCTTCGCAGCCGGAAGTGTTGCGGCACGACGCCGACGAGCGTTATGCGCCCTTCCCTTTGACGCCCATTCAGCAC
GCCTACTGGCTGGGGCGAACCCACTTCATTGGCTATGGCGGCGTCGCCTGTCACGTCCTGTTTGAGTGGGATAAACGCCA
CGATGAGTTCGATCTCGCCATACTGGAGAAAGCATGGAACCAGCTCATCGCACGCCACGATATGTTGCGTATGGTGGTTG
ATGCCGACGGGCAGCAGCGAATCCTGGCGACAACGCCGGAGTATCACATCCCGCGTGACGATCTGCGCGCGCTTTCCCCG
GAAGAACAGCGCATCGCGCTGGAAAAACGGCGGCATGAACTGAGCTATCGCGTTTTGCCTGCCGACCAGTGGCCTCTTTT
TGAGCTGGTGGTCAGCGAAATCGACGATTGCCATTACCGTCTGCATATGAACCTCGACCTTTTGCAGTTTGATGTGCAGA
GTTTTAAAGTCATGATGGACGACCTGGCGCAGGCCTGGCGCGGTGAAACGCTGGCACCGCTCGCTATTACCTTCCGTGAT
TATGTGATGGCTGAACAGGCGCGCCGACAGACATCGGCATGGCACGATGCCTGGGATTACTGGCAGGAAAAACTGCCGCA
ACTGCCCTTAGCGCCAGAGCTGCCGGTGGTTGAGACGCCCCCGGAAACGCCACACTTCACCACCTTCAAATCGACGATCG
GCAAGACAGAATGGCAGGCCGTGAAACAGCGCTGGCAGCAGCAAGGCGTCACACCGTCTGCCGCGCTGCTCACGCTGTTT
GCCGCCACCCTTGAGCGCTGGAGCCGTACCACAACATTTACGCTGAACCTGACGTTCTTCAATCGCCAGCCGATCCATCC
GCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTTGATTTTAACTTCTCAGCGCCGGTGACGTTGCAAG
AGCAGATGCAACAGACCCAACAGCGCCTCTGGCAAAACATGGCGCACAGTGAAATGAACGGTGTTGAGGTGATCCGTGAG
CTGGGCCGCCTGCGCGGATCACAACGTCAACCGCTGATGCCGGTAGTGTTTACCAGTATGCTGGGGATGACGCTGGAAGG
CATGACTATCGATCAGGCGATGAGCCATCTGTTCGGCGAACCCTGCTATGTATTCACGCAAACGCCGCAGGTCTGGCTGG
ATCATCAGGTCATGGAGAGCGACGGCGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTGCTGGAACCCGGCGCTGCC
GAGGCGATGTTTAATGACTATTGCGCCATCCTGCAAGCCGTCATCGCCGCCCCTGAAAGCCTGAAGACTCTCGCCAGCGG
CATCGCCAGGCACATTCCCCGCCGACGCTGGCCGCTGAACGCGCAGGCGGACTACGACCTGCGGGATATTGAGCAGGCGA
CGCTCGAATACCCCGGCATCCGGCAGGCCAGAGCGGAAATAACCGAACAGGGCGCGTTGACGCTGGATATCGTGATGGCC
GACGATCCTTCGCCATCAGCGGCGATGCCTGATGAGCACGAACTTACCCAACTGGCGCTGCCGTTGCCTGAGCAGGCGCA
GCTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAGGCGCGTGCGCTACAGGGGATCGCGGCTACGCTAAATCGTCACG
GCCTGTTTACCACGCCGGAGATCGCCCATCGCTTTAGCGCAATAGTACAGGCGCTGTCCGCGCAAGCGTCTCACCAGCGT
CTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAGTGGTTAATCCGCGAAGGTGAAAGCTGGCGCTGCCGCATTCC
GCTCAGCGAGATTCCTGAGCCTCAGGAAGCGTGCCCGCAAAGCCAATGGAGCCGGGCGCTGGCGCAGTATCTGGAAACCT
GCATCGCCCGGCACGACGCCCTTTTCTCCGGGCAGTGTTCTCCGCTGGAATTGCTGTTCAACGAGCAGCATCGCGTTACC
GACGCGCTGTATCGCGACAACCCCGCCAGCGCCTGTCTGAATCGCTATACCGCGCAGATTGCCGCCTTGTGCAGCGCAGA
ACGGATTCTGGAGGTTGGCGCCGGAACCGCAGCCACTACCGCGCCGGTGCTGAAGGCCACGCGGAACACGCGACAGTCGT
ACCACTTCACGGACGTCTCCGCGCAGTTCCTCAATGACGCCAGAGCCCGTTTCCATGATGAATCGCAGGTGTCTTATGCC
TTGTTCGACATCAACCAGCCGCTGGATTTCACCGCCCACCCGGAGGCGGGTTACGACCTGATCGTTGCCGTCAATGTGCT
CCACGACGCCAGCCATGTCGTCCAGACGTTGCGCAGATTAAAACTGTTGCTGAAAGCCGGCGGACGTTTGCTGATCGTTG
AAGCGACGGAGCGAAACAGCGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGCGATTTCCGC
CGCCGGGATGAGAAACCGATGCTCACCCGCTCCGCATGGCAGGAGGTTCTCGTTCAGGCCGGGTTTGCAAACGAGCTGGC
GTGGCCCGCGCAGGAATCGTCGCCGCTGCGCCAGCATCTGCTGGTAGCGCGTTCGCCTGGCGTAAATCGCCCGGATAAAA
AAGCCGTGAGCCGCTATTTACAGCAGCGCTTTGGCACCGGTCTGCCCATTTTACAGATCCGGCAAAGAGAAGCGTTATTT
ACGCCGCTGCATGCCCCGTCTGATGCGCCGACTGAGCCAGCCAAACCCACGCCAGTTGCCGGGGGGAATCCGGCGCTGGA
AAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCCGTGGCAAGGCATCACGACTTTTTCGAACTGGGCGGCG
ACAGCCTGATGGCGACAAGGATGGTCGCGCAGCTAAACCGGAGAGGGATTGCCAGGGCTAACCTTCAGGATCTGTTCAGC
CATTCGACGCTGAGCGACTTCTGCGCCCATCTACAGGCGGCTACGTCAGGAGAGGACAACCCGATACCCCTTTGCCAGGG
CGACGGTGAGGAAACCCTGTTTGTCTTCCACGCTTCAGACGGCGATATCAGCGCCTGGCTGCCGCTCGCTAGCGCGTTGA
ACAGGCGCGTTTTCGGCCTGCAAGCAAAATCGCCGCAGCGCTTTGCCACGCTCGACCAGATGATCGATGAGTATGTCGGG
TGCATCCGTCGTCAGCAGCCTCACGGCCCTTATGTGCTGGCGGGTTGGTCGTATGGCGCGTTTCTTGCGGCGGGCGCCGC
ACAGCGCCTGTACGCCAAAGGCAAGCAGGTTCGGATGGTGTTAATCGATCCCGTGTGCCGACAGGATTTCTGTTGCGAAA
ACCGGGCGGCCCTGCTGCGCCTGTTAGCCGAAGGACAAACGCCTCTGGCACTGCCCGAACATTTCGACCAGCAGACGCCC
GACAGCCAGCTTGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCAAAACCTGACGCTGCAAGCGGCAGA
AACGTGGCTCGACAACATCGCGCATCTGCTGCGTTTACTGACTGAGCATACGCCGGGCGAAAGCGTTCCGGTCCCCTGTC
TCATGGTGTATGCCGCCGGGAGACCCGAGCGCTGGACGCCAGCAGAAACCGAGTGGCAGGGCTGGATAAACAACGCCGAC
GACGCTGTGATTGAAGCCAGCCACTGGCAAATCATGATGGAAGCCCCCCACGTTCAGGCTTGTGCGCAACACATTACGCG
CTGGCTTTGCGCAACCTCAACGCAACCGGAGAACACGTTATGA

Protein sequence :
MRFSSAPTADSIDASIAQHYPDCEPVAVIGYACHFPESPDGETFWQNLLEGRECSRRFTREELLAVGLDAAIIDDPHYVN
IGTVLDNADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEVAQV
KGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPGMIF
SPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALMLAAI
DDRQVGYIETHGTGTPLGDAIEIEALRNVYAPRPQDQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPLLNFHT
PNPALKLEESPFTIPVSAQAWQDEMRYAGVSSFGIGGTNCHMIVASLPDALNARLPNTDSGRKSTALLLSAASDSALRRL
ATDYAGALRENADASSLAFTALHARRLDLPFRLAAPLNRETAEALSAWAGEKSGALVYSGHGASGKQVWLFTGQGSHWRT
MGQTMYHHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSAQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAIGHSVG
EFAAAVVCGHYTIEQVMPLVCRRGALMQQCVSGAMVAVFADEDTLMPLARQFELDLAANNGTQHTVFSGPEARLAVFCAT
LSQHDINYRRLSVTGAAHSALLEPILDRFQDACAGLHAEPGQIPIISTLTADVIDESTLNQADYWRRHMRQPVRFIQSIQ
VAHQLGARVFLEMGPDAQLVACGQREYRDNAYWIASARRNKEASDVLNQALLQLYAAGVALPWADLLAGDGQRIAAPCYP
FDTERYWKERVSPACEPADAALSAGLEVASRAATALDLPRLEALKQCATRLHAIYVDQLVQRCTGDAIENGVDAMTIMRR
GRLLPRYQQLLQRLLNNCVVDGDYRCTDGRYVRARPIEHQQRESLLTELAGYCEGFQAIPDTIARAGDRLYEMMSGAEEP
VAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGIVQTRQPRQPLRILEVGGGTGGTTAWLLPELNGVPALEYHFTDI
SALFTRRAQQKFADYDFVKYSELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGRTLDNLRPLLKPGGRLLMREITQPM
RLFDFVFGPLVLPLQDLDAREGELFLTTAQWQQQCRHAGFSKVAWLPQDGSPNAGMSEHIILATLPGQAVSAVTFTAPSE
PVLGQALTDNGDYLADWSDCAGQSEQFNARWQEAWRLLSQRHGDALPVEPPPVAAPEWLGKVRLSWQNEAFSRGQMRVEA
RHPAGEWLPLSPAAPLPAPQTHYQWRWTPLNVASIDHPLTFSFSAGTLARSDELAQYGIIHDPHASSRLMIVEESEDTLA
LAEKVIAALTASAAGLIVVTRRAWRVEENEALSASHHALWALLRVAANEQPERLLAAIDLAENTPWETLHQGLSAVSLPQ
RWLAARGDTLWLPSLSPNTGCAAELPANVFTGDSRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGG
QTRVCRCDVGDAGQLATVLDDLAANGGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLRNHDGRYLILY
SSAAATLGAPGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLATLASRGMGALSDAEGCWHLEQA
VMRGAPWRLAMRVFTDKMPPLQQALFNISATEKAATPVIPPADDNAFNGSLSDETAVMAWLKKRIAVQLRLSDPASLHPN
QDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLTQLICSKPEATPAASQPEVLRHDADERYAPFPLTPIQH
AYWLGRTHFIGYGGVACHVLFEWDKRHDEFDLAILEKAWNQLIARHDMLRMVVDADGQQRILATTPEYHIPRDDLRALSP
EEQRIALEKRRHELSYRVLPADQWPLFELVVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQAWRGETLAPLAITFRD
YVMAEQARRQTSAWHDAWDYWQEKLPQLPLAPELPVVETPPETPHFTTFKSTIGKTEWQAVKQRWQQQGVTPSAALLTLF
AATLERWSRTTTFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSAPVTLQEQMQQTQQRLWQNMAHSEMNGVEVIRE
LGRLRGSQRQPLMPVVFTSMLGMTLEGMTIDQAMSHLFGEPCYVFTQTPQVWLDHQVMESDGELMFSWYCMDNVLEPGAA
EAMFNDYCAILQAVIAAPESLKTLASGIARHIPRRRWPLNAQADYDLRDIEQATLEYPGIRQARAEITEQGALTLDIVMA
DDPSPSAAMPDEHELTQLALPLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFSAIVQALSAQASHQR
LLRQWLQCLTEREWLIREGESWRCRIPLSEIPEPQEACPQSQWSRALAQYLETCIARHDALFSGQCSPLELLFNEQHRVT
DALYRDNPASACLNRYTAQIAALCSAERILEVGAGTAATTAPVLKATRNTRQSYHFTDVSAQFLNDARARFHDESQVSYA
LFDINQPLDFTAHPEAGYDLIVAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYRDFR
RRDEKPMLTRSAWQEVLVQAGFANELAWPAQESSPLRQHLLVARSPGVNRPDKKAVSRYLQQRFGTGLPILQIRQREALF
TPLHAPSDAPTEPAKPTPVAGGNPALEKQVAELWQSLLSRPVARHHDFFELGGDSLMATRMVAQLNRRGIARANLQDLFS
HSTLSDFCAHLQAATSGEDNPIPLCQGDGEETLFVFHASDGDISAWLPLASALNRRVFGLQAKSPQRFATLDQMIDEYVG
CIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYAKGKQVRMVLIDPVCRQDFCCENRAALLRLLAEGQTPLALPEHFDQQTP
DSQLADFISLAKTAGMVSQNLTLQAAETWLDNIAHLLRLLTEHTPGESVPVPCLMVYAAGRPERWTPAETEWQGWINNAD
DAVIEASHWQIMMEAPHVQACAQHITRWLCATSTQPENTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 99
irp1 CAA21391.1 - Virulence HPI Protein 0.0 99
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 99
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 98
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 98