Gene Information

Name : RORB6_02500 (RORB6_02500)
Accession : YP_007872748.1
Strain : Raoultella ornithinolytica B6
Genome accession: NC_021066
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 578935 - 588426 bp
Length : 9492 bp
Strand : -
Note : COG3321 Polyketide synthase modules and related proteins

DNA sequence :
ATGGATAACTTGCGCTTCTCTTCTGCGCCGACAGCAGATTCCATTGATGCATCGATCGCTCAACACTTCCCGGACTGCGA
ACCTGTCGCGGTTATCGGCTATGCCTGCCATTTTCCTGAATCGCCGGATGGCGAAACGTTCTGGAAAAACCTGCTGGAAG
GTCGTGAATGCAGCCGACGCTTTGCGCGCGAAGAGCTTCTGGCCGTCGGTCTGGAAGCCGCCACCGTTGACGATCCTCAT
TACGTCAATATCGGTACGGTTTTAGAAAATGCCGATTGCTTCGACGCCACTCTGTTTGGCTATTCACGACAGGAAGCGGA
ATCGATGGACCCGCAGCAGCGCCTGTTTTTACAGGCGGTCTGGCATGCGCTGGAGCATGCCGGTTATGCCCCCGGCGCCG
TCCCGCATAAGACCGGCGTTTTCGCCTCTTCCCGGATGAGTACCTATCCCGGTCGTGAGGCGCTGAACGTGACTGAAGTC
GCGCAGGTAAAGGGCCTGCAATCTCTGATGGGCAACGACAAGGACTATATTGCCACCCGCGCCGCCTACAAACTCAATTT
GCACGGTCCGGCACTCTCAGTACAGACCGCCTGCTCCAGTTCGCTGGTTGCCGTGCATCTGGCTTGCGAAAGCCTGCGCG
CAGGTGAATCCGATATGGCGGTTGCCGGCGGCGTGGCGCTCTCTTTCCCCCAGCAGGCGGGCTACCGCTACCAGCCCGGG
ATGATTTTTTCTCCCGATGGTCACTGTCGTCCTTTTGACGCCTCAGCGGAAGGTACCTGGGCCGGTAACGGTCTGGGTTG
CGTGGTTTTGCGTCGCCTGAGAGATGCGCTGCTGTCAGGCGATCCGATTATCTCGGTGATCCTCTCCAGCGCGGTCAACA
ACGACGGCAACAGAAAGGTCGGTTATACCGCCCCTTCCGTCGCAGGACAGCAGGCGGTCATCGAAGAGGCGTTAATGCTG
GCGGCCATCGACGACAGGCAGATCGGTTACATTGAAACGCACGGCACCGGCACACCGCTGGGCGACGCGATTGAAATTGA
AGCGCTACGCAACGTCTACGCGCCACGTCCGCAGGAGCAACGCTGTGCGCTGGGTTCCGTGAAAAGTAACATGGGCCACC
TGGATACCGCAGCGGGTATTGCCGGATTGCTGAAAACCGTTCTGGCGGTAAGCCGCGGCCAAATTCCGCCAATGCTGCAT
TTTCACTCGCCCAACCCGGCGCTTAAGCTTGAAGAGAGCCCTTTTACCATACCGGTGTCGGCACAGGCGTGGCAGGACGA
AATACGCTATGCGGGCGTCTCCTCCTTTGGTATTGGTGGCACCAACTGCCATATGATCGTCGCCTCGCTGCCCGACGCGC
TCAAAACGCGCCTCCCCAATGCCGATGGCGACAGAAAGAGCACCGCGCTACTACTCAGCGCCGCCAGCGACAGCGCGCTG
CGGCAACTGGCGACCAATTATGCCGGGGCGCTGGCGCAGAATGCAGATGCCAACAATCTGGCCTTTACGGCCATGCACGC
GCGCCGTCTCGATCTTCCCTTCCGCCTGGCGGTGCCATTAAACCGTGAAACCGCCGCTGCGCTCAGTGCCTGGGCTACTG
ATAAATCGGGGCCGCCTGTTTACAGCGGCCACGGCGCCAGCGGCAAGCAGACATGGCTGTTTACCGGGCAGGGTTCGCAC
TGGCGCACGATGGGCCAGGCAATGTACCGACATTCGACGGCGTTTGCCGACACGCTGGATCGCTGTTTTTCCGCCTGTAG
CGAAATGCTCACGCCATCACTGCGGGAAGCGATGTTTAACCCCGACTCGACGCAGCTGGACAATATGGCCTGGGCCCAGC
CGGCGATTGTCGCGTTTGAAATCGCAATGGCGGCGCACTGGCGCGCTGAAGGGCTAAAACCAGACTTCGCTATGGGTCAT
TCCGTCGGTGAATTTGCCGCTGCCGTCGTCTGCGGACACTATACGATTGAACAGGTCATGCCACTGGTTTGTCGGCGCGG
CGCGCTCATGCAGCAGTGCGCGAGCGGCGCCATGGTGGCGGTATTTGCAAACGAAGACACGCTAATGCCGCTGGCCCGCC
GCTTTGAGCTGGATCTCGCCGCCAACAATGGCACGCAACATACGGTATTTTCCGGGCCGGAAGCCCGTCTCGCGGAATTT
TGCGCCGCACTTTCTCAGCATGACATCAACTATCGTCGTCTGAGCGTAACCGGCGCCGCGCACTCCGCTTTACTGGAACC
CATACTCGACCGCTTTCAGGAAGCCTGCGCGGGGCTGCACGCTGAGCCGGGACAAATACCGCTTATCTCCACGCTCATCG
CCGACGTCATCGATGAGTCAACGCTCAATCAGGCGGACTACTGGCGCCGACACATGCGCCAGCCGGTACGTTTTATCCAG
AGTATTCAGGTGGCGCATGAACTCGGCGCCCGCATTTTTCTTGAGACAGGGCCTGATGCTCAGCTGGTTGCTTGCGGGCA
GCGCGAATATCGCGATAACGCATACTGGATAGCCAGCGCCCGACGTAGCAAAGAGGCGAGCGATGTCCTCAATCAGGCCC
TGCTCCAGCTTTACGCGGCCGGCGTCGCCTTACCGTGGGCCGACCTGCTGGCGGGCGATGGACAACGTATCGCTGCCCCC
TGCTATCCGTTTGATACCGAGCGTTACTGGAAAGATCGCGCCTCCCGCGCCAACGAGCCTGCCGACGCAGCGCTGTCCGC
CGGGCTGGCGGTGGCGAGTCGCGCCGCGACAGCGCTCGATCTCCCCCGTCTGGAAGCGCTTAAACAGTGCGCCACGCGAC
TGCACGCCATCTATGTCGATCGGCTGGTTCAACGCTGTACCGGCGATGCCATTGAGCACGGCGTGGACGCCATAACCATC
ATGCGCCGTGGCCGTCTGCTGCCCCGCTACCAGCAGCTGCTCCAGCGCCTGCTCAACAACTGCGTGGTCGACGGCGATTA
CCGCTGCGCCGACGGACGTTACACCCGCGCCCGTCCCATAGATCATCAACAGCGGGAATCACTGCTGACGGAACTTGCCA
GTTATTGTGAAGGTTTTCAGGCCATTCCCGACACCATCGCCCGTGCCGGCGATCGGTTATATGAAATGATGAGCGGCGCG
GAAGAACCGGTGGCGATTATCTTCCCGCAAAGCGCCTCCGACGGCGTGGAGGTGTTGTACCAGGAGTTCAGCTTTGGCCG
CTATTTCAACCAAATCGCCGCTGGGGTATTACGCGGTATCCTCCAGACGCGTCAGCCCCGCCAGCCATTGCGCATTCTTG
AAGTCGGCGGCGGAACCGGCGGGACCACCGCGTGGCTACTGCCGGAACTCAGAGGCGTTCCAGCGCTGGAGTACCATTTC
ACCGATATCTCCGCGCTGTTCACCCGCCGCGCGCAGCAGAAATTCGCCGACTATGAGTTTGTGCATTACAACGAGCTGGA
TCTCGAAAAAGAGGCCCAATCTCAGGGTTTCCAGGCGCAGTCTTACGATCTTATCGTGGCGGCGAACGTGATTCATGCCA
CGCGCCATATTGGCCACACGCTCGACAATCTGCGCCCCCTGCTCAAACCGGGCGGACGCCTGCTGATGCGCGAAATCACC
CAGCCAATGCGTCTGTTTGATTTCGTTTTCGGTCCGCTGGTCCTCCCGCTACAGGATATCGGCGCCCGCGAAGGTGAGTT
ATTCCTCACCACCGATCAGTGGCAACAACAGTGTCGCCGGGCCGGATTCAGCAAAGTGGAGTGGCTACCGCAGGATGGCA
GCCCAACCGCCGGGATGAGCGAACATATCATTCTTGCCACGCTGCCCGGTCAGGCGGTTTGTGCAACCTCAGGCAGCGCG
CCATCAGACCCGGTGTTGGGACAGGCGTTGACGGATAGCGCTGACTACCTCGCCGACTGGTCTGATTGTGCTGGTCAGCC
TGAGCGGTTTAACGCCCGCTGGCAGGAGGCCTGGCGCCGGCTCTCACAGCGTCATGGCGATGATTTTCCCGGGGAGCCGC
CCCTCGTCACCGCCCCGGAATGGCTGGGGGACGTTCGTTTAAGCTGGCAAAACAACGCGTTTTCCCGCGGTCAGATGCGC
GTTGAAGCCCGTCGTCCTGATGGCGAGTGGCTGCCGCTTTCGCCCGCCGCGCCTCTTCCTGCCCCGCAAACGCATTATCA
ATGGCGCTGGACGCCCTGCAACGTCGCCAGCGTTGACCATCCTCTTACCTTCAGCTTCAGCGCCAATACGCTTGCGCGCG
GCGACAAGCTGGCGCAATACGGCATCATCCACGATCCGCAGGCCTCTTTGCTCCTGATGGTTATTGAGGAGAGCGAGGAT
ACTCTGGCCTTAGCGGAGAAAGTGATGGAGGCACTTACCGCCAGCACAGCCGGTTTGATTGTGGTTACTCACCGCGCGTG
GCGAATCGAGGAAAATGAGGCGCTTTCTGCCTCCCATCATGCTCTATGGGCCTTGCTTCGTGTCGCGGCCAACGAACAGC
CAGAACGGTTGATTGCCGCCATCGATCTCGCCGAAAACACCCCGGGAGAAACGCTGCATCGAGGGTTGAGCGCCGTCTCC
CTCTCTCAGCGCTGGCTCGCCGCGCGGGGTAACACCCTCTGGCTCCCTTCACTGGCGCCCAATACGGGATGCGCCGCCGA
ATTACCGGCAAACGTGTTTGCTGGCGATAACCGCTGGCATCTGGTAACGGGAGCGTTTGGCGGATTAGGCCGTCTTGCCG
TGAACTGGCTCAGAGAAAAAGGGGCGCGACGCATCGCCCTGCTGGCGCCGCGCGTGGATGAATCATGGCTACGCGACATC
GAGGGTGAGCAGACCCGCGTCTGCCGTTGTGATGTGGGCGATGCCGGACAACTGGCCACGGTTCTTGATGAACTGGCGGC
CAACGGCGGCATTGCCGGGGCGATTCATGCCGCTGGCGTATTGGCTGATGCCCCCTTGCAGGAGCTTGATCACCCCCCGC
TGGCCGCCGTTTTCGCGGTAAAAGCGCAGGCGGCAACCCAGCTATTGCAAACCCTGCGCAACCACGGCGGACGTTATCTT
ATTCTCTACTCCTCCGCTGCCGCCACCCTCGGCGCCTCGGGTCAAAGCGCCCATGCGCTGGCCTGCGGTTATCTCGACGG
GCTGGCCCAACAGTTTTCCACCCTTGATGCGCCGAAAACGCTCTCTGTCGCCTGGGGCGCATGGGGGGAAAGCGGTCGGG
CGGCCACGCCGGAAATGTTGGCGGCGCTCGCCAGCCGTGGCATGGGTGCGTTAAGCGATGCTGAAGGCTGCTGGCACCTG
GAACAGGCGGTGATGCGCGGTACTCCGTGGCGGCTGGCGATGCGCGTTTTCATCGACAAAATGCCCCCGTTACAACAGGC
TCTGTTTAACGTCGGTGCTGCCGGAAAAACCGCAACGCCCGTCATCCCTCCTGCTGATGACAACGCCTTTAACGGCAGCC
TGAGCGATGAAACGGCGGTGATGGCATGGCTGAAAAAGCGGATTGCGGTTCAGCTACGACTAAACGATCCGGCGTCATTG
CTCCCTAACCAGGATCTGTTGCAACTTGGCATGGACTCGCTGCTCTTCCTTGAACTCAGTAGCGATATTCAGCACTACCT
GGGCGTACGCATCAATGCGGAACGGGCATGGCAGGATCTGTCGCCTCATGGCCTCGCGCAGCTTATCTGCGGTAAGCCCG
AGGCGACGCCTGGCGCTTCGCAGCCGGAAGTATTGCGGCACGACGCCGGTGAGCGTTATATGCCCTTCCCGCTGACACCC
ATTCAGCACGCCTACTGGCTGGGGCGCACACATCTCATTGGCTATGGTGGCGTTGCCTGCCACGTCCTGTTTGAGTGGGA
TAAACGCCACGATGAGTTCGATCTCACCGTGCTGGAGAAAGCCTGGAACCAGCTCATTGCCCGTCACGATATGTTGCGCA
TGGTAGTGGATGCCGACGGGCAGCAGCGAATCCTGGCGACAACGCCGGAATATCGCATCGCCCATGATGATCTGCGCATG
CTTTCCCCGACAGAACAGCGCATAGCGCTGGAAAAACGGCGGCATGAACTTAGCTATCGCGTTTTGCCTGCCGACCAGTG
GCCTCTCTTTGAGCTGGCGGTCAGCGAAATCGACGATTGCCACTACCGCCTGCACATGAACCTCGACCTTTTGCAGTTCG
ATGTGCAGAGTTTTAAAGTCATGATGGACGACCTGGCGCAGGCCTGGCGCGGCGAAACGCTGGCGCCGCTCGATATTACC
TTCCGTGATTATGTGATGGCTGAACAGGCGCGTCGACAGACCCCGGCATGGCACGATGCCTGGGACTACTGGCAGGAAAA
ACTGCCGCAATTGCCCTTAGCGCCCGAGCTCCCGTTGGTTGAAACAGCCGCGGAAACGCCACGCTTCACCACCTTCAAAT
CGACGATCGGCAAGAAAGAATGGCAGGCAGTAAAACAGCGCTGGCAGCAGCAAGGTGTCACGCCGTCTGCCGCGCTGCTC
ACGCTGTTTGCCACCGCCCTTGAACGCTGGAGTCGCACCCCGGCCTTTACGCTAAACCTGACCTTCTTCAATCGCCAGCC
GATCCATCCGCAAATCAACCAGTTGATTGGTGATTTTACCTCCGTCACGCTGGTCGATTTTAACTTCTCAACACCGGTGA
CGCTGCAGGAGCAAATGCAACAGACCCAACGGCGCCTCTGGCAAAACATGGCACACAGTGAAATGAACGGTGTTGAGGTG
ATCCGTGAGCTGGGTCGCCTGCGCGGGTCACAACGTCAACCGCTCATGCCGATAGTCTTTACCAGCATGCTGGGGATGAC
GCTGGAAGGGATGACTATCGATCGGGCAATGAGCCATCTGTTTGGCGAACCCTGCTATGTGTTCACCCAAACGCCGCAGG
TCTGGCTGGATCATCAGGTCATGGAGAGCGACGACGAGTTGATGTTTAGCTGGTACTGCATGGACAACGTACTGGAGCCA
GGCGCTGCTGAAGCGATGTTCAATGACTACTGCGCCATTCTGCAGGCCGTCATGACCAGCCCTGAAAGCCCGAAGACTAG
CACCAGTGGCATCGCCGGGCATATTCCCCGCCGACGCTGGCCGCTGAGCGCGCAGGCCGACTACGATCTGCGGGATATTG
AGCAGGCGACGCTCGAACATCCGCACATCCAACAGGCCAGGGCGGAAATAAGCGATACAGGCGCGTTAACGCTGGATATT
GTGATGGCCGACGTTCCGCCGCGCTTAACCGCCGCTTCTGACGGGCACGATCTCGCCCGGCTGGCGCTGCCGTTGCCTGA
GCAGGCGCAACTTGATGAGCTGGAGGCGACCTGGCGCTGGCTGGAAGCGCGTGCGCTGCAGGGTATTGCGGCCACGCTAA
ATCGCCACGGCCTGTTCACCACGCCGGAGATCGCCCATCGCTTTACCGGGATAGCAGAGGCGCTGTCCGCGCAAGCGTCT
CACCAGCGCCTGCTGCGCCAGTGGCTACAGTGTCTGACGGAAAGAGAATGGTTAATCCGCGAAGGTGAAAGCTGGCGCTG
CCGTATTCCGCTCAGCGAGATTCCTGAACCTCAGCAGGCATGCCCGCAAAGCCAATGGAGCCAGGCGCTGGCGCAGTATC
TGGACACCTGCATCGCCCGGCACGACGATCTCTTCTCCGGACGCTGTTCTCCGCTGGAATTGCTGTTCAACGAGTCGCTG
CGCGTCACTGACGCACTGTATCGGGACAACCCTGCCAGCGCCTGTCTGAATCGCTACACCGCGCAGATTGCCGCCTTGTG
CGGCGCAGAACGGATTCTGGAGGTTGGCGCCGGAACCGCGGCCACTGCCGCACCGGTGCTGGAAGCCACGCGGAACACGC
GGCAGTCATACCACTTTACCGACGTCTCTGCGCAATTCCTCAACGATGCCAGAGGCCGTTTCCACGATCAATCATGCGTC
ACTTACGCATTGTTCGATATCAACCAGCCGCTGGATTTCAGCGCCCACCCGGAGGCGGGTTACGACCTGATCATTGCCGT
CAACGTACTCCACGACGCCAGCCATGTCGTCCAGACGTTGCGCCGATTAAAACTGTTGCTGAAAGCCGGCGGACGCTTGC
TGATCGTTGAGGCGACGGAGCGAAACAGTGTATTCCAGCTGGCGAGCGTGGGCTTTATTGAGGGATTAAGCGGATACCGC
GATTTCCGCCGTCGGGATGAGAAGCCGATGCTCACCCTTTCGGCGTGGCAGGAGGTTCTTGTTCAGGCCGGGTTTGCCAA
CGAGCTGGCGTGGCCCGCGCAGGAGTCGTCACCGCTGCGCCAGCATCTGCTGGTGGCCCGTTCACCCGGTGTAAATCGCC
CGGACAAAGAAGCCGTGCGCCGCAATTTACAGCAGCGTTTCGGCAGCGGTCTGCCCGTTTTACAGATCCGGCAAAGAGAA
ACGCTGTTTGCGCCGCTGCATGCTCCATCTGATGCGCTGGTTGAACCTGCTGAACCCATGCCAGCTGCCGGGGGGAACCC
GGCGCTGGAAAAACAGGTGGCTGAACTCTGGCAATCGCTGCTGTCTCGCCCTGTGGCAAGACATCACGACTTTTTCGAGT
CGGGAGGCGACAGTCTGATGGCGACAAGGATGGTGGCGCAGTTGAACCAAAGAGGGATTGCGAGGGCCAACCTTCAGGAT
CTGTTCACCCATTCAACGCTGAGAGACTTCTGCGCCCACCTGCAGGCCTGTTCGGCGGGAGAGGACAACCCGATCCCCAT
TTGCCAGGGCGATGGCGACGAAACCCTGTTCGTCTTCCACGCCTCGGACGGCGATGTCAGCGCCTGGCTCCCGCTCGCCA
GCACGCTGAATATGCGCGTTTTCGGCCTGCAGGCAAAATCACCGCAGCGCTTTGCAACGCTTGACCAGATGATCGACGAA
TATGTCGAGTGCATCCGCCGCCAGCAGCCTCACGGCCCGTACGTGCTGGCGGGTTGGTCGTACGGCGCTTTTCTTGCGGC
GGGCGCCGCACAGCGCCTGTACGCCAGGGGCGAGCCGGTTCGGATCGCGTTAATCGATCCCGTGTGTCGACAGGATTTCT
GTTGCCAAAACCGGACAGCCCTGCTGCGCCTGTTAGCTGCAGGACAAACGCCTCTGGCGCTGGCGGAACATTTCGACCAA
CAGGCGCCCGACAGCCAGCTGGCCGACTTTATCAGCCTCGCTAAAACGGCCGGTATGGTGTCGCCAAACCTGACGCAGCA
AGCGGCAGAAGCGTGGCTCGACAACATCGCGCATCTGCTGCGTTTGCTGGTTGAGCATACGCCAGGCGAAAGCGTTCCGA
TCCCCTGTCTCATGGTCTATGCCGCCGGGAGACCCGCGCGCTGGACGCCGGCAGAAACCGAATGGCAGGACTGGATAAAC
AACGCTGACGACTATGTGATTGAAGCCAGCCACTGGCAAATCATGCTGGAGCCTCCCCACGTTCAGGCCTGCGCACGACA
CATTAAGCGCTGGCTTGGCGCCACCTCAACGCAACCGGAGAACACGTTATGA

Protein sequence :
MDNLRFSSAPTADSIDASIAQHFPDCEPVAVIGYACHFPESPDGETFWKNLLEGRECSRRFAREELLAVGLEAATVDDPH
YVNIGTVLENADCFDATLFGYSRQEAESMDPQQRLFLQAVWHALEHAGYAPGAVPHKTGVFASSRMSTYPGREALNVTEV
AQVKGLQSLMGNDKDYIATRAAYKLNLHGPALSVQTACSSSLVAVHLACESLRAGESDMAVAGGVALSFPQQAGYRYQPG
MIFSPDGHCRPFDASAEGTWAGNGLGCVVLRRLRDALLSGDPIISVILSSAVNNDGNRKVGYTAPSVAGQQAVIEEALML
AAIDDRQIGYIETHGTGTPLGDAIEIEALRNVYAPRPQEQRCALGSVKSNMGHLDTAAGIAGLLKTVLAVSRGQIPPMLH
FHSPNPALKLEESPFTIPVSAQAWQDEIRYAGVSSFGIGGTNCHMIVASLPDALKTRLPNADGDRKSTALLLSAASDSAL
RQLATNYAGALAQNADANNLAFTAMHARRLDLPFRLAVPLNRETAAALSAWATDKSGPPVYSGHGASGKQTWLFTGQGSH
WRTMGQAMYRHSTAFADTLDRCFSACSEMLTPSLREAMFNPDSTQLDNMAWAQPAIVAFEIAMAAHWRAEGLKPDFAMGH
SVGEFAAAVVCGHYTIEQVMPLVCRRGALMQQCASGAMVAVFANEDTLMPLARRFELDLAANNGTQHTVFSGPEARLAEF
CAALSQHDINYRRLSVTGAAHSALLEPILDRFQEACAGLHAEPGQIPLISTLIADVIDESTLNQADYWRRHMRQPVRFIQ
SIQVAHELGARIFLETGPDAQLVACGQREYRDNAYWIASARRSKEASDVLNQALLQLYAAGVALPWADLLAGDGQRIAAP
CYPFDTERYWKDRASRANEPADAALSAGLAVASRAATALDLPRLEALKQCATRLHAIYVDRLVQRCTGDAIEHGVDAITI
MRRGRLLPRYQQLLQRLLNNCVVDGDYRCADGRYTRARPIDHQQRESLLTELASYCEGFQAIPDTIARAGDRLYEMMSGA
EEPVAIIFPQSASDGVEVLYQEFSFGRYFNQIAAGVLRGILQTRQPRQPLRILEVGGGTGGTTAWLLPELRGVPALEYHF
TDISALFTRRAQQKFADYEFVHYNELDLEKEAQSQGFQAQSYDLIVAANVIHATRHIGHTLDNLRPLLKPGGRLLMREIT
QPMRLFDFVFGPLVLPLQDIGAREGELFLTTDQWQQQCRRAGFSKVEWLPQDGSPTAGMSEHIILATLPGQAVCATSGSA
PSDPVLGQALTDSADYLADWSDCAGQPERFNARWQEAWRRLSQRHGDDFPGEPPLVTAPEWLGDVRLSWQNNAFSRGQMR
VEARRPDGEWLPLSPAAPLPAPQTHYQWRWTPCNVASVDHPLTFSFSANTLARGDKLAQYGIIHDPQASLLLMVIEESED
TLALAEKVMEALTASTAGLIVVTHRAWRIEENEALSASHHALWALLRVAANEQPERLIAAIDLAENTPGETLHRGLSAVS
LSQRWLAARGNTLWLPSLAPNTGCAAELPANVFAGDNRWHLVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDI
EGEQTRVCRCDVGDAGQLATVLDELAANGGIAGAIHAAGVLADAPLQELDHPPLAAVFAVKAQAATQLLQTLRNHGGRYL
ILYSSAAATLGASGQSAHALACGYLDGLAQQFSTLDAPKTLSVAWGAWGESGRAATPEMLAALASRGMGALSDAEGCWHL
EQAVMRGTPWRLAMRVFIDKMPPLQQALFNVGAAGKTATPVIPPADDNAFNGSLSDETAVMAWLKKRIAVQLRLNDPASL
LPNQDLLQLGMDSLLFLELSSDIQHYLGVRINAERAWQDLSPHGLAQLICGKPEATPGASQPEVLRHDAGERYMPFPLTP
IQHAYWLGRTHLIGYGGVACHVLFEWDKRHDEFDLTVLEKAWNQLIARHDMLRMVVDADGQQRILATTPEYRIAHDDLRM
LSPTEQRIALEKRRHELSYRVLPADQWPLFELAVSEIDDCHYRLHMNLDLLQFDVQSFKVMMDDLAQAWRGETLAPLDIT
FRDYVMAEQARRQTPAWHDAWDYWQEKLPQLPLAPELPLVETAAETPRFTTFKSTIGKKEWQAVKQRWQQQGVTPSAALL
TLFATALERWSRTPAFTLNLTFFNRQPIHPQINQLIGDFTSVTLVDFNFSTPVTLQEQMQQTQRRLWQNMAHSEMNGVEV
IRELGRLRGSQRQPLMPIVFTSMLGMTLEGMTIDRAMSHLFGEPCYVFTQTPQVWLDHQVMESDDELMFSWYCMDNVLEP
GAAEAMFNDYCAILQAVMTSPESPKTSTSGIAGHIPRRRWPLSAQADYDLRDIEQATLEHPHIQQARAEISDTGALTLDI
VMADVPPRLTAASDGHDLARLALPLPEQAQLDELEATWRWLEARALQGIAATLNRHGLFTTPEIAHRFTGIAEALSAQAS
HQRLLRQWLQCLTEREWLIREGESWRCRIPLSEIPEPQQACPQSQWSQALAQYLDTCIARHDDLFSGRCSPLELLFNESL
RVTDALYRDNPASACLNRYTAQIAALCGAERILEVGAGTAATAAPVLEATRNTRQSYHFTDVSAQFLNDARGRFHDQSCV
TYALFDINQPLDFSAHPEAGYDLIIAVNVLHDASHVVQTLRRLKLLLKAGGRLLIVEATERNSVFQLASVGFIEGLSGYR
DFRRRDEKPMLTLSAWQEVLVQAGFANELAWPAQESSPLRQHLLVARSPGVNRPDKEAVRRNLQQRFGSGLPVLQIRQRE
TLFAPLHAPSDALVEPAEPMPAAGGNPALEKQVAELWQSLLSRPVARHHDFFESGGDSLMATRMVAQLNQRGIARANLQD
LFTHSTLRDFCAHLQACSAGEDNPIPICQGDGDETLFVFHASDGDVSAWLPLASTLNMRVFGLQAKSPQRFATLDQMIDE
YVECIRRQQPHGPYVLAGWSYGAFLAAGAAQRLYARGEPVRIALIDPVCRQDFCCQNRTALLRLLAAGQTPLALAEHFDQ
QAPDSQLADFISLAKTAGMVSPNLTQQAAEAWLDNIAHLLRLLVEHTPGESVPIPCLMVYAAGRPARWTPAETEWQDWIN
NADDYVIEASHWQIMLEPPHVQACARHIKRWLGATSTQPENTL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 CAA21391.1 - Virulence HPI Protein 0.0 93
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 93
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 93
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 93
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 93
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 93
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 92
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 92