Gene Information

Name : KOX_20630 (KOX_20630)
Accession : YP_005020087.1
Strain : Klebsiella oxytoca KCTC 1686
Genome accession: NC_016612
Putative virulence/resistance : Virulence
Product : yersiniabactin synthetase, HMWP2 component
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4458862 - 4464960 bp
Length : 6099 bp
Strand : +
Note : COG1020 Non-ribosomal peptide synthetase modules and related proteins

DNA sequence :
ATGATTTCTGGCGCCCCCTCTCAGGATCCGCTGTTATCGGACAACGGCGAAGCGGCCGATTATCAACAATTACGCGAACT
GTTAATACAGGAACTGAATGTAGCGCCGCAGCAGTTACAGGAAGAGAGCAATCTGATCCAGGCGGGCCTGGACTCCATAA
GATTAATGAGATGGTTACACTGGTTTCGTAAAAAGGGCTACCGCCTCACCCTTCGCGAGCTGTATGCCGCCCCCACGTTG
GCAGCATGGCGCCAGCTGATGCGCAGCCGTTCGGGTGAAAAACCGGATGATGCAAGCTCCCCGGCGGAGGCGGCGTGGCC
AGTCATGAGCGAGGGTACCCCCTTCCCGCTGACGCCGGTACAGCATGCCTATCTGACGGGCCGAATGCCCGGACAGACGC
TGGGCGGCGTCGGTTGCCATCTGTATCAGGAGTTTGCCGGCCATTATCTGACGGCGCCAAAGCTGGAGCAGGCTATCACG
ATCTTACTGCAACGCCACCCGATGCTGCATATCGCCTTTCGCGCCGACGGGCAGCAGGTCTGGCTACCGCAGCCATACTG
GAACGGCGTCACCGTTCATGATTTACGCCAGACCGACGAGGCAAGCCGCCAGGCCTATCTGGAAACATTGCGCCAGCGCC
TGAGCCACCGTCTTTTACGCGTGGAAATGGGCGAAACCTTTGATTTCCAGTTGACGCTCTTGCCGGACAATTGCCACCGC
CTCCATGTCAATATCGACCTGCTGATTATGGATGCTTCCAGCTTTACGCTGTTTTTCGATGAACTTAACGCCCTGCTGGC
CGGAGAGTCGCTGCCGCCCGGCGACCCCCGCTATGATTTCCGCGCTTATCTGCTGCACCAGCAGAAGATCAATCAGCCGC
TGCTCGACAAAGCGCGCGCGTACTGGCTGGCGAAAGCATCAATGCTGCCACCCGCGCCGGTTTTACCGCTAGCCTGCGAA
CCCGCCACGCTGCGTGAAGTCCGTAATACCCGACGCCGCATGATTGTCCCGACAACACGCTGGAACGCCTTTAGCCAACG
GGCCGGTGAGAATGGCGTGACGCCGACAATGGCGCTGGCGACCTGTTTTGCTGCCGTTCTGGGGCGCTGGGGCGGCCTGA
CGCGGCTGCTGCTCAATATCACGTTATTTGACCGCCAGCCGCTGCACCCGGCGGTTGACGAAATGCTTGCCGACTTCACC
AATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACTGA
GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAGCTCCTTCGTGAACTCAAACGCCAGCAAAGCCATCCCCACGGCGCCC
CGGTGGTATTTACCAGCAATCTGGGGCGCTCCCTGTACAGCAGCCGCCCCGAGTCGCCGCTGGGCGAACCGGAATGGGGC
ATCTCGCAGACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCGGGGCGAGGTCTGGCTACAATGGGATAGCAA
TGACGCGTTGTTCCCTCCGGCGCTGGTCGAGACGTTGTTCAACGCCTACTGCCAGCTGATTAACCAACTCTGCGATGATG
AAAGCGCCTGGAAGAAGCCGTTCGCGGACAGGATGCCCCAAAGCCAGCGGGAGATACGTCAACGGGTTAACGCAACCGAC
GCACCCGTTCCTCAGGGCTTGCTGCATGAAGGCATTTTCCGCATCGCCCTCCGGCAACCGCAGGCGCTGGCGGTAACGGA
CGCGCATTATCAGTGGAATTATCGTGAGCTGACCGAGAACGCACGCCGCTGCGCTGGCAGGTTAATCGCGTGCGGGGTTC
AGCCCGGCGACAATGTCGCTATTACAATGTCGAAAGGCGCGGGGCAGCTGGTTGCGGTCCTCGCCGTTCTGCTGTCCGGG
GCGGTGTACGTTCCGGTTTCGCTCGACCAGCCCGCCGCGCGGCGCGGGAAAATCTACGCTGACGCCAACGTCCGGCTGGT
GCTCACCTGCCAGCACGACGCCAGCGCCTGGTCAGACGATATTCCCCACCTGACCTGGCAGCAGGCCATTGAGGCGGAGC
CGCTAGCCGACCAGGCGGCGCACGCGCCGACGCAGCCGGCCTACATTATCTATACCTCCGGGTCGACCGGCACGCCGAAA
GGCGTGGTCATTTCTCACCGTGCAGCGCTGAATACCTGCTGCGATATCAATAGCCGTTACCAGGTGGGTCCCGGAGACAG
AGTGCTGGCCCTCTCCGCCCTGCATTTTGATTTATCGGTTTACGACATTTTTGGCGTTCTGAGCGCCGGCGGCTCGCTGG
TTATCGTTATGGAAAATCAACGGCGCGATCCCCGCGCATGGTGCGAATTGATCCAGCGTCATCAGGTTACGCTCTGGAAC
AGCGTCCCGGCGCTGTTCGATATGCTGTTGACCTGGTGTGAAGGTTTCGCCGACGCCGCGCCGGAGAAACTGCGGGCGGT
CATGCTTTCCGGCGACTGGATCGGCCTCGATCTCCCAGCCCGCTATCACGCCTTCCGCCCCCAGGGACAGTTTATCGCCA
TGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGTGAAATTAACCGGGTCCCCGACCACTGGCGCGCCATCCCT
TATGGCTTTCCGCTGGCCAACCAACGTTACCGGGTGGTGGATGAACTGGGCCGGGACTGCCCGGACTGGGTACCGGGCGA
ATTGTGGATTGGCGGTATCGGGGTCGCGGAAGGCTACTTTAACGATCCTGTACGCAGCGAGCAGCAGTTTGTAACGCAAT
CGAACGCGCGCTGGTATCGCACCGGCGATCTCGGGTGCTACTGGCCAGACGGTACGTTAGAGTTCCTCGGTCGTCGCGAT
AAGCAGGTCAAAGTCGGGGGATATCGCATCGAGCTGGGTGAAATCGAAAGCGCGCTCAGCCAGCTGGCGGGAGTGAAACA
GTCAACCGTTGTGGCGATCGGCGAAAAAGAAAAGACCCTGGCGGCCTGGGTTGTGCCTCAGGGTTCGGCTTTCTGTGTTA
CCCATCATCGGGACCCGGCGCTGCCCCAGGCGTGGCGCGGGCTTGCCGGAACGCTGCCCTGCTGCGTCTGCCCCCCGGAA
ATCTCCGCCGGCCAGGTCGCTGATTTTCTTCAGCATCGCCTGTTAAAACTGAAACCGGGTCAAACTCCTGGCGCCGATCC
TCTTCCCCTGATGAATGCTCTCGCTATCCAGCCTCGCTGGCGGGCCGTGGTGGAACGCTGGTTAGCCTTTCTCGTGACCC
AACAGCGGCTGCAGCCCGCCGCTGAGGGTTATCAGGTTTGTGCGGGGGAAGCGCCGGAGAATGACCCTCCTTCTTTCAGC
GGGCATGACCTCACCTTAACGCAGATCCTTCGCGGCGCTCGCCACGAACTGTCATTACTGAACGATGCGCGGTGGTCGCC
GGAAAGTCTGGCTTTCGACCATCCGGCCAGCGCCCTGTATATCGAGGAACTGGCGACAATCTGTCAACAGCTTTCCCGGC
GTTTACAGCGCCCGGTACGCCTGCTAGAGGTGGGTGTCCGCACCGCCCGCGCCGCAGAGTGCCTGCTAACCCGGCTCAGC
GCTGACGAGATTGAGTATGTCGGGCTTGAGCACAGCCAGGAGCTGCTGCTGAGCGCCCGACAGCGGCTTGCACCGTGGTC
TGACGCCCGTCTGGCCCTCTGGAGTGCAGACACGCTGACGGCGCATGCTCATTCGGCGGACATTATCTGGCTGAATAACG
CCCTGCATCGTCTGCTGCCGGAAGAGCCCGGGCTCCTCGCGGCATTACAGCAGCTGGCCGTTCCAGGCGCGCTGCTCTAC
GTGTTGGAGTTTCGTCAGTTAACGCCCTCCGCCCTGCTCAGCACGCTTCTGTTAACCGATGGTCAGCCCGAAGCCTTACT
TCACAATAGCGCTGACTGGGGGGCGATATTTACCGCCGCCGCTTTCAACTGCCAGCACGGCGATGAAGTCGAGGGATTGC
AGCGCTTCCTCGTACAATGCCCCGTTAGCCAGGTACGCCGCGATCCTCGTCAGCTGCAGTCCGCTCTCGCCGAACGTCTG
CCGGGGTGGATGGTACCGAAACGGATCTTCTTACTTGACGCCTTACCGCTGACGGCGAACGGCAAAATTGACTATCAGAC
GCTAAAGCGCTGTCATACCCCGGAAGCTGAAAACCGGACTGAAGCGGATTTACCCCTGGGCGATATTGAGAAGCAGGTTG
CCGTCATCTGGCAGCCGCTCTTGTCGATGGGCGCCGTCAGCAGAGAAACCGACTTCTTCCAGCATGGCGGTGATAGCCTG
CTAGCCACCAGGCTGATCGGACAGCTCCACCAGGCGGGTTATGAAGCACGATTAAGCGACCTGTTTAATCACCCCCGGCT
GGCTGATTTTGCCGCCACGCTGCGTAAAACCGACCTCCCTGTTGAACAACCCTTTGTCCACTCCCCTGAAGAACGCTACC
GGCCTTTCGCGCTGACTGATGTGCAGCAGGCTTACCTGGTCGGGCGTCAGCCGGGCTTTGCCCTTGGCGGCGTAGGCTCA
CACTTCTTTGTTGAATTTGAAATTGCCGATCTGGATATTCACCGGCTGGAGAAGGTCTGGAACCGGTTAATCGCCCGCCA
CGATATGCTGCGCGCCGTGGTGCGGGATGGACAACAGCGGGTTCTCGAACAGACTCCCCCGTGGGTGATACCCGCGCACA
TCCTGCACAGCCCCGAAGAGGCGCTGCAGGTACGCGACAGACTGGCGCATCAGGTTCTCAACCCTGAAGTATGGCCGGTA
TTCGATCTCCAGGTCGGGTTCGTCGACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTGCTTGACGGGTT
GAGCATGCAGATCCTGTTGTCAGAGCTGGAACACGGCTATCGCTATCCGCAGCAGCTGCCGCCGCCGCTTCCCGTTACCT
TCAGGGATTATCTGCAGCAACCCGCGCTGCGGACGCCCAATCCCGACTCTCTGGCATGGTGGCAAACGCAGCTTGATGAT
ATTCCTCCGGCGCCTGCGTTACCGCTACGCTGCTTGCCTCAGGACGTTGAGACGCCGCGCTTCGCACGGCTCTATGGGGC
AATGGACAGCGCGCGCTGGCGTCGCCTGAAACAGCGGGCGGCTGACGCTCATCTCACCCCGTCGGCCGTACTGTTATCCG
TGTGGTCAACGGTTCTTGCTGCATGGAGCGCACAGCCCGATTTCACGCTTAACCTTACGCTTTTCGACAGACGACCGCTG
CACCCGCAAATCAACCAGATCCTGGGCGACTTTACTTCGCTGATGCTGCTGAGCTGGCATCCCGGCGAGAGCTGGCTGCA
AAGCGCGCGGTTATTACAGCAGCGACTAAGCGAGAGCCTCAACCACCGCGATGTGTCGGCAATCCGCGTGATGCGGCAAC
TGGCGCGGCGGCAAAACGTACCGGCTGTCCCAATGCCCGTCGTCTTTACCAGCGCGCTGGGCTTTGAGCAGGATAACTTC
CTTGCCCGGCGCAATCTGCTCAAACCGGTGTGGGGTATATCCCAGACGCCGCAGGTCTGGCTCGATCACCAGGTTTATGA
ATCCGAAGGGGAACTGCGCTTTAACTGGGATTTTGTCGCCGCCCTGTTTCCTGACGGGCAGGTGGAGCGCCAGTTTGCAC
AGTATTGCGCGTTGCTTAACCGAATGGCGGAGGATGACAGCAGCTGGCAGCTGCCGCTCGCCGACCTGGTTCCCCCGCTT
AAGGTCACGGAGCGACGCGCCCGCCGGCTGCGCCCTGAGCGCGCTCAGCCGCGGATCGCGGCGGACAAGAGCAGCGTCAG
CCTGATTTGCGACACCTTTCGCGAGGTGGTTGGCGAGCCTGTCGCACCCGCAGAGAACTTTTTTGAGGCGGGCGCCACGT
CGCTGAATCTGGTGCAGCTGCACGTCTTGTTACAACGTCACGAATTCGCCACCCTGACGCTGCTTGACCTGTTCACTCAC
CCTTCTCCTGTCGCGCTGGCCAATTATCTCGCCGGCGTAGCCCTGAAGGAGAAAACCAAACGCGTGCGGCCGGTTCGCCG
TCGTCAGCGGCGCATTTGA

Protein sequence :
MISGAPSQDPLLSDNGEAADYQQLRELLIQELNVAPQQLQEESNLIQAGLDSIRLMRWLHWFRKKGYRLTLRELYAAPTL
AAWRQLMRSRSGEKPDDASSPAEAAWPVMSEGTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFAGHYLTAPKLEQAIT
ILLQRHPMLHIAFRADGQQVWLPQPYWNGVTVHDLRQTDEASRQAYLETLRQRLSHRLLRVEMGETFDFQLTLLPDNCHR
LHVNIDLLIMDASSFTLFFDELNALLAGESLPPGDPRYDFRAYLLHQQKINQPLLDKARAYWLAKASMLPPAPVLPLACE
PATLREVRNTRRRMIVPTTRWNAFSQRAGENGVTPTMALATCFAAVLGRWGGLTRLLLNITLFDRQPLHPAVDEMLADFT
NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQSHPHGAPVVFTSNLGRSLYSSRPESPLGEPEWG
ISQTPQVWIDHLAFEHRGEVWLQWDSNDALFPPALVETLFNAYCQLINQLCDDESAWKKPFADRMPQSQREIRQRVNATD
APVPQGLLHEGIFRIALRQPQALAVTDAHYQWNYRELTENARRCAGRLIACGVQPGDNVAITMSKGAGQLVAVLAVLLSG
AVYVPVSLDQPAARRGKIYADANVRLVLTCQHDASAWSDDIPHLTWQQAIEAEPLADQAAHAPTQPAYIIYTSGSTGTPK
GVVISHRAALNTCCDINSRYQVGPGDRVLALSALHFDLSVYDIFGVLSAGGSLVIVMENQRRDPRAWCELIQRHQVTLWN
SVPALFDMLLTWCEGFADAAPEKLRAVMLSGDWIGLDLPARYHAFRPQGQFIAMGGATEASIWSNACEINRVPDHWRAIP
YGFPLANQRYRVVDELGRDCPDWVPGELWIGGIGVAEGYFNDPVRSEQQFVTQSNARWYRTGDLGCYWPDGTLEFLGRRD
KQVKVGGYRIELGEIESALSQLAGVKQSTVVAIGEKEKTLAAWVVPQGSAFCVTHHRDPALPQAWRGLAGTLPCCVCPPE
ISAGQVADFLQHRLLKLKPGQTPGADPLPLMNALAIQPRWRAVVERWLAFLVTQQRLQPAAEGYQVCAGEAPENDPPSFS
GHDLTLTQILRGARHELSLLNDARWSPESLAFDHPASALYIEELATICQQLSRRLQRPVRLLEVGVRTARAAECLLTRLS
ADEIEYVGLEHSQELLLSARQRLAPWSDARLALWSADTLTAHAHSADIIWLNNALHRLLPEEPGLLAALQQLAVPGALLY
VLEFRQLTPSALLSTLLLTDGQPEALLHNSADWGAIFTAAAFNCQHGDEVEGLQRFLVQCPVSQVRRDPRQLQSALAERL
PGWMVPKRIFLLDALPLTANGKIDYQTLKRCHTPEAENRTEADLPLGDIEKQVAVIWQPLLSMGAVSRETDFFQHGGDSL
LATRLIGQLHQAGYEARLSDLFNHPRLADFAATLRKTDLPVEQPFVHSPEERYRPFALTDVQQAYLVGRQPGFALGGVGS
HFFVEFEIADLDIHRLEKVWNRLIARHDMLRAVVRDGQQRVLEQTPPWVIPAHILHSPEEALQVRDRLAHQVLNPEVWPV
FDLQVGFVDGMPARLWLCLDNLLLDGLSMQILLSELEHGYRYPQQLPPPLPVTFRDYLQQPALRTPNPDSLAWWQTQLDD
IPPAPALPLRCLPQDVETPRFARLYGAMDSARWRRLKQRAADAHLTPSAVLLSVWSTVLAAWSAQPDFTLNLTLFDRRPL
HPQINQILGDFTSLMLLSWHPGESWLQSARLLQQRLSESLNHRDVSAIRVMRQLARRQNVPAVPMPVVFTSALGFEQDNF
LARRNLLKPVWGISQTPQVWLDHQVYESEGELRFNWDFVAALFPDGQVERQFAQYCALLNRMAEDDSSWQLPLADLVPPL
KVTERRARRLRPERAQPRIAADKSSVSLICDTFREVVGEPVAPAENFFEAGATSLNLVQLHVLLQRHEFATLTLLDLFTH
PSPVALANYLAGVALKEKTKRVRPVRRRQRRI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2 YP_070124.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 88
irp2 YP_002346902.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 88
irp2 NP_669706.1 HMWP2 nonribosomal peptide synthetase Virulence HPI Protein 0.0 88
irp2 NP_993007.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 88
irp2 CAA21390.1 - Virulence HPI Protein 0.0 88
irp2 YP_853075.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 88
irp2 YP_001006815.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 87
PMI2599 YP_002152317.1 non-ribosomal peptide synthase Not tested Not named Protein 0.0 41