Gene Information

Name : A225_2992 (A225_2992)
Accession : YP_006498703.1
Strain : Klebsiella oxytoca E718
Genome accession: NC_018106
Putative virulence/resistance : Virulence
Product : iron aquisition yersiniabactin synthesis enzyme (Irp2)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3164832 - 3170930 bp
Length : 6099 bp
Strand : +
Note : -

DNA sequence :
ATGATTTCTGGCGCCCCCTCTCAGGACCCGCTGTTATCGGACAACGGCGAAGCGGCCGATTATCAACAATTACGCGAACT
GTTATTACAGGAACTGAATGTAGCGCCGCAGCAGCTACAGGAAGAGAGCAATCTGATCCAGGCGGGCCTGGACTCCATAA
GATTAATGAGATGGTTACACTGGTTTCGTAAAAAGGGCTACCGCCTCACCCTTCGCGAGCTGTATGCCGCCCCCACGTTG
GCAGCATGGCGCCAGCTGATGCGCAGCCGTTCGGGTGAAAAACCGGATGATGCAAGCTCCCCGGCGGACGCGGCGTGGCC
AGTCATGAGCGAGGGTACCCCCTTCCCGCTGACGCCGGTACAGCATGCCTATCTGACGGGCCGAATGCCCGGACAGACGC
TGGGCGGCGTCGGTTGCCATCTGTATCAGGAGTTTGCCGGCCATTATCTGACGGCGCCAAAGCTGGAGCAGGCCATCACG
ATCTTACTGCAACGCCACCCGATGCTGCATATCGCCTTTCGCGCCGACGGGCAGCAGGTCTGGCTACCGCAGCCATACTG
GAACGGCGTCACCGTTCATGATTTACGCCAGACCGACGAGGCAAGCCGCCAGGCCTATCTGGAAACATTGCGCCAGCGCC
TGAGCCACCGTCTTTTACGCGTGGAAATGGGCGAAACCTTTGATTTCCAGTTGACGCTCTTGCCGGACAATCGCCACCGC
CTCCATGTCAATATCGACCTGCTGATTATGGATGCTTCCAGCTTTACGCTGTTTTTCGATGAACTTAACGCCCTGCTGGC
CGGAGAGTCGCTGCCGCCCGGCGACCCCCGCTATGATTTCCGCGCTTATCTGCTGCACCAGCAGAAGATCAAACAGCCGC
TGCTCGACAAAGCGCGCGCGTACTGGCTGGCGAAAGCATCAATGCTGCCACCCGCGCCGGTTTTACCGCTAGCCTGCGAA
CCCGCCACGCTGCGTGAAGTCCGTAATACCCGACGCCGCATGATTGTCCCGACAACACGCTGGAACGCCTTTAGCCAACG
GGCCGGTGAGAATGGCGTAACGCCGACAATGGCGCTGGCGACCTGTTTTGCTGCCGTTCTGGGGCGCTGGGGCGGCCTGA
CGCGGCTGCTGCTCAATATCACGTTATTTGACCGCCAGCCGCTGCACCCGGCGGTTGACGAAATGCTTGCCGACTTCACC
AATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACTGA
GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAGCTCCTTCGTGAACTCAAACGCCAGCAAAGCCATCCCCACGGCGCCC
CGGTGGTATTTACCAGCAATCTGGGGCGCTCCCTGTACAGCAGCCGCGCCGAGTCGCCGCTGGGCGAACCGGAATGGGGC
ATCTCGCAGACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCGGGGCGAGGTCTGGCTACAATGGGATAGCAA
TGACGCGTTGTTCCCTCCGGCGCTGGTCGAGACGTTGTTCAACGCCTACTGCCAGCTGATTAACCAACTCTGCGATGATG
AAAGCGCCTGGAAGAAGTCGTTCGCGGACAGGATGCCCCAAAGCCAGCGGGAGATACGTCAACGGGTTAACGCAACCGAC
GCCCCCGTTCCTCAGGGCTTGCTGCATGAAGGCATTTTCCGCATCGCCCTCCGGCAACCGCAGGCGCTGGCGGTAACGGA
CGCGCATTATCGGTGGAATTATCGTGAGCTGACCGAGAACGCGCGCCGCTGCGCTGGCAGGTTAATCGCGTGCGGGGTTC
AGCCCGGCGACAATGTCGCTATTACAATGTCGAAAGGCGCGGGGCAGCTGGTTGCGGTCCTCGCCGTGCTGCTGTCCGGG
GCGGTGTACGTTCCGGTTTCGCTCGACCAGCCCGCCGCGCGGCGCGGGAAAATCTACGCTGACGCCAACGTCCGGCTGGT
GCTCACCTGCCAGCACGACGCCAGCGCCTGGTCAGACGATATTCCCCACCTGACCTGGCAGCAGGCCATTGAGGCGGAGC
CGCTAGCCGACCAGGCGGCGCGCGCGCCGACGCAGCCGGCCTACATTATCTATACCTCCGGGTCGACCGGCACGCCGAAA
GGCGTGGTCATTTCTCACCGTGCAGCGCTGAATACCTGCTGCGATATCAATAGCCGTTACCAGGTGGGTCCCGGAGACAG
AGTGCTGGCCCTCTCCGCCCTGCATTTTGATTTATCGGTTTACGACATTTTTGGCGTTCTGAGCGCCGGCGGCTCGCTGG
TTATCGTTATGGAAAATCAACGGCGCGATCCCCGCGCATGGTGCGAATTGATCCAGCGTCATCAGGTTACGCTCTGGAAC
AGCGTCCCGGCGCTGTTCGATATGCTGTTGACCTGGTGTGAAGGTTTCGCCGACGCCGCGCCGGAGAAACTGCGGGCGGT
CATGCTTTCCGGCGACTGGATCGGCCTCGATCTTCCTGCCCGCTATCACGCCTTCCGCCCCCAGGGACAGTTTATCGCCA
TGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGTGAAATTAACCGGGTCCCCGACCACTGGCGCGCCATCCCT
TATGGTTTTCCGCTGGCCAACCAACGTTACCGGGTGGTGGATGAACTGGGCCGGGACTGCCCGGACTGGGTTCCGGGCGA
ATTGTGGATTGGCGGTATCGGGGTCGCGGAAGGCTACTTTAACGATCCTGTACGCAGCGAGCAGCAGTTTGTAACGCAAT
CGAACGCGCGCTGGTATCGCACCGGCGATCTCGGGTGCTACTGGCCAGACGGTACGTTAGAGTTCCTCGGTCGTCGCGAT
AAGCAGGTCAAAGTCGGGGGATATCGCATCGAGCTGGGTGAAATCGAAAGCGCGCTCAGCCAGCTGGCGGGAGTGAAACA
GTCAACCGTTGTGGCGATCGGCGAAAAAGAAAAGACCCTGGCGGCCTGGGTTGTGCCTCAGGGTTCGGCTTTCTGTGTTA
CCCATCATCGGGACCCGGCGCTGCCCCAGGCGTGGCGCGGGCTTGCCGGAACGCTGCCCTGCTGCGTCTGCCCCCCGGAA
ATCTCCGCCGGCCAGGTCGCTGATTTTCTTCAGCATCGCCTGTTAAAACTGAAACCGGGTCAAACTCCTGGCGCCGATCC
TCTTCCCCTGATGAATGCTCTCGCTATCCAGCCTCGCTGGCGGGCCGTGGTGGAACGCTGGTTAGCCTTTCTCGTGACCC
AACAGCGGCTGCAGCCCGCCGCTGAGGGTTATCAGGTTTGTGCGGGGGAAGCGCCGGAGAATGACCCTCCTTCTTTCAGC
GGGCATGACCTCACCTTAACGCAGATCCTTCGCGGCGCTCGCCACGAACTGTCATTACTGAACGATGCGCGGTGGTCGCC
GGAAAGTCTGGCTTTCGACCATCCGGCCAGCGCCCTGTATATCGAGGAACTGGCGACAATCTGTCAACAGCTTTCCCGGC
GTTTACAGCGCCCGGTACGCCTGCTAGAGGTGGGTGTCCGCACCGCCCGCGCCGCAGAGTGCCTGCTAACCCGGCTCAGC
GCTGACGAGATTGAGTATGTCGGGCTTGAGCACAGCCAGGAGCTGCTGCTGAGCGCCCGACAGCGGCTTGCACCGTGGTC
TGACGCCCGTCTGGCCCTCTGGAGTGCAGACACGCTGGCGGCGCATGCTCATTCGGCGGACATTATCTGGCTGAATAACG
CCCTGCATCGTCTGCTGCCGGAAGAGCCCGGGCTCCTCGCGGCATTACAGCAGCTGGCCGTTCCAGGCGCGCTGCTCTAT
GTGTTGGAGTTTCGTCAGTTAACGCCCTCCGCCCTGCTCAGCACGCTTCTGTTAACCGATGGTCAGCCCGAAGCCTTACT
TCACAATAGCGCTGACTGGGGGGCGATATTTACCGCCGCCGCTTTCAACTGCCAGCACGGCGATGAAGTCGAGGGATTGC
AGCGCTTCCTCGTACAATGCCCCGTTAGCCAGGTACGCCGCGATCCTCGTCAGCTGCAGTCCGCTCTCGCCGAACGTCTG
CCGGGGTGGATGGTACCGAAACGGATCTTCTTACTTGACGCCTTACCGCTGACGGCGAACGGCAAAATTGACTATCAGAC
GCTAAAACGCTGTCATACCCCGGAAGCTGAAAACCGGACTGAAGCGGATTTACCCCTGGGCGATATTGAGAAGCAGGTTG
CCGTCATCTGGCAGCCGCTCTTGTCGATGGGCGCCGTCAGCAGAGAAACCGACTTCTTCCAGCATGGCGGTGATAGCCTG
CTAGCCACCAGGCTGATCGGACAGCTCCACCAGGCGGGTTATGAAGCACGATTAAGCGACCTGTTTAATCACCCCCGGCT
GGCTGATTTTGCCGCCACGCTGCGTAAAACCGACCTCCCTGTTGAACAACCCTTTGTCCACTCCCCTGAAGAACGCTACC
GGCCTTTCGCGCTGACTGATGTGCAGCAGGCTTACCTGGTCGGGCGTCAGCCGGGCTTTGCCCTTGGCGGCGTAGGCTCA
CACTTCTTTGTTGAATTTGAAATTGCCGATCTGGATATTCACCGGCTGGAGAAGGTCTGGAACCGGTTAATCGCCCGCCA
CGATATGCTGCGCGCCGTGGTGCGGGATGGACAACAGCGGGTTCTCGAACAGACTCCCCCATGGGTGATACCCGCGCACA
TCCTGCACAGCCCCGAAGAGGCGCTGCAGGTACGCGACAGACTGGCGCATCAGGTTCTCAACCCTGAAGTATGGCCGGTA
TTCGATCTTCAGGTCGGGTTCGTCGACGGGATGCCCGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTGCTTGACGGGTT
GAGCATGCAGATCCTGTTGTCAGAGCTGGAACACGGCTATCGCTATCCGCAGCAGCTGCCGCCGCCGCTTCCCATTACCT
TCAGGGATTATCTGCAGCAACCCGCGCTGCGGACGCCCAATCCCGACTCTCTGGCATGGTGGCAAACGCAGCTTGATGAT
ATTCCTCCGGCACCTGCGTTACCGCTACGCTGCTTGCCTCAGGACGTTGAGACGCCGCGCTTCGCACGGCTCTATGGGGC
AATGGACAGCGCGCGCTGGCGTCGCCTGAAACAGCGGGCGGCTGATGCTCATCTCACCCCGTCGGCCGTACTGTTATCCG
TGTGGTCAACGGTTCTTGCTGCATGGAGCGCACAGCCCGATTTCACGCTTAACCTTACGCTTTTCGACAGACGACCTCTG
CACCCGCAAATCAACCAGATCCTGGGCGACTTTACTTCGCTGATGCTGCTGAGTTGGCATCCCGGCGAGAGCTGGCTGCA
AAGCGCGCGGTTATTACAGCAGCGACTAAGCGAGAGCCTCAACCACCGCGATGTGTCGGCAATCCGCGTGATGCGGCAAC
TGGCGCGGCGGCAAAACGTACCGGCTGTCCCAATGCCCGTCGTCTTTACCAGCGCGCTGGGCTTTGAGCAGGATAACTTC
CTTGCCCGGCGCAATCTGCTCAAACCGGTGTGGGGTATATCCCAGACGCCGCAGGTCTGGCTCGATCACCAGGTTTATGA
ATCCGAAGGGGAACTGCGCTTTAACTGGGATTTTGTCGCCGCCCTGTTTCCTGACGGGCAGGTGGAGCGCCAGTTTGCAC
AGTATTGCGCGTTGCTTAACCGAATGGCGGAGGATGACAGCAGCTGGCAGCTGCCGCTCGCCGACCTGGTTCCCCCGCTT
AAGGTCACGGAGCGACGCGCCCGCCGGCTGCGCCCTGAGCGCGCTCAGCCGCGGATCGCGGCGGACAAGAGCAGCGTCAG
CCTGATTTGCGACACCTTTCGCGAGGTGGTTGGCGAGCCAGTCGCACCCGCAGAGAACTTTTTTGAGGCGGGCGCCACGT
CGCTGAATCTGGTGCAGCTGCACGTCTTGTTACAACGTCACGAATTCGCCACCCTGACGCTGCTTGACCTGTTCACTCAC
CCTTCTCCTGTCGCGCTGGCCAATTATCTCGCCGGCGTAGCCCTGGAGGAGAAAACCAAACGCGTGCGGCCGGTTCGCCG
TCGTCAGCGGCGCATTTGA

Protein sequence :
MISGAPSQDPLLSDNGEAADYQQLRELLLQELNVAPQQLQEESNLIQAGLDSIRLMRWLHWFRKKGYRLTLRELYAAPTL
AAWRQLMRSRSGEKPDDASSPADAAWPVMSEGTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFAGHYLTAPKLEQAIT
ILLQRHPMLHIAFRADGQQVWLPQPYWNGVTVHDLRQTDEASRQAYLETLRQRLSHRLLRVEMGETFDFQLTLLPDNRHR
LHVNIDLLIMDASSFTLFFDELNALLAGESLPPGDPRYDFRAYLLHQQKIKQPLLDKARAYWLAKASMLPPAPVLPLACE
PATLREVRNTRRRMIVPTTRWNAFSQRAGENGVTPTMALATCFAAVLGRWGGLTRLLLNITLFDRQPLHPAVDEMLADFT
NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQSHPHGAPVVFTSNLGRSLYSSRAESPLGEPEWG
ISQTPQVWIDHLAFEHRGEVWLQWDSNDALFPPALVETLFNAYCQLINQLCDDESAWKKSFADRMPQSQREIRQRVNATD
APVPQGLLHEGIFRIALRQPQALAVTDAHYRWNYRELTENARRCAGRLIACGVQPGDNVAITMSKGAGQLVAVLAVLLSG
AVYVPVSLDQPAARRGKIYADANVRLVLTCQHDASAWSDDIPHLTWQQAIEAEPLADQAARAPTQPAYIIYTSGSTGTPK
GVVISHRAALNTCCDINSRYQVGPGDRVLALSALHFDLSVYDIFGVLSAGGSLVIVMENQRRDPRAWCELIQRHQVTLWN
SVPALFDMLLTWCEGFADAAPEKLRAVMLSGDWIGLDLPARYHAFRPQGQFIAMGGATEASIWSNACEINRVPDHWRAIP
YGFPLANQRYRVVDELGRDCPDWVPGELWIGGIGVAEGYFNDPVRSEQQFVTQSNARWYRTGDLGCYWPDGTLEFLGRRD
KQVKVGGYRIELGEIESALSQLAGVKQSTVVAIGEKEKTLAAWVVPQGSAFCVTHHRDPALPQAWRGLAGTLPCCVCPPE
ISAGQVADFLQHRLLKLKPGQTPGADPLPLMNALAIQPRWRAVVERWLAFLVTQQRLQPAAEGYQVCAGEAPENDPPSFS
GHDLTLTQILRGARHELSLLNDARWSPESLAFDHPASALYIEELATICQQLSRRLQRPVRLLEVGVRTARAAECLLTRLS
ADEIEYVGLEHSQELLLSARQRLAPWSDARLALWSADTLAAHAHSADIIWLNNALHRLLPEEPGLLAALQQLAVPGALLY
VLEFRQLTPSALLSTLLLTDGQPEALLHNSADWGAIFTAAAFNCQHGDEVEGLQRFLVQCPVSQVRRDPRQLQSALAERL
PGWMVPKRIFLLDALPLTANGKIDYQTLKRCHTPEAENRTEADLPLGDIEKQVAVIWQPLLSMGAVSRETDFFQHGGDSL
LATRLIGQLHQAGYEARLSDLFNHPRLADFAATLRKTDLPVEQPFVHSPEERYRPFALTDVQQAYLVGRQPGFALGGVGS
HFFVEFEIADLDIHRLEKVWNRLIARHDMLRAVVRDGQQRVLEQTPPWVIPAHILHSPEEALQVRDRLAHQVLNPEVWPV
FDLQVGFVDGMPARLWLCLDNLLLDGLSMQILLSELEHGYRYPQQLPPPLPITFRDYLQQPALRTPNPDSLAWWQTQLDD
IPPAPALPLRCLPQDVETPRFARLYGAMDSARWRRLKQRAADAHLTPSAVLLSVWSTVLAAWSAQPDFTLNLTLFDRRPL
HPQINQILGDFTSLMLLSWHPGESWLQSARLLQQRLSESLNHRDVSAIRVMRQLARRQNVPAVPMPVVFTSALGFEQDNF
LARRNLLKPVWGISQTPQVWLDHQVYESEGELRFNWDFVAALFPDGQVERQFAQYCALLNRMAEDDSSWQLPLADLVPPL
KVTERRARRLRPERAQPRIAADKSSVSLICDTFREVVGEPVAPAENFFEAGATSLNLVQLHVLLQRHEFATLTLLDLFTH
PSPVALANYLAGVALEEKTKRVRPVRRRQRRI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2 YP_002346902.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 88
irp2 NP_669706.1 HMWP2 nonribosomal peptide synthetase Virulence HPI Protein 0.0 88
irp2 YP_070124.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 88
irp2 YP_853075.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 88
irp2 NP_993007.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 88
irp2 CAA21390.1 - Virulence HPI Protein 0.0 88
irp2 YP_001006815.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 87
PMI2599 YP_002152317.1 non-ribosomal peptide synthase Not tested Not named Protein 0.0 41