Gene Information

Name : irp2 (YpAngola_A2098)
Accession : YP_001606556.1
Strain : Yersinia pestis Angola
Genome accession: NC_010159
Putative virulence/resistance : Virulence
Product : yersiniabactin synthetase, HMWP2 component
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG1020
EC number : -
Position : 2182760 - 2188867 bp
Length : 6108 bp
Strand : -
Note : identified by similarity to GB:CAA21390.1; match to protein family HMM PF00501; match to protein family HMM PF00550; match to protein family HMM PF00668; match to protein family HMM PF08241; match to protein family HMM PF08242; match to protein family HMM

DNA sequence :
ATGATTTCTGGCGCACCATCTCAGGATTCGCTGTTACCGGACAACCGCCACGCGGCTGATTACCAACAATTACGCGAGCG
GCTCATACAGGAACTGAATTTAACGCCGCAGCAGTTACATGAAGAGAGCAACCTGATCCAGGCCGGCCTGGATTCCATAA
GATTGATGAGATGGTTACACTGGTTTCGTAAAAATGGCTACCGCCTTACCCTTCGCGAGCTGTATGCCGCCCCCACGCTG
GCGGCATGGAACCAGTTAATGCTCAGCCGGTCGCCTGAGAACGCGGAAGAAGAAACGCCGCCCGACGAATCATCCTGGCC
GAACATGACCGAAAGTACCCCCTTCCCATTGACGCCAGTACAGCACGCCTACCTGACGGGCCGCATGCCGGGGCAGACGC
TTGGCGGCGTGGGTTGCCACCTGTATCAGGAGTTTGAAGGCCATTGTCTGACGGCGTCGCAGCTGGAGCAGGCCATCACG
ACCTTGCTGCAACGCCACCCAATGCTGCATATCGCCTTTCGCCCCGACGGGCAGCAGGTCTGGCTACCGCAACCTTACTG
GAACGGCGTCACCGTTCATGATTTACGCCATAACGACGCTGAAAGCCGCCAGGCCTATCTGGACGCACTGCGCCAGCGCC
TGAGCCACCGTCTTTTACGCGTGGAAATCGGCGAAACGTTTGATTTTCAGCTGACGCTCTTGCCGGACAATCGCCACCGC
CTCCATGTCAATATTGACCTGCTGATTATGGATGCCTCCAGCTTTACGCTTTTCTTCGATGAGCTTAACGCCCTGCTGGC
CGGAGAATCGCTGCCGGCTATCGACACCCGCTATGATTTCCGCTCGTATTTGCTGCACCAGCAGAAGATCAATCAACCAC
TGAGAGACGACGCGCGCGCTTACTGGCTGGCGAAAGCATCGACGCTTCCCCCCGCGCCCGTCTTGCCGCTGGCCTGCGAA
CCCGCCACGCTACGTGAAGTCCGTAATACCCGACGCCGCATGATTGTCCCGGCAACACGCTGGCACGCCTTTAGCAACCG
GGCCGGCGAGTATGGCGTGACGCCGACAATGGCGCTGGCGACCTGTTTTTCTGCCGTGCTGGCTCGCTGGGGCGGCCTGA
CGCGTCTGCTGCTTAACATCACCTTATTCGACCGCCAGCCGCTGCACCCGGCGGTTGGCGCGATGCTTGCCGACTTCACC
AATATTCTTCTGCTGGATACCGCCTGCGATGGCGATACCGTCAGCAACCTGGCGCGTAAAAACCAGCTCACGTTTACGGA
GGACTGGGAGCATCGCCACTGGTCCGGCGTCGAATTACTCCGTGAACTCAAACGCCAGCAGCGCTACCCCCACGGCGCCC
CGGTGGTATTTACCAGCAATCTGGGGCGTTCCCTCTACAGCAGCCGCGCAGAATCGCCGTTGGGCGAGCCGGAATGGGGC
ATCTCGCAAACGCCGCAGGTCTGGATAGATCATCTGGCGTTCGAGCATCACGGCGAGGTCTGGCTACAATGGGACAGCAA
CGACGCGCTGTTCCCTCCGGCGTTAGTCGAAACATTGTTCGACGCCTACTGCCAGTTGATTAACCAACTCTGCGATGACG
AAAGCGCCTGGCAAAAGCCGTTCGCAGATATGATGCCCGCCAGCCAGCGCGCGATACGCGAACGGGTCAACGCCACCGGC
GCCCCCATTCCCGAAGGCTTGCTGCATGAAGGCATTTTCCGTATCGCTCTGCAACAGCCGCAGGCGCTGGCGGTAACGGA
CATGCGTTATCAGTGGAATTATCATGAGCTGACAGACTATGCCCGCCGTTGCGCGGGCAGGTTAATCGAGTGCGGGGTTC
AGCCCGGCGATAATGTGGCTATCACGATGTCGAAAGGCGCAGGACAACTTGTTGCGGTTCTGGCCGTCCTGCTGGCCGGG
GCGGTTTACGTTCCGGTTTCGCTGGATCAGCCTGCCGCACGGCGCGAGAAAATCTACGCTGACGCCAGCGTCCGGCTGGT
GCTCATTTGTCAGCACGACGCCAGCGCCGGGTCAGACGATATTCCCGTCCTTGCCTGGCAGCAGGCCATTGAGGCGGAGC
CGATCGCCAACCCGGTAGTACGCGCCCCCACGCAACCGGCCTACATTATCTACACCTCCGGCTCTACCGGTACGCCGAAA
GGGGTAGTCATTTCTCACCGGGGAGCGCTTAACACCTGTTGCGATATCAATACCCGCTATCAGGTTGGCCCGCATGACAG
GGTGCTGGCCCTCTCCGCCCTACATTTTGATTTATCGGTTTACGACATTTTTGGCGTACTGCGCGCGGGCGGCGCGCTGG
TGATGGTGATGGAAAATCAACGGCGCGATCCTCACGCATGGTGTGAGCTGATCCAGCGCCATCAGGTCACGCTCTGGAAC
AGCGTCCCGGCGCTGTTCGATATGCTGCTGACCTGGTGTGAAGGTTTCGCCGACGCCACGCCGGAAAACCTGCGCGCAGT
GATGCTTTCCGGCGACTGGATCGGGCTTGACCTCCCCGCCCGTTATCGGGCCTTCCGGCCACAAGGACAATTTATCGCGA
TGGGCGGCGCCACCGAGGCGTCTATCTGGTCTAACGCCTGCGAAATTCACGACGTCCCCGCCCACTGGCGCTCCATCCCT
TACGGTTTTCCGCTAACCAACCAACGCTACCGGGTGGTGGATGAACAGGGCCGGGACTGCCCTGACTGGGTGCCGGGTGA
ATTATGGATTGGCGGCATTGGGGTCGCGGAAGGCTATTTCAACGATCCCCTGCGTAGCGAGCAGCAATTTTTGACGCTCC
CGGACGAGCGCTGGTATCGCACCGGCGATCTCGGCTGCTACTGGCCAGATGGCACAATCGAGTTCCTCGGTCGTCGCGAC
AAGCAGGTCAAAGTCGGAGGATATCGCATCGAGCTGGGCGAAATCGAAAGCGCGCTCAGCCAGCTGGCGGGGGTGAAACA
AGCAACCGTTCTGGCGATCGGCGAAAAAGAAAAAACGCTGGCGGCATACGTTGTTCCTCAGGGCGAGGCTTTTTGCGTTA
CCGATCATCGGAACCCGGCACTGCCGCAGGCGTGGCACACGCTTGCGGGAACGTTGCCCTGTTGCGCCATCTCGCCAGAG
ATCTCCGCAGAACAGGTAGCCGATTTCCTTCAGCATCGCCTGCTAAAACTGAAGCCGGGTCACACCGCTGGCGCCGATCC
TCTCCCCCTGATGAACTCACTCGCTATCCAGCCGCGCTGGCAGGCCGTGGTGGAACGCTGGTTAGCATTTCTGGTGACAC
AACGGCGACTGAAGCCCGCTGCTGAAGGTTATCAGGTCTGCGCTGGTGAAGAACGCGAGGATGAGCACCCGCACTTCAGC
GGACATGATTTAACGTTATCGCAAATTCTTCGCGGTGCCCGTAACGAACTGTCGTTACTGAACGACGCGCAGTGGTCGCC
GGAAAGCCTGGCCTTTAACCATCCGGCCAGCGCCCCGTATATTCAGGAACTGGCGACAATTTGCCAACAGCTTGCACAGC
GCTTACAGCGCCCGGTACGCCTGCTTGAGGTGGGAACCCGCACTGGCCGCGCCGCAGAATCGCTGTTAGCACAGCTCAAC
GCCGGACAGATTGAGTATGTCGGGCTTGAGCAGAGCCAGGAGATGCTGCTGAGCGCCCGGCAGAGGCTCGCCCCCTGGCC
TGGCGCCCGTCTGTCCCTCTGGAATGCAGACACGCTGGCGGCGCACGCTCACTCGGCGGACATTATCTGGCTTAATAACG
CCCTGCATCGTCTGCTGCCGGAAGATCCCGGGCTCCTTGCGACATTACAACAGCTTGCCGTTCCCGGCGCGCTGCTCTAC
GTGATGGAGTTTCGCCAGTTAACGCCGTCCGCCCTGCTCAGCACGCTCCTGTTAACCAATGGGCAGCCGGAGGCCTTGCT
GCATAACAGCGCCGACTGGGCGGCATTATTTAGCGCGGCCGCCTTCAACTGTCAGCATGGCGATGAGGTCGCGGGGTTAC
AACGCTTCCTCGTACAATGTCCTGACAGGCAGGTGCGCCGCGATCCCCGTCAACTTCAGGCCGCCCTCGCCGGACGTCTG
CCGGGGTGGATGGTGCCGCAACGGATCGTCTTCCTCGACGCCTTACCGCTGACGGCTAACGGGAAAATTGACTACCAGGC
GCTGAAGCGTCGTCATACCCCTGAAGCGGAAAACCCGGCCGAAGCGGATTTACCCCAGGGCGACATTGAAAAACAGGTTG
CCGCCCTCTGGCAGCAACTCTTATCAACTGGCAATGTCACCAGAGAAACCGACTTCTTCCAGCAAGGCGGCGATAGCCTG
CTGGCGACCCGTCTGACCGGGCAACTTCATCAGGCAGGTTATGAAGCGCAATTAAGCGACCTGTTTAATCATCCCCGGCT
GGCGGATTTTGCCGCCACGCTGCGGAAAACCGACGTCCCGGTCGAACAACCATTCGTCCACTCCCCTGAAGATCGCTACC
AGCCCTTTGCGCTTACCGACGTGCAGCAGGCTTACCTTGTGGGGCGTCAGCCGGGCTTTGCCCTGGGCGGCGTCGGCTCA
CATTTCTTTGTTGAATTTGAAATTGCCGATCTGGACATCACCCGGCTGGAGACGGTCTGGAACCGATTAATCGCCCGCCA
CGATATGCTGCGCGCCATCGTGCGTGATGGACAGCAACAGGTGCTCGAACAGACGCCCCCCTGGGTGATACCCGCACACA
CCCTCCATACGCCTGAAGAGGCGTTGCGGGTGCGCGAAAAACTGGCGCATCAGGTACTCAACCCCGAAGTGTGGCCGGTA
TTCGATCTCCAGGTCGGATACGTGGACGGGATGCCTGCCCGCCTGTGGCTGTGTCTGGATAACCTGTTGCTTGACGGTCT
GAGCATGCAGATCCTGCTGGCGGAGCTGGAGCACGGCTACCGCTACCCGCAACAGCTGCTTCCGCCGCTGCCCGTCACCT
TCAGGGATTATCTGCAACAACCCTCGCTACAGTCGCCCAATCCAGATTCTCTGGCATGGTGGCAGGCGCAGCTTGATGAT
ATTCCTCCGGCGCCTGCGTTGCCGCTGCGCTGCTTGCCTCAGGAGGTTGAAACACCGCGCTTCGCCCGCCTGAACGGCGC
GCTGGACAGCACGCGCTGGCATCGGCTGAAAAAACGGGCGGCTGACGCCCATCTCACCCCGTCGGCCGTACTGTTGTCGG
TGTGGTCAACGGTTCTCTCTGCATGGAGTGCACAGCCTGAGTTCACGCTTAACCTTACGCTTTTCGACAGGCGACCGCTG
CACCCGCAAATCAACCAGATTCTGGGCGATTTCACCTCGCTGATGCTGCTGAGCTGGCATCCCGGCGAAAGCTGGCTGCA
CAGCGCGCAGTCACTACAGCAGCGGCTGAGCCAGAACCTCAACCACCGCGATGTGTCAGCCATCCGCGTGATGCGTCAAC
TGGCGCAACGGCAAAACGTGCCTGCCGTTCCGATGCCCGTCGTCTTTACCAGCGCGCTGGGCTTTGAGCAGGATAACTTC
CTCGCCCGGCGTAATCTGCTCAAACCGGTCTGGGGCATCTCCCAGACGCCGCAGGTCTGGCTCGATCACCAGATTTATGA
ATCCGAAGGCGAACTGCGCTTTAACTGGGATTTTGTCGCCGCGCTGTTTCCTGCCGGGCAGGTGGAGCGCCAGTTTGAAC
AGTATTGCGCATTGCTAAACCGAATGGCCGAGGATGAAAGCGGCTGGCAACTGCCGCTCGCCGCGCTGGTGCCTCCCGTT
AAACACGCAGGGCAATGCGCAGAGCGCTCACCGCGCGTATGCCCTGAGCACTCTCAGCCACACATTGCGGCGGACGAGAG
CACCGTCAGCCTGATTTGCGACGCCTTCCGCGAGGTGGTTGGCGAGTCTGTCACGCCCGCAGAAAACTTCTTTGAGGCAG
GCGCAACGTCGCTGAATCTGGTGCAACTGCACGTTTTGTTACAACGTCACGAATTTTCCACCCTGACGTTGCTTGACCTC
TTCACCCACCCTTCTCCTGCTGCCCTGGCCGATTATCTGGCCGGCGTCGCCACGGTGGAGAAAACAAAACGACCTCGCCC
TGTTCGCCGTCGTCAGCGGCGGATATAG

Protein sequence :
MISGAPSQDSLLPDNRHAADYQQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTL
AAWNQLMLSRSPENAEEETPPDESSWPNMTESTPFPLTPVQHAYLTGRMPGQTLGGVGCHLYQEFEGHCLTASQLEQAIT
TLLQRHPMLHIAFRPDGQQVWLPQPYWNGVTVHDLRHNDAESRQAYLDALRQRLSHRLLRVEIGETFDFQLTLLPDNRHR
LHVNIDLLIMDASSFTLFFDELNALLAGESLPAIDTRYDFRSYLLHQQKINQPLRDDARAYWLAKASTLPPAPVLPLACE
PATLREVRNTRRRMIVPATRWHAFSNRAGEYGVTPTMALATCFSAVLARWGGLTRLLLNITLFDRQPLHPAVGAMLADFT
NILLLDTACDGDTVSNLARKNQLTFTEDWEHRHWSGVELLRELKRQQRYPHGAPVVFTSNLGRSLYSSRAESPLGEPEWG
ISQTPQVWIDHLAFEHHGEVWLQWDSNDALFPPALVETLFDAYCQLINQLCDDESAWQKPFADMMPASQRAIRERVNATG
APIPEGLLHEGIFRIALQQPQALAVTDMRYQWNYHELTDYARRCAGRLIECGVQPGDNVAITMSKGAGQLVAVLAVLLAG
AVYVPVSLDQPAARREKIYADASVRLVLICQHDASAGSDDIPVLAWQQAIEAEPIANPVVRAPTQPAYIIYTSGSTGTPK
GVVISHRGALNTCCDINTRYQVGPHDRVLALSALHFDLSVYDIFGVLRAGGALVMVMENQRRDPHAWCELIQRHQVTLWN
SVPALFDMLLTWCEGFADATPENLRAVMLSGDWIGLDLPARYRAFRPQGQFIAMGGATEASIWSNACEIHDVPAHWRSIP
YGFPLTNQRYRVVDEQGRDCPDWVPGELWIGGIGVAEGYFNDPLRSEQQFLTLPDERWYRTGDLGCYWPDGTIEFLGRRD
KQVKVGGYRIELGEIESALSQLAGVKQATVLAIGEKEKTLAAYVVPQGEAFCVTDHRNPALPQAWHTLAGTLPCCAISPE
ISAEQVADFLQHRLLKLKPGHTAGADPLPLMNSLAIQPRWQAVVERWLAFLVTQRRLKPAAEGYQVCAGEEREDEHPHFS
GHDLTLSQILRGARNELSLLNDAQWSPESLAFNHPASAPYIQELATICQQLAQRLQRPVRLLEVGTRTGRAAESLLAQLN
AGQIEYVGLEQSQEMLLSARQRLAPWPGARLSLWNADTLAAHAHSADIIWLNNALHRLLPEDPGLLATLQQLAVPGALLY
VMEFRQLTPSALLSTLLLTNGQPEALLHNSADWAALFSAAAFNCQHGDEVAGLQRFLVQCPDRQVRRDPRQLQAALAGRL
PGWMVPQRIVFLDALPLTANGKIDYQALKRRHTPEAENPAEADLPQGDIEKQVAALWQQLLSTGNVTRETDFFQQGGDSL
LATRLTGQLHQAGYEAQLSDLFNHPRLADFAATLRKTDVPVEQPFVHSPEDRYQPFALTDVQQAYLVGRQPGFALGGVGS
HFFVEFEIADLDITRLETVWNRLIARHDMLRAIVRDGQQQVLEQTPPWVIPAHTLHTPEEALRVREKLAHQVLNPEVWPV
FDLQVGYVDGMPARLWLCLDNLLLDGLSMQILLAELEHGYRYPQQLLPPLPVTFRDYLQQPSLQSPNPDSLAWWQAQLDD
IPPAPALPLRCLPQEVETPRFARLNGALDSTRWHRLKKRAADAHLTPSAVLLSVWSTVLSAWSAQPEFTLNLTLFDRRPL
HPQINQILGDFTSLMLLSWHPGESWLHSAQSLQQRLSQNLNHRDVSAIRVMRQLAQRQNVPAVPMPVVFTSALGFEQDNF
LARRNLLKPVWGISQTPQVWLDHQIYESEGELRFNWDFVAALFPAGQVERQFEQYCALLNRMAEDESGWQLPLAALVPPV
KHAGQCAERSPRVCPEHSQPHIAADESTVSLICDAFREVVGESVTPAENFFEAGATSLNLVQLHVLLQRHEFSTLTLLDL
FTHPSPAALADYLAGVATVEKTKRPRPVRRRQRRI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2 YP_001006815.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 YP_002346902.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 NP_669706.1 HMWP2 nonribosomal peptide synthetase Virulence HPI Protein 0.0 99
irp2 YP_070124.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 NP_993007.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 99
irp2 CAA21390.1 - Virulence HPI Protein 0.0 99
irp2 YP_853075.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 99
PMI2599 YP_002152317.1 non-ribosomal peptide synthase Not tested Not named Protein 0.0 41