Gene Information

Name : PSPPH_2897 (PSPPH_2897)
Accession : YP_275079.1
Strain : Pseudomonas syringae 1448A
Genome accession: NC_005773
Putative virulence/resistance : Virulence
Product : yersiniabactin non-ribosomal peptide synthetase
Function : -
COG functional category : Q : Secondary metabolites biosynthesis, transport and catabolism
COG ID : COG1020
EC number : -
Position : 3350681 - 3356836 bp
Length : 6156 bp
Strand : +
Note : identified by similarity to GB:AAO56106.1; match to protein family HMM PF00501; match to protein family HMM PF00550; match to protein family HMM PF00668; match to protein family HMM TIGR01733

DNA sequence :
ATGTACACATCGACCAGAAAAGCCTCGACTGCTCGCGGTACGCCACCTGAAGCATTCGACCCTGCCCTGCTGGGCGAGGA
AATCGCCAGGCAAATGCGTCTGCCCCCCGAGTCACTGACACAGAACGCCAGCCTGTTGAAACTGGGCATGGACTCGATGC
ACCTGATGGCCTGGCTCAACCGGTTTCGCCGCATGGGTTTCAAAGTGACCCTCAGAGACCTCTATGACCAGCCGACGCTG
CAGGGCTGGCAGCAACTGCTTGGCAGCGTAGCCGTTCAGATCAACATCGTCGAGCCTGTCGATAAAGATGAAACGCTGCC
GCTCATGACTGACGGCCAGCCCTTCGCTCTGACCAGCGTACAGCACGCCTACCTGGTGGGGCGTTCCAGCGAACAACCGT
TGGGTGGAGTCGGCTGTCATCTGTATCAGGAATTCGACGGCCACGGCCTGACACCGCAAGTGCTGGAAGCGGCGATCTAT
CGGTTGATCGAACGGCATCCGATGCTCAAGGCACGTTTTCTCGCCGACGGTCGTCAGCAGTGGCAAGCGCGCTCGGCATG
GCCGGGCCTCAAGGTTCATGACCTGCGCGACATTAATGAGGCCTTTCGCCAGCAAAGCCTGATGAATCTGCGCGATCAAC
TGGGCCATCGTCGCCTGGATGTCGAAAATGGCGAAACCTTCGATTTCCAGCTTTGCCTGTTGCCAAACGGGCAAAACCGG
CTGTTTGTCAACATTGACCTGCTGGTCGCGGATGCCGCCAGCTTCAATCAGTTGTTTGAAGAACTGATCGCGCTGATCGA
AGACCGTCCGCTGCCGCCCTCACCGATCGATTACGATTTTTGCAGTTACCTAACGCAGGTGAAGCGCAATGATCAGGCGC
GCATCGAGCAAGCCAGAGCCTGGTGGCTGACGCGTCTGGACGACCTGCCCCTGGCACCGATTCTGCCGCTGGCGCAGGAC
CCTGAGCGCATCGAGAAGGTTCGTATCAGCCGCCGTCGCGACCAGTTGGACACGGCACAGTGGCAGACTTTCAAGGAGCA
CGCGGGAGCGGTCGGTGTAACACCGACCATGGCGCTGGCAACGCTGTTCAGTGCGGTGCTGGGACGCTGGAGCGGGCAAC
AGGCGCTTTTGCTCAACCTCACCCTGTTCGACCGCCAGCCTTTGAACCCGGCAGTCGCCAGCATGATCGCCGACTTCACC
AATATTCTGGCCCTGCCAGTGATCTGCTATGGGCAATCCTTTGCCGAGCTGGCAAGGGCCAATCAGCAGTCGTTTGCCGA
GGTTCACGAACACAGCGCGTGGTCGGGGGTCGAGGTGCTGCGCGAGCTGAAGAAACGCCAGCGCCACCCGCACGGTGCGC
CACTGGTGTTTACCAGCAACCTGGGCCGCCCGCTGTTTGGCGAAGCGGCAGAGAACAAGTTGGGGGCACCCGGCTGGGGC
ATTTCCCAGACGCCGCAAGTGTGGATCGACCACCTGGCCTTCGAGCATCAGGGGCGGATTTATCTGCAATGGGACAGCAA
TGACCAGCTGTTCCCCGAGGGGCTGACCGACACGCTGTTCGACGTCTATTTCGAACAGGTACTGGCGTTGGTGGCAAACC
CTGCGCACTGGTCAAGCCCCTTGTCGGACCTGATGCCGTTCGCCCAGCGTGCCGTTCGCGAACAGGTCAATGCTACCGGG
CAGGCCATTCCTGATGGTTTGCTGCACGATGCAATCTTCATCCAGGCCGAACGTGCCCCGCAGGCCACCGCAGTGATTCA
GGATCAGCGGCGTCTGAGCTTCGACCAACTGGCCGATCAAGCCAGTCGGCTGGCCGGCTGCCTGCAGGCCCTGGGCGTCG
AAGCAGGCGACACGGTTGCTGTCAGCATGCCCAAGGACATCGGCCAGATCGTCAGCGTGCTGGGCATTCTCAAGGCGGGC
GCGATCTATGTGCCAGTACCGCCGGACCAGCCCTTGGCACGGCGTATCGGCATCTATAAAGGTGCCGGGGTGAAGTGCGT
GCTGACCAGCGCCGATGAGCCTGACGAGCACGACATCGACAAGGCCCTGACCTGGCAGCAGGCAATCCGCAGCGAGCCGC
TTCAGCATCAGGTGGCGGTCAGCGCACAGCAACCGGCCTACATCATCTACACCTCAGGCTCGACCGGTGAACCCAAGGGC
GTGGTGATCAGCCATCAAAGCGCCCTGAACACCTGCATGGACATCAGCCAGCGGCATGGCGTCGGCCCCGACGACTGCGT
GCTGGCGCTATCTGCGCTGCATTTTGACCTGTCGGTGTATGACATCTTCGGCGTACTCGGCGCAGGCGCTTCGCTGGTGC
TGGTCAACGAGCAGCAACGGCGCGACCCGGCGCTGTGGTGCAGGCTGATCGATGAACACGGCATCACCCTGTGGAACAGC
GTGCCTGCCCTGCTGGACATGCTGCTGACCTACAGCGAAGGCTTCGATCTGCATAGCCCGGCCGGCCTGAGGCTGGTGAT
GCTGTCCGGCGACTGGATCGGGCTGGACTTGCCGGAGCGCTATCACCAGTACCGCAGCGACGGTCAGTTTGTCGCCATGG
GCGGTGCGACGGAAGCGGCCATCTGGTCCAACATCTGCACCGTGAGCCGGGTTGAGCCGCACTGGCGCTCGATCCCCTAT
GGCACGCCGCTGGCCAACCAGCGTTATCGCGTGGTCGACGAAGCAGGGCGCGATTGCCCGGACTGGGTCGCCGGTGAGCT
GTGGATCGGCGGTACAGGTGTTGCCCAAGGCTATTTCAATGACCCCGAACGCAGTGCCCGGCAGTTCGTCGAAGTGGCTG
GTGAACGCTGGTATCGCACCGGCGACATGGGCTGTTACTGGCCGGACGGTACGCTTGAGTTTCTCGGCCGCCGCGACAAG
CAGGTCAAGGTCGGTGGCTATCGTATCGAGCTGGGTGAAATCGATGCGGCATTCAACCGACTGGAAGGTGTCAAGGCAGC
CATCAGCATGGCGCCGGGTGAACGGGAAAAGAGTCTGGTGGCCTTTGTCGTGCCACAGGGTTCCAGCCTGTGCAGTAGCA
TCGAAGCCGACCCCCGTACACCGGCCAGCTACGCCGACTGGCTGACGCCTGAAACGGCAGCGACAGATACTGCGGAGAGT
TCGCTGGGCGATTTCCTGCTCCAGCATCTGGCATTCAACGGCGTACATTTCAGCGTCCCTGTGACCACCGCAACGGTGCT
GCAGACCTACGCCTGCCAGGCACACTGGCAACCACTGCTGGACCGCTGGTTGACATGGCTCGCAACGCACGGCCGCCTGC
AACGCGATGCAAGGGGAACATGGCAGCTCAAGCACCCGAGCGCAAAGGTAACGTCGCATCACGATTCTGCGTATGCGCTG
ACAGAAGCCCTGTATGCGCACCACGACACGCTGCGCGAAATACTGCACGGTCAGCGCAGCCCTCAGACTCTGCTCGAACA
TCCGCACTGGGCTCCAGAACAGTTGTTGATGCACGCTAGCGGCAGCCGAGAAACCCTCAGCGCGCTGGCAACGTCGCTGG
CCGACCTGAGTCGTCTGTTGCAACGCCCGGTGCGTCTAATCGAAGCCAGCGCCCGCAGCGCCGTGACCGGACTGTTTTTA
CTCGAACGGCTAGATGCATCACAGCTGCAATACATCGCGCTGGATGCCTCACAGGCAATGGTCCTCAAGGCACGCGAACG
CCTCGCCGGGTTCGCGCACGGCAGCGCGCGGCGCGACACCGACAGCGAGCGCCAAGCGCTTGCCTACAGCGCCGACGTGC
TGCTGATCAACAACCAGCTGCACCGCCTTGAAACCCCGCACGCGGTACTGACGCAGTGGAGACAACTATGCGCGCCCGGT
GCTGCGGTCTGTGTGCTGGAACTGGCACAGGCCTCGGCGCTGGCCATGATCAGCGCCGACCTGCTCAACGAAGGCGACAG
TGCTGCAGAACAATTGCGTGACAGCGGGCATTGGCAGGCGTTGCTGACGCAGGCCGGTTTGGCAGCCCAAGGAGTGGATC
GACTGGGCGACATGCAGCGTCTTATCCTGCGTGCCCCCGATGTGCTGCAACGCCCCGATCCTGCGCGACTGACAAAGGCG
CTGGCCAACGAATTGCCCGGTTACATGATTCCACAGCGCCTGTATTTTATCGACGCGCTGCCGCTGACCGCCAACGGCAA
GGTCGACCATCAGGCACTGGCGCAGTTGTGCAAACCGTCCAGCGCACATGCCGAGCAGTGCCAGGCACCTGAAACCGCTC
GCGAACAGATGGTTGCGCAACTCTGGCAGGCGCTGCTGCAAACCGGCGCATTGCATCGCCACAGCCACTTCTTCCAGCTT
GGCGGCGACAGCCTGCTGGCGACCCGGCTGATCGGCGAACTGGCCAGCCACGGCTATCGTGCCCGGCTCGATGATCTGTT
CAACTTTCCGAGCCTACATGCCTTTGCCGCAGCATTGCTGGAAGATGCGCCGCCAGCGGCACGACTTGAGCATGATCCGG
CAGGCCGCTATCAACCGTTTGCCCTGAGCGAAGTGCAGCAGGCTTATCTGGTCGGACGTCAGCCCGGCTTCGTGCTGGGC
GGCGTGGGCGCGCATTTTTTTGTCGAGTTCAGCGTTGAACGGCTTGATATTCAGTTGTTTCAAAATGCCTGGCAGCGTCT
CATCGAGCGTCACGACATCCTGCGTGCCGTGGTGCGTGACGGCGCGTTGCAGGTTCTGCAACAGGTCCCGGCTTTCATTC
CTCGACGCCACCGGGTGTCCGCGCTGAATGGCAAGGAAGCACTTGCACTGCGCGAACGGCTCTCTCATCAGGTACTCGAT
GCCGGGCAATGGCCGCTGTTCGATGCACAGATAGCCGAAGACGGCAGCGCACACAGCCGGATTTTCGTCAGCCTCGACAA
CTTGCTGCTCGACGGCATGAGCATGCAGATTTTGCTGGCCGAGCTAGAAACGCTGTACCTGCACCCGGAACAGGTACTGG
ACCCGATCAATATCAGCTTCCGTGACTATCAATGCCTGCGCGCCAGCCAGCCGCCCTCGCCCGGTGCTCAAGCGTACTGG
CAGCGACGTCTCGACGACCTGCCGCCGGCGCCGCGTCTGCCGCTGCGCTGCGAGCCGGCTGACGCCGGTACGCCACGCTT
CGTACGCCTTGCCGAGCGCCTGCCGGCGGCGCGCTGGGAAAGCCTGAAAGCCCAGGCCCATCAGCACCAGTTGACGCCGT
CGGGGCTGCTGCTCAGCGCTTTTTCCGCCGTGCTGTCGGCGTGGAGCAGCGAGCAAGAGCTGAGCCTTAACCTGACTCTG
TTTGATCGTCAGCCACTGCACCCGCATATCGACCGGGTACTGGGTGATTTCACCTCATTACTGCTTCTGGCCTGGCAACC
TTCTGCTGACTGGCTCGGCAGTGCCCAGCGTTTGCAACAGCGGCTATGGCGCGATCTGGCCCAGCGCGATCACTCGGCAA
TCCGCGTCATGCGAGAACTGGCCAGCCGCCGCGGCATGGCTGCCGCGCAAATGCCGGTGGTCTTCACCAGTGCGCTGGGG
TTCGACAAAGGCCGCTTCATGGCCCAGTCCTCGTGGCTCAAGCCGGTCTGGGGAATCTCGCAAACCCCGCAAGTCTGGCT
CGATCATCAAGTGTACGAGTCCGAAGGCGACCTGTGCCTGAACTGGGATGCGGTCGAGGCGCTGTTTGAGCCTAAAGTCT
TGCGCAGCATGTTCGATCAGTACCTTGCCCTGCTCAATCGTCTGGCTGAGCAACCCCAGGCCTGGGTCTTGCCTTTGGCC
CGGTTGGTTGTCCCCGCAGAGCCTGACGCTGGCGTGGCAACGCTGCCTAGTAGGCCACAGCAGCACGAGCCTGGGCAACC
GGCCGACGAACAACTCGTCGAGCAGATCCGGCATGCCTTTCATGAGGTCGTCGGCCTGAAGCTGCAAGACTGTCGGCAGA
ACTTTTTCGACGCTGGCGCCAGCTCGCTGAAACTGGTGCAACTGCATGTGAAGCTGACCCAGCAGGGCCATCGTCATTTG
CAGGCCACCGATCTGTTCGGCTACCCGAATGCCCGTGCGCTGGCGCAACACCTGAACCACACGCAACCGGCAAACGATAC
GCGGGAACAACCCCGGCAAACCCGGTTGGTGCAGAGAAACGCCAGAAGGTTGCGGCGCAGCGGAGGTGGTTCATGA

Protein sequence :
MYTSTRKASTARGTPPEAFDPALLGEEIARQMRLPPESLTQNASLLKLGMDSMHLMAWLNRFRRMGFKVTLRDLYDQPTL
QGWQQLLGSVAVQINIVEPVDKDETLPLMTDGQPFALTSVQHAYLVGRSSEQPLGGVGCHLYQEFDGHGLTPQVLEAAIY
RLIERHPMLKARFLADGRQQWQARSAWPGLKVHDLRDINEAFRQQSLMNLRDQLGHRRLDVENGETFDFQLCLLPNGQNR
LFVNIDLLVADAASFNQLFEELIALIEDRPLPPSPIDYDFCSYLTQVKRNDQARIEQARAWWLTRLDDLPLAPILPLAQD
PERIEKVRISRRRDQLDTAQWQTFKEHAGAVGVTPTMALATLFSAVLGRWSGQQALLLNLTLFDRQPLNPAVASMIADFT
NILALPVICYGQSFAELARANQQSFAEVHEHSAWSGVEVLRELKKRQRHPHGAPLVFTSNLGRPLFGEAAENKLGAPGWG
ISQTPQVWIDHLAFEHQGRIYLQWDSNDQLFPEGLTDTLFDVYFEQVLALVANPAHWSSPLSDLMPFAQRAVREQVNATG
QAIPDGLLHDAIFIQAERAPQATAVIQDQRRLSFDQLADQASRLAGCLQALGVEAGDTVAVSMPKDIGQIVSVLGILKAG
AIYVPVPPDQPLARRIGIYKGAGVKCVLTSADEPDEHDIDKALTWQQAIRSEPLQHQVAVSAQQPAYIIYTSGSTGEPKG
VVISHQSALNTCMDISQRHGVGPDDCVLALSALHFDLSVYDIFGVLGAGASLVLVNEQQRRDPALWCRLIDEHGITLWNS
VPALLDMLLTYSEGFDLHSPAGLRLVMLSGDWIGLDLPERYHQYRSDGQFVAMGGATEAAIWSNICTVSRVEPHWRSIPY
GTPLANQRYRVVDEAGRDCPDWVAGELWIGGTGVAQGYFNDPERSARQFVEVAGERWYRTGDMGCYWPDGTLEFLGRRDK
QVKVGGYRIELGEIDAAFNRLEGVKAAISMAPGEREKSLVAFVVPQGSSLCSSIEADPRTPASYADWLTPETAATDTAES
SLGDFLLQHLAFNGVHFSVPVTTATVLQTYACQAHWQPLLDRWLTWLATHGRLQRDARGTWQLKHPSAKVTSHHDSAYAL
TEALYAHHDTLREILHGQRSPQTLLEHPHWAPEQLLMHASGSRETLSALATSLADLSRLLQRPVRLIEASARSAVTGLFL
LERLDASQLQYIALDASQAMVLKARERLAGFAHGSARRDTDSERQALAYSADVLLINNQLHRLETPHAVLTQWRQLCAPG
AAVCVLELAQASALAMISADLLNEGDSAAEQLRDSGHWQALLTQAGLAAQGVDRLGDMQRLILRAPDVLQRPDPARLTKA
LANELPGYMIPQRLYFIDALPLTANGKVDHQALAQLCKPSSAHAEQCQAPETAREQMVAQLWQALLQTGALHRHSHFFQL
GGDSLLATRLIGELASHGYRARLDDLFNFPSLHAFAAALLEDAPPAARLEHDPAGRYQPFALSEVQQAYLVGRQPGFVLG
GVGAHFFVEFSVERLDIQLFQNAWQRLIERHDILRAVVRDGALQVLQQVPAFIPRRHRVSALNGKEALALRERLSHQVLD
AGQWPLFDAQIAEDGSAHSRIFVSLDNLLLDGMSMQILLAELETLYLHPEQVLDPINISFRDYQCLRASQPPSPGAQAYW
QRRLDDLPPAPRLPLRCEPADAGTPRFVRLAERLPAARWESLKAQAHQHQLTPSGLLLSAFSAVLSAWSSEQELSLNLTL
FDRQPLHPHIDRVLGDFTSLLLLAWQPSADWLGSAQRLQQRLWRDLAQRDHSAIRVMRELASRRGMAAAQMPVVFTSALG
FDKGRFMAQSSWLKPVWGISQTPQVWLDHQVYESEGDLCLNWDAVEALFEPKVLRSMFDQYLALLNRLAEQPQAWVLPLA
RLVVPAEPDAGVATLPSRPQQHEPGQPADEQLVEQIRHAFHEVVGLKLQDCRQNFFDAGASSLKLVQLHVKLTQQGHRHL
QATDLFGYPNARALAQHLNHTQPANDTREQPRQTRLVQRNARRLRRSGGGS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp2 YP_070124.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp2 YP_001006815.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp2 YP_002346902.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp2 NP_669706.1 HMWP2 nonribosomal peptide synthetase Virulence HPI Protein 0.0 52
irp2 NP_993007.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 52
irp2 CAA21390.1 - Virulence HPI Protein 0.0 52
irp2 YP_853075.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 52
PMI2599 YP_002152317.1 non-ribosomal peptide synthase Not tested Not named Protein 0.0 43