Gene Information

Name : BTI_4537 (BTI_4537)
Accession : YP_007920988.1
Strain :
Genome accession: NC_021174
Putative virulence/resistance : Virulence
Product : methyltransferase domain protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1177706 - 1187443 bp
Length : 9738 bp
Strand : +
Note : -

DNA sequence :
ATGGATAAAGCCACCTCGTCCGCTCGCCCGGCCGCGGATCTCGACGCGCTGCTCGCGACGCATTACCCGAACGGCGAGCC
GATCGCCGTGATCGGCCACGCGTGCCGCTTTCCGGAAGCGGACGACAGCGACGCGTTCTGGCGCAACCTGCTCGCCGGCG
CTGAATGCAGCCGCCGCTTCACGCGGCAGGCGTTGCTCGACGCGGGCCTCGACGCCGCGACGATCGACGCGCCGAACTTC
GTCAACGTCGGCTCCGTCGTGCGCGACGCCGACGCGTTCGATGCCGCGCTGTTCGGCTATTCCCGTCAGGAAGCCGAATC
GATCGACCCGCAGCAGCGCCTCTTCCTGCAGATCGTCTGGCACGCGCTCGAGCACGCGGGCTACGCGCCGCGCGACGTCC
CGCACCGCACCGGCGTGTTCGGGTCCGCGCGCATCAGCACGTATCCGGGCAGGGAGCCGCTGCGGATCGCCGAAGTCGCG
CAGGTCAAGGGCCTGCAGTCGCTGATGGGCAACGACAAGGACTATGTCGCGACGCGCGTCGCGTACAAGCTGAACCTGCG
CGGACCGGCGCTCGCCGTGCAGACCGCGTGCTCGAGCTCGCTCGTCGCCGCGCACCTGGCTTGCGAAAGCCTGCGCGCCG
GCGAATGCGACATGGCCGTCGCGGGCGGCGTGGCCGTATCGTTTCCGCAGCACGCGGGCTATCTGCATCAGCCGGGCATG
ATCTTCTCGCCGGACGGCCGCTGCCGGCCGTTCGACGCCGACGCGCAAGGCACGTTCGCCGGCAACGGCGTCGGCGCGGT
CGTGCTGCGTCGGCTCGGCGACGCGCTGCGCGACGGCGACCCCGTCGTCGCGGTGCTGCTCGGCAGCGCGATCAACAACG
ACGGCGACCGCAAGGTCGGCTACACCGCGCCGTCGGCGGCCGGACAGCGCGACGCGATCCGCGACGCGCTGATGCTCGCG
GGCGTCGACAGCACGCAGATCGGCCTCGTCGAGGCGCACGGCACCGGCACGCCGCTCGGCGATCCGGTCGAGCTCGAAGC
GCTGCGCGGCGTCTTTCATCGGGCGGGCGAAGGACCGCGCTGCGCGCTCGGCTCGGTCAAGAGCAACATCGGGCATCTCG
ATACGGCGGCCGGCATCGCGAGCCTCCTGAAAGCCGTGCTCGCCGTCGAGCGGCGCGCGATTCCGCCGAGCCTGCACTTC
CGCAAGCCGAATCCGGCGCTCGGCCTCGACGACAGCCCGTTTTACGTGCCCACCGAAGCGCAGCCCTGGGACGACGCATC
GCGCGTCGCGGGCGTGTCGTCGTTCGGCATCGGCGGCACGAACTGCCATGTGGTCGTCGCGTCGCTGCCGGACGCGTTGC
GCGCGGCGGTCGGCGGCGACGACCGTGCGCAGCCGGACGCGGGCGCGGCGCTGCTGCTGAGCGCGGCGAGCGAGCCCGCG
CTGCAACGCCTCGCGCGCGGCTACGCGGACGCGCTGCGGCATGCGGGCGCGCGCGACTTGCTCCATACCGCGCTGCGCGG
ACGGCAGCTCGACCTGCCGCACCGGCTCGCGGTGCCGTTCTGCGAAGAAACCGTCGCGGCGCTCGACGCGTTCGCGCTGG
GCGAAGACGATGCGCTCGTCCATCGCGGCAGCGGCGAAGCCGGGCAAATGGTCTGGCTTTTCACGGGCCAGGGCTCGCAC
TGGCCCGGCATGGGACAGGCGCTCTATCGGCAATCGCCCGCGTTCGCCGCGTGTCTCGACCGCTGCTTCGCGGCCTGCGA
CGGCGAGCTCGACGTCCCGCTGCGCGACGCGATGTCCGGCGAGCGCGGCGATCTGCTCGAACGGATGGACTACGCGCAAC
CCGCGATCGTCGCATTCGAGCTCGCGATGGCCGCGCACTGGCGCGCACTCGGTCTCGAGCCGCAGATCGTGATCGGTCAT
TCGGTCGGCGAATACGCGGCCGCGGTCGTCGCCGGACACTACGAGATCGAGCAGGCGATGCCGCTCGTCCGGCTGCGCGG
CGCGCTGATGCAGCGCTGCGCGGAGGGCGCGATGCTCGCCGCATTCGCGTGCGCCGACGAATTGCTGCCGCTCGCCAGGC
AAGCCGGCGTCGACGTCGCCGCGCATAACGGCGAGCGTCATCTCGTGTTCTCCGGCCGTCGCGATGCAGTCGACGCGCTC
GCCGCCGCGCTCGCCGCGAAAGACATCCGCCACGCGCGGCTCACGGTCCCGGGCGCCGCGCATTCGGCGCTGCTCGATCC
GGTGCTCGACACGTTCGAACGCGCGGCTGCGCAACTGCATGCGGCGCCCGGCCGCGTGCAGCTCGTGTCGACGCTGCTCG
GCGGGCCGATCGACGCCGATGGGCTCAACGCGCACGGCTATTGGCGCCGGCACATGCGCGAACCGGTCCGTTACGCCGAC
GCGGTCCGTCACGCGATCGCACAGGGCGCGGGCGTGTTTCTCGAACTCGGGCCCGATGCGCAACTGACGGGGATCGGCCT
GCGCGAGTCGCCTCAACGCGCGCGCTGGATCGCGAGCGCGCGCCGACAACAGCCCGCGCTCGCGCAAACGCGGCAAGCGA
CGCTCGAACTCTATGCGGCGGGCGTCGCGCTGCCGTGGGCGAACGTGCTGCCTTCGTCCGGCCGCAAGCTGCACGCGCCG
CGCTATCCGTTCGATGCCGAACGCTATTGGCGCGATGCGCAGCCGGCGGCCGTCACACCGCCCGCGGCGCACGGCGGCGA
TGTCGACCCGGCGCTCGCCGAAGGCCGGCGGGTCGCGGCCGCAGCCGCCGCATCGCTCGATCTGCCGCGCCTGCAACGGC
TCTACGACTGTGTCACGCAGTTGCATGCGATCTACGTCGATCGGCTCGTGCGCCGCTGCATCGGCGAGCGCTTCGACGAG
GGCGCGACCGCGCTCGACATCCTGCGCGCGGGCCGTCTGCTGCCGCGCCATCGCCAGTTGCTCGTGCGGCTCCTGAATGC
GTGCGTCGAAGATGGCTACTACCGCCGCGACAACGACCGCTACGCGCCCGCGCTCGCGGCGCCCCACGCGGAACGCGACG
CGCTGCTGCAGATCCTGCGCGACTGCTGCGAAGGCTTCGACGTGATCGCCGATACCGTCGCGCGCGCGGGAGACAGCCTG
CACGCGATGATGAGCGGCGACATCGAGCCGGTCGCGGTGATCTTCCCGGACAGCGCGTCGAGCGGCGTCGAAGTGCTGTA
TCAGGAATTCAGCTTCGGGCGCTATTTCAACCAGATCGCAGCGGGCGTCGTCGCGGGGCTCGTCCGCGAGCGGCGGACGA
ACCGTCGCCGGCACCGCCCATTCCGGATTCTCGAAGTCGGCGGCGGCACCGGCGGCACGACCGCGTGGCTGCTGCCGGAA
CTCGACGGCGAGCCGAACGTCCGCTACGACTTCACCGACATCTCGCCGATTTTCACGCGGCGCGCCGAGCAGAAATTCGC
CGCTCACGAAGGCGTCGACTATCGCGTGTTCGATCTGCAAAAAGACGCGCATGCGCAAGGCTTCGAAGCGGGCGCATACG
ACCTGATCGTCGCGGCCAACGTGATCCATGCGACGCAGCACGTCGGCCGCGCGCTCGCGAACCTCGCGCCGCTGCTGAAG
CCGGGCGGCCGTCTGCTGATGCGCGAGATCACGCGGCCGATGCGCCTCTTCGATTTCGTATTCGGCCCGCTCGTGCTGCC
GCTGCACGACGAGGACGCGCGCGGCGGCGAGCTGTTCCTGTCGACCGCGCGCTGGAAGGAACAATGCGTCGCGGCGGGCT
TCGAGCGCATCGACTGGCTGCCGGACGACGGCGCGCCGACGTCGGGCATCAGCGAGCACATCGTGCTCGCGACCGCGCCG
GGCCGCTCGGCGGGCGTCGCGCCGTGGCTCGCCGACGACGCCGATCCGCTGCTCGGTCAGCCGCTCACCGACGACGGCGT
GTATCTCGCCGACTGGTCCGATTGCGCGGGCCGGCGCGAGGCATGGCAGCAACGGCTCGCGCGGGGCGGCGCGGAACTGG
CGGGCCGCCACGGCGGCGGCCAGGCGGCTCCGACGATTCGCGCGCCCGAGCGCGCGCCGGCGTGGCTGACGCTCGTGCGC
CTGCGCTGGTGCGCCAGTCCGTTCGGCGCGGCGCGGATCGCGCTCGACGCGCGCGACGAATCCGGCGCATGGCGGCCGCT
CGACGCCGACTCCCCCGAAGACGGCCTGCCCGCGCCGCTGCCTGCGCGCGATACGCATTACGGCTGGCAGTGGCGGCCCG
TGTCCGATGCGTCGCCAGATGCGAACGGCATCGCATTGCACGCCGCGTCGACGCCGTTCGCCGACTCGCTGCGCGACGCG
GGCATGCCGATCGCGCCGCACGCGGATCGCCGATTGTTCATCCTCGATCCGGACGAGAAGCCGCTACAGGCGATCGCGCC
GGCATTGCTCGACGCGCTGTCCGAAGCAAGCCGCGCGCCGCTCGTCGTCGTGACCCGCGGCGCATGGAAGGTCCACGCCG
ACGATCTCGTCGACCCCGCGCATCGCGCCGCGTGGGGGCTGCTGCGCGTCGCGGCCGCCGAACGGCCGGACCGGATGCTC
GCCGCGATCGACCTGCATCCGGCCGCCGCGTGGCGCGATCTGCTGCCGGCGCTCGATGCGCTCGGCAGCGGCGCTCGCTG
GCTCGCGGTCCGCGACGGCCGCGCGCATGCGCCATCGCTCGCGGCGGAACCCTACGTCGCGCCCGCGCTGCCCGCCGGCG
CGCTCGCCGGCGAACGCTGGCACGTCGTCACCGGCGCGTTCGGCGGCCTCGGCCGGCTGAGCGTGCGCTGGCTCGCGCGC
CACGGCGCGCGCCGCATCGCGCTCGTCGCGCCTCGCGCGCATGACGACTGGTCTGCGTTCCAGCATGAAGTCGAAGCGCT
CCATGCATGCAAATTGCGCTGGGTGCGCTGCGACATCGCCGAACCCGCGCAGTTGACGGCGGCGCTGCACGCGCTGCACG
CGGACGGCGGCGTCGCGGGCGCGATCCATGCGGCCGGCATCCTCGACGATGCGCCGCTCGCCACGCTCGACGCCGAACGC
ATCGCGCCCGTGCTCGCGGTGAAGGCCGACGCGGCGCGCGTGCTGCGCGACTGGCTCGGCGCGCACGACGCGCGCTATCT
GCTTTTCTATTCGTCGGCCGCCGCCGCGCTCGGCGCGCCGGGACAAGGCGCGCACGCGTTCGCGAGCGCCTATCTCGACG
GGCTCGCCGAAGCGCGCGCAAGCGGCGAAACGCCAGCGGTGATCTCCATCGCGTGGGGCGCATGGGGCGAAGCCGGACGC
GCGGCCGAGCCCGCATTGCAACACAAGCTCGCGGAGAGCGGGATGGGCGTGCTGTCGAGCGCGGAAGGCCTGTGGCATCT
GGAACAGGCCGTGATGCGCGGCGCGCCATACCGGCTCGCGATGCGCGTGCTGCGCGAGCGGCTCGACGCCGGCCGCCGCG
CGCTGTTCGACGCCGGCGCCGATACGCCATCCGCGCGGCTCGCGCGCTCGCCGACGGCGGCCTCCCGCGCCGTAGCGGCG
TCGGCGCCCGCCGCCGCGCGCCCCGATCCGCGCGATGCCGATGCCGTCCGCCAATGGTTGACGGCGCGCATCGCCGCGCA
ACTGAAGCTCGACGATCTCGCTCAACTCACGCCGAAACGCGATCTGCTGAAGCTCGGCCTCGATTCGCTGCTGTTCCTCG
AATTGCGGAGCGCGGTCGAATCGCAGCTCGGCGTCAAGCTCGACGCCGAGCGCGCGTATCGCGACATGACGGTCGCCGGA
ATCGGCCGGCTGATCGTCGAATCCGCGCCCGCCGACGCCAGCGCGCCCGCTTCGGATGCCGGCGCGCTCATCCACGATCC
CGCGAACCGCTTCGAGCCGTTCCCGTTGACGCCGATCCAGCACGCGTACTGGCTCGGCCGCACCGATCTGATCGCGTACG
GCGGCGTCGCGTGTCACGTGCTGTTCGAATGGGACATGCGGCGCGACACGTTCGACCTCACACGCTTCGAGGCCGCCTGG
AACGCGCTCGTCGCGCGTCACGACATGCTGCGGATGATCGTCGATTCGGACGGCCGCCAGCGCATCCTGCGCGACGTGCC
CGTCTATCGGCCGCAGCGGCGCGACCTGAAGCGCCTCGCCGCCGACGAGCAGGCGCGCGCGCTCGAACACACCCGCGACG
AACTGTCGTACCGCGTGCTGCCCGCCGACCGCTGGCCGCTGTTCGAGCTCGTCGTGACCGAGCTCGACGATGCGCGCTAC
CGGCTCCACATGAACCTCGACCTGCTGCTGTTCGACGTGCAGAGCTTCAAGGTGATGATGGACGACCTCGCGCGCGCCTA
TCGCGGCGCGGCGCTCGAACCGCTGCAGATCACGTTCCGCGACTACGTGCTCGCCGATCAGGCGCGCCGCGACGCGCCCG
ACTGGCAGGCATCGTGGCGCTACTGGCAACGCACGCTCCCTCAGTTGCCGCCCGCGCCCCTGCTGCCGCTCGACCCCGAA
CGCGCCGACCGCGCGCGGCCCCGCTTCACCACATATCAGGCGCGCCTCGAACGCGCGGACTGGGACAAGCTCAAACGCGA
ATGGCAGCGCTGGGGCGCGACGCCGTCGGCCGCGCTGCTCGCGCTCTTCGCGCACACGCTCGAACGCTGGAGCCGGCATC
CGGATTTCACGCTGAACCTGACGTTCTTCAACCGGCGTCCCGATCATCCGCAGGTCTCCCAACTGATCGGCGATTTCACG
TCGGTGCTGCTGATCGACTTCGCGCTGAACGGCGCGCCGACGCTGCGCGACACGATCGAGCGCACGCAGGAGCGGCTCTG
GCAGCGCCTCGCGCACAGCCAGGTCAACGGCGTCGAGCTGATGCGCGAGCTGTCGCGCGGCCGCGCGCATGACCCTCGCC
GGCCGCTGATGCCGGTCGTGTTCACGAGCATGCTGGGGATGTCGCTCGACGGCCTGAGCATCGATCAGGCGATGACGAGC
CTGTTCGGCGAGCCGGTCCACGTGTTCACTCAGACGCCGCAGGTCTGGCTCGATCATCAGGTGATGGAAGTCGACGGCGA
TCTCGTGTTCAGTTGGTACTGCATGGACGACGTGCTCGCGCACGGCGCCGCGCAGGCGATGTTCGACGATTATCGATGCC
TGCTGCGCGGAATCGCCGCGCAACCGGAACGGATGACGCAGCCCGGGCTCGCGAAACTGCGCGACGACGGCGCATGGGTG
GATTTCGCGCGTCGCCGCTGGCCGCTGCATGCCGGCGATGCGGGCATGGATCTGCGCGACATCGAAGACCTGCTGCGCGC
GCAGGACGGCGTATCCGACGCCGCCGCGACGCTCGCCGAAGACGGTCGCACGCTCGACATCGTCGTGTCGGCTGCTGGCG
CGCAGGTCGCGCCGCCGCCCGAGACCGGCGCGCCGCTGGCCCTGACGTCTGCGCTGCCGATGCTCGACGCGTCGCAGCTC
GCGGAAATCGATGCGACTTGGCACTGGCTCGAGGCGCGCGCGCTGCGCGGCATCGCCCGGACGCTGCATCGCCACGGCCT
GTTCGCGCATGCCGGGCAACGCCACGATCTCGACGAAGTGCAGTCGCGCCTGCGCGCGCGGCCGCAGCATCGCCGGCTCG
CGCGGCAATGGCTGCTCGCGCTGGCCGAGCGCGGCTGGCTGCGCCGCGAAGGCGACGCGTTCGTCGGCGAACGTGCGCTC
GATACGGTTCCCGACCCGGCCGAGGCGCTACCGCAGGCCGGCTGGAGCCGCACGCTCGGCGCCTATCTGGATACGTGCAT
CGCGCGTCACGATGCGCTGTTCGACGGCACGCAAGCGCCGCTCGCGCTGCTGTTCGACGACGACGATGCGATCACCCGCG
CGCTGTACAGCGACAATCCGGCGATCGACTGCCTGAATCGCGGCGCCGCGCAGATCGCGCGCGCGCTCGGCGAGCGTTCG
GGCGGGCTGCGCGTGCTCGAAGTCGGCGCCGGCACCGCCGCGACGACGCGCCACCTCGTGCCGGCGCTCGACGGCCGGCT
GCACAGCTACCGCTTCACCGACGTGTCGACGCTGTTCCTCGACGCCGCCCGCGAGCGCTTCGCCGGCCACGCGCAGCTCG
ACTACGCGCGCTTCGACATCAACGCGCCAGTCGATTTCGATGCGCATCCGGAAGCAGGCTACGACATCGTCGTCGCGGTC
AACGTGCTGCACGACGCGAGCGACGTCGTGCGGTCGCTGCGCCGCCTCGGGCAACTGCTGAGACCGGGCGGCCGCCTGTT
GATGATCGAGGCGACCGAGCGCGACAGCGCGTTGCAGATGGCGAGCATCGGCTTCATCGAAGGCCTGAACGGCTACGACG
ATTTCCGCACGGCCGACGACAAGCCGATGCTCGATCTGCCGACGTGGCGCGACGCGCTCGGGCAGGCGGGCTTTTCGGTC
GAACTCGCATGGCCGGAGCAGGAATGCAGCCCGCTGCGTCAGCACCTGGTGCTCGCACGCGCGACGCACGTCGGCCGGCT
CGATCTCGGGGCGCTCGAACGCGGCCTGCGCGCATGCTGCGGCGACGCGCTGCCGCCGGTGCGCATCCGGCAATGCGAAC
GCATCGATCGACATGCGGACCGCCACGCGGCGCGACGCGACGAAGCGAACGAAGGTGCGCCGAGCCGCGCGGCTCGCGCG
TCGCACACGTCGACCGCGCCTGTCGCGCCCATCGCGCCGCCGCGCGAGCCGCAAGCGCAAGCCGCGCTCGAACGCAGCGT
CGGCGCGGTGTGGCAGGCGCTGCTGAAGTGCCCTATCCATCGCGACAGCGATTTCTTCCAATCCGGCGGCGACAGTCTGA
TCGCGACGCGAATGATCGCGCAGCTCAATCGCGACGGCATGCGCGGCGCGAGTCTGCAGGCGCTCTTCGCCGAACCGACG
CTCGGCGCGTTCTGCGCGACGCTGAACGCGCCGCCCGCGTCCGAAGCGGCCGTGGCAGCCGATGCGGGCGGCTGCCTGGT
GGCGCTCGCCGAAGGGCGCGATCCGGCGCGCGTGTTCATGTTCCACGCGTCCGACGGCGAGCTCGCCGCGTACATGCCGC
TCGCGCGGCATCTGGACTGCCGCGTTCACGGCTTGCGCGCGACGGACGCGACGCTGCCCGACGATCTCGGCGCGCTCGCC
GATCGCTACGTGCAAGCCATCCGCGCGTCGCAGCCGCACGGGCCGTATACGTTGATCGGGTGGTCCTATGGCGCGTCCGT
CGCCGCCGAGGCCGCGCGCTTGCTGCACGAGCGCGGCGAAACGGTCGAACTCGCGCTGCTCGATCCCGTCTGCCGGGCCG
ATTTCGATCACGACGATCGCACGTCGCTGCTGCGTCTGCTCGCACAAGGACGCGCCGTCGTGCCGCTGCCCGACGACCTC
GAACAGCTCGACGCCGACGAACAGACCGCGTGCTTCGTGCGCGCCGCGCAAACGGCCGGGCTGCTGCCGGAGCGCACGAG
CGCCGCCGATGCGCAGCGCTGGCTGACGCGCGTCGGCGACCTGCTCGGCCTGCTCGCGCGCCACCACGCGCCCGCCCCGC
TGCCGATCCGCTGCCTGTGGATCGCCGCCGCGCACCGCCCGCCCCGCTGGCGACCGGCCGAACTCGACTGGCAAGGCTGG
GACATGCACGCGGAACGCCACACGCTCGACGCAGACCACTGGACGCTCGTGATGGACGACGCACGAGCACAGAACGTCGC
CGCCCTGTTCCGGCAGTGGCGCGACAACCCTCGCCGCCCGCAGGAGAAAGTCGCATGA

Protein sequence :
MDKATSSARPAADLDALLATHYPNGEPIAVIGHACRFPEADDSDAFWRNLLAGAECSRRFTRQALLDAGLDAATIDAPNF
VNVGSVVRDADAFDAALFGYSRQEAESIDPQQRLFLQIVWHALEHAGYAPRDVPHRTGVFGSARISTYPGREPLRIAEVA
QVKGLQSLMGNDKDYVATRVAYKLNLRGPALAVQTACSSSLVAAHLACESLRAGECDMAVAGGVAVSFPQHAGYLHQPGM
IFSPDGRCRPFDADAQGTFAGNGVGAVVLRRLGDALRDGDPVVAVLLGSAINNDGDRKVGYTAPSAAGQRDAIRDALMLA
GVDSTQIGLVEAHGTGTPLGDPVELEALRGVFHRAGEGPRCALGSVKSNIGHLDTAAGIASLLKAVLAVERRAIPPSLHF
RKPNPALGLDDSPFYVPTEAQPWDDASRVAGVSSFGIGGTNCHVVVASLPDALRAAVGGDDRAQPDAGAALLLSAASEPA
LQRLARGYADALRHAGARDLLHTALRGRQLDLPHRLAVPFCEETVAALDAFALGEDDALVHRGSGEAGQMVWLFTGQGSH
WPGMGQALYRQSPAFAACLDRCFAACDGELDVPLRDAMSGERGDLLERMDYAQPAIVAFELAMAAHWRALGLEPQIVIGH
SVGEYAAAVVAGHYEIEQAMPLVRLRGALMQRCAEGAMLAAFACADELLPLARQAGVDVAAHNGERHLVFSGRRDAVDAL
AAALAAKDIRHARLTVPGAAHSALLDPVLDTFERAAAQLHAAPGRVQLVSTLLGGPIDADGLNAHGYWRRHMREPVRYAD
AVRHAIAQGAGVFLELGPDAQLTGIGLRESPQRARWIASARRQQPALAQTRQATLELYAAGVALPWANVLPSSGRKLHAP
RYPFDAERYWRDAQPAAVTPPAAHGGDVDPALAEGRRVAAAAAASLDLPRLQRLYDCVTQLHAIYVDRLVRRCIGERFDE
GATALDILRAGRLLPRHRQLLVRLLNACVEDGYYRRDNDRYAPALAAPHAERDALLQILRDCCEGFDVIADTVARAGDSL
HAMMSGDIEPVAVIFPDSASSGVEVLYQEFSFGRYFNQIAAGVVAGLVRERRTNRRRHRPFRILEVGGGTGGTTAWLLPE
LDGEPNVRYDFTDISPIFTRRAEQKFAAHEGVDYRVFDLQKDAHAQGFEAGAYDLIVAANVIHATQHVGRALANLAPLLK
PGGRLLMREITRPMRLFDFVFGPLVLPLHDEDARGGELFLSTARWKEQCVAAGFERIDWLPDDGAPTSGISEHIVLATAP
GRSAGVAPWLADDADPLLGQPLTDDGVYLADWSDCAGRREAWQQRLARGGAELAGRHGGGQAAPTIRAPERAPAWLTLVR
LRWCASPFGAARIALDARDESGAWRPLDADSPEDGLPAPLPARDTHYGWQWRPVSDASPDANGIALHAASTPFADSLRDA
GMPIAPHADRRLFILDPDEKPLQAIAPALLDALSEASRAPLVVVTRGAWKVHADDLVDPAHRAAWGLLRVAAAERPDRML
AAIDLHPAAAWRDLLPALDALGSGARWLAVRDGRAHAPSLAAEPYVAPALPAGALAGERWHVVTGAFGGLGRLSVRWLAR
HGARRIALVAPRAHDDWSAFQHEVEALHACKLRWVRCDIAEPAQLTAALHALHADGGVAGAIHAAGILDDAPLATLDAER
IAPVLAVKADAARVLRDWLGAHDARYLLFYSSAAAALGAPGQGAHAFASAYLDGLAEARASGETPAVISIAWGAWGEAGR
AAEPALQHKLAESGMGVLSSAEGLWHLEQAVMRGAPYRLAMRVLRERLDAGRRALFDAGADTPSARLARSPTAASRAVAA
SAPAAARPDPRDADAVRQWLTARIAAQLKLDDLAQLTPKRDLLKLGLDSLLFLELRSAVESQLGVKLDAERAYRDMTVAG
IGRLIVESAPADASAPASDAGALIHDPANRFEPFPLTPIQHAYWLGRTDLIAYGGVACHVLFEWDMRRDTFDLTRFEAAW
NALVARHDMLRMIVDSDGRQRILRDVPVYRPQRRDLKRLAADEQARALEHTRDELSYRVLPADRWPLFELVVTELDDARY
RLHMNLDLLLFDVQSFKVMMDDLARAYRGAALEPLQITFRDYVLADQARRDAPDWQASWRYWQRTLPQLPPAPLLPLDPE
RADRARPRFTTYQARLERADWDKLKREWQRWGATPSAALLALFAHTLERWSRHPDFTLNLTFFNRRPDHPQVSQLIGDFT
SVLLIDFALNGAPTLRDTIERTQERLWQRLAHSQVNGVELMRELSRGRAHDPRRPLMPVVFTSMLGMSLDGLSIDQAMTS
LFGEPVHVFTQTPQVWLDHQVMEVDGDLVFSWYCMDDVLAHGAAQAMFDDYRCLLRGIAAQPERMTQPGLAKLRDDGAWV
DFARRRWPLHAGDAGMDLRDIEDLLRAQDGVSDAAATLAEDGRTLDIVVSAAGAQVAPPPETGAPLALTSALPMLDASQL
AEIDATWHWLEARALRGIARTLHRHGLFAHAGQRHDLDEVQSRLRARPQHRRLARQWLLALAERGWLRREGDAFVGERAL
DTVPDPAEALPQAGWSRTLGAYLDTCIARHDALFDGTQAPLALLFDDDDAITRALYSDNPAIDCLNRGAAQIARALGERS
GGLRVLEVGAGTAATTRHLVPALDGRLHSYRFTDVSTLFLDAARERFAGHAQLDYARFDINAPVDFDAHPEAGYDIVVAV
NVLHDASDVVRSLRRLGQLLRPGGRLLMIEATERDSALQMASIGFIEGLNGYDDFRTADDKPMLDLPTWRDALGQAGFSV
ELAWPEQECSPLRQHLVLARATHVGRLDLGALERGLRACCGDALPPVRIRQCERIDRHADRHAARRDEANEGAPSRAARA
SHTSTAPVAPIAPPREPQAQAALERSVGAVWQALLKCPIHRDSDFFQSGGDSLIATRMIAQLNRDGMRGASLQALFAEPT
LGAFCATLNAPPASEAAVAADAGGCLVALAEGRDPARVFMFHASDGELAAYMPLARHLDCRVHGLRATDATLPDDLGALA
DRYVQAIRASQPHGPYTLIGWSYGASVAAEAARLLHERGETVELALLDPVCRADFDHDDRTSLLRLLAQGRAVVPLPDDL
EQLDADEQTACFVRAAQTAGLLPERTSAADAQRWLTRVGDLLGLLARHHAPAPLPIRCLWIAAAHRPPRWRPAELDWQGW
DMHAERHTLDADHWTLVMDDARAQNVAALFRQWRDNPRRPQEKVA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
irp1 YP_853076.1 yersiniabactin biosynthetic protein Virulence PAI IV APEC-O1 Protein 0.0 57
irp1 YP_070123.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 56
irp1 YP_001006816.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 56
irp1 CAA73127.1 HMWP1 protein Virulence HPI Protein 0.0 56
irp1 NP_993006.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 56
irp1 YP_002346901.1 yersiniabactin biosynthetic protein Virulence HPI Protein 0.0 56
irp1 NP_669707.1 HMWP1 nonribosomal peptide/polyketide synthase Virulence HPI Protein 0.0 56
irp1 CAA21391.1 - Virulence HPI Protein 0.0 56