Gene Information

Name : O3K_26432 (O3K_26432)
Accession : YP_006792611.1
Strain :
Genome accession: NC_018666
Putative virulence/resistance : Virulence
Product : protease IgA1
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 55669 - 59763 bp
Length : 4095 bp
Strand : +
Note : COG3468 Type V secretory pathway, adhesin AidA

DNA sequence :
ATGAACAAAATATATTATCTTAAGTATTGCCATATAACCAAAAGCCTGATTGCTGTCTCCGAACTGGCCCGCAGGGTTAC
CTGCAAAAGCCATCGCAGACTTTCACGTCGGGTTATTCTTACGTCTGTTGCAGCTTTATCACTTTCTTCGGCATGGCCAG
CTCTGTCAGCAACGGTCAGTGCAGAGATCCCTTATCAGATATTCCGCGACTTTGCCGAGAATAAAGGTCAGTTTACGCCG
GGGACCACGAATATTTCCATTTACGACAAGCAGGGCAATCTGGTCGGTAAACTTGATAAAGCCCCGATGGCTGATTTCAG
TAGTGCCACTATTACTACCGGTAGCCTGCCTCCGGGAAACCATACACTTTACTCCCCTCAGTATGTGGTTACCGCCAAAC
ATGTCAGCGGCTCTGACACGATGAGCTTCGGATACGCCAAAAACACCTATACTGCGGTTGGTACAAATAATAATAGCGGT
CTTGATATTAAAACCCGCCGCCTGAGTAAACTGGTGACAGAGGTGGCGCCGGCAGAAGTGTCTGACATCGGGGCTGTGAG
CGGTGCCTATCAGGCCGGCGGACGCTTTACTGCATTCTATCGCCTTGGGGGGGGGATGCAGTACGTCAAAGACAAAAATG
GTAATCGTACCCAAGTGTATACCAATGGCGGATTTCTTGTTGGGGGAACCGTCAGCGCGCTGAACTCCTATAACAACGGA
CAGATGATTACAGCTCAGACCGGTGACATTTTTAATCCCGCGAATGGACCTCTGGCTAATTATCTGAATATGGGGGACAG
CGGCTCCCCCCTGTTTGCTTATGATTCCCTGCAAAAAAAATGGGTACTGATTGGCGTACTTTCCTCAGGAACTAACTATG
GAAATAACTGGGTCGTCACCACTCAGGATTTCCTTGGTCAGCAACCGCAAAATGATTTTGACAAAACCATAGCCTACACC
TCTGGTGAGGGAGTACTGCAGTGGAAATATGATGCAGCTAATGGCACCGGCACACTGACTCAGGGAAACACGACCTGGGA
TATGCATGGAAAGAAAGGAAATGACCTGAACGCAGGAAAAAACCTCCTGTTCACCGGCAATAATGGCGAGGTCGTTCTGC
AGAATTCCGTTAATCAGGGAGCTGGGTATCTGCAGTTTGCCGGCGATTACAGGGTGTCCGCCCTGAACGGCCAGACATGG
ATGGGGGGCGGGATTATCACCGATAAAGGAACCCACGTTCTGTGGCAGGTGAACGGAGTTGCCGGTGATAACCTGCATAA
AACTGGCGAAGGTACCCTGACAGTAAACGGGACCGGTGTGAATGCAGGCGGACTCAAGGTAGGGGACGGTACTGTTATTC
TCAACCAGCAGGCGGATGCTGACGGAAAGGTACAGGCTTTCAGCTCTGTTGGTATTGCCAGCGGTCGTCCGACAGTTGTG
CTTTCTGACTCACAGCAAGTTAATCCGGATAATATTTCGTGGGGATACCGTGGTGGACGTCTGGAACTGAATGGTAATAA
CCTGACCTTTACCCGTCTTCAGGCGGCTGACTACGGGGCCATCATTACCAATAACAGTGAGAAAAAATCAACGGTAACAC
TCGACCTTCAGACATTAAAAGCCAGTGACATCAATGTACCGGTTAATACTGTCAGTATTTTTGGGGGGAGAGGAGCGCCG
GGAGACCTGTATTATGACAGTTCTACCAAGCAATATTTCATCCTGAAAGCGAGTTCATATTCCCCGTTTTTCTCCGATCT
GAACAACAGCAGTGTCTGGCAGAATGTCGGAAAAGATCGTAATAAAGCCATTGATACTGTGAAGCAACAAAAAATTGAGG
CCAGCAGCCAGCCTTATATGTATCACGGCCAACTAAACGGCAATATGGATGTGAATATTCCGCAGCTCTCAGGTAAGGAT
GTACTGGCTCTTGATGGTTCAGTTAACCTGCCAGAAGGTTCGATAACCAAGAAGTCCGGCACGCTGATATTCCAGGGTCA
TCCAGTTATTCATGCTGGAACGACGACTTCTTCCAGCCAGAGCGACTGGGAAACCCGTCAGTTTACGCTGGAAAAACTGA
AACTCGATGCGGCAACATTCCATCTGTCCAGAAACGGCAAGATGCAGGGAGATATTAACGCCACGAATGGAAGTACAGTC
ATTCTGGGAAGTAGCCGTGTCTTTACTGACAGGAGTGACGGAACCGGTAATGCGGTCTCCTCTGTTGAAGGGAGTGCCAC
TGCAACCACAGTTGGTGACCAGAGTGATTACAGCGGAAATGTTACGCTGGAAAATAAATCATCCCTGCAAATCATGGAGA
GATTCACCGGTGGTATTGAGGCTTATGATAGCACCGTGAGTGTGACCTCTCAGAATGCTGTTTTTGACCGTGTTGGCAGC
TTTGTCAACAGCAGCCTGACCCTCGGAAAAGGAGCAAAACTTACGGCTCAGAGCGGTATTTTCAGCACCGGGGCTGTGGA
TGTAAAAGAAAATGCTTCTCTGACCCTGACGGGGATGCCTTCTGCGCAGAAACAGGGGTATTACTCACCCGTAATTTCTA
CAACAGAAGGGATTAACCTCGAAGATAAGGCCAGCTTTTCTGTTAAAAATATGGGCTATCTGAGTTCGGATATTCATGCA
GGAACCACTGCGGCAACCATTAATCTGGGAGACAGTGATGCTGATGCAGGGAAGACGGACTCTCCGTTATTCAGCTCCTT
AATGAAGGGATATAACGCTGTTTTGAGAGGCAGTATTACGGGGGCACAGAGTACGGTAAATATGATCAATGCTCTGTGGT
ACTCTGACGGAAAATCAGAGGCCGGTGCACTGAAGGCTAAGGGCTCGCGAATTGAACTGGGGGACGGGAAACACTTTGCC
ACTTTACAAGTAAAAGAGCTTTCTGCAGATAATACCACGTTCCTGATGCATACCAACAACAGCTGGGCTGACCAGTTGAA
TGTCACAGACAAACTGTCAGGCAGTAATAATAGCGTCCTGGTTGACTTTTTAAACAAACCAGCCAGTGAAATGAGCGTGA
CGTTAATTACCGCACCGAAAGGGAGTGATGAGAAAACGTTCACTGCCGGTACGCAGCAGATTGGTTTCAGTAACGTTACG
CCAGTAATCAGCACGGAAAAAACGAATGATGCCACAAAATGGGTGCTGACAGGATATCAGACTACCGCTGATGCCGGAGC
CTCGAAAGCCGCAAAAGACTTTATGGCATCAGGTTATAAGTCCTTCCTTACAGAGGTCAATAACCTGAACAAACGTATGG
GTGACCTGCGGGATACTCAGGGGGATGCCGGTGTCTGGGCACGCATAATGAATGGTACCGGTTCGGCAGATGGTGACTAC
AGCGATAACTACACTCACGTTCAGATTGGTGTCGACAGAAAGCATGAGCTGGACGGTGTGGATTTATTTACGGGGGCATT
GCTGACCTATACGGACAGCAATGCAAGCAGCCACGCATTCAGTGGAAAAACCAAATCCGTGGGTGGCGGTCTGTATGCCT
CTGCACTCTTTAATTCCGGAGCTTATTTTGACCTGATTGGTAAATATCTCCATCATGATAATCAGCACACGGCGAATTTT
GCCTCACTGGGAACAAAAGACTACAGCTCTCATTCCTGGTATGCCGGTGCTGAAGTTGGTTATCGTTACCACCTGACGAA
AGAGTCTTGGGTGGAGCCACAGATAGAGCTGGTTTACGGTTCTGTATCAGGAAAAGCTTTTAGCTGGGAAGACCGGGGAA
TGGCTCTGAGCATGAAAGACAAGGATTATAACCCACTGATTGGCCGTACTGGTGTTGACGTGGGAAGAGCCTTCTCCGGA
GACGACTGGAAAATCACAGCTCGAGCCGGGCTGGGTTATCAGTTCGACCTGCTGGCGAACGGAGAAACGGTTCTGCAGGA
TGCTTCCGGAGAGAAACGTTTCGAAGGTGAAAAAGATAGCAGGATGCTGATGACGGTAGGGATGAATGCGGAAATTAAGG
ATAATATGCGTTTGGGACTGGAGCTGGAGAAATCAGCGTTCGGGAAATATAATGTGGATAATGCGATAAACGCCAACTTC
CGTTATGTTTTCTGA

Protein sequence :
MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVAALSLSSAWPALSATVSAEIPYQIFRDFAENKGQFTP
GTTNISIYDKQGNLVGKLDKAPMADFSSATITTGSLPPGNHTLYSPQYVVTAKHVSGSDTMSFGYAKNTYTAVGTNNNSG
LDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTAFYRLGGGMQYVKDKNGNRTQVYTNGGFLVGGTVSALNSYNNG
QMITAQTGDIFNPANGPLANYLNMGDSGSPLFAYDSLQKKWVLIGVLSSGTNYGNNWVVTTQDFLGQQPQNDFDKTIAYT
SGEGVLQWKYDAANGTGTLTQGNTTWDMHGKKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRVSALNGQTW
MGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDGTVILNQQADADGKVQAFSSVGIASGRPTVV
LSDSQQVNPDNISWGYRGGRLELNGNNLTFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNTVSIFGGRGAP
GDLYYDSSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDRNKAIDTVKQQKIEASSQPYMYHGQLNGNMDVNIPQLSGKD
VLALDGSVNLPEGSITKKSGTLIFQGHPVIHAGTTTSSSQSDWETRQFTLEKLKLDAATFHLSRNGKMQGDINATNGSTV
ILGSSRVFTDRSDGTGNAVSSVEGSATATTVGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRVGS
FVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSPVISTTEGINLEDKASFSVKNMGYLSSDIHA
GTTAATINLGDSDADAGKTDSPLFSSLMKGYNAVLRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSRIELGDGKHFA
TLQVKELSADNTTFLMHTNNSWADQLNVTDKLSGSNNSVLVDFLNKPASEMSVTLITAPKGSDEKTFTAGTQQIGFSNVT
PVISTEKTNDATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNNLNKRMGDLRDTQGDAGVWARIMNGTGSADGDY
SDNYTHVQIGVDRKHELDGVDLFTGALLTYTDSNASSHAFSGKTKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANF
ASLGTKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWEDRGMALSMKDKDYNPLIGRTGVDVGRAFSG
DDWKITARAGLGYQFDLLANGETVLQDASGEKRFEGEKDSRMLMTVGMNAEIKDNMRLGLELEKSAFGKYNVDNAINANF
RYVF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 58
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 56
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 56
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 56
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 54
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 47
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 47
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_26432 YP_006792611.1 protease IgA1 VFG0635 Protein 0.0 56
O3K_26432 YP_006792611.1 protease IgA1 VFG0861 Protein 0.0 56
O3K_26432 YP_006792611.1 protease IgA1 VFG0903 Protein 0.0 56
O3K_26432 YP_006792611.1 protease IgA1 VFG1689 Protein 0.0 47
O3K_26432 YP_006792611.1 protease IgA1 VFG0904 Protein 0.0 47