Gene Information

Name : O3K_23005 (O3K_23005)
Accession : YP_006781255.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Virulence
Product : Serine protease pet precursor (Plasmid-encoded toxin pet)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4743678 - 4747535 bp
Length : 3858 bp
Strand : +
Note : COG3468 Type V secretory pathway, adhesin AidA

DNA sequence :
ATGAATAAAATTTATTCACTGAAATATAGTCATATTACAGGTGGATTAGTTGCTGTTTCTGAACTGACCCGGAAAGTTAG
TGTCGGTACATCAAGAAAGAAAGTTATCCTCGGTATTATTTTATCCTCAATATATGGAAGTTATGGCGAAACAGCATTTG
CAGCAATGCTGGATATAAATAATATATGGACCCGCGATTATCTTGACCTTGCTCAAAACAGAGGAGAGTTCAGACCGGGT
GCAACAAATGTTCAATTAATGATGAAAGATGGAAAGATATTTCATTTTCCAGAACTACCTGTACCTGATTTTTCTGCTGT
TTCCAACAAAGGTGCAACAACATCAATTGGAGGTGCGTACAGTGTTACTGCGACTCATAACGGTACACAGCATCATGCAA
TAACAACACAGTCATGGGATCAGACAGCATATAAAGCAAGTAACAGAGTATCATCTGGCGACTTTTCGGTTCATCGTCTG
AATAAATTCGTCGTGGAAACAACAGGGGTTACGGAGAGTGCCGACTTCTCACTTTCTCCCGAAGATGCGATGAAAAGATA
TGGCGTAAACTACAACGGTAAGGAACAAATAATTGGCTTCAGAGCAGGTGCCGGAACAACCTCAACGATATTAAACGGCA
AACAATATCTGTTTGGACAAAACTATAATCCCGACTTGTTAAGCGCAAGTCTTTTTAATCTGGACTGGAAAAACAAGAGT
TACATTTATACCAACAGAACCCCTTTTAAAAACTCACCAATTTTTGGCGATAGTGGTTCTGGTTCTTATCTATATGATAA
AGAACAACAAAAATGGGTTTTCCATGGTGTTACCAGTACAGTTGGTTTTATCAGTAGTACCAATATAGCCTGGACAAACT
ACTCGTTATTTAATAATATTCTGGTAAACAATTTAAAAAAGAATTTCACAAACACTATGCAGCTGGATGGTAAAAAACAA
GAGTTATCATCGATTATAAAAGATAAGGACCTGTCTGTCTCAGGAGGAGGGGTATTAACGCTCAAGCAGGATACCGATCT
TGGCATTGGCGGGCTTATATTCGATAAGAACCAGACATATAAAGTGTACGGAAAAGATAAGTCTTATAAAGGTGCCGGGA
TAGATATTGATAATAATACCACCGTTGAATGGAATGTTAAGGGCGTTGCCGGAGATAATCTGCATAAAATAGGTAGTGGT
ACTCTGGATGTAAAAATAGCACAGGGAAATAACCTTAAAATAGGTAATGGGACTGTCATCCTTAGTGCTGAAAAAGCCTT
CAATAAAATTTACATGGCCGGAGGTAAAGGTACGGTAAAAATAAATGCCAAAGACGCTTTAAGCGAAAGCGGTAATGGCG
AAATCTATTTTACCAGAAATGGCGGAACACTGGATCTAAACGGCTATGACCAGTCATTTCAGAAAATCGCAGCAACAGAT
GCGGGAACAACCGTAACGAACTCAAACGTGAAGCAATCAACATTATCACTTACTAATACTGATGCATATATGTACCATGG
GAATGTATCAGGTAATATAAGCATAAATCATATTATCAATACTACCCAGCAACATAACAATAATGCCAATCTGATCTTTG
ATGGCTCAGTCGATATCAAAAACGATATCTCTGTCCGGAATGCACAGTTAACATTACAAGGACATGCGACAGAACATGCC
ATATTTAAAGAAGGCAATAACAACTGTCCAATTCCTTTTTTATGTCAAAAAGACTATTCTGCTGCCATAAAGGACCAGGA
AAGCACTGTAAATAAACGTTACAATACGGAATATAAGTCCAACAATCAGATAGCCTCTTTTTCCCAGCCCGACTGGGAAA
GTCGTAAATTTAATTTCCGGAAATTAAATTTAGAAAACGCAACCCTGAGTATAGGCCGGGATGCTAATGTAAAAGGACAC
ATAGAGGCTAAAAACTCTCAAATTGTTCTGGGAAATAAAACTGCATACATTGACATGTTCTCAGGAAGAAACATTACTGG
CGAAGGTTTTGGATTCAGACAACAGCTTCGCTCCGGGGATTCAGCAGGCGAAAGTAGTTTCAACGGCAGTCTGAGTGCTC
AAAACAGCAAAATAACTGTTGGTGATAAATCAACTGTTACTATGACTGGTGCATTATCCTTAATTAATACAGACCTGATT
ATCAACAAAGGAGCTACTGTTACCGCCCAGGGAAAAATGTATGTAGATAAAGCTATTGAACTGGCCGGAACCCTGACATT
AACAGGCACCCCTACAGAAAATAATAAATACAGCCCGGCAATCTATATGTCAGATGGATATAATATGACAGAAGATGGTG
CCACGTTAAAGGCTCAAAATTATGCCTGGGTCAATGGTAATATAAAATCAGACAAAAAAGCATCTATTCTGTTTGGTGTT
GACCAGTATAAAGAAGATAACCTGGACAAAACCACACACACACCGCTGGCTACAGGTTTGCTGGGTGGCTTTGATACTTC
TTATACCGGAGGTATTGATGCTCCTGCAGCCTCAGCCAGCATGTATAACACCTTATGGAGAGTAAACGGACAGTCAGCCC
TGCAATCATTAAAAACCCGCGACAGTCTTTTGTTGTTTAGTAACATAGAGAATTCGGGTTTCCATACTGTGACAGTAAAC
ACACTGGATGCCACTAATACTGCTGTGATTATGCGGGCTGATCTGAGCCAGTCTGTAAATCAATCGGATAAACTCATTGT
TAAAAATCAGTTAACCGGAAGCAATAACAGTCTGTCGGTCGATATACAGAAAGTGGGAAATAATAACTCAGGATTAAACG
TTGACCTGATAACAGCCCCAAAAGGAAGCAATAAAGAGATATTTAAAGCCAGTACTCAGGCCATAGGTTTCAGCAACATA
TCTCCTGTGATCAGCACGAAAGAGGATCAGGAACATACCACGTGGACCCTGACCGGATATAAGGTGGCTGAAAATACAGC
ATCTTCCGGTGCAGCAAAATCGTATATGTCCGGTAATTACAAAGCCTTCCTGACAGAAGTCAACAACCTGAATAAACGAA
TGGGGGATCTGCGTGACACCAATGGCGAGGCCGGTGCATGGGCCCGCATCATGAGCGGAGCAGGTTCAGCTTCTGGTGGA
TACAGTGACAACTACACCCATGTGCAGATTGGTGTGGATAAAAAACATGAGCTGGATGGACTTGACCTTTTCACTGGTCT
GACTATGACGTATACCGACAGTCATGCCAGCAGTAATGCATTCAGTGGCAAGACGAAGTCCGTCGGGGCAGGTCTGTATG
CTTCCGCTATATTTGACTCTGGTGCCTATATCGACCTGATTAGTAAGTATGTTCACCATGATAATGAGTACTCGGCGACC
TTTGCTGGACTCGGAACAAAAGACTACAGTTCTCATTCCTTGTATGTGGGTGCTGAAGCAGGCTACCGCTATCATGTAAC
AGAAGACTCCTGGATTGAGCCGCAGGCAGAACTGGTTTATGGGGCCGTATCAGGTAAACGGTTCGACTGGCAGGATCGCG
GAATGAGCGTGACCATGAAGGATAAGGACTTTAATCCGCTGATTGGGCGTACCGGTGTTGATGTGGGTAAATCCTTCTCC
GGTAAGGACTGGAAAGTCACAGCCCGCGCCGGCCTTGGCTACCAGTTTGACCTGTTTGCCAACGGTGAAACCGTACTGCG
TGATGCGTCCGGTGAGAAACGTATCAAAGGTGAAAAAGACGGTCGTATTCTCATGAATGTTGGTCTCAACGCCGAAATTC
GCGATAATCTTCGCTTCGGTCTTGAGTTTGAGAAATCGGCATTTGGTAAATACAACGTGGATAACGCGATCAACGCCAAC
TTCCGTTACTCTTTCTGA

Protein sequence :
MNKIYSLKYSHITGGLVAVSELTRKVSVGTSRKKVILGIILSSIYGSYGETAFAAMLDINNIWTRDYLDLAQNRGEFRPG
ATNVQLMMKDGKIFHFPELPVPDFSAVSNKGATTSIGGAYSVTATHNGTQHHAITTQSWDQTAYKASNRVSSGDFSVHRL
NKFVVETTGVTESADFSLSPEDAMKRYGVNYNGKEQIIGFRAGAGTTSTILNGKQYLFGQNYNPDLLSASLFNLDWKNKS
YIYTNRTPFKNSPIFGDSGSGSYLYDKEQQKWVFHGVTSTVGFISSTNIAWTNYSLFNNILVNNLKKNFTNTMQLDGKKQ
ELSSIIKDKDLSVSGGGVLTLKQDTDLGIGGLIFDKNQTYKVYGKDKSYKGAGIDIDNNTTVEWNVKGVAGDNLHKIGSG
TLDVKIAQGNNLKIGNGTVILSAEKAFNKIYMAGGKGTVKINAKDALSESGNGEIYFTRNGGTLDLNGYDQSFQKIAATD
AGTTVTNSNVKQSTLSLTNTDAYMYHGNVSGNISINHIINTTQQHNNNANLIFDGSVDIKNDISVRNAQLTLQGHATEHA
IFKEGNNNCPIPFLCQKDYSAAIKDQESTVNKRYNTEYKSNNQIASFSQPDWESRKFNFRKLNLENATLSIGRDANVKGH
IEAKNSQIVLGNKTAYIDMFSGRNITGEGFGFRQQLRSGDSAGESSFNGSLSAQNSKITVGDKSTVTMTGALSLINTDLI
INKGATVTAQGKMYVDKAIELAGTLTLTGTPTENNKYSPAIYMSDGYNMTEDGATLKAQNYAWVNGNIKSDKKASILFGV
DQYKEDNLDKTTHTPLATGLLGGFDTSYTGGIDAPAASASMYNTLWRVNGQSALQSLKTRDSLLLFSNIENSGFHTVTVN
TLDATNTAVIMRADLSQSVNQSDKLIVKNQLTGSNNSLSVDIQKVGNNNSGLNVDLITAPKGSNKEIFKASTQAIGFSNI
SPVISTKEDQEHTTWTLTGYKVAENTASSGAAKSYMSGNYKAFLTEVNNLNKRMGDLRDTNGEAGAWARIMSGAGSASGG
YSDNYTHVQIGVDKKHELDGLDLFTGLTMTYTDSHASSNAFSGKTKSVGAGLYASAIFDSGAYIDLISKYVHHDNEYSAT
FAGLGTKDYSSHSLYVGAEAGYRYHVTEDSWIEPQAELVYGAVSGKRFDWQDRGMSVTMKDKDFNPLIGRTGVDVGKSFS
GKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRILMNVGLNAEIRDNLRFGLEFEKSAFGKYNVDNAINAN
FRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 100
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 100
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 100
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 55
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 54
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 1e-172 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_23005 YP_006781255.1 Serine protease pet precursor (Plasmid-encoded toxin pet) VFG0630 Protein 0.0 100
O3K_23005 YP_006781255.1 Serine protease pet precursor (Plasmid-encoded toxin pet) VFG0862 Protein 0.0 57
O3K_23005 YP_006781255.1 Serine protease pet precursor (Plasmid-encoded toxin pet) VFG0844 Protein 0.0 56
O3K_23005 YP_006781255.1 Serine protease pet precursor (Plasmid-encoded toxin pet) VFG0902 Protein 0.0 55
O3K_23005 YP_006781255.1 Serine protease pet precursor (Plasmid-encoded toxin pet) VFG0772 Protein 0.0 54