Gene Information

Name : O3K_23140 (O3K_23140)
Accession : YP_006781278.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Virulence
Product : Serine protease pic precursor (ShMu)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4772682 - 4776800 bp
Length : 4119 bp
Strand : -
Note : COG3468 Type V secretory pathway, adhesin AidA

DNA sequence :
GTGAATAAAGTTTATTCTCTTAAATATTGCCCCGTCACCGGGGGGCTTATTGCTGTCTCTGAACTTGCCCGCAGGGTAAT
AAAAAAGACATGCCGAAGATTAACGCATATTCTTCTGGCTGGCATTCCAGCAATCTGTCTGTGTTACTCTCAGATATCCC
AGGCGGGTATTGTCCGTTCCGATATTGCCTATCAGATTTATCGTGATTTCGCCGAAAACAAAGGGCTTTTTGTACCTGGT
GCCAATGATATTCCGGTATATGATAAGGACGGAAAACTTGTGGGAAGACTGGGTAAAGCCCCAATGGCCGATTTCAGCAG
TGTGAGCTCAAATGGCGTTGCTACGCTTGTATCGCCTCAGTATATCGTCAGCGTAAAGCATAACGGAGGATATCGGAGTG
TGAGCTTTGGTAATGGGAAAAATACATATTCCCTTGTTGACCGTAATAACCACCCTTCTATTGACTTCCATGCTCCACGT
CTGAATAAACTGGTTACAGAAGTTATTCCCTCAGCGGTAACATCAGAAGGAACCAAAGCCAATGCTTATAAATACACTGA
ACGTTACACCGCTTTTTATCGGGTGGGTAGTGGTACGCAGTACACTAAGGACAAGGACGGAAATTTAGTTAAGGTTGCCG
GTGGATATGCTTTTAAAACAGGAGGAACCACAGGAGTTCCTCTGATATCTGATGCAACAATAGTCTCTAATCCCGGGCAA
ACTTATAATCCTGTAAACGGCCCTTTACCTGACTATGGAGCCCCTGGGGATAGTGGTTCTCCTTTGTTTGCTTATGATAA
ACAACAAAAAAAATGGGTTATTGTTGCTGTATTAAGAGCATATGCAGGTATTAATGGTGCTACGAACTGGTGGAATGTCA
TACCAACAGATTATCTGAACCAGGTTATGCAGGACGATTTCGATGCCCCCGTAGACTTTGTTTCCGGACTGGGCCCCCTG
AACTGGACATACGACAAAACATCAGGCACAGGTACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGA
CAATGACCTCAATGCCGGTAAAAATCTGGTATTCAGCGGGCAGAATGGTGCAATTATCCTGAAAGACAGTGTGACTCAGG
GTGCCGGTTATCTCGAATTTAAAGACAGTTACACCGTATCTGCTGAATCCGGAAAAACATGGACGGGTGCCGGCATTATT
ACTGACAAGGGGACGAATGTAACCTGGAAGGTCAACGGCGTTGCCGGTGACAACTTGCATAAGCTGGGGGAAGGAACCCT
GACCATAAACGGAACAGGTGTAAACCCGGGAGGACTGAAAACGGGAGACGGTATCGTTGTACTTAACCAGCAGGCAGACA
CTGCAGGTAATATCCAGGCCTTCAGTTCAGTGAACCTCGCCAGCGGACGTCCGACCGTGGTGCTCGGGGATGCCCGTCAG
GTCAATCCGGATAACATTTCATGGGGATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACT
GCAGGCTGCTGATTACGGGGCGGTGATTACAAATAATGCACAGCAAAAATCCCAGCTTTTACTGGATCTTAAGGCTCAGG
ATACAAATGTCAGTGAACCGACGATTGGAAATATATCCCCCTTTGGTGGTACCGGCACTCCAGGAAACCTGTACAGCATG
ATACTCAACAGCCAGACCCGCTTCTATATTCTGAAATCTGCCAGCTATGGTAACACTCTGTGGGGGAACAGCCTGAATGA
TCCGGCTCAGTGGGAGTTTGTTGGCATGAACAAAAACAAAGCAGTTCAGACAGTAAAAGATAGGATCCTGGCCGGGCGGG
CAAAACAACCCGTTATCTTTCATGGTCAGCTGACCGGGAATATGGATGTCGCCATTCCACAGGTGCCGGGGGGAAGAAAG
GTAATCTTTGATGGTAGCGTGAACCTGCCGGAAGGTACCCTGAGTCAGGACAGTGGCACCCTGATATTCCAGGGACATCC
GGTTATCCATGCCTCCATCAGTGGCAGTGCACCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGTCAGTTTACAATGA
AAACACTGTCGCTGAAAGACGCTGACTTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGC
CATATCACACTGGGAAGTGACAGGGCATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCCGGAGGAAGGTAC
CTCTGTCCCGGACACCGTGAATGACAGGAGCCAGTATGAAGGGAATATTACGCTGAACCATAACTCAGCCCTGGATATCG
GCAGCAGGTTCACCGGGGGGATTGACGCTTATGACAGTGCCGTCAGCATCACCTCTCCGGACGTCCTGTTGACAGCCCCG
GGTGCTTTTGCCGGCAGTTCACTGACAGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCA
TATTCAGGCCGGTAAGAACGGCAAAATCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAATCAGTATGCCCCTGCTG
TATATCTGACGGACGGATATGACCTGACCGGCGATAACGCAGCACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGAT
ATTCATGCCTCTGCGGCATCAACAGTTACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGGCTGCATC
GGCGTTTGCCGGCAGTCTTCTTGAGGGCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTA
TGCATAATGCACTGTGGACTCTGGGTGGGGACTCTGCCATCCACAGTCTTACCGTCAGAAACAGCCGTATTAGTTCTGAA
GGAGACCGTACATTCCGTACCCTGACGGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTTTGCGTACGGACCTGAA
AAATGCCGATAAAATTAATGTGACTGAAAAAGCCACTGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAATAATCCTG
CTCAGGGACAGGCCCTGAATATTCCTCTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGG
GTGACAGGTTTCAGTCGGGTGACCCCAACCCTGCATGTTGATACCAGTGGTGGCAATACGAAGTGGATACTGGATGGTTT
TAAAGCGGAGGCTGATAAAGCCGCTGCCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAG
TTAACAATCTGAACAAACGTATGGGTGACCTGCGTGACACAAACGGTGATGCCGGTGCCTGGGCGCGCATCATGAGTGGT
GCCGGTTCTGCAGACGGTGGTTACAGTGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGG
TGTGGACCTGTTTACCGGTGTCACGATGACCTATACCGACAGCAGTGCAGACAGCCATGCATTCAGCGGAAAGACGAAAT
CGGTGGGGGGCGGTCTGTATGCTTCAGCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCAT
GACAATGATTACACAGGTAACTTTGCTAGCCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAAC
GGGTTACCGCTATCACCTGACAGAGGACACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAA
CATTCCGCTGGAAAGACGGTGATATGGACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGGTTGGAAGAACAGGGGTT
GAACTGGGCAAGACCTTCAGTGGTAAGGACTGGAGTGTGACGGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAA
TAATGGAGAGACCGTACTGCGTGATGCGTCCGGGGAGAAACGGATAAAAGGAGAGAAGGACAGCCGGATGCTGTTTAATG
TTGGTATGAATGCGCAGATAAAGGACAATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTG
GATAATGCGGTAAACGCGAATTTCCGGTATATGTTCTGA

Protein sequence :
MNKVYSLKYCPVTGGLIAVSELARRVIKKTCRRLTHILLAGIPAICLCYSQISQAGIVRSDIAYQIYRDFAENKGLFVPG
ANDIPVYDKDGKLVGRLGKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYRSVSFGNGKNTYSLVDRNNHPSIDFHAPR
LNKLVTEVIPSAVTSEGTKANAYKYTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ
TYNPVNGPLPDYGAPGDSGSPLFAYDKQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLGPL
NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII
TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFSSVNLASGRPTVVLGDARQ
VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTIGNISPFGGTGTPGNLYSM
ILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMNKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVAIPQVPGGRK
VIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDADFHLSRNASLNSDIKSDNS
HITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGIDAYDSAVSITSPDVLLTAP
GAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNAALEITRGAHASGD
IHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHSLTVRNSRISSE
GDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIPLVTAPAGTSAEMFKAGTR
VTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSG
AGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHH
DNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLVGRTGV
ELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNV
DNAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 100
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 100
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 99
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 99
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 53
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 47
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_23140 YP_006781278.1 Serine protease pic precursor (ShMu) VFG0635 Protein 0.0 100
O3K_23140 YP_006781278.1 Serine protease pic precursor (ShMu) VFG0861 Protein 0.0 99
O3K_23140 YP_006781278.1 Serine protease pic precursor (ShMu) VFG0903 Protein 0.0 97
O3K_23140 YP_006781278.1 Serine protease pic precursor (ShMu) VFG1689 Protein 0.0 47
O3K_23140 YP_006781278.1 Serine protease pic precursor (ShMu) VFG0904 Protein 0.0 47