Gene Information

Name : O3K_04520 (O3K_04520)
Accession : YP_006777626.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Virulence
Product : Serine protease pic precursor (ShMu)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 919718 - 923836 bp
Length : 4119 bp
Strand : -
Note : COG3468 Type V secretory pathway, adhesin AidA

DNA sequence :
GTGAATAAAGTTTATTCTCTTAAATATTGCCCCGTCACCGGGGGGCTTATTGCTGTCTCTGAACTTGCCCGCAGGGTAAT
AAAAAAGACATGCCGAAGATTAACGCATATTCTTCTGGCTGGCATTCCAGCAATCTGTCTGTGTTACTCTCAGATATCCC
AGGCGGGTATTGTCCGTTCCGATATTGCCTATCAGATTTATCGTGATTTCGCCGAAAACAAAGGGCTTTTTGTACCTGGT
GCCAATGATATTCCGGTATATGATAAGGACGGAAAACTTGTGGGAAGACTGGGTAAAGCCCCAATGGCCGATTTCAGCAG
TGTGAGCTCAAATGGCGTTGCTACGCTTGTATCGCCTCAGTATATCGTCAGCGTAAAGCATAACGGAGGATATCGGAGTG
TGAGCTTTGGTAATGGGAAAAATACATATTCCCTTGTTGACCGTAATAACCACCCTTCTATTGACTTCCATGCTCCACGT
CTGAATAAACTGGTTACAGAAGTTATTCCCTCAGCGGTAACATCAGAAGGAACCAAAGCCAATGCTTATAAATACACTGA
ACGTTACACCGCTTTTTATCGGGTGGGTAGTGGTACGCAGTACACTAAGGACAAGGACGGAAATTTAGTTAAGGTTGCCG
GTGGATATGCTTTTAAAACAGGAGGAACCACAGGAGTTCCTCTGATATCTGATGCAACAATAGTCTCTAATCCCGGGCAA
ACTTATAATCCTGTAAACGGCCCTTTACCTGACTATGGAGCCCCTGGGGATAGTGGTTCTCCTTTGTTTGCTTATGATAA
ACAACAAAAAAAATGGGTTATTGTTGCTGTATTAAGAGCATATGCAGGTATTAATGGTGCTACGAACTGGTGGAATGTCA
TACCAACAGATTATCTGAACCAGGTTATGCAGGACGATTTCGATGCCCCCGTAGACTTTGTTTCCGGACTGGGCCCCCTG
AACTGGACATACGACAAAACATCAGGCACAGGTACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGA
CAATGACCTCAATGCCGGTAAAAATCTGGTATTCAGCGGGCAGAATGGTGCAATTATCCTGAAAGACAGTGTGACTCAGG
GTGCCGGTTATCTCGAATTTAAAGACAGTTACACCGTATCTGCTGAATCCGGAAAAACATGGACGGGTGCCGGCATTATT
ACTGACAAGGGGACGAATGTAACCTGGAAGGTCAACGGCGTTGCCGGTGACAACTTGCATAAGCTGGGGGAAGGAACCCT
GACCATAAACGGAACAGGTGTAAACCCGGGAGGACTGAAAACGGGAGACGGTATCGTTGTACTTAACCAGCAGGCAGACA
CTGCAGGTAATATCCAGGCCTTCAGTTCAGTGAACCTCGCCAGCGGACGTCCGACCGTGGTGCTCGGGGATGCCCGTCAG
GTCAATCCGGATAACATTTCATGGGGATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACT
GCAGGCTGCTGATTACGGGGCGGTGATTACAAATAATGCACAGCAAAAATCCCAGCTTTTACTGGATCTTAAGGCTCAGG
ATACAAATGTCAGTGAACCGACGATTGGAAATATATCCCCCTTTGGTGGTACCGGCACTCCAGGAAACCTGTACAGCATG
ATACTCAACAGCCAGACCCGCTTCTATATTCTGAAATCTGCCAGCTATGGTAACACTCTGTGGGGGAACAGCCTGAATGA
TCCGGCTCAGTGGGAGTTTGTTGGCATGAACAAAAACAAAGCAGTTCAGACAGTAAAAGATAGGATCCTGGCCGGGCGGG
CAAAACAACCCGTTATCTTTCATGGTCAGCTGACCGGGAATATGGATGTCGCCATTCCACAGGTGCCGGGGGGAAGAAAG
GTAATCTTTGATGGTAGCGTGAACCTGCCGGAAGGTACCCTGAGTCAGGACAGTGGCACCCTGATATTCCAGGGACATCC
GGTTATCCATGCCTCCATCAGTGGCAGTGCACCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGTCAGTTTACAATGA
AAACACTGTCGCTGAAAGACGCTGACTTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGC
CATATCACACTGGGAAGTGACAGGGCATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCCGGAGGAAGGTAC
CTCTGTCCCGGACACCGTGAATGACAGGAGCCAGTATGAAGGGAATATTACGCTGAACCATAACTCAGCCCTGGATATCG
GCAGCAGGTTCACCGGGGGGATTGACGCTTATGACAGTGCCGTCAGCATCACCTCTCCGGACGTCCTGTTGACAGCCCCG
GGTGCTTTTGCCGGCAGTTCACTGACAGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCA
TATTCAGGCCGGTAAGAACGGCAAAATCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAATCAGTATGCCCCTGCTG
TATATCTGACGGACGGATATGACCTGACCGGCGATAACGCAGCACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGAT
ATTCATGCCTCTGCGGCATCAACAGTTACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGGCTGCATC
GGCGTTTGCCGGCAGTCTTCTTGAGGGCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTA
TGCATAATGCACTGTGGACTCTGGGTGGGGACTCTGCCATCCACAGTCTTACCGTCAGAAACAGCCGTATTAGTTCTGAA
GGAGACCGTACATTCCGTACCCTGACGGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTTTGCGTACGGACCTGAA
AAATGCCGATAAAATTAATGTGACTGAAAAAGCCACTGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAATAATCCTG
CTCAGGGACAGGCCCTGAATATTCCTCTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGG
GTGACAGGTTTCAGTCGGGTGACCCCAACCCTGCATGTTGATACCAGTGGTGGCAATACGAAGTGGATACTGGATGGTTT
TAAAGCGGAGGCTGATAAAGCCGCTGCCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAG
TTAACAATCTGAACAAACGTATGGGTGACCTGCGTGACACAAACGGTGATGCCGGTGCCTGGGCGCGCATCATGAGTGGT
GCCGGTTCTGCAGACGGTGGTTACAGTGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGG
TGTGGACCTGTTTACCGGTGTCACGATGACCTATACCGACAGCAGTGCAGACAGCCATGCATTCAGCGGAAAGACGAAAT
CGGTGGGGGGCGGTCTGTATGCTTCAGCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCAT
GACAATGATTACACAGGTAACTTTGCTAGCCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAAC
GGGTTACCGCTATCACCTGACAGAGGACACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAA
CATTCCGCTGGAAAGACGGTGATATGGACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGGTTGGAAGAACAGGGGTT
GAACTGGGCAAGACCTTCAGTGGTAAGGACTGGAGTGTGACGGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAA
TAATGGAGAGACCGTACTGCGTGATGCGTCCGGGGAGAAACGGATAAAAGGAGAGAAGGACAGCCGGATGCTGTTTAATG
TTGGTATGAATGCGCAGATAAAGGACAATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTG
GATAATGCGGTAAACGCGAATTTCCGGTATATGTTCTGA

Protein sequence :
MNKVYSLKYCPVTGGLIAVSELARRVIKKTCRRLTHILLAGIPAICLCYSQISQAGIVRSDIAYQIYRDFAENKGLFVPG
ANDIPVYDKDGKLVGRLGKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYRSVSFGNGKNTYSLVDRNNHPSIDFHAPR
LNKLVTEVIPSAVTSEGTKANAYKYTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ
TYNPVNGPLPDYGAPGDSGSPLFAYDKQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLGPL
NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII
TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFSSVNLASGRPTVVLGDARQ
VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTIGNISPFGGTGTPGNLYSM
ILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMNKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVAIPQVPGGRK
VIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDADFHLSRNASLNSDIKSDNS
HITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGIDAYDSAVSITSPDVLLTAP
GAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNAALEITRGAHASGD
IHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHSLTVRNSRISSE
GDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIPLVTAPAGTSAEMFKAGTR
VTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSG
AGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHH
DNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLVGRTGV
ELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNV
DNAVNANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 100
pic NP_708747.3 serine protease Not tested SHI-1 Protein 0.0 100
she AAB58244.1 mucinase Virulence SHI-1 Protein 0.0 99
pic AAK00464.1 Pic Virulence SHI-1 Protein 0.0 99
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 53
vat AAO21903.1 vacuolating autotransporter toxin Virulence Not named Protein 0.0 48
vat YP_851472.1 vacuolating autotransporter Not tested PAI III APEC-O1 Protein 0.0 47
unnamed CAD66214.1 putative hemoglobin protease Not tested PAI III 536 Protein 0.0 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_04520 YP_006777626.1 Serine protease pic precursor (ShMu) VFG0635 Protein 0.0 100
O3K_04520 YP_006777626.1 Serine protease pic precursor (ShMu) VFG0861 Protein 0.0 99
O3K_04520 YP_006777626.1 Serine protease pic precursor (ShMu) VFG0903 Protein 0.0 97
O3K_04520 YP_006777626.1 Serine protease pic precursor (ShMu) VFG0904 Protein 0.0 47
O3K_04520 YP_006777626.1 Serine protease pic precursor (ShMu) VFG1689 Protein 0.0 47