Name : c0350 (c0350) Accession : NP_752289.1 Strain : Escherichia coli CFT073 Genome accession: NC_004431 Putative virulence/resistance : Virulence Product : Pic serine protease Function : - COG functional category : M : Cell wall/membrane/envelope biogenesis COG ID : COG3468 EC number : - Position : 326209 - 330324 bp Length : 4116 bp Strand : - Note : Residues 1 to 1371 of 1371 are 96.28 pct identical to residues 1 to 1372 of 1372 from GenPept.129 : >gb|AAD23953.1|AF097644_1 (AF097644) Pic serine protease precursor [Escherichia coli] DNA sequence : GTGAATAAAGTTTATTCTCTTAAATATTGCCCTGTCACCGGGGGACTTATTGTTGTCTCTGAACTTGCCAGCAGGGTAAT AAAAAAGACATGCCGAAGATTAACGCATATTCTTCTGGCTGGCATTCCAGCAGTCTATCTGTATTACCCTCAGATATCCC AGGCGGGCATTGTCCGTTCCGATATTGCCTATCAGATTTATCGTGATTTCGCTGAAAACAAAGGGCTTTTTGTACCTGGT GCCACAGATATTCCGGTATATGATAAGGACGGAAAACTTGTGGGGAGACTGGATAAAGCCCCAATGGCCGATTTCAGCAG TGTGAGCTCAAATGGCGTTGCTACGCTTGTATCACCTCAGTATATCGTCAGCGTAAAGCATAACGGAGGATATCAGAGTG TGAGCTTTGGTAATGGGAAAAATACATATTCCCTTGTTGACCGTAATAACCACTCTTCTGTTGACTTCCATGCTCCACGT CTGAATAAACTGGTTACAGAAGTTATTCCCTCAGCGATAACATCAGAAGGAACCAAAGCCAATGCTTATAAAGACACTGA ACGTTACACCGCTTTTTATCGGGTGGGTAGTGGTACGCAGTACACTAAGGACAAGGACGGAAATTTAGTTAAGGTTGCCG GCGGATATGCTTTTAAAACAGGAGGAACCACAGGAGTTCCTCTGATATCTGATGCAACAATAGTCTCTAATCCCGGGCAA ACCTATAATCCTGTAAACGGACCTTTACCTGACTATGGTGCCCCTGGGGACAGTGGTTCTCCTTTGTTTGCTTATGATGA ACAACAAAAAAAATGGGTTATTGTTGCTGTATTAAGAGCATATGCAGGTATTAATGGTGCTACGAACTGGTGGAATGTCA TACCAACAGATTATCTGAACCAGGTTATGCAGGACGATTTCGATGCCCCCGTGGACTTTGTTTCCGGACTGCCCCCCCTG AACTGGACATACGACAAAACATCAGGCACAGGCACCCTGAGCCAGGGCAGTAAAAACTGGACCATGCACGGGCAGAAAGA CAATGATCTCAATGCCGGTAAAAATCTGGTATTCAGCGGGCAGAACGGTGCAATTGTCCTGAAAGACAGTGTGACTCAGG GGGCCGGTTATCTCGAATTTAAAGACAGTTACACCGTATCTGCTGAATCCGGAAAAACCTGGACGGGTGCCGGCATTATT ACTGACAAGGGGACGAATGTGACCTGGAAGGTCAACGGGGTTGCCGGTGACAACCTGCATAAATTGGGGGAAGGAACCCT GACCATAAACGGAACAGGTGTAAACCCGGGAGGACTGAAAACGGGAGACGGTACCGTTGTACTTAACCAGCAGGCAGACA CTGCAGGTAATGTTCAGGCCTTCAGTTCCGTGAACCTCGCCAGCGGACGACCGACCGTGGTGCTCGGAGATGCCCGTCAG GTCAATCCGGATAACATTTCATGGGGATACCGGGGAGGTAAGCTTGACCTTAATGGTAATGCCGTTACCTTCACCCGACT GCAGGCTGCCGATTACGGGGCGGTGATTACAAATAATGCACAGCAAAAATCCCGGCTTTTACTGGATCTTAAGGCTCAGG ATACAAATGTCAGTGTTCCGATTGGCAGTATATCCCCCTTTGGCGGTACCGGCACACCGGGAAACCTATACAGCATGATA CTCAACGGCCAGACCCGCTTCTATATTCTGAAATCTGCCAGCTATGGTAACACCCTGTGGGGGAACAGCCTGAATGACCC GGCTCAGTGGGAATTTGTTGGCACGGACAAAAACAAAGCAGTTCAGACAGTAAAAGACCGGATCCTGGCCGGGCGGGCAA AACAACCCGTTATCTTTCATGGTCAGCTGACCGGGAATATGGATGTCACCATTCCACAGCTGCCGGGGGGAAGAAAGGTC ATCCTTGATGGTAGCGTGAACCTGCCGGAAGGTACCCTGAGTGAGGACAGTGGCACCCTGATATTCCAGGGGCATCCGGT TATCCACGCCTCCGTCAGTGGCAGTGCGCCGGTCAGCCTGAACCAGAAAGACTGGGAAAACCGCCAGTTCATAATGAAAA CACTGTCGCTGAAAGATGCTGACTTCCATCTTTCACGTAACGCCTCGCTGAACAGTGACATTAAGTCGGATAACAGCCAT ATCACACTGGGAAGTGACAGGGTATTTGTGGATAAAAATGACGGAACAGGAAATTATGTCATTCTGGAGGAAGGTACCTC TGTCCCGGACACCGTGAATGACAGGAGCCAGTATGAAGGGAATATTACGCTGGACCATAACTCAACCCTGGATATCGGCA GCCGGTTCACCGGAGGGATTGAAGCTTATGACAGTGCCGTCAGTATCACCTCTCCGGACGTCCTGTTAACAGCCCCGGGT GCTTTTGCCGGCAGTTCACTGACAGTGCATGATGGCGGTCATCTTACAGCACTGAACGGTCTTTTCAGCGACGGGCATAT TCAGGCCGGTAAGAACAGCAAAATCACCCTGAGCGGTACACCGGTTAAAGATACGGCTAACCAGTATGCCCCTGCTGTAT ATCTGACGGACGGATATGACCTGACCGGCGATAACGCAACACTGGAAATTACCCGTGGAGCACATGCTTCCGGTGATATT CATGCCTCTGCGGCATCAACAGTTACCATCGGGTCTGACACGCCGGCAGAACTGGCTTCTGCGGAAACGACTGCATCGGC GTTTGCCGGCAGTCTTCTTGAGGGCTATAACGCAGCATTCAATGGTGCCATAACCGGTGGCAGGGCTGATGTCAGTATGC ATAATGCACTGTGGACTCTGGGTGGGGACTCCGCCATCCACACTCTTACCGTCAGAAACAGCCGTATCAGTTCTGAAGGA GACCGTACTTTCCGTACCCTGACGGTGAATAAACTGGATGCAACAGGCAGTGATTTTGTTCTGCGTACGGACCTGAAAAA TGCCGATAAAATTAATGTGACTGAAAAAGCCACGGGTTCAGATAACAGCCTGAACGTCAGCTTTATGAAGGATCCGGCTC AGGGACAGTCCCTGAATATTCCTCTGGTCACGGCACCGGCGGGAACTTCAGCAGAGATGTTTAAGGCCGGCACCCGGATG ATAGGTTTCAGTCGGGTGACTCCAACCCTGCATGTTGACACCAGTGGTGGCAATACGAAGTGGATACTGGACGGTTTTAA AGCGGAGGCTGATAAAGCCGCTGCCGCGAAGGCTGACAGTTTCATGAATGCCGGGTATAAAAACTTCATGACGGAAGTCA ACAATCTGAACAAACGTATGGGTGACCTGCGTGACACAAACGGTGATGCCGGAGCCTGGGCGCGCATCATGAGTGGTGCC GGTTCTGCAGACGGTGGTTACAGTGATAATTACACCCATGTTCAGGTCGGCTTTGACAAAAAACATGAACTGGACGGAGT GGACCTGTTTACCGGTGTCACGATGACCTATACCGACAGCAGTGCAGACAGCCATGCGTTCAGCGGCAAGACGAAATCGG TGGGGGGCGGTCTGTATGCTTCAGCATTGTTTGAGTCCGGTGCCTATATCGATTTGATTGGTAAATATATTCACCATGAC AATGATTACACAGGTAACTTTGCCGGTCTGGGAACGAAACACTACAACACCCATTCCTGGTATGCCGGTGCTGAAACGGG TTACCGCTATCACCTGACAGAGGAAACGTTCATTGAGCCGCAGGCTGAACTGGTTTACGGCGCCGTGTCCGGGAAAACAT TCCGCTGGAAAGACGGTGATATGGACCTGAGCATGAAGAACAGGGACTTCAGTCCGCTGATTGGAAGAACAGGGATTGAA CTGGGCAAGACCTTCAGTGGTAAGGACTGGAGTGTGACGGCCCGTGCCGGGACCAGCTGGCAGTTTGACCTACTGAATAA TGGTGAGACGGTACTGCGTGATGCGTCCGGGGAGAAACGGATAAAAGGCGAGAAAGACAGCCGGATGCTGTTTAATGTTG GTATGAATGCGCAGATAAAGGACAATATGCGCTTTGGTCTGGAGTTTGAGAAGTCAGCCTTTGGTAAATATAACGTGGAT AATGCGGTAAACGCAAATTTCCGGTATATGTTCTGA Protein sequence : MNKVYSLKYCPVTGGLIVVSELASRVIKKTCRRLTHILLAGIPAVYLYYPQISQAGIVRSDIAYQIYRDFAENKGLFVPG ATDIPVYDKDGKLVGRLDKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYQSVSFGNGKNTYSLVDRNNHSSVDFHAPR LNKLVTEVIPSAITSEGTKANAYKDTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ TYNPVNGPLPDYGAPGDSGSPLFAYDEQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLPPL NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIVLKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGTVVLNQQADTAGNVQAFSSVNLASGRPTVVLGDARQ VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSRLLLDLKAQDTNVSVPIGSISPFGGTGTPGNLYSMI LNGQTRFYILKSASYGNTLWGNSLNDPAQWEFVGTDKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVTIPQLPGGRKV ILDGSVNLPEGTLSEDSGTLIFQGHPVIHASVSGSAPVSLNQKDWENRQFIMKTLSLKDADFHLSRNASLNSDIKSDNSH ITLGSDRVFVDKNDGTGNYVILEEGTSVPDTVNDRSQYEGNITLDHNSTLDIGSRFTGGIEAYDSAVSITSPDVLLTAPG AFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNSKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNATLEITRGAHASGDI HASAASTVTIGSDTPAELASAETTASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHTLTVRNSRISSEG DRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMKDPAQGQSLNIPLVTAPAGTSAEMFKAGTRM IGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSGA GSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHHD NDYTGNFAGLGTKHYNTHSWYAGAETGYRYHLTEETFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLIGRTGIE LGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNVD NAVNANFRYMF |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
pic | NP_838464.1 | serine protease precurser | Virulence | SHI-1 | Protein | 0.0 | 97 |
pic | NP_708747.3 | serine protease | Not tested | SHI-1 | Protein | 0.0 | 97 |
she | AAB58244.1 | mucinase | Virulence | SHI-1 | Protein | 0.0 | 97 |
pic | AAK00464.1 | Pic | Virulence | SHI-1 | Protein | 0.0 | 97 |
unnamed | CAC39286.1 | hypothetical protein | Not tested | LPA | Protein | 0.0 | 53 |
unnamed | CAD66214.1 | putative hemoglobin protease | Not tested | PAI III 536 | Protein | 0.0 | 48 |
vat | YP_851472.1 | vacuolating autotransporter | Not tested | PAI III APEC-O1 | Protein | 0.0 | 48 |
vat | AAO21903.1 | vacuolating autotransporter toxin | Virulence | Not named | Protein | 0.0 | 48 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
c0350 | NP_752289.1 | Pic serine protease | VFG0903 | Protein | 0.0 | 100 |
c0350 | NP_752289.1 | Pic serine protease | VFG0635 | Protein | 0.0 | 97 |
c0350 | NP_752289.1 | Pic serine protease | VFG0861 | Protein | 0.0 | 97 |
c0350 | NP_752289.1 | Pic serine protease | VFG0904 | Protein | 0.0 | 48 |
c0350 | NP_752289.1 | Pic serine protease | VFG1689 | Protein | 0.0 | 48 |