Name : papC (c3590)
Accession : NP_755465.1
PAI name : PAI I CFT073
PAI accession : NC_004431_P1
Strain : Escherichia coli 042
Virulence or Resistance: Virulence
Product : PapC protein
Function : -
Note : Residues 5 to 836 of 840 are 43.20 pct identical to residues 1 to 842 of 879 from EDL933 : z3600
Homologs in the searched genomes : 270 hits ( 270 protein-level )
Publication :
-Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R., "Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli", Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024 (2002) PUBMED 12471157.
-Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R., "Direct Submission", Submitted (10-SEP-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
-Welch,R.A., Burland,V., Plunkett,G.D. III, Redford,P., Roesch,P., Rasko,D.A., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R., "Direct Submission", Submitted (20-JUN-2002) Genetics Laboratory, University of Wisconsin - Madison, 445 Henry Mall, Madison, WI 53706, USA.
DNA sequence : | |
GTGATGCGTGGAATGAAAGACAGAATACCTTTTGCAGTCAACAATATTACCTGTGTGATATTGTTGTCTCTGTTTTGTAA
CGCAGCCAGTGCCGTTGAGTTTAATACAGATGTACTTGACGCGGCGGACAAGAAAAATATTGACTTCACCCGTTTTTCAG
AAGCCGGTTATGTTCTGCCGGGGCAATATCTTCTGGATGTGATTGTTAACGGGCAAAGTATTTCTCCCGCATCGTTACAG
ATTTCATTTGTTGAACCTCAGTCGTCAGGAGATAAGGCAGAAAAAAAATTGCCACAGGCCTGTCTGACATCAGATATGGT
CAGACTGATGGGGTTAACAGCAGAATCTCTGGATAAAGTTGTTTACTGGCATGATGGTCAGTGTGCGGATTTTCATGGGT
TGCCGGGAGTGGATATTCGTCCTGATACCGGAGCGGGCGTATTACGCATCAATATGCCGCAGGCCTGGCTTGAGTATTCT
GATGCCACCTGGCTGCCTCCCTCACGCTGGGACGACGGCATTCCCGGACTGATGCTGGATTATAACCTCAACGGGACGGT
TTCCCGTAATTATCAGGGAGGAGACTCTCATCAGTTCAGTTATAACGGGACTGTGGGGGGGAATCTGGGGCCCTGGCGCC
TGCGGGCTGACTATCAGGGAAGCCAGGAGCAGAGCCGCTACAACGGGGAAAAAACGACAAACAGAAATTTCACATGGAGT
CGCTTTTATCTGTTCCGTGCCATTCCACGATGGCGGGCAAACCTGACGCTGGGCGAGAATAATATCAACTCAGATATATT
CCGGTCATGGAGTTATACGGGAGCCAGCCTGGAAAGCGATGACCGGATGCTGCCACCCAGACTGCGAGGCTATGCACCGC
AGATTACCGGGATTGCGGAGACTAATGCCCGTGTTGTGGTGTCGCAGCAGGGACGGGTGCTGTACGACTCGATGGTCCCC
GCAGGGCCATTCAGTATTCAGGACCTGGACAGTTCAGTTCGCGGACGTCTTGATGTTGAGGTTATTGAACAGAACGGACG
GAAGAAAACCTTTCAGGTCGATACGGCCTCGGTTCCTTATCTGACGCGTCCGGGACAGGTCCGGTACAAACTTGTCTCCG
GTCGCTCCCGCGGATACGGGCATGAGACCGAAGGGCCTGTATTTGCAACCGGAGAGGCGTCCTGGGGGCTCAGTAACCAG
TGGTCGCTGTATGGCGGGGCTGTGCTTGCCGGTGATTATAATGCACTGGCAGCCGGTGCCGGCTGGGACCTGGGTGTGCC
GGGGACCCTTTCCGCTGATATCACGCAGTCAGTAGCCCGTATTGAGGGAGAGAGAACGTTTCAGGGAAAATCCTGGCGTC
TTAGCTACTCCAAACGGTTTGATAATGCGGATGCCGACATTACGTTCGCCGGGTATCGTTTCTCAGAGCGAAACTATATG
ACCATGGAGCAGTACCTGAACGCCCGCTACCGTAATGATTACAGCAGTCGGGAAAAAGAGATGTATACCGTTACGCTGAA
TAAAAACGTGGCGGACTGGAACACCTCTTTTAACCTGCAGTACAGCCGTCAGACATACTGGGACATACGGAAAACGGACT
ATTATACGGTGAGCGTCAACCGCTACTTTAATGTTTTCGGACTGCAGGGTGTGGCGGTTGGATTGTCAGCCTCAAGGTCT
AAATATCTGGGGCGTGATAACGATTCTGCTTACCTGCGTATATCCGTGCCGCTGGGGACGGGGACAGCGAGCTACAGTGG
CAGTATGAGTAATGACCGTTATGTGAATATGGCCGGCTACACTGACATGTTCAATGACGGTCTGGACAGCTACAGCCTGA
ACGCCGGCCTTAACAGTGGCGGTGGACTGACATCGCAACGTCAGATTAATGCCTATTACAGTCATCGTAGTCCGCTGGCA
AATTTGTCCGCGAATATTGCATCCCTGCAGAAAGGATATACGTCTTTCGGCGTCAGTGCTTCCGGTGGGGCAACAATTAC
CGGAAAAGGTGCGGCGTTACATGCAGGGGGAATGTCCGGTGGAACACGTCTTCTTGTTGACACGGATGGTGTGGGAGGTG
TACCGGTTGATGGCGGGCAGGTGGTGACAAATCGCTGGGGAACGGGCGTGGTGACTGACATCAGCAGTTATTACCGGAAT
ACAACCTCTGTTGACCTGAAGCGCTTACCGGATGATGTGGAAGCAACCCGTTCTGTTGTGGAATCGGCGCTGACAGAAGG
TGCCATTGGTTACCGGAAATTCAGCGTGCTTAAAGGGAAACGTCTGTTTGCAATACTGCGTCTTGCTGATGGCTCTCAGC
CCCCGTTTGGTGCCAGTGTAACCAGTGAAAAAGGCCGGGAGCTGGGCATGGTGGCCGACGAAGGCCTTGCCTGGCTGAGT
GGCGTGACGCCGGGGGAAACCCTGTCGGTAAACTGGGATGGAAAAATACAGTGTCAGGTAAATGTACCGGAGACAGCAAT
ATCTGACCAGCAGTTATTGCTTCCCTGTACGCCTCAGAAATAA
|
Protein sequence : | |
MMRGMKDRIPFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLPGQYLLDVIVNGQSISPASLQ
ISFVEPQSSGDKAEKKLPQACLTSDMVRLMGLTAESLDKVVYWHDGQCADFHGLPGVDIRPDTGAGVLRINMPQAWLEYS
DATWLPPSRWDDGIPGLMLDYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTWS
RFYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIAETNARVVVSQQGRVLYDSMVP
AGPFSIQDLDSSVRGRLDVEVIEQNGRKKTFQVDTASVPYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQ
WSLYGGAVLAGDYNALAAGAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERNYM
TMEQYLNARYRNDYSSREKEMYTVTLNKNVADWNTSFNLQYSRQTYWDIRKTDYYTVSVNRYFNVFGLQGVAVGLSASRS
KYLGRDNDSAYLRISVPLGTGTASYSGSMSNDRYVNMAGYTDMFNDGLDSYSLNAGLNSGGGLTSQRQINAYYSHRSPLA
NLSANIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGGTRLLVDTDGVGGVPVDGGQVVTNRWGTGVVTDISSYYRN
TTSVDLKRLPDDVEATRSVVESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLS
GVTPGETLSVNWDGKIQCQVNVPETAISDQQLLLPCTPQK
|
|