PAI Gene Information


Name : papCb
Accession : AAL67415.1
PAI name : PAI II CFT073
PAI accession : AF447814
Strain : Escherichia coli 042
Virulence or Resistance: Virulence
Product : truncated PapC
Function : -
Note : ORF83; similar to Escherichia coli PapC encoded by GenBank Accession Number X61239
Homologs in the searched genomes :   16 hits    ( 16 protein-level )  
Publication :
    -Rasko,D.A., Phillips,J.A., Li,X. and Mobley,H.L., "Identification of DNA sequences from a second pathogenicity island of uropathogenic Escherichia coli CFT073: probes specific for uropathogenic populations", J. Infect. Dis. 184 (8), 1041-1049 (2001) PUBMED 11574920.

    -Rasko,D.A., Phillips,J.A., Li,X. and Mobley,H.L.T., "Direct Submission", Submitted (14-NOV-2001) Dept of Microbiology and Immunology, University of Maryland School of Medicine, 655 W. Baltimore Street, Baltimore, MD 21201, USA.


DNA sequence :
ATGCGTGGAATGAAAGACAGAATACCTTTTGCAGTCAACAATATTACCTGTGTGATATTGTTGTCTCTGTTTTGTAACGC
AGCCAGTGCCGTTGAGTTTAATACAGATGTACTTGACGCGGCGGACAAGAAAAATATTGACTTCACCCGTTTTTCAGAAG
CCGGCTATGTTCTGCCGGGGCAATATCTTCTGGATGTGATTGTTAACGGGCAAAGTATTTCTCCCGCATCGTTACAGATT
TCATTTGTTGAACCTCAGTCGTCAGGAGATAAGGCAGAAAAAAAATTGCCGCAGGCCTGCCTGACATCAGATATGGTCAG
ACTGATGGGGTTAACAGCAGAATCTCTGGATAAAGTTGTTTACTGGCATGATGGTCAGTGTGCGGATTTTCATGGGTTGC
CGGGAGTGGATATTCGTCCTGATACCGGAGCGGGCGTATTACGCATCAATATGCCGCAGGCCTGGCTTGAGTATTCTGAT
GCCACCTGGCTGCCTCCCTCACGCTGGGACGACGGCATTCCCGGACTGATGCTGGATTATAACCTCAACGGGACGGTTTC
CCGTAATTATCAGGGAGGAGACTCTCATCAGTTCAGTTATAACGGGACTGTGGGGGGGAATCTGGGGCCCTGGCGCCTGC
GGGCTGACTATCAGGGAAGCCAGGAGCAGAGCCGCTACAACGGGGAAAAAACGACAAACAGAAATTTCACATGGAGTCGC
TTTTATCTGTTCCGTGCCATTCCACGATGGCGGGCAAACCTGACGCTGGGCGAGAATAATATCAACTCAGATATATTCCG
GTCATGGAGTTATACGGGAGCCAGCCTGGAAAGCGATGACCGGATGCTGCCACCCAGACTGCGAGGCTATGCACCGCAGA
TTACCGGGATTGCGGAGACTAATGCCCGTGTTGTGGTGTCGCAGCAGGGACGGGTGCTGTACGACTCGATGGTCCCCGCA
GGGCCATTCAGTATTCAGGACCTGGACAGTTCAGTTCGCGGACGTCTTGATGTTGAGGTTATTGAACAGAACGGACGGAA
GAAAACCTTTCAGGTCGATACGGCCTCGGTTCCTTATCTGACGCGTCCGGGACAGGTCCGGTACAAACTTGTCTCCGGTC
GCTCCCGCGGATACGGGCATGAGACCGAAGGGCCTGTATTTGCAACCGGAGAGGCGTCCTGGGGGCTCAGTAACCAGTGG
TCGCTGTATGGCGGGGCTGTGCTTGCCGGTGATTATAATGCACTGGCAGCCGGTGCCGGCTGGGACCTGGGTGTGCCGGG
GACCCTTTCCGCTGATATCACGCAGTCAGTAGCCCGTATTGAGGGAGAGAGAACGTTTCAGGGAAAATCCTGGCGTCTTA
GCTACTCCAAACGGTTTGATAATGCGGATGCCGACATTACGTTCGCCGGGTATCGTTTCTCAGAGCGAAACTATATGACC
ATGGAGCAGTACCTGAACGCCCGCTACCGTAATGATTACAGCAGTCGGGAAAAAGAGATGTATACCGTTACGCTGAATAA
AAACGTGGCGGACTGGAACACCTCTTTTAACCTGCAGTACAGCCGTCAGACATACTGGGACATACGGAAAACGGACTATT
ATACGGTGAGCGTCAACCGCTACTTTAATGTTTTCGGACTGCAGGGTGTGCGGTTGGATTGTCAGCCTCAAGGTCTAAAT
ATCTGGGGCGTGATAACGATTCTGCTTACCTGCGTATATCCGTGCCGCTGGGGACGGGGACAGCGAGCTACAGTGGCAGT
ATGA

Protein sequence :
MRGMKDRIPFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLPGQYLLDVIVNGQSISPASLQI
SFVEPQSSGDKAEKKLPQACLTSDMVRLMGLTAESLDKVVYWHDGQCADFHGLPGVDIRPDTGAGVLRINMPQAWLEYSD
ATWLPPSRWDDGIPGLMLDYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTWSR
FYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIAETNARVVVSQQGRVLYDSMVPA
GPFSIQDLDSSVRGRLDVEVIEQNGRKKTFQVDTASVPYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQW
SLYGGAVLAGDYNALAAGAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERNYMT
MEQYLNARYRNDYSSREKEMYTVTLNKNVADWNTSFNLQYSRQTYWDIRKTDYYTVSVNRYFNVFGLQGVRLDCQPQGLN
IWGVITILLTCVYPCRWGRGQRATVAV