PAI Gene Information


Name : unnamed
Accession : AAL67386.1
PAI name : PAI II CFT073
PAI accession : AF447814
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : R6-like protein
Function : -
Note : ORF53; similar to Escherichia coli R6 encoded by GenBank Accession Number AF081285
Homologs in the searched genomes :   196 hits    ( 194 protein-level,   2 DNA-level )  
Publication :
    -Rasko,D.A., Phillips,J.A., Li,X. and Mobley,H.L., "Identification of DNA sequences from a second pathogenicity island of uropathogenic Escherichia coli CFT073: probes specific for uropathogenic populations", J. Infect. Dis. 184 (8), 1041-1049 (2001) PUBMED 11574920.

    -Rasko,D.A., Phillips,J.A., Li,X. and Mobley,H.L.T., "Direct Submission", Submitted (14-NOV-2001) Dept of Microbiology and Immunology, University of Maryland School of Medicine, 655 W. Baltimore Street, Baltimore, MD 21201, USA.


DNA sequence :
ATGACTCAGCGTAAGAAAGGTATAACTCAGCATATCTCGGCCATGAAGGCTGGTATCTCAGTCCGTTCTGGTCGTCGGAT
CGAAAAAGGAGAGTGGGCAAAAAACAGTGTTCGGCACTGGCGCACACGCAAAGATCCTCTGGAAGCTGTGTGGGACAGCA
TGCTTGTTCCTCTGTTGAAAGAGAGGCCGGCTCTGACACCAACAACTCTGCTGGAGATGCTACAGGATAAATATCCCGGC
CAGTACCCCAACAGCCTTCGAAGAACAATGCAACGGCGGGTCCGCGAATGGAAGCTACAGTATGGTGCAGAGCAGGAGGT
CATGTTCCGCCAGCGACATCAGCCCGGTCTGCGAGGTCTGTCGGACTTTACTGAACTGAAAGGTGTAGTTGTCACCATCG
CCGGTAAGTTGTTGGCGCATAAGTTGTATCACTTCCGTCTGGAATGGAGCCACTGGAGCTGGATGCGGGTTGTGCTGGGT
GGTGAGAGCTTCTCTGCTCTGGCTGAAGGTCTGCAGGAAGCCCTCGGACAACTGGGCGGAGTGCCGGTAGAACATAAAAC
GGACAGCCTGAGGGCAGCATGGAAACAACAGGGCGAAGATGGACGCCGCGAGCTGACTGAGCGTTATGCTGCTCTCTGTC
AGCACTACGGAATGCAGGGCGTACACAATAATGCCGGTCGGGGCCACGAAAATGGCTCGGTTGAAAGTGCCCACGGACAT
CTGAAAAGGCGTATCTGTCAGGCGCTGATACTGCGGGGCAGTAACGACTTCAGCACCATAGAAGAATATCAGGCCTTCAT
CACTCAGCAGGTTATGCGGCACAACCGTAACAATCAGGATCTGGTCAAGGAAGAACGTCTTCATCTGAAACCGCTGCCGC
TTCGTCGCAGTGCTGACTATGATGAGCTGACTGTGAGGGTTAGCCGCAGCAGTACCATCAATGTGAAGCACGTCGTCTAC
AGCGTACCTTCCCGGCTTGTAGGTCAACTGTTACGGGTCCGGTTATGGGACGATCGTCTGAGCTGTTACGTTGGCAGCAG
CGAGGTCATGAGCTGCCCACGTGTCAGACCAGAAAAAGGGAAGACGCGGGCCCGTCGTATCGACTTCCGACATGTGATCG
ACAGTCTGGCAAAAAAGCCCGGTGCGTTCTGCCATGCAACGCTGAGAAATGACATCCTGCCAGACGATGAATGGCGGAGG
CTGTGGCGTCGCTTATGTAATCATCTGGAACCCGACATGGCAGGCAGGCTGATGGTACATGCTCTGAAACTGGCTGCAGG
ATACGACGATATCTCAGTCGTGGCAAAAGGTATGGAGCAGATGCTGAATACCCCGGGAAACGTGGATCTGCACCGGCTGA
TGCGCTTCCTGGGTATAAAGGAAAAGGCGTTGCCGGTAGTCAATGTGAAACAGCATAACCTGAGCAGTTATGAGCAACTA
CTGCGTGGCAAGGGAGGTTCGCAGTGA

Protein sequence :
MTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRHWRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPG
QYPNSLRRTMQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLAHKLYHFRLEWSHWSWMRVVLG
GESFSALAEGLQEALGQLGGVPVEHKTDSLRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHGH
LKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHLKPLPLRRSADYDELTVRVSRSSTINVKHVVY
SVPSRLVGQLLRVRLWDDRLSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLRNDILPDDEWRR
LWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGMEQMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQL
LRGKGGSQ