Name : unnamed
Accession : AAL67386.1
PAI name : PAI II CFT073
PAI accession : AF447814
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : R6-like protein
Function : -
Note : ORF53; similar to Escherichia coli R6 encoded by GenBank Accession Number AF081285
Homologs in the searched genomes : 196 hits ( 194 protein-level, 2 DNA-level )
Publication :
-Rasko,D.A., Phillips,J.A., Li,X. and Mobley,H.L., "Identification of DNA sequences from a second pathogenicity island of uropathogenic Escherichia coli CFT073: probes specific for uropathogenic populations", J. Infect. Dis. 184 (8), 1041-1049 (2001) PUBMED 11574920.
-Rasko,D.A., Phillips,J.A., Li,X. and Mobley,H.L.T., "Direct Submission", Submitted (14-NOV-2001) Dept of Microbiology and Immunology, University of Maryland School of Medicine, 655 W. Baltimore Street, Baltimore, MD 21201, USA.
DNA sequence : | |
ATGACTCAGCGTAAGAAAGGTATAACTCAGCATATCTCGGCCATGAAGGCTGGTATCTCAGTCCGTTCTGGTCGTCGGAT
CGAAAAAGGAGAGTGGGCAAAAAACAGTGTTCGGCACTGGCGCACACGCAAAGATCCTCTGGAAGCTGTGTGGGACAGCA
TGCTTGTTCCTCTGTTGAAAGAGAGGCCGGCTCTGACACCAACAACTCTGCTGGAGATGCTACAGGATAAATATCCCGGC
CAGTACCCCAACAGCCTTCGAAGAACAATGCAACGGCGGGTCCGCGAATGGAAGCTACAGTATGGTGCAGAGCAGGAGGT
CATGTTCCGCCAGCGACATCAGCCCGGTCTGCGAGGTCTGTCGGACTTTACTGAACTGAAAGGTGTAGTTGTCACCATCG
CCGGTAAGTTGTTGGCGCATAAGTTGTATCACTTCCGTCTGGAATGGAGCCACTGGAGCTGGATGCGGGTTGTGCTGGGT
GGTGAGAGCTTCTCTGCTCTGGCTGAAGGTCTGCAGGAAGCCCTCGGACAACTGGGCGGAGTGCCGGTAGAACATAAAAC
GGACAGCCTGAGGGCAGCATGGAAACAACAGGGCGAAGATGGACGCCGCGAGCTGACTGAGCGTTATGCTGCTCTCTGTC
AGCACTACGGAATGCAGGGCGTACACAATAATGCCGGTCGGGGCCACGAAAATGGCTCGGTTGAAAGTGCCCACGGACAT
CTGAAAAGGCGTATCTGTCAGGCGCTGATACTGCGGGGCAGTAACGACTTCAGCACCATAGAAGAATATCAGGCCTTCAT
CACTCAGCAGGTTATGCGGCACAACCGTAACAATCAGGATCTGGTCAAGGAAGAACGTCTTCATCTGAAACCGCTGCCGC
TTCGTCGCAGTGCTGACTATGATGAGCTGACTGTGAGGGTTAGCCGCAGCAGTACCATCAATGTGAAGCACGTCGTCTAC
AGCGTACCTTCCCGGCTTGTAGGTCAACTGTTACGGGTCCGGTTATGGGACGATCGTCTGAGCTGTTACGTTGGCAGCAG
CGAGGTCATGAGCTGCCCACGTGTCAGACCAGAAAAAGGGAAGACGCGGGCCCGTCGTATCGACTTCCGACATGTGATCG
ACAGTCTGGCAAAAAAGCCCGGTGCGTTCTGCCATGCAACGCTGAGAAATGACATCCTGCCAGACGATGAATGGCGGAGG
CTGTGGCGTCGCTTATGTAATCATCTGGAACCCGACATGGCAGGCAGGCTGATGGTACATGCTCTGAAACTGGCTGCAGG
ATACGACGATATCTCAGTCGTGGCAAAAGGTATGGAGCAGATGCTGAATACCCCGGGAAACGTGGATCTGCACCGGCTGA
TGCGCTTCCTGGGTATAAAGGAAAAGGCGTTGCCGGTAGTCAATGTGAAACAGCATAACCTGAGCAGTTATGAGCAACTA
CTGCGTGGCAAGGGAGGTTCGCAGTGA
|
Protein sequence : | |
MTQRKKGITQHISAMKAGISVRSGRRIEKGEWAKNSVRHWRTRKDPLEAVWDSMLVPLLKERPALTPTTLLEMLQDKYPG
QYPNSLRRTMQRRVREWKLQYGAEQEVMFRQRHQPGLRGLSDFTELKGVVVTIAGKLLAHKLYHFRLEWSHWSWMRVVLG
GESFSALAEGLQEALGQLGGVPVEHKTDSLRAAWKQQGEDGRRELTERYAALCQHYGMQGVHNNAGRGHENGSVESAHGH
LKRRICQALILRGSNDFSTIEEYQAFITQQVMRHNRNNQDLVKEERLHLKPLPLRRSADYDELTVRVSRSSTINVKHVVY
SVPSRLVGQLLRVRLWDDRLSCYVGSSEVMSCPRVRPEKGKTRARRIDFRHVIDSLAKKPGAFCHATLRNDILPDDEWRR
LWRRLCNHLEPDMAGRLMVHALKLAAGYDDISVVAKGMEQMLNTPGNVDLHRLMRFLGIKEKALPVVNVKQHNLSSYEQL
LRGKGGSQ
|
|