Name : unnamed
Accession : AAB61290.1
PAI name : PAI I CFT073
PAI accession : AF003741
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : unknown
Function : -
Note : first ORF in the pathogenicity island
Homologs in the searched genomes : 361 hits ( 361 protein-level )
Publication :
-Kao,J.-S., Stucker,D.M., Warren,J.W. and Mobley,H.L.T., "Pathogenicity Island Sequenced of Pyelonephritogenic Escherichia coli CFT073 Are Associated with Virulent Uropathogenic Strains", Infect. Immun. 65 (1997) In press.
-Kao,J.-S., Stucker,D.M., Warren,J.W. and Mobley,H.L.T., "Direct Submission", Submitted (12-MAY-1997) Medicine, University of Maryland at Baltimore, 10 S Pine St MSTF Rm 900, Baltimore, MD 21201, USA.
DNA sequence : | |
ATGCTGGCGGGGGTTGATGGCGTCGGCGGTATCCCGTTTGATAATTACCCCTTCGCCTACATGGTAAGTAACCTGGCGCT
GGCGATTATTTTGCTCGACGGCGGGATGCGTACTCAGGCCAGCTCCTTTTGTGTGGCGTTAGGACCGGCACTGTCGCTGG
CGACGCTGGGCGTGCTTATCACCTCTGGTTTAACCGGCATGATGGCGGGGTGGCTGTTTAATCTTGATTTGATTGAAGGA
TTATTAATCGGCGCTATCGTCGGCTCCACCGATGCTGCAGCGGTCTTTTTTTTGCTGGGTGGTAAGGGGCTTAACGAACG
TGTTGGTTCGACGCTGGAAATTGAATCCGGCAGTAATGATCCAATGGCGGTCTTTCTGACGATTACCCTGATTGCGATGA
TCCAGCAACATGAAAGCAGTGTCAGTTGGATGTTCGTGGTCGATATTCTGCAGCAATTTGGCCTCGGGATTGTCATCGGG
CTTGGCGGTGGTTATTTACTGCTGCAAATGATTAATCGCATCGCCCTGCCCGCCGGATTATATCCATTGCTGGCATTAAG
CGGCGGTATTTTAATTTTTGCTTTAACCACGGCGCTGGAAGGCAGCGGTATTCTGGCTGTTTATCTGTGCGGTTTTCTGC
TGGGTAATCGGCCGATTCGCAACCGCTACGGCATCCTGCAAAATTTCGACGGCCTCGCCTGGCTGGCGCAAATCGCCATG
TTCCTGGTGCTGGGGCTATTGGTTAACCCAAGCGATCTGCTGCCCATTGCCATTCCGGCGCTCATTTTGTCCGCATGGAT
GATTTTCTTCGCCCGTCCTCTTTCGGTATTTGCCGGATTGCTACCGTTCCGTGGCTTCAATCTGCGTGAGCGCGTGTTTA
TCAGCTGGGTAGGATTACGCGGCGCGGTGCCGATTATTCTGGCTGTGTTCCCGATGATGGCGGGGCTGGAGAATGCGCGA
CTGTTCTTTAATGTCGCCTTCTTTGTGGTCCTGGTTTCACTGCTATTGCAGGGAACATCACTCTCGTGGGCGGCGAAAAA
AGCCAAAGTGGTCGTTCCGCCAGTGGGACGTCCGGTGTCACGCGTTGGCCTAGATATTCATCCGGAAAATCCGTGGGAGC
AGTTTGTTTATCAATTGAGTGCCGATAAATGGTGCGTGGGCGCGGCACTGCGTGATTTGCATATGCCAAAAGAGACGCGT
ATTGCGGCACTGTTTCGTGATAACCAGTTGCTTCATCCCACCGGCAGCACCCGACTGCGCGAAGGCGATGTGTTGTGTGT
AATTGGTCGGGAACGCGATCTCCCGGCGCTCGGTAAACTGTTCAGCCAGTCGCCGCCGGTCGCGCTGGATCAACGCTTCT
TTGGTGACTTCATTCTCGAAGCCAGCGCCAAATATGCTGATGTGGCGCTGATATATGGTCTGGAAGACGGGCGAGAATAT
CGCGATAAGCAGCAAACGCTGGGTGAAATCGTCCAGCAGTTGTTAGGCGCAGCACCGGTTGTCGGTGACCAGGTAGAGTT
TGCCGGGATGATCTGGACGGTGGCCGAGAAAGAAGATAATGAAGTGTTGAAGATTGGTGTTCGGGTAGCGGAGGAAGAAG
CCGAATCTTAA
|
Protein sequence : | |
MLAGVDGVGGIPFDNYPFAYMVSNLALAIILLDGGMRTQASSFCVALGPALSLATLGVLITSGLTGMMAGWLFNLDLIEG
LLIGAIVGSTDAAAVFFLLGGKGLNERVGSTLEIESGSNDPMAVFLTITLIAMIQQHESSVSWMFVVDILQQFGLGIVIG
LGGGYLLLQMINRIALPAGLYPLLALSGGILIFALTTALEGSGILAVYLCGFLLGNRPIRNRYGILQNFDGLAWLAQIAM
FLVLGLLVNPSDLLPIAIPALILSAWMIFFARPLSVFAGLLPFRGFNLRERVFISWVGLRGAVPIILAVFPMMAGLENAR
LFFNVAFFVVLVSLLLQGTSLSWAAKKAKVVVPPVGRPVSRVGLDIHPENPWEQFVYQLSADKWCVGAALRDLHMPKETR
IAALFRDNQLLHPTGSTRLREGDVLCVIGRERDLPALGKLFSQSPPVALDQRFFGDFILEASAKYADVALIYGLEDGREY
RDKQQTLGEIVQQLLGAAPVVGDQVEFAGMIWTVAEKEDNEVLKIGVRVAEEEAES
|
|