PAI Gene Information


Name : unnamed
Accession : AAB61290.1
PAI name : PAI I CFT073
PAI accession : AF003741
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : unknown
Function : -
Note : first ORF in the pathogenicity island
Homologs in the searched genomes :   361 hits    ( 361 protein-level )  
Publication :
    -Kao,J.-S., Stucker,D.M., Warren,J.W. and Mobley,H.L.T., "Pathogenicity Island Sequenced of Pyelonephritogenic Escherichia coli CFT073 Are Associated with Virulent Uropathogenic Strains", Infect. Immun. 65 (1997) In press.

    -Kao,J.-S., Stucker,D.M., Warren,J.W. and Mobley,H.L.T., "Direct Submission", Submitted (12-MAY-1997) Medicine, University of Maryland at Baltimore, 10 S Pine St MSTF Rm 900, Baltimore, MD 21201, USA.


DNA sequence :
ATGCTGGCGGGGGTTGATGGCGTCGGCGGTATCCCGTTTGATAATTACCCCTTCGCCTACATGGTAAGTAACCTGGCGCT
GGCGATTATTTTGCTCGACGGCGGGATGCGTACTCAGGCCAGCTCCTTTTGTGTGGCGTTAGGACCGGCACTGTCGCTGG
CGACGCTGGGCGTGCTTATCACCTCTGGTTTAACCGGCATGATGGCGGGGTGGCTGTTTAATCTTGATTTGATTGAAGGA
TTATTAATCGGCGCTATCGTCGGCTCCACCGATGCTGCAGCGGTCTTTTTTTTGCTGGGTGGTAAGGGGCTTAACGAACG
TGTTGGTTCGACGCTGGAAATTGAATCCGGCAGTAATGATCCAATGGCGGTCTTTCTGACGATTACCCTGATTGCGATGA
TCCAGCAACATGAAAGCAGTGTCAGTTGGATGTTCGTGGTCGATATTCTGCAGCAATTTGGCCTCGGGATTGTCATCGGG
CTTGGCGGTGGTTATTTACTGCTGCAAATGATTAATCGCATCGCCCTGCCCGCCGGATTATATCCATTGCTGGCATTAAG
CGGCGGTATTTTAATTTTTGCTTTAACCACGGCGCTGGAAGGCAGCGGTATTCTGGCTGTTTATCTGTGCGGTTTTCTGC
TGGGTAATCGGCCGATTCGCAACCGCTACGGCATCCTGCAAAATTTCGACGGCCTCGCCTGGCTGGCGCAAATCGCCATG
TTCCTGGTGCTGGGGCTATTGGTTAACCCAAGCGATCTGCTGCCCATTGCCATTCCGGCGCTCATTTTGTCCGCATGGAT
GATTTTCTTCGCCCGTCCTCTTTCGGTATTTGCCGGATTGCTACCGTTCCGTGGCTTCAATCTGCGTGAGCGCGTGTTTA
TCAGCTGGGTAGGATTACGCGGCGCGGTGCCGATTATTCTGGCTGTGTTCCCGATGATGGCGGGGCTGGAGAATGCGCGA
CTGTTCTTTAATGTCGCCTTCTTTGTGGTCCTGGTTTCACTGCTATTGCAGGGAACATCACTCTCGTGGGCGGCGAAAAA
AGCCAAAGTGGTCGTTCCGCCAGTGGGACGTCCGGTGTCACGCGTTGGCCTAGATATTCATCCGGAAAATCCGTGGGAGC
AGTTTGTTTATCAATTGAGTGCCGATAAATGGTGCGTGGGCGCGGCACTGCGTGATTTGCATATGCCAAAAGAGACGCGT
ATTGCGGCACTGTTTCGTGATAACCAGTTGCTTCATCCCACCGGCAGCACCCGACTGCGCGAAGGCGATGTGTTGTGTGT
AATTGGTCGGGAACGCGATCTCCCGGCGCTCGGTAAACTGTTCAGCCAGTCGCCGCCGGTCGCGCTGGATCAACGCTTCT
TTGGTGACTTCATTCTCGAAGCCAGCGCCAAATATGCTGATGTGGCGCTGATATATGGTCTGGAAGACGGGCGAGAATAT
CGCGATAAGCAGCAAACGCTGGGTGAAATCGTCCAGCAGTTGTTAGGCGCAGCACCGGTTGTCGGTGACCAGGTAGAGTT
TGCCGGGATGATCTGGACGGTGGCCGAGAAAGAAGATAATGAAGTGTTGAAGATTGGTGTTCGGGTAGCGGAGGAAGAAG
CCGAATCTTAA

Protein sequence :
MLAGVDGVGGIPFDNYPFAYMVSNLALAIILLDGGMRTQASSFCVALGPALSLATLGVLITSGLTGMMAGWLFNLDLIEG
LLIGAIVGSTDAAAVFFLLGGKGLNERVGSTLEIESGSNDPMAVFLTITLIAMIQQHESSVSWMFVVDILQQFGLGIVIG
LGGGYLLLQMINRIALPAGLYPLLALSGGILIFALTTALEGSGILAVYLCGFLLGNRPIRNRYGILQNFDGLAWLAQIAM
FLVLGLLVNPSDLLPIAIPALILSAWMIFFARPLSVFAGLLPFRGFNLRERVFISWVGLRGAVPIILAVFPMMAGLENAR
LFFNVAFFVVLVSLLLQGTSLSWAAKKAKVVVPPVGRPVSRVGLDIHPENPWEQFVYQLSADKWCVGAALRDLHMPKETR
IAALFRDNQLLHPTGSTRLREGDVLCVIGRERDLPALGKLFSQSPPVALDQRFFGDFILEASAKYADVALIYGLEDGREY
RDKQQTLGEIVQQLLGAAPVVGDQVEFAGMIWTVAEKEDNEVLKIGVRVAEEEAES