PAI Gene Information


Name : unnamed
Accession : CAD33771.1
PAI name : PAI I 536
PAI accession : AJ488511
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : putative reverse transcriptase
Function : -
Note : ORF61
Homologs in the searched genomes :   158 hits    ( 157 protein-level,   1 DNA-level )  
Publication :
    -Dobrindt,U., "Direct Submission", Submitted (27-MAY-2002) Dobrindt U., Inst. f. Molekulare Infektionsbiologie, Universitaet Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY.

    -Dobrindt,U., Blum-Oehler,G., Nagy,G., Schneider,G., Johann,A., Gottschalk,G. and Hacker,J., "Genetic structure and distribution of four pathogenicity islands (PAI I(536) to PAI IV(536)) of uropathogenic Escherichia coli strain 536", Infect. Immun. 70 (11), 6365-6372 (2002) PUBMED 12379716.


DNA sequence :
ATGCAACGAAAATCATTTGAAATACCAAAAGCACTGGTCTGGGCATCATATCTGGATGTGCGTCGCAACAAGGGAGCCCC
CGGATGCGACGGGCAGACTTTGAAAATGTTCGACCAGCAACGTGATGGCAATCTATACAAGATCTGGAACCGGCTATGTT
CGGGAACATGGTTTCCGCCACCAGTGCTGGAAAAACGGATCCCGAAGTCGAATGGCAAGGAGCGCATTCTGGGGATCCCA
ACAGTATCCGATCGAATTGCTCAGGGAGCGATAAAACTTTTCATGGAAGAAAAGCTTGACCCGATTTTCCATGCTGATTC
ATATGGTTACCGACCAGGCAAATCAGCTCATGACGCTCTGAAACAATGTGCCATCCGGTGCTGGCGTTATAGCTGGATAC
TGGAAGTGGACATCAGTGCGTTTTTCGACCATGTGAGACATGATCTGGTGCTCAAAGCACTGGAACACCACGGGATGCCT
AAATGGGCCATCCTGTACTGTCGACGCTGGATGGAAGCGCCAATGCAGAGTTGTGAAAATGGAGAATTAATAACCCGAAC
ACGAGGCACTCCGCAGGGCGGGGTCATTAGTCCGCTACTTGCTAACTTGTTCCTTCACTATGCATTTGATTTGTGGATGG
AAAGAGAATATCGGGGGGTACCGTTTGAGAGGTACGCTGATGATATTGTAGTACATTGTTCACGAATGAGTGATGCAACA
AGGCTGAAGAACAGGTTATCGGAGCGCTTCTCGGAAGTCGGGCTGGTCCTGAATGCAGGGAAAACGAATATCGCCTACAT
TGACACGTTTAAAAGGAGAAACGTCGCAACGAGTTTCACCTTTCTCGGATATGACTTCAAAGTGCGTACGCTGAAGAATT
TCAAAGGCGAACTGTACCGAAAATGCATGCCGGGTGCGTCAAATGCAGCAATGCGCAAAATAACAGAAACAATCAAAAAG
TGGCGTATACATCGCTCAACAGCTGAGAGTTTGCTGGATTTTGCGAGGCGCTACAATGCGATAGTGAGAGGCTGGATCGG
GTACTACGCAAAGTTCTGGTCCAGAAATTTCAACTATCGACTGTGGAGTGCAATGCAGTCACGTCTGCTCAAGTGGATGC
AGTCTAAATACAGACTTTCGAACCGGAAGGCTCAGCGAAAGCTGACGCTGGTAAGGAAAGAGTATCCGAAGCTATTTGTC
CACTGGTATTTACTACGTGCATCGAATGAGTGA

Protein sequence :
MQRKSFEIPKALVWASYLDVRRNKGAPGCDGQTLKMFDQQRDGNLYKIWNRLCSGTWFPPPVLEKRIPKSNGKERILGIP
TVSDRIAQGAIKLFMEEKLDPIFHADSYGYRPGKSAHDALKQCAIRCWRYSWILEVDISAFFDHVRHDLVLKALEHHGMP
KWAILYCRRWMEAPMQSCENGELITRTRGTPQGGVISPLLANLFLHYAFDLWMEREYRGVPFERYADDIVVHCSRMSDAT
RLKNRLSERFSEVGLVLNAGKTNIAYIDTFKRRNVATSFTFLGYDFKVRTLKNFKGELYRKCMPGASNAAMRKITETIKK
WRIHRSTAESLLDFARRYNAIVRGWIGYYAKFWSRNFNYRLWSAMQSRLLKWMQSKYRLSNRKAQRKLTLVRKEYPKLFV
HWYLLRASNE