PAI Gene Information


Name : eae (Z5110)
Accession : NP_290259.1
PAI name : LEE
PAI accession : NC_002655_P5
Strain : Escherichia coli 042
Virulence or Resistance: Virulence
Product : intimin adherence protein
Function : -
Note : -
Homologs in the searched genomes :   12 hits    ( 12 protein-level )  
Publication :
    -Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J., Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A., Dimalanta,E., Potamousis,K.,, "Genome sequence of enterohaemorrhagic Escherichia coli O157:H7", Nature 409 (6819), 529-533 (2001) PUBMED 11206551.

    -Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J., Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A., Dimalanta,E., Potamousis,K.,, "Direct Submission", Submitted (28-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J., Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A., Dimalanta,E., Potamousis,K.,, "Direct Submission", Submitted (22-OCT-2000) Laboratory of Genetics, University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.


DNA sequence :
ATGATTACTCATGGTTGTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTGATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAAACTGT
TAACTCATGATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACTGGTGAAACTGTTGCCGATCTTTCTAAATCGCAA
GATATTAATTTATCGACGATTTGGTCGTTGAATAAGCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTTCCCTTTGAATACAGTGCACTACCACTTTTAGGTTCGGCACCTCTTGTTG
CTGCAGGTGGTGTTGCTGGTCACACGAATAAACTGACTAAAATGTCCCCGGACGTGACCAAAAGCAACATGACCGATGAC
AAGGCATTAAATTATGCGGCACAACAGGCGGCGAGTCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAAAATG
CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT
TCCTGCAAACATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGTTTTAATGGCTATCTACCGTCATATCCGGCATTAGGCGCCAA
GCTGATATATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGCGGCGA
CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAATCGTGGTCTCAGCAAATTGAACCACAGTATGTTAACGAGTT
AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAGTTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACGGCTCGCGCCTATGACC
GTAATGGCAATAGCTCTAACAATGTACAGCTTACTATTACCGTTCTGTCGAATGGTCAAGTTGTCGACCAGGTTGGGGTA
ACGGACTTTACGGCGGATAAGACTTCGGCTAAAGCGGATAACGCCGATACCATTACTTATACCGCGACGGTGAAAAAGAA
TGGGGTAGCTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGCCAAAA
CGGATGCTAACGGTAAGGCAACCGTAACGTTGAAGTCGAGTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCAGCACTTAATGCCAGTGCGGTTATATTTTTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
GACAACTGCAGTAGCAAATGGTAAGGATGCTATTAAATATACTGTAAAAGTTATGAAAAACGGTCAGCCAGTTAATAATC
AATCCGTTACATTCTCAACAAACTTTGGGATGTTCAACGGTAAGTCTCAAACGCAAGCAACCACGGGAAATGATGGTCGT
GCGACGATAACACTAACTTCCAGTTCCGCCGGTAAAGCGACTGTTAGTGCGACAGTCAGTGATGGGGCTGAGGTTAAAGC
GACTGAGGTCACTTTTTTTGATGAACTGAAAATTGACAACAAGGTTGATATTATTGGTAACAATGTCAGAGGCGAGTTGC
CTAATATTTGGCTGCAATATGGTCAGTTTAAACTGAAAGCAAGCGGTGGTGATGGTACATATTCATGGTATTCAGAAAAT
ACCAGTATCGCGACTGTCGATGCATCAGGGAAAGTCACTTTGAATGGTAAAGGCAGTGTCGTAATTAAAGCCACATCTGG
TGATAAGCAAACAGTAAGTTACACTATAAAAGCACCGTCGTATATGATAAAAGTGGATAAGCAAGCCTATTATGCTGATG
CTATGTCCATTTGCAAAAATTTATTACCATCCACACAGACGGTATTGTCAGATATTTATGACTCATGGGGGGCTGCAAAT
AAATATAGCCATTATAGTTCTATGAACTCAATAACTGCTTGGATTAAACAGACATCTAGTGAGCAGCGTTCTGGAGTATC
AAGCACTTATAACCTAATAACACAAAACCCTCTTCCTGGGGTTAATGTTAATACTCCAAATGTCTATGCGGTTTGTGTAG
AATAA

Protein sequence :
MITHGCYTRTRHKHKLKKTLIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHDSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPANMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLIYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKSWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV
TDFTADKTSAKADNADTITYTATVKKNGVAQANVPVSFNIVSGTATLGANSAKTDANGKATVTLKSSTPGQVVVSAKTAE
MTSALNASAVIFFDQTKASITEIKADKTTAVANGKDAIKYTVKVMKNGQPVNNQSVTFSTNFGMFNGKSQTQATTGNDGR
ATITLTSSSAGKATVSATVSDGAEVKATEVTFFDELKIDNKVDIIGNNVRGELPNIWLQYGQFKLKASGGDGTYSWYSEN
TSIATVDASGKVTLNGKGSVVIKATSGDKQTVSYTIKAPSYMIKVDKQAYYADAMSICKNLLPSTQTVLSDIYDSWGAAN
KYSHYSSMNSITAWIKQTSSEQRSGVSSTYNLITQNPLPGVNVNTPNVYAVCVE