PAI Gene Information


Name : eae (ECO26_5280)
Accession : YP_003232162.1
PAI name : LEE
PAI accession : NC_013361_P1
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : intimin
Function : -
Note : Integrative element ECO26_IE08
Homologs in the searched genomes :   12 hits    ( 12 protein-level )  
Publication :
    -Hattori,M., Toh,H., Oshima,K., Yamashita,A., Hayashi,T., Ogura,Y. and Ooka,T., "Direct Submission", Submitted (21-DEC-2008) Contact:Masahira Hattori University of Tokyo, Graduate School of Frontier Sciences; 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan URL :http://www.cb.k.u-tokyo.ac.jp/hattorilab/.

    -Ogura,Y., Ooka,T., Iguchi,A., Toh,H., Asadulghani,M., Oshima,K., Kodama,T., Abe,H., Nakayama,K., Kurokawa,K., Tobe,T., Hattori,M. and Hayashi,T., "Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coli", Proc. Natl. Acad. Sci. U.S.A. 106 (42), 17939-17944 (2009) PUBMED 19815525.

    -Ogura,Y., Ooka,T., Iguchi,A., Toh,H., Asadulghani,M., Oshima,K., Kodama,T., Abe,H., Nakayama,K., Kurokawa,K., Tobe,T., Hattori,M. and Hayashi,T., "Direct Submission", Submitted (07-OCT-2009) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAACCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT
TAACTCAAAATGCCGCTCAGGATCGCCTTTTTTATACGTTAAAAACAGGTGAAACTGTTGCCAATATTTCTAAATCACAG
GGTATCAGTTTATCGGTAATTTGGTCACTGAATAAACATTTATACAGTTCCGAAAGCGAAATGATGAAGGCTGGACCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATAGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACGAATAAAATGACTAAAATGTCCCCGGACGCGACTAAAAGCAACACGACCGATGAC
AAGGCTCTAAATTATGCGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTCCAGTCGCGCTCACTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATGGCCAGCAGCCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAACATG
CTGGCATTTGGTCAGGTCGGTGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCTGGCCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCGGCAAATGGTTTTGATATCCGCTTTAATGGCTATTTACCATCATATCCGGCATTAGGCGCCAA
ACTGATGTACGAACAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGTTGCAGTCGAATCCTGGCGCGGCGA
CCGTTGGTGTAAACTACACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATCGAGCCACAGTATGTTAACGAGTT
AAGAACATTATCGGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGCGCATTACGCAGTCAGGGCGGTCAGATTCAGCATGGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGCGGCAGCAATATTTATAAAGTGACCGCTCGCGCCTATGACC
GAAATGGTAATAGTTCTAATAATGTACAGCTCACTATTACCGTTTTACCGAATGGGCAGGTTGTGGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAAACATCGGCTAAAGCGGATGGCATAGAAGCTATTACCTATACCGCGACGGTTAAAAAGAA
TGGTGTAGCTCAGGCTAATGTCCCTGTAACATTTAGTATTGTATCCGGGACTGCAACTCTTGGGGCAAATAGTGCCAGAA
CGGATGGTAACGGTAAGGCGACCGTAACGCTGAAGTCGGCTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCGCCACTTAATGCCAGCGCGGTTATATTTGTTGATCAAACCAAGGCCAGTATTACTGAGATTAAGGCTGATAA
AACAACAGCGAAGGCAGATGGTTCTGATGCGATTACCTATACTGTCAGAGTGATGAAGGAGGGGGCACCCGTAGTAGATC
AGAAAGTGACCTTTTCTAAGGATTTTGGGACCCTGAATAAGACTGAAGCAACAACCGATCAGAATGGTTATGCTACTGTA
AAATTATCATCCAATACTCCTGGCAAGGCCATTGTTAGTGCAAAAGTGAGTGGAGTAGGTACAGAAGTTAAGGCTACTAC
CGTTGAGTTTTTTGCCCCGTTGAGTATTGATGGTGATAAAGTGACCGTAATTGGTACTGGTATCACGGGGGCTCTGCCAA
AGAACTGGTTACAGTATGGTCAGGTTAAGCTACAGGCAACAGGGGGCAATGGAAAATACACATGGAAATCCAGTAATACT
AAAATTGCTTCTGTTGATAACTCGGGAGTGATAACCTTAAATGAAAAAGGGAGTGCCACAATTACTGTAGTATCTGGTGA
TAATCAGAGTGCGACATACACAATTAATGCACCGGGTAGTATTGTAATTGCTGTGGATAAAAATACTCGAGTTACGTATT
TTGATGCCGAAAACAAATGTAAGACAAATAGCGCAAATTTAGCACAGTCAAAAGAACTATTGGCCAATATCTATTCAACA
TGGGGTGCTGCAAATAAATATCCTTACTATTCTGGTTCTAAATCATTGACTGCTTGGATTAAACAATCCTCTTCTGAACA
GTCATCAGGTGTATCAAGCACATATGATTTGGTTACGAAGAACCAGTTGATCAATGTTGGAGTAAACAATAAGAATGCTT
TTTCTGTTTGTGTAAAATAA

Protein sequence :
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNAAQDRLFYTLKTGETVANISKSQ
GISLSVIWSLNKHLYSSESEMMKAGPGQQIILPLKKLSVEYSALPVLGSAPVVAAGGVAGHTNKMTKMSPDATKSNTTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGMASSQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSENM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHGGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLPNGQVVDQVGV
TDFTADKTSAKADGIEAITYTATVKKNGVAQANVPVTFSIVSGTATLGANSARTDGNGKATVTLKSATPGQVVVSAKTAE
MTSPLNASAVIFVDQTKASITEIKADKTTAKADGSDAITYTVRVMKEGAPVVDQKVTFSKDFGTLNKTEATTDQNGYATV
KLSSNTPGKAIVSAKVSGVGTEVKATTVEFFAPLSIDGDKVTVIGTGITGALPKNWLQYGQVKLQATGGNGKYTWKSSNT
KIASVDNSGVITLNEKGSATITVVSGDNQSATYTINAPGSIVIAVDKNTRVTYFDAENKCKTNSANLAQSKELLANIYST
WGAANKYPYYSGSKSLTAWIKQSSSEQSSGVSSTYDLVTKNQLINVGVNNKNAFSVCVK