Name : eae (ECO111_3743) Accession : YP_003236079.1 Strain : Escherichia coli 11128 Genome accession: NC_013364 Putative virulence/resistance : Virulence Product : theta intimin Function : - COG functional category : - COG ID : - EC number : - Position : 3728030 - 3730837 bp Length : 2808 bp Strand : - Note : Integrative element ECO111_IE05 DNA sequence : ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGCGCTGGTTT AGGATTGTTTTTTTATGTTAACCAGAACTCATTTGCAAACGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT TAACTCAAAATGTTGCTCAGGATCGCCTTTTTTATACGTTGAAAACAGGTGAAACTGTTTCCAGTATTTCTAAATCACAA GGTATCAGTTTATCCGTAATTTGGTCACTGAATAAACATTTATACAGTTCCGAAAGCGAAATGCTGAAGGCTGCGCCTGG CCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATGGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG CTGCAGGTGGTGTCGCTGGGCATACAAATAAAATGACTAAAATGTCCCCGGACGCGACTCAAAGCAACATGACTGATGAC AAGGCTCTAAATTATACGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTTCAGTCGCGCTCTCTGCACGGCGATTACGC GAAAGATACCGCTCTTGGTATCGCGGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGATTTCTTATTACCGTTCTATGATTCCGAAAAAATG CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT TCCTGAAAACATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGCTTTAATGGCTATCTACCATCATACCCGGCATTAGGCGCCAA GCTGATGTATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGCTGCAGTCGAATCCTGGTGCGGCGA CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT CTTCTTTACTCAATGCAGCTTCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATTGAGCCACAGTATGTTAACGAGTT AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGATATTC TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGTAGCCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC ACAAGATTACCAGGCTATTTTGCCGGCTTATGTGCAAGGCGGTAGCAATATTTATAAAGTGACGGCTCGTGCCTATGACC GTAATGGCAATAGCTCTAACAATGTACAGCTCACTATTACCGTTCTGTCGAATGGTCAGGTGGTCGACCAGGTTGGGGTA ACGGACTTTACGGCTGATAAGACTTCGGCTAAAGCGGATGGCACCGAGGCGATTACTTATACCGCGACGGTGAAAAAGAA TGGGGTAACTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGCCACAA CGGATGCTAACGGTAAGGCAACTGTAACGTTGAAGTCGAGTACGCCAGGGCAGGTAGTCGTGTCTGCTAAAACCGCGGAG ATGACTTCAGCACTTAATGCCAGTGCGGTTATATTTGTTGAGCAAACCAAGGCCAGTATTACTGAGATTAAGGCTGATAA GACAACTGCAGTAGCAAATGGTAATGATGCTGTTACATACACTGTTAAAGTGATGAAAGAGGGTCAGCCAGTGCAGGGAC ACTCCGTTGCATTCACAACAAACTTTGGGATGTTCAACGGTAAGTCTCAGACGCAAAATGCGACCACGGGAAGTGATGGT CGTGCGACGATAACACTGACTTCCAGTTCCGCAGGTAAAGCGACTGTTAGTGCGACTGTTAGTGGTGGGAATGATGTTAA AGCACCTGAGGTTACATTTTTTGATGGACTGAAAATTGACAACAAGGTTGATATTCTTGGTAAGAACGTTACTGGTGACT TACCTAATATCTGGTTGCAATATGGTCAGTTTAAACTGAAGGTAAGCGGTGGTAATGGTACATATTCATGGCATTCAGAG AATACCAATATTGCGACTGTTGATGAATCAGGGAAAGTAACCTTGAAAGGAAAAGGTACTGCAGTAATTAATGTTACATC TGGTGATAAGCAAACAGTAAGCTACACTATTAAAGCTCCGAATTATATGATAAGAGTGGGTAATAAAGCCAGTTATGCAA ATGCTATGTCCTTTTGTGGAAATTTATTACCATCCTCACAGACGGTATTATCAAACGTTTATAATTCATGGGGGCCTGCA AACGGATATGACCATTATCGTTCTATGCAGTCAATAACAGCTTGGATTACACAAACTGAAGCTGATAAAATATCAGGAGT ATCAACTACTTATGACTTAATAACACAAAACCCTCATAAGGATGTTACGCTAAACGCTCCAAATGTCTATGCAGTTTGTG TAGAATAA Protein sequence : MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNVAQDRLFYTLKTGETVSSISKSQ GISLSVIWSLNKHLYSSESEMLKAAPGQQIILPLKKLSVEYGALPVLGSAPVVAAGGVAGHTNKMTKMSPDATQSNMTDD KALNYTAQQAASLGSQLQSRSLHGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND LLYSMQLRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV TDFTADKTSAKADGTEAITYTATVKKNGVTQANVPVSFNIVSGTATLGANSATTDANGKATVTLKSSTPGQVVVSAKTAE MTSALNASAVIFVEQTKASITEIKADKTTAVANGNDAVTYTVKVMKEGQPVQGHSVAFTTNFGMFNGKSQTQNATTGSDG RATITLTSSSAGKATVSATVSGGNDVKAPEVTFFDGLKIDNKVDILGKNVTGDLPNIWLQYGQFKLKVSGGNGTYSWHSE NTNIATVDESGKVTLKGKGTAVINVTSGDKQTVSYTIKAPNYMIRVGNKASYANAMSFCGNLLPSSQTVLSNVYNSWGPA NGYDHYRSMQSITAWITQTEADKISGVSTTYDLITQNPHKDVTLNAPNVYAVCVE |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
eae | YP_003236079.1 | theta intimin | Not tested | LEE | Protein | 0.0 | 100 |
eae | NP_290259.1 | intimin adherence protein | Virulence | LEE | Protein | 0.0 | 90 |
ECs4559 | NP_312586.1 | gamma intimin | Virulence | LEE | Protein | 0.0 | 90 |
eaeA | AAC31504.1 | L0025 | Virulence | LEE | Protein | 0.0 | 90 |
unnamed | ACU09449.1 | gamma intimin | Virulence | LEE | Protein | 0.0 | 90 |
eae | AAL57551.1 | Eae | Virulence | LEE | Protein | 0.0 | 83 |
eae | CAC81871.1 | Intimin | Not tested | LEE II | Protein | 0.0 | 83 |
eae | YP_003232162.1 | intimin | Not tested | LEE | Protein | 0.0 | 83 |
eae | AAK26724.1 | intimin | Virulence | LEE | Protein | 0.0 | 83 |
eae | CAI43865.1 | intimin epsilon | Virulence | LEE | Protein | 0.0 | 83 |
eae | YP_003223466.1 | intimin epsilon | Not tested | LEE | Protein | 0.0 | 83 |
eae | AAC38392.1 | intimin | Virulence | LEE | Protein | 0.0 | 82 |
unnamed | AAL06378.1 | intimin | Virulence | LEE | Protein | 0.0 | 77 |
eae | AFO66294.1 | intimin-like protein | Not tested | SESS LEE | Protein | 0.0 | 48 |
eae | AFO66392.1 | intimin-like protein | Virulence | SESS LEE | Protein | 0.0 | 47 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
eae | YP_003236079.1 | theta intimin | VFG0803 | Protein | 0.0 | 90 |
eae | YP_003236079.1 | theta intimin | VFG0739 | Protein | 0.0 | 82 |