Name : eae (ECO103_3609) Accession : YP_003223466.1 Strain : Escherichia coli 12009 Genome accession: NC_013353 Putative virulence/resistance : Virulence Product : intimin epsilon Function : - COG functional category : N : Cell motility COG ID : COG5492 EC number : - Position : 3686375 - 3689221 bp Length : 2847 bp Strand : - Note : Integrative element ECO103_IE03 DNA sequence : ATGATTACTCATGGTTTTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT AGGATTGTTTTTTTATGTTAACCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT TAACTCAAAATGCCGCTCAGGATCGCCTTTTTTATACGTTAAAAACAGGTGAAACTGTTGCCAATATTTCTAAATCACAG GGGATCAGTTTATCGGTAATTTGGTCACTGAATAAACATTTATACAGCTCCGAAAGCGAAATGATGAAGGCTGGGCCTGG TCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATAGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG CTGCAGGTGGTGTCGCTGGTCATACGAATAAAATGACTAAAATGTCCCCGGACGCGACTAAAAGCAACACGACCGATGAC AAGGCTCTAAATTATGCGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTCCAGTCGCGCTCACTGAACGGCGATTACGC GAAAGATACCGCTCTTGGTATGGCCAGCAGCCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAACATG CTGGCATTTGGTCAGGTCGGGGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCTGGCCAGCGTTTTTTCCT TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA GACTATGATGAGCGCCCGGCAAATGGTTTTGATATCCGCTTTAATGGCTATTTACCATCATATCCGGCATTAGGCGCCAA ACTGATGTACGAACAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGTTGCAGTCGAATCCTGGCGCGGCGA CCGTTGGTGTAAACTACACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATCGAGCCACAGTATGTTAACGAGTT AAGAACATTATCGGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAT GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACCGCTCGCGCCTATGACC GAAATGGTAATAGTTCTAATAATGTACAGCTCACTATTACCGTTTTACCGAATGGGCAGGTTGTGGACCAGGTTGGGGTA ACGGACTTTACGGCTGATAAGACATCGGCTAAAGCGGATAACGTTGATACCATTACTTATACCGCGACGGTTAAAAAGAA TGGTGTAGCTCAGGCTAATGCCCCTGTAACATTTAGTATTGTATCCGGGACTGCAACTCTCGGGGCAAATAGTGCCAAAA CGGATGGTAACGGTAAGGCAACCGTAACGTTGAAGTCGGGTACGCCAGGGCAGGTCGTCGTGTCTGCTAAAACCGCGGAG ATGACTTCGCCACTTAATGCCAGTGCGGTTATATTTGTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA AACAACAGCGAAGGCAAATGGTTCTGATGCGATTACCTATATTGTTAAAGTAATGAAGAATAACCAACCAGAAGCAAACC ATTCTGTTACATTCTCAACGAACTTTGGTAATCTGGGGGGAAATTCTAATACCCAAATTGTGAAAACGGATAAAGATGGT AGGGCTACGGTAAAACTGACATCTGGCGTTGCAGGTAATGCTGTTGTTAGTGCAAAAGTCAGCGAAGTTAATACAGAGGT TAAGGCTCCTGAGGTAAAATTCTTCTCAGTTCTGAGCATTGATAGTAATGTGAGTATTATTGGAACCTCCGCTAATGGCG CTTTACCTAATATTTGGTTGCGATATGGTCAGTTTAAGCTGACAGCCAAAGGTGGCGATGGGAAATATCAATGGCGCTCT CAAGATCCAAGTGTTGCATCAGTTGATGCTTTAACTGGTCGAGTTACTTTGCTGAAGAAAGGAACAACAACAATTGAAGT TGTATCGGGTGATAACCAAACAGCAATGTATACAATTAATACACCTACAAAATTTATATCTGTGGAGACACAAAATAAAG TAGTCTATAGTGATGCTGAGGCAACATGTAGAATGAATAATGCACGCTTGCCGTCATCTACGAGTGAGCTAAAGGATGTG TATAATAAATGGGGCGCCGCCAATAGTTATGAAGGCTATAAAGGTAAAAAAACAATAACAGCATGGACACAGCAAACTGA GGATGATAAACAAAAAGGTTGGACTAGTACATTTGACATAGTTACAAAAAATGAAATCCCTAGTAATGGCAGTAATAGTA AAGTCCACGTGAATAAAGCTAACGCTTTTGCCGTCTGTGTAAGATGA Protein sequence : MITHGFYTRTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNAAQDRLFYTLKTGETVANISKSQ GISLSVIWSLNKHLYSSESEMMKAGPGQQIILPLKKLSVEYSALPVLGSAPVVAAGGVAGHTNKMTKMSPDATKSNTTDD KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGMASSQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSENM LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLPNGQVVDQVGV TDFTADKTSAKADNVDTITYTATVKKNGVAQANAPVTFSIVSGTATLGANSAKTDGNGKATVTLKSGTPGQVVVSAKTAE MTSPLNASAVIFVDQTKASITEIKADKTTAKANGSDAITYIVKVMKNNQPEANHSVTFSTNFGNLGGNSNTQIVKTDKDG RATVKLTSGVAGNAVVSAKVSEVNTEVKAPEVKFFSVLSIDSNVSIIGTSANGALPNIWLRYGQFKLTAKGGDGKYQWRS QDPSVASVDALTGRVTLLKKGTTTIEVVSGDNQTAMYTINTPTKFISVETQNKVVYSDAEATCRMNNARLPSSTSELKDV YNKWGAANSYEGYKGKKTITAWTQQTEDDKQKGWTSTFDIVTKNEIPSNGSNSKVHVNKANAFAVCVR |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
eae | CAI43865.1 | intimin epsilon | Virulence | LEE | Protein | 0.0 | 100 |
eae | YP_003223466.1 | intimin epsilon | Not tested | LEE | Protein | 0.0 | 100 |
eae | CAC81871.1 | Intimin | Not tested | LEE II | Protein | 0.0 | 85 |
eae | YP_003232162.1 | intimin | Not tested | LEE | Protein | 0.0 | 85 |
eae | AAK26724.1 | intimin | Virulence | LEE | Protein | 0.0 | 85 |
eae | AAL57551.1 | Eae | Virulence | LEE | Protein | 0.0 | 85 |
eae | YP_003236079.1 | theta intimin | Not tested | LEE | Protein | 0.0 | 83 |
eaeA | AAC31504.1 | L0025 | Virulence | LEE | Protein | 0.0 | 82 |
unnamed | ACU09449.1 | gamma intimin | Virulence | LEE | Protein | 0.0 | 82 |
eae | NP_290259.1 | intimin adherence protein | Virulence | LEE | Protein | 0.0 | 82 |
ECs4559 | NP_312586.1 | gamma intimin | Virulence | LEE | Protein | 0.0 | 82 |
eae | AAC38392.1 | intimin | Virulence | LEE | Protein | 0.0 | 81 |
unnamed | AAL06378.1 | intimin | Virulence | LEE | Protein | 0.0 | 78 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
eae | YP_003223466.1 | intimin epsilon | VFG0803 | Protein | 0.0 | 82 |
eae | YP_003223466.1 | intimin epsilon | VFG0739 | Protein | 0.0 | 81 |