Gene Information

Name : eae (ECO26_5280)
Accession : YP_003232162.1
Strain : Escherichia coli 11368
Genome accession: NC_013361
Putative virulence/resistance : Virulence
Product : intimin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 5354665 - 5357484 bp
Length : 2820 bp
Strand : +
Note : Integrative element ECO26_IE08

DNA sequence :
ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAACCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT
TAACTCAAAATGCCGCTCAGGATCGCCTTTTTTATACGTTAAAAACAGGTGAAACTGTTGCCAATATTTCTAAATCACAG
GGTATCAGTTTATCGGTAATTTGGTCACTGAATAAACATTTATACAGTTCCGAAAGCGAAATGATGAAGGCTGGACCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATAGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACGAATAAAATGACTAAAATGTCCCCGGACGCGACTAAAAGCAACACGACCGATGAC
AAGGCTCTAAATTATGCGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTCCAGTCGCGCTCACTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATGGCCAGCAGCCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAACATG
CTGGCATTTGGTCAGGTCGGTGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCTGGCCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCGGCAAATGGTTTTGATATCCGCTTTAATGGCTATTTACCATCATATCCGGCATTAGGCGCCAA
ACTGATGTACGAACAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGTTGCAGTCGAATCCTGGCGCGGCGA
CCGTTGGTGTAAACTACACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATCGAGCCACAGTATGTTAACGAGTT
AAGAACATTATCGGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGCGCATTACGCAGTCAGGGCGGTCAGATTCAGCATGGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGCGGCAGCAATATTTATAAAGTGACCGCTCGCGCCTATGACC
GAAATGGTAATAGTTCTAATAATGTACAGCTCACTATTACCGTTTTACCGAATGGGCAGGTTGTGGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAAACATCGGCTAAAGCGGATGGCATAGAAGCTATTACCTATACCGCGACGGTTAAAAAGAA
TGGTGTAGCTCAGGCTAATGTCCCTGTAACATTTAGTATTGTATCCGGGACTGCAACTCTTGGGGCAAATAGTGCCAGAA
CGGATGGTAACGGTAAGGCGACCGTAACGCTGAAGTCGGCTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCGCCACTTAATGCCAGCGCGGTTATATTTGTTGATCAAACCAAGGCCAGTATTACTGAGATTAAGGCTGATAA
AACAACAGCGAAGGCAGATGGTTCTGATGCGATTACCTATACTGTCAGAGTGATGAAGGAGGGGGCACCCGTAGTAGATC
AGAAAGTGACCTTTTCTAAGGATTTTGGGACCCTGAATAAGACTGAAGCAACAACCGATCAGAATGGTTATGCTACTGTA
AAATTATCATCCAATACTCCTGGCAAGGCCATTGTTAGTGCAAAAGTGAGTGGAGTAGGTACAGAAGTTAAGGCTACTAC
CGTTGAGTTTTTTGCCCCGTTGAGTATTGATGGTGATAAAGTGACCGTAATTGGTACTGGTATCACGGGGGCTCTGCCAA
AGAACTGGTTACAGTATGGTCAGGTTAAGCTACAGGCAACAGGGGGCAATGGAAAATACACATGGAAATCCAGTAATACT
AAAATTGCTTCTGTTGATAACTCGGGAGTGATAACCTTAAATGAAAAAGGGAGTGCCACAATTACTGTAGTATCTGGTGA
TAATCAGAGTGCGACATACACAATTAATGCACCGGGTAGTATTGTAATTGCTGTGGATAAAAATACTCGAGTTACGTATT
TTGATGCCGAAAACAAATGTAAGACAAATAGCGCAAATTTAGCACAGTCAAAAGAACTATTGGCCAATATCTATTCAACA
TGGGGTGCTGCAAATAAATATCCTTACTATTCTGGTTCTAAATCATTGACTGCTTGGATTAAACAATCCTCTTCTGAACA
GTCATCAGGTGTATCAAGCACATATGATTTGGTTACGAAGAACCAGTTGATCAATGTTGGAGTAAACAATAAGAATGCTT
TTTCTGTTTGTGTAAAATAA

Protein sequence :
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNAAQDRLFYTLKTGETVANISKSQ
GISLSVIWSLNKHLYSSESEMMKAGPGQQIILPLKKLSVEYSALPVLGSAPVVAAGGVAGHTNKMTKMSPDATKSNTTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGMASSQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSENM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHGGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLPNGQVVDQVGV
TDFTADKTSAKADGIEAITYTATVKKNGVAQANVPVTFSIVSGTATLGANSARTDGNGKATVTLKSATPGQVVVSAKTAE
MTSPLNASAVIFVDQTKASITEIKADKTTAKADGSDAITYTVRVMKEGAPVVDQKVTFSKDFGTLNKTEATTDQNGYATV
KLSSNTPGKAIVSAKVSGVGTEVKATTVEFFAPLSIDGDKVTVIGTGITGALPKNWLQYGQVKLQATGGNGKYTWKSSNT
KIASVDNSGVITLNEKGSATITVVSGDNQSATYTINAPGSIVIAVDKNTRVTYFDAENKCKTNSANLAQSKELLANIYST
WGAANKYPYYSGSKSLTAWIKQSSSEQSSGVSSTYDLVTKNQLINVGVNNKNAFSVCVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae AAL57551.1 Eae Virulence LEE Protein 0.0 100
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 100
eae AAK26724.1 intimin Virulence LEE Protein 0.0 100
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 99
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 87
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 85
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 85
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 83
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 83
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 83
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 83
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 83
eae AAC38392.1 intimin Virulence LEE Protein 0.0 83

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eae YP_003232162.1 intimin VFG0803 Protein 0.0 83
eae YP_003232162.1 intimin VFG0739 Protein 0.0 83