Gene Information

Name : eae (ECO111_3743)
Accession : YP_003236079.1
Strain : Escherichia coli 11128
Genome accession: NC_013364
Putative virulence/resistance : Virulence
Product : theta intimin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3728030 - 3730837 bp
Length : 2808 bp
Strand : -
Note : Integrative element ECO111_IE05

DNA sequence :
ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGCGCTGGTTT
AGGATTGTTTTTTTATGTTAACCAGAACTCATTTGCAAACGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT
TAACTCAAAATGTTGCTCAGGATCGCCTTTTTTATACGTTGAAAACAGGTGAAACTGTTTCCAGTATTTCTAAATCACAA
GGTATCAGTTTATCCGTAATTTGGTCACTGAATAAACATTTATACAGTTCCGAAAGCGAAATGCTGAAGGCTGCGCCTGG
CCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATGGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG
CTGCAGGTGGTGTCGCTGGGCATACAAATAAAATGACTAAAATGTCCCCGGACGCGACTCAAAGCAACATGACTGATGAC
AAGGCTCTAAATTATACGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTTCAGTCGCGCTCTCTGCACGGCGATTACGC
GAAAGATACCGCTCTTGGTATCGCGGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGATTTCTTATTACCGTTCTATGATTCCGAAAAAATG
CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT
TCCTGAAAACATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGCTTTAATGGCTATCTACCATCATACCCGGCATTAGGCGCCAA
GCTGATGTATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGCTGCAGTCGAATCCTGGTGCGGCGA
CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTTCTTTACTCAATGCAGCTTCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATTGAGCCACAGTATGTTAACGAGTT
AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGTAGCCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGATTACCAGGCTATTTTGCCGGCTTATGTGCAAGGCGGTAGCAATATTTATAAAGTGACGGCTCGTGCCTATGACC
GTAATGGCAATAGCTCTAACAATGTACAGCTCACTATTACCGTTCTGTCGAATGGTCAGGTGGTCGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAGACTTCGGCTAAAGCGGATGGCACCGAGGCGATTACTTATACCGCGACGGTGAAAAAGAA
TGGGGTAACTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGCCACAA
CGGATGCTAACGGTAAGGCAACTGTAACGTTGAAGTCGAGTACGCCAGGGCAGGTAGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCAGCACTTAATGCCAGTGCGGTTATATTTGTTGAGCAAACCAAGGCCAGTATTACTGAGATTAAGGCTGATAA
GACAACTGCAGTAGCAAATGGTAATGATGCTGTTACATACACTGTTAAAGTGATGAAAGAGGGTCAGCCAGTGCAGGGAC
ACTCCGTTGCATTCACAACAAACTTTGGGATGTTCAACGGTAAGTCTCAGACGCAAAATGCGACCACGGGAAGTGATGGT
CGTGCGACGATAACACTGACTTCCAGTTCCGCAGGTAAAGCGACTGTTAGTGCGACTGTTAGTGGTGGGAATGATGTTAA
AGCACCTGAGGTTACATTTTTTGATGGACTGAAAATTGACAACAAGGTTGATATTCTTGGTAAGAACGTTACTGGTGACT
TACCTAATATCTGGTTGCAATATGGTCAGTTTAAACTGAAGGTAAGCGGTGGTAATGGTACATATTCATGGCATTCAGAG
AATACCAATATTGCGACTGTTGATGAATCAGGGAAAGTAACCTTGAAAGGAAAAGGTACTGCAGTAATTAATGTTACATC
TGGTGATAAGCAAACAGTAAGCTACACTATTAAAGCTCCGAATTATATGATAAGAGTGGGTAATAAAGCCAGTTATGCAA
ATGCTATGTCCTTTTGTGGAAATTTATTACCATCCTCACAGACGGTATTATCAAACGTTTATAATTCATGGGGGCCTGCA
AACGGATATGACCATTATCGTTCTATGCAGTCAATAACAGCTTGGATTACACAAACTGAAGCTGATAAAATATCAGGAGT
ATCAACTACTTATGACTTAATAACACAAAACCCTCATAAGGATGTTACGCTAAACGCTCCAAATGTCTATGCAGTTTGTG
TAGAATAA

Protein sequence :
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNVAQDRLFYTLKTGETVSSISKSQ
GISLSVIWSLNKHLYSSESEMLKAAPGQQIILPLKKLSVEYGALPVLGSAPVVAAGGVAGHTNKMTKMSPDATQSNMTDD
KALNYTAQQAASLGSQLQSRSLHGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQLRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV
TDFTADKTSAKADGTEAITYTATVKKNGVTQANVPVSFNIVSGTATLGANSATTDANGKATVTLKSSTPGQVVVSAKTAE
MTSALNASAVIFVEQTKASITEIKADKTTAVANGNDAVTYTVKVMKEGQPVQGHSVAFTTNFGMFNGKSQTQNATTGSDG
RATITLTSSSAGKATVSATVSGGNDVKAPEVTFFDGLKIDNKVDILGKNVTGDLPNIWLQYGQFKLKVSGGNGTYSWHSE
NTNIATVDESGKVTLKGKGTAVINVTSGDKQTVSYTIKAPNYMIRVGNKASYANAMSFCGNLLPSSQTVLSNVYNSWGPA
NGYDHYRSMQSITAWITQTEADKISGVSTTYDLITQNPHKDVTLNAPNVYAVCVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 100
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 90
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 90
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 90
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 90
eae AAL57551.1 Eae Virulence LEE Protein 0.0 83
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 83
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 83
eae AAK26724.1 intimin Virulence LEE Protein 0.0 83
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 83
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 83
eae AAC38392.1 intimin Virulence LEE Protein 0.0 82
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 77
eae AFO66294.1 intimin-like protein Not tested SESS LEE Protein 0.0 48
eae AFO66392.1 intimin-like protein Virulence SESS LEE Protein 0.0 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eae YP_003236079.1 theta intimin VFG0803 Protein 0.0 90
eae YP_003236079.1 theta intimin VFG0739 Protein 0.0 82