Gene Information

Name : eae (ECO103_3609)
Accession : YP_003223466.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Virulence
Product : intimin epsilon
Function : -
COG functional category : N : Cell motility
COG ID : COG5492
EC number : -
Position : 3686375 - 3689221 bp
Length : 2847 bp
Strand : -
Note : Integrative element ECO103_IE03

DNA sequence :
ATGATTACTCATGGTTTTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAACCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGAGTTCAGATTCAAAACTGT
TAACTCAAAATGCCGCTCAGGATCGCCTTTTTTATACGTTAAAAACAGGTGAAACTGTTGCCAATATTTCTAAATCACAG
GGGATCAGTTTATCGGTAATTTGGTCACTGAATAAACATTTATACAGCTCCGAAAGCGAAATGATGAAGGCTGGGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTGTCTGTTGAATATAGTGCCTTACCTGTCTTAGGTTCGGCACCTGTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACGAATAAAATGACTAAAATGTCCCCGGACGCGACTAAAAGCAACACGACCGATGAC
AAGGCTCTAAATTATGCGGCACAACAGGCCGCGAGCCTTGGTAGCCAGCTCCAGTCGCGCTCACTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATGGCCAGCAGCCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAACATG
CTGGCATTTGGTCAGGTCGGGGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCTGGCCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCGGCAAATGGTTTTGATATCCGCTTTAATGGCTATTTACCATCATATCCGGCATTAGGCGCCAA
ACTGATGTACGAACAGTATTATGGTGATAATGTTGCTTTGTTTAATTCCGATAAGTTGCAGTCGAATCCTGGCGCGGCGA
CCGTTGGTGTAAACTACACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATCGAGCCACAGTATGTTAACGAGTT
AAGAACATTATCGGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAT
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACCGCTCGCGCCTATGACC
GAAATGGTAATAGTTCTAATAATGTACAGCTCACTATTACCGTTTTACCGAATGGGCAGGTTGTGGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAGACATCGGCTAAAGCGGATAACGTTGATACCATTACTTATACCGCGACGGTTAAAAAGAA
TGGTGTAGCTCAGGCTAATGCCCCTGTAACATTTAGTATTGTATCCGGGACTGCAACTCTCGGGGCAAATAGTGCCAAAA
CGGATGGTAACGGTAAGGCAACCGTAACGTTGAAGTCGGGTACGCCAGGGCAGGTCGTCGTGTCTGCTAAAACCGCGGAG
ATGACTTCGCCACTTAATGCCAGTGCGGTTATATTTGTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
AACAACAGCGAAGGCAAATGGTTCTGATGCGATTACCTATATTGTTAAAGTAATGAAGAATAACCAACCAGAAGCAAACC
ATTCTGTTACATTCTCAACGAACTTTGGTAATCTGGGGGGAAATTCTAATACCCAAATTGTGAAAACGGATAAAGATGGT
AGGGCTACGGTAAAACTGACATCTGGCGTTGCAGGTAATGCTGTTGTTAGTGCAAAAGTCAGCGAAGTTAATACAGAGGT
TAAGGCTCCTGAGGTAAAATTCTTCTCAGTTCTGAGCATTGATAGTAATGTGAGTATTATTGGAACCTCCGCTAATGGCG
CTTTACCTAATATTTGGTTGCGATATGGTCAGTTTAAGCTGACAGCCAAAGGTGGCGATGGGAAATATCAATGGCGCTCT
CAAGATCCAAGTGTTGCATCAGTTGATGCTTTAACTGGTCGAGTTACTTTGCTGAAGAAAGGAACAACAACAATTGAAGT
TGTATCGGGTGATAACCAAACAGCAATGTATACAATTAATACACCTACAAAATTTATATCTGTGGAGACACAAAATAAAG
TAGTCTATAGTGATGCTGAGGCAACATGTAGAATGAATAATGCACGCTTGCCGTCATCTACGAGTGAGCTAAAGGATGTG
TATAATAAATGGGGCGCCGCCAATAGTTATGAAGGCTATAAAGGTAAAAAAACAATAACAGCATGGACACAGCAAACTGA
GGATGATAAACAAAAAGGTTGGACTAGTACATTTGACATAGTTACAAAAAATGAAATCCCTAGTAATGGCAGTAATAGTA
AAGTCCACGTGAATAAAGCTAACGCTTTTGCCGTCTGTGTAAGATGA

Protein sequence :
MITHGFYTRTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLSSDSKLLTQNAAQDRLFYTLKTGETVANISKSQ
GISLSVIWSLNKHLYSSESEMMKAGPGQQIILPLKKLSVEYSALPVLGSAPVVAAGGVAGHTNKMTKMSPDATKSNTTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGMASSQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSENM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLPNGQVVDQVGV
TDFTADKTSAKADNVDTITYTATVKKNGVAQANAPVTFSIVSGTATLGANSAKTDGNGKATVTLKSGTPGQVVVSAKTAE
MTSPLNASAVIFVDQTKASITEIKADKTTAKANGSDAITYIVKVMKNNQPEANHSVTFSTNFGNLGGNSNTQIVKTDKDG
RATVKLTSGVAGNAVVSAKVSEVNTEVKAPEVKFFSVLSIDSNVSIIGTSANGALPNIWLRYGQFKLTAKGGDGKYQWRS
QDPSVASVDALTGRVTLLKKGTTTIEVVSGDNQTAMYTINTPTKFISVETQNKVVYSDAEATCRMNNARLPSSTSELKDV
YNKWGAANSYEGYKGKKTITAWTQQTEDDKQKGWTSTFDIVTKNEIPSNGSNSKVHVNKANAFAVCVR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 100
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 100
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 85
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 85
eae AAK26724.1 intimin Virulence LEE Protein 0.0 85
eae AAL57551.1 Eae Virulence LEE Protein 0.0 85
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 83
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 82
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 82
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 82
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 82
eae AAC38392.1 intimin Virulence LEE Protein 0.0 81
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 78

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eae YP_003223466.1 intimin epsilon VFG0803 Protein 0.0 82
eae YP_003223466.1 intimin epsilon VFG0739 Protein 0.0 81