Gene Information

Name : eaeA (E2348C_3939)
Accession : YP_002331401.1
Strain : Escherichia coli E2348/69
Genome accession: NC_011601
Putative virulence/resistance : Virulence
Product : intimin EaeA
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4110343 - 4113162 bp
Length : 2820 bp
Strand : -
Note : -

DNA sequence :
ATGATTACTCATGGTTTTTATGCCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTTATTATGCTTAGTGCTGGTTT
AGGATTGTTTTTTTATGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAAACTGT
TAACTCATAATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACAGGTGAAACTGTTGCCGATCTTTCTAAATCGCAA
GATATTAATTTATCGACGATTTGGTCGTTGAATAAGCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGG
TCAGCAGATCATTTTGCCACTCAAAAAACTTCCCTTTGAATACAGTGCCTTACCACTTTTAGGTTCGGCACCTCTTGTTG
CTGCAGGTGGTGTCGCTGGTCATACAAATAAACTGACTAAAATGTCCCCGGACGTGACCAAAAGCAACATGACCGATGAC
AAGGCATTAAATTATGCGGCACAACAGGCGGCGAGTCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGC
GAAAGATACCGCTCTTGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG
AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAAAATG
CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT
TCCTGAAAATATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG
AATACTGGCGAGACTATTTCAAAAGTAGTGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA
GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGTTTTAATGGCTATCTGCCATCATACCCGGCATTAGGTGCCAA
GCTGATGTATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGCGGCGA
CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT
CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAACCGTGGTCTCAGCAAATTGAGCCACAATATGTTAACGAGTT
AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAAAAGCAGGATATTC
TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACGCAGTACGCAGAAGATTCAATTGATCGTTAAGAGCAAATAC
GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGTAGCCAGGGCGGCCAGATTCAGCATAGCGGAAGCCAAAGCGC
ACAAGATTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGTAGCAATGTTTATAAAGTGACGGCTCGCGCCTATGACC
GTAATGGCAATAGCTCTAACAATGTACTGCTTACTATTACCGTTCTGTCGAATGGTCAGGTGGTCGACCAGGTTGGGGTA
ACGGACTTTACGGCTGATAAGACTTCGGCTAAAGCGGATGGCACCGAAGCAATTACTTATACTGCGACGGTGAAAAAGAA
TGGGGTAGCTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAGTTTTAAGTGCCAATAGTGCCAATA
CCAATGGTAGCGGTAAGGCGACTGTAACCCTGAAATCGGATAAACCAGGCCAGGTCGTCGTGTCTGCTAAAACAGCAGAG
ATGACTTCAGCGCTTAATGCCAATGCAGTTATATTTGTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA
AACAACGGCAGTAGCAAATGGTCAGGATGCTATTACATACACTGTTAAAGTGATGAAGGGGGATAAGCCTGTATCTAATC
AGGAAGTGACCTTTACGACGACCTTAGGTAAGTTAAGTAATTCCACTGAAAAAACGGATACGAATGGCTATGCCAAAGTA
ACATTAACATCGACAACTCCAGGAAAATCACTCGTTAGTGCCCGTGTTAGCGATGTCGCCGTTGATGTCAAAGCACCTGA
AGTTGAATTTTTTACAACGCTTACAATTGATGACGGTAATATTGAAATTGTTGGAACCGGAGTTAAAGGGAAGTTACCCA
CTGTATGGTTGCAATATGGTCAAGTTAATCTGAAAGCCAGCGGAGGTAACGGAAAATATACATGGCGCTCAGCAAATCCA
GCAATTGCTTCGGTGGATGCTTCTTCTGGTCAGGTCACCTTAAAAGAGAAGGGAACTACAACTATTTCCGTTATCTCAAG
TGATAATCAAACTGCAACTTATACTATTGCAACACCTAATAGTCTGATTGTTCCTAATATGAGCAAGCGTGTGACCTATA
ATGATGCTGTGAATACATGTAAGAATTTTGGAGGAAAGTTGCCGTCTTCTCAGAATGAACTGGAAAATGTCTTTAAAGCA
TGGGGGGCTGCAAATAAATATGAATATTATAAGTCTAGTCAGACTATAATTTCATGGGTACAACAAACAGCTCAAGATGC
GAAGAGTGGTGTTGCAAGTACATACGATTTAGTTAAACAAAACCCTCTGAATAATATTAAGGCTAGTGAATCTAATGCTT
ATGCCACTTGTGTAAAATAA

Protein sequence :
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGV
TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAE
MTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKV
TLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANP
AIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKA
WGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLVKQNPLNNIKASESNAYATCVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
eae AAC38392.1 intimin Virulence LEE Protein 0.0 99
eaeA AAC31504.1 L0025 Virulence LEE Protein 0.0 83
unnamed ACU09449.1 gamma intimin Virulence LEE Protein 0.0 83
eae NP_290259.1 intimin adherence protein Virulence LEE Protein 0.0 83
ECs4559 NP_312586.1 gamma intimin Virulence LEE Protein 0.0 83
eae AAK26724.1 intimin Virulence LEE Protein 0.0 83
eae YP_003232162.1 intimin Not tested LEE Protein 0.0 83
eae AAL57551.1 Eae Virulence LEE Protein 0.0 83
eae CAC81871.1 Intimin Not tested LEE II Protein 0.0 83
eae YP_003236079.1 theta intimin Not tested LEE Protein 0.0 81
eae CAI43865.1 intimin epsilon Virulence LEE Protein 0.0 81
eae YP_003223466.1 intimin epsilon Not tested LEE Protein 0.0 81
unnamed AAL06378.1 intimin Virulence LEE Protein 0.0 79
eae AFO66294.1 intimin-like protein Not tested SESS LEE Protein 0.0 59
eae AFO66392.1 intimin-like protein Virulence SESS LEE Protein 0.0 59

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
eaeA YP_002331401.1 intimin EaeA VFG0739 Protein 0.0 99
eaeA YP_002331401.1 intimin EaeA VFG0803 Protein 0.0 83