Name : ECH74115_5054 (ECH74115_5054) Accession : YP_002273158.1 Strain : Escherichia coli EC4115 Genome accession: NC_011353 Putative virulence/resistance : Virulence Product : intimin C-type lectin domain-containing protein Function : - COG functional category : N : Cell motility COG ID : COG5492 EC number : - Position : 4699562 - 4702366 bp Length : 2805 bp Strand : - Note : identified by match to protein family HMM PF01476; match to protein family HMM PF02368; match to protein family HMM PF02369; match to protein family HMM PF07979 DNA sequence : ATGATTACTCATGGTTGTTATACCCGGACCCGGCACAAGCATAAGCTAAAAAAAACATTGATTATGCTTAGTGCTGGTTT AGGATTGTTTTTTTATGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAAACTGT TAACTCATGATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACTGGTGAAACTGTTGCCGATCTTTCTAAATCGCAA GATATTAATTTATCGACGATTTGGTCGTTGAATAAGCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGG TCAGCAGATCATTTTGCCACTCAAAAAACTTCCCTTTGAATACAGTGCACTACCACTTTTAGGTTCGGCACCTCTTGTTG CTGCAGGTGGTGTTGCTGGTCACACGAATAAACTGACTAAAATGTCCCCGGACGTGACCAAAAGCAACATGACCGATGAC AAGGCATTAAATTATGCGGCACAACAGGCGGCGAGTCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGC GAAAGATACCGCTCTTGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAACGGCAG AGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATTACCGTTCTATGATTCCGAAAAAATG CTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGACTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCT TCCTGCAAACATGTTGGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGGTGGCG AATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGGCTGGCATGAGTCATACAATAAGAAA GACTATGATGAGCGCCCAGCAAATGGCTTCGATATCCGTTTTAATGGCTATCTACCGTCATATCCGGCATTAGGCGCCAA GCTGATATATGAGCAGTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGCGGCGA CCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCGTCATGGTACGGGTAATGAAAATGAT CTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGATAAATCGTGGTCTCAGCAAATTGAACCACAGTATGTTAACGAGTT AAGAACATTATCAGGCAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAAGAAGCAGGATATTC TTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGATTCAGTTGATCGTTAAGAGCAAATAC GGTCTGGATCGTATCGTCTGGGATGATAGTGCATTACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGC ACAAGACTACCAGGCTATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACGGCTCGCGCCTATGACC GTAATGGCAATAGCTCTAACAATGTACAGCTTACTATTACCGTTCTGTCGAATGGTCAAGTTGTCGACCAGGTTGGGGTA ACGGACTTTACGGCGGATAAGACTTCGGCTAAAGCGGATAACGCCGATACCATTACTTATACCGCGACGGTGAAAAAGAA TGGGGTAGCTCAGGCTAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGCCAAAA CGGATGCTAACGGTAAGGCAACCGTAACGTTGAAGTCGAGTACGCCAGGACAGGTCGTCGTGTCTGCTAAAACCGCGGAG ATGACTTCAGCACTTAATGCCAGTGCGGTTATATTTTTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAA GACAACTGCAGTAGCAAATGGTAAGGATGCTATTAAATATACTGTAAAAGTTATGAAAAACGGTCAGCCAGTTAATAATC AATCCGTTACATTCTCAACAAACTTTGGGATGTTCAACGGTAAGTCTCAAACGCAAGCAACCACGGGAAATGATGGTCGT GCGACGATAACACTAACTTCCAGTTCCGCCGGTAAAGCGACTGTTAGTGCGACAGTCAGTGATGGGGCTGAGGTTAAAGC GACTGAGGTCACTTTTTTTGATGAACTGAAAATTGACAACAAGGTTGATATTATTGGTAACAATGTCAGAGGCGAGTTGC CTAATATTTGGCTGCAATATGGTCAGTTTAAACTGAAAGCAAGCGGTGGTGATGGTACATATTCATGGTATTCAGAAAAT ACCAGTATCGCGACTGTCGATGCATCAGGGAAAGTCACTTTGAATGGTAAAGGCAGTGTCGTAATTAAAGCCACATCTGG TGATAAGCAAACAGTAAGTTACACTATAAAAGCACCGTCGTATATGATAAAAGTGGATAAGCAAGCCTATTATGCTGATG CTATGTCCATTTGCAAAAATTTATTACCATCCACACAGACGGTATTGTCAGATATTTATGACTCATGGGGGGCTGCAAAT AAATATAGCCATTATAGTTCTATGAACTCAATAACTGCTTGGATTAAACAGACATCTAGTGAGCAGCGTTCTGGAGTATC AAGCACTTATAACCTAATAACACAAAACCCTCTTCCTGGGGTTAATGTTAATACTCCAAATGTCTATGCGGTTTGTGTAG AATAA Protein sequence : MITHGCYTRTRHKHKLKKTLIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHDSYQNRLFYTLKTGETVADLSKSQ DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM LAFGQVGARYIDSRFTANLGAGQRFFLPANMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK DYDERPANGFDIRFNGYLPSYPALGAKLIYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND LLYSMQFRYQFDKSWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV TDFTADKTSAKADNADTITYTATVKKNGVAQANVPVSFNIVSGTATLGANSAKTDANGKATVTLKSSTPGQVVVSAKTAE MTSALNASAVIFFDQTKASITEIKADKTTAVANGKDAIKYTVKVMKNGQPVNNQSVTFSTNFGMFNGKSQTQATTGNDGR ATITLTSSSAGKATVSATVSDGAEVKATEVTFFDELKIDNKVDIIGNNVRGELPNIWLQYGQFKLKASGGDGTYSWYSEN TSIATVDASGKVTLNGKGSVVIKATSGDKQTVSYTIKAPSYMIKVDKQAYYADAMSICKNLLPSTQTVLSDIYDSWGAAN KYSHYSSMNSITAWIKQTSSEQRSGVSSTYNLITQNPLPGVNVNTPNVYAVCVE |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
eae | NP_290259.1 | intimin adherence protein | Virulence | LEE | Protein | 0.0 | 100 |
ECs4559 | NP_312586.1 | gamma intimin | Virulence | LEE | Protein | 0.0 | 100 |
eaeA | AAC31504.1 | L0025 | Virulence | LEE | Protein | 0.0 | 100 |
unnamed | ACU09449.1 | gamma intimin | Virulence | LEE | Protein | 0.0 | 100 |
eae | YP_003236079.1 | theta intimin | Not tested | LEE | Protein | 0.0 | 90 |
eae | AAK26724.1 | intimin | Virulence | LEE | Protein | 0.0 | 83 |
eae | AAL57551.1 | Eae | Virulence | LEE | Protein | 0.0 | 83 |
eae | CAC81871.1 | Intimin | Not tested | LEE II | Protein | 0.0 | 83 |
eae | YP_003232162.1 | intimin | Not tested | LEE | Protein | 0.0 | 83 |
eae | AAC38392.1 | intimin | Virulence | LEE | Protein | 0.0 | 83 |
eae | YP_003223466.1 | intimin epsilon | Not tested | LEE | Protein | 0.0 | 82 |
eae | CAI43865.1 | intimin epsilon | Virulence | LEE | Protein | 0.0 | 82 |
unnamed | AAL06378.1 | intimin | Virulence | LEE | Protein | 0.0 | 78 |
eae | AFO66392.1 | intimin-like protein | Virulence | SESS LEE | Protein | 0.0 | 60 |
eae | AFO66294.1 | intimin-like protein | Not tested | SESS LEE | Protein | 0.0 | 59 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
ECH74115_5054 | YP_002273158.1 | intimin C-type lectin domain-containing protein | VFG0803 | Protein | 0.0 | 100 |
ECH74115_5054 | YP_002273158.1 | intimin C-type lectin domain-containing protein | VFG0739 | Protein | 0.0 | 83 |