| Name : ECO103_3554 (ECO103_3554) Accession : YP_003223421.1
 PAI name :  LEE
 PAI accession : NC_013353_P1
 Strain : Escherichia coli 042
 Virulence or Resistance: Not determined
 Product : hypothetical protein
 Function : -
 Note : Integrative element ECO103_IE03; intact form of IS683 are not identified; predicted protein Orf3 in insertion sequence IS683
 Homologs in the searched genomes :    468 hits    ( 462 protein-level,   6 DNA-level )
 Publication :
 
-Hattori,M., Toh,H., Oshima,K., Yamashita,A., Hayashi,T., Ogura,Y. and Ooka,T., "Direct Submission", Submitted (21-DEC-2008) Contact:Masahira Hattori University of Tokyo, Graduate School of Frontier Sciences; 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan URL :http://www.cb.k.u-tokyo.ac.jp/hattorilab/.
 -Ogura,Y., Ooka,T., Iguchi,A., Toh,H., Asadulghani,M., Oshima,K., Kodama,T., Abe,H., Nakayama,K., Kurokawa,K., Tobe,T., Hattori,M. and Hayashi,T., "Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coli", Proc. Natl. Acad. Sci. U.S.A. 106 (42), 17939-17944 (2009) PUBMED 19815525.
 
 -Ogura,Y., Ooka,T., Iguchi,A., Toh,H., Asadulghani,M., Oshima,K., Kodama,T., Abe,H., Nakayama,K., Kurokawa,K., Tobe,T., Hattori,M. and Hayashi,T., "Direct Submission", Submitted (07-OCT-2009) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
 
 
 
 
      | DNA sequence : |  |  | ATGAACAACACACTCCCCGACGACATCGAGCAACTGAAGGCCCTGCTGATCGCACAGCAGGCTGTTATCGTCCGTCTGTC
TGGTGAAATAACCGGCTATGCCCGCGAGATCAGCTCACTCAGAGCGCTGGTCGCTAAACTGCAGAGAATGTTGTTCGGTC
GCAGCAGCGAGAAAAGCCGCGAGAAGATAGAAAAGAAGATCGCACGGGCAGAAACGCGTATAACCGAGCTCCAGAACAGG
CTTGGTGAGGCGCAGTTGCAACTCACCTCAATGGCCGGAGAGACAGCGCCGAAAACATCAGACTCTCCCGTCCGCAAAGC
ACTTCCGGCAACACTTCCCCGTGACAGGCAGGTTATCTCCCCGGCAGAAACCGAATGCCCCGTCTGCAGCGGCAAACTGA
AACCGCTGGGAGAAAGCATCTCTGAACAACTGGATATCATCAACACCGCGTTCAGGGTAATCGAAACGGTTCGCCCAAAA
CGGGCCTGCAGCCGGTGCGACTGTATAGTTCAGGCTCCGCAGCCACCAAAACCCATCGAGCGCAGTTACGCCAGTCCGGC
TCTGCTGGCCCGCATAATCATGGCTAAGTTCGCCGAGCATCTGCCGCTGTACCGTCAGTCGGAAATCTATGCCCGCCAGG
GCGTGGAGCTGCACCGCAATACGATGGGGCGCTGGGTTGACATCATGGGAGAGCAGCTTCGCCCGCTGTATGATGAACTG
AAGCACTATGTGCTGATGCCGGGTAAAGTGCATGCCGATGACACGCCGGTAAATGTACTGGAGCCGGGTCAGGGTAAAAC
CCGTACCGGACGGCTGTGGGTCTATGTTCGTGACGATCGCAACGCCGGTTCGACCATGCCGGCAGCGGTGTGGTTCTCAT
ACTCTCCCGACCGCAAAGGCATCCACCCACAGCAACATCTGGCGGACTACAGAGGTATCCTGCAGGCCGATGCATATGCG
GGTTACAATGCTCTTTACGAAAGCGGTCAGGCAACCGAAGCGGCTTGTATGGCACATGCCCGACGCAAGATCCACGATGT
ACATGTCCGCCATCCAACGACAGTAACGGGAGAAGCGCTCCGTCGTATCGGGGAACTGTACGCTATCGAGGCTGAGATCC
GCGGCAGTCCGGCAGAAGAGCGACTGGCGGTCAGAAAAGCCAGAACGGTACCGCTAATGCAGTCGTTGTATGAGTGGCTC
CAGGGGCAGATGAACACGCTGTCGCGCCACTCGGATACAGCGAAAGCGTTCACCTATCTGCTGAAGCAATGGGACGCTCT
GAACGAATACTGCCGCAATGGCTGGGTGGAGATCGACAATAACCTGTGTGAAAACGCCCTCCGGGTAGTTGCACTGGGGC
GGCGTAACTACCGGACGTGGTTTCTGCCAGATAAAGGCAACGGAACAAAAGAATCATGGTTTTTCCGGAACCGGCCACAC
CATGAATGA
 
 |  | Protein sequence : |  |  | MNNTLPDDIEQLKALLIAQQAVIVRLSGEITGYAREISSLRALVAKLQRMLFGRSSEKSREKIEKKIARAETRITELQNR
LGEAQLQLTSMAGETAPKTSDSPVRKALPATLPRDRQVISPAETECPVCSGKLKPLGESISEQLDIINTAFRVIETVRPK
RACSRCDCIVQAPQPPKPIERSYASPALLARIIMAKFAEHLPLYRQSEIYARQGVELHRNTMGRWVDIMGEQLRPLYDEL
KHYVLMPGKVHADDTPVNVLEPGQGKTRTGRLWVYVRDDRNAGSTMPAAVWFSYSPDRKGIHPQQHLADYRGILQADAYA
GYNALYESGQATEAACMAHARRKIHDVHVRHPTTVTGEALRRIGELYAIEAEIRGSPAEERLAVRKARTVPLMQSLYEWL
QGQMNTLSRHSDTAKAFTYLLKQWDALNEYCRNGWVEIDNNLCENALRVVALGRRNYRTWFLPDKGNGTKESWFFRNRPH
HE
 
 |  |