| Name : c5167 (c5167) Accession : NP_757015.1
 PAI name :  PAI II CFT073
 PAI accession : NC_004431_P2
 Strain : Escherichia coli 042
 Virulence or Resistance: Not determined
 Product : transposase IS629
 Function : -
 Note : Escherichia coli O157:H7 ortholog: z3297
 Homologs in the searched genomes :    1174 hits    ( 1154 protein-level,   20 DNA-level )
 Publication :
 
-Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R., "Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli", Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024 (2002) PUBMED 12471157.
 -Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R., "Direct Submission", Submitted (10-SEP-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
 
 -Welch,R.A., Burland,V., Plunkett,G.D. III, Redford,P., Roesch,P., Rasko,D.A., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R., "Direct Submission", Submitted (20-JUN-2002) Genetics Laboratory, University of Wisconsin - Madison, 445 Henry Mall, Madison, WI 53706, USA.
 
 
 
 
      | DNA sequence : |  |  | ATGATGCCACTGCTGGATAAGCTGCGTGAGCAGTACGGGGTCGGACCGCTATGCAGCGAACTGCATATTGCCCCGTCAAC
GTATTACCACTGTCAGCAACAGCGACATCATCCGGATAAACGCAGTGCCCGTGCGCAGCGCGATGACTGGCTGAAGAAAG
AGATACAGCGCGTATACGATGAAAATCACAAGGTATACGGTGTGCGTAAAGTCTGGCGTCAGTTGTTACGGGAAGGTATC
AGAGTGGCCAGATGCACTGTGGCACGTCTCATGGCGGTTATGGGACTTGCCGGTGTTCTCCGGGGTAAAAAGGTCCGTAC
GACCATCAGCCGGAAAGCCGTTGCCGCAGGCGACCGCGTAAACCGTCAGTTCGTGGCAGAACGACCTGACCAGCTGTGGG
TGGCTGATTTTACTTACGTCAGCACATGGCGGGGCTTCGTCTATGTGGCGTTCATCATTGATGTGTTTGCCGGATACATC
GTGGGGTGGCGGGTCTCATCGTCCATGGAAACGACATTCGTGCTGGATGCACTGGAGCAGGCGTTATGGGCCCGTCGACC
GTCCGGCACGGTCCATCACAGTGATAAAGGTTCTCAGTATGTATCGCTGGCCTACACACAGCGGCTTAAGGAAGCCGGAT
TACTGGCATCAACAGGAAGTAACAGGCGACTCGTATGA
 
 |  | Protein sequence : |  |  | MMPLLDKLREQYGVGPLCSELHIAPSTYYHCQQQRHHPDKRSARAQRDDWLKKEIQRVYDENHKVYGVRKVWRQLLREGI
RVARCTVARLMAVMGLAGVLRGKKVRTTISRKAVAAGDRVNRQFVAERPDQLWVADFTYVSTWRGFVYVAFIIDVFAGYI
VGWRVSSSMETTFVLDALEQALWARRPSGTVHHSDKGSQYVSLAYTQRLKEAGLLASTGSNRRLV
 
 |  |