PAI Gene Information


Name : EC042_4096 (EC042_4096)
Accession : YP_006098379.1
PAI name : Tn2411
PAI accession : NC_017626_R1
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : Putative acetyltransferase
Function : -
Note : similar to puromycin N-acetyltransferase protein in Streptomyces anulatus ATCC 12461, GenBank Accession Number M25346
Homologs in the searched genomes :   19 hits    ( 19 protein-level )  
Publication :
    -Aslett,M.A., "Direct Submission", Submitted (25-SEP-2009) Aslett M.A., Wellcome Trust Sanger Institute, Pathogen Sequencing Unit, Wellcome Trust Genome Campus, Hinxton, Cambridge, Cambridgeshire. CB10 1SA, UNITED KINGDOM.

    -Chaudhuri,R.R., Sebaihia,M., Hobman,J.L., Webber,M.A., Leyton,D.L., Goldberg,M.D., Cunningham,A.F., Scott-Tucker,A., Ferguson,P.R., Thomas,C.M., Frankel,G., Tang,C.M., Dudley,E.G., Roberts,I.S., Rasko,D.A., Pallen,M.J., Parkhill,J., Nataro,J.P., Thomson,N, "Complete genome sequence and comparative metabolic profiling of the prototypical enteroaggregative Escherichia coli strain 042", PLoS ONE 5 (1), E8801 (2010) PUBMED 20098708 REMARK Publication Status: Online-Only.

    -Chaudhuri,R.R., Sebaihia,M., Hobman,J.L., Webber,M.A., Leyton,D.L., Goldberg,M.D., Cunningham,A.F., Scott-Tucker,A., Ferguson,P.R., Thomas,C.M., Frankel,G., Tang,C.M., Dudley,E.G., Roberts,I.S., Rasko,D.A., Pallen,M.J., Parkhill,J., Nataro,J.P., Thomson,N, "Direct Submission", Submitted (11-APR-2012) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGGACAGCGAGGAGCCTCCGAACGTTCGGGTCGCCTGCTCGGGTGATATCGACGAGGTTGTGCGGCTGATGCACGACGC
TGCGGCGTGGATGTCCGCCAAGGGAACGCCCGCCTGGGACGTCGCGCGGATCGACCGGACATTCGCGGAGACCTTCGTCC
TGAGATCCGAGCTCCTAGTCGCGAGTTGCAGCGACGGCATCGTCGGCTGTTGCACCTTGTCGGCCGAGGATCCCGAGTTC
TGGCCCGACGCCCTCAAGGGGGAGGCCGCATATCTGCACAAGCTCGCGGTGCGACGGACACATGCGGGCCGGGGTGTCAG
CTCCGCGCTGATCGAGGCTTGCCGCCATGCCGCGCGAACGCAGGGGTGCGCCAAGCTGCGGCTCGACTGCCACCCGAACC
TGCGTGGCCTATACGAGCGGCTCGGATTCACCCACGTCGACACTTTCAATCCCGGCTGGGATCCAACCTTCATCGCAGAA
CGCCTAGAACTCGAAATCTAA

Protein sequence :
MDSEEPPNVRVACSGDIDEVVRLMHDAAAWMSAKGTPAWDVARIDRTFAETFVLRSELLVASCSDGIVGCCTLSAEDPEF
WPDALKGEAAYLHKLAVRRTHAGRGVSSALIEACRHAARTQGCAKLRLDCHPNLRGLYERLGFTHVDTFNPGWDPTFIAE
RLELEI