PAI Gene Information


Name : APECO1_1080 (APECO1_1080)
Accession : YP_853100.1
PAI name : PAI IV APEC-O1
PAI accession : NC_008563_P3
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : transposase
Function : -
Note : -
Homologs in the searched genomes :   28 hits    ( 20 protein-level,   8 DNA-level )  
Publication :
    -Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (14-SEP-2006) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes", J. Bacteriol. 189 (8), 3228-3236 (2007) PUBMED 17293413 REMARK Erratum:[J Bacteriol. 2007 Jun;189(12):4554].

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "Direct Submission", Submitted (08-NOV-2006) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
GTGGGTCTCCGTTGCAGTGCTGATACTCTTCTTCGCAGGCTTATCAATACCCCGGAGACGAAACAGTCAGGCGCGCCTCA
TGTCGGTATTGATGAGTGGGCGTGGCATCGGGGCCACTGCTGCGGTATGTTAATAGTCAATCCTGATACTCACCGTCCCC
TCGTCCTGCTTCCCGGCCGTGATCAGCGTACGCTGGCGACCTGGTTCAGAAAATATCCGGAAATACAGGTTGTCTCGCGT
GATCGCAGTGGAGTCTATGCGACAGCAGCACGTGAAGGTGCACCTCAGGCCAGACAGGTGGCCGATCGATGGCACCTGCT
AAAAAATATTGGTGATGAGCCTGAACGAATGATGTACAGACATATGCCTCTGATACGTCTTGTTGTCAGAGAGTTATCAC
TGAAGAAATCACCTGAGCCAGAAATATCTGTGCCTGTAGCATCGCTCCGTCGTCTGGAACGCCTTAAACAGCACATCCGC
AAAAAACGGCATCAGCGTTGGACAGAGGTTATGGCCCTGCATAACAAGGGATGTAGTTTCAGGGAAATATCCCGTATTAC
AGGCCTGTCGCGAGTGACAGTCAGTCGCTGGGTGGGTTCAGGAACATTCCCTGAAATGTCAACCAGGCCTCCAAAGCGAG
GGCTTCTGGACCCATGGAGGGAGTGGTTAAAAGAGCAACGAGAATGTGGTAATTATAACTCCGGCCGGATATGGCGGGAA
ATGGTGGCCAGGGGGGTTACAGGCAGTGAAACCATCGTCAGGGATGCTGTTGCCAAATGGCATAAAGGCTGGATCCCACC
GGTTACTACTGCCGCAAGACTTCCTTCAGTGTCCCGGGTAAGCCGCTGGTTGATGCCCTGGAGAATAATCAGGGGTGAAG
AAAATTATGCTTTCCGATTTATTAGTCTGATGTGTGAAAAAGAACCGGAGTTGAAAATAGCGCAGCAACTGGTACTCGAG
TTCTACCGTATTCTGAAAACCTAA

Protein sequence :
MGLRCSADTLLRRLINTPETKQSGAPHVGIDEWAWHRGHCCGMLIVNPDTHRPLVLLPGRDQRTLATWFRKYPEIQVVSR
DRSGVYATAAREGAPQARQVADRWHLLKNIGDEPERMMYRHMPLIRLVVRELSLKKSPEPEISVPVASLRRLERLKQHIR
KKRHQRWTEVMALHNKGCSFREISRITGLSRVTVSRWVGSGTFPEMSTRPPKRGLLDPWREWLKEQRECGNYNSGRIWRE
MVARGVTGSETIVRDAVAKWHKGWIPPVTTAARLPSVSRVSRWLMPWRIIRGEENYAFRFISLMCEKEPELKIAQQLVLE
FYRILKT