PAI Gene Information


Name : APECO1_3534 (APECO1_3534)
Accession : YP_854214.1
PAI name : PAI I APEC-O1
PAI accession : NC_008563_P4
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : P4-like integrase
Function : -
Note : integrase core domain
Homologs in the searched genomes :   520 hits    ( 519 protein-level,   1 DNA-level )  
Publication :
    -Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (14-SEP-2006) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes", J. Bacteriol. 189 (8), 3228-3236 (2007) PUBMED 17293413 REMARK Erratum:[J Bacteriol. 2007 Jun;189(12):4554].

    -Johnson,T.J., Kariyawasam,S., Wannemuehler,Y., Mangiamele,P., Johnson,S.J., Doetkott,C., Skyberg,J.A., Lynne,A.M., Johnson,J.R. and Nolan,L.K., "Direct Submission", Submitted (08-NOV-2006) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGGCACTGACTGACGCAAAAATCCGGGCTGCAAAGCCCACTGACAAGGCTTATAAACTCACTGACGGGGCCGGCATGTT
CCTGCTGGTTCATCCTAATGGTTCCCGTTACTGGCGTCTCCGTTATCGTATTCTGGGTAAGGAGAAAACCCTGGCACTTG
GTGTGTATCCGGAAGTTTCTCTCTCCGAAGCTCGTACAAAACGGGATGAGGCCCGAAAACTGATTTCAGAGGGGATTGAC
CCTTGCGAACAGAAAAGAGTTAAAAAAGTAGTCCCTGATTTACAGCTCTCTTTTGAACATATTGCACGACGCTGGCATGC
CAGTAATAAACAATGGGCACAATCACACAGCGATAAAGTACTCAAAAGCCTCGAGACACACGTTTTCCCCTTTATCGGCA
ACCGGGATATCACAACACTCAGTACCCCGGACCTGCTTATTCCTGTTCGTGCTGCAGAAGCAAAACAAATTTATGAAATC
GCCAGTCGTCTGCAGCAAAGAATATCTGCTGTAATGCGTTACGCCGTACAGTCTGGCATCATCAGATATAACCCGGCTCT
GGATATGGCTGGTGCATTGACCACGGTAAAACGCCAGCATCGCCCCGCTCTTGATCTTTCTCGCCTGCCTGAACTTTTGT
CGCGTATTAGCAGTTACAAGGGGCAACCTGTCACCCAGCTTGCCGTTATGCTGAATTTACTGGTTTTTATTCGTTCCAGT
GAACTCAGATACGCCCGGTGGTCTGAAATTGATATTGACAATGCCATGTGGACTATTCCAGCCGAACGCGAACCTCTGCC
CGGCGTAAAATTCTCACACCGGGGCTCCAAGATGCGAACACCACATCTTGTGCCACTCAGCAAACAGGCTGTAGCCATAC
TGACAGAACTTCAGACATGGGCTGGTGAAAATGGTCTGATATTTACGGGTGCACATGACCCGCGTAAACCAATCAGTGAA
AATACTGTAAATAAGGCCCTGAGGGTGATGGGGTATGACACAACCCAGGAAGTCTGTGGCCATGGATTCCGGGCGATGGC
GTGCAGTGCATTGATTGAATCAGGTTTGTGGTCCCGCGATGCTGTGGAACGTCAGATGAGCCATCAGGAGCGTAATGGTG
TACGTGCTGCGTATATCCATAAAGCAGAACATCTGGAAGAACGGCGACTGATGTTGCAATGGTGGGCCGATTTTCTGGAT
GCGAACCGGGAGAAGGGTATCAGTCCGTTTGAATATGCAAAGATTAACAATCCATTAAAATAG

Protein sequence :
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKTLALGVYPEVSLSEARTKRDEARKLISEGID
PCEQKRVKKVVPDLQLSFEHIARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLSTPDLLIPVRAAEAKQIYEI
ASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQHRPALDLSRLPELLSRISSYKGQPVTQLAVMLNLLVFIRSS
ELRYARWSEIDIDNAMWTIPAEREPLPGVKFSHRGSKMRTPHLVPLSKQAVAILTELQTWAGENGLIFTGAHDPRKPISE
NTVNKALRVMGYDTTQEVCGHGFRAMACSALIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANREKGISPFEYAKINNPLK