PAI Gene Information


Name : ORF_55 (ORF_55)
Accession : AAZ04464.1
PAI name : PAI I APEC-O1
PAI accession : DQ095216
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : conserved hypothetical protein
Function : -
Note : similar to GenBank Accession Number AAG54658
Homologs in the searched genomes :   4 hits    ( 4 protein-level )  
Publication :
    -Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "The pap Operon of Avian Pathogenic Escherichia coli Strain O1:K1 Is Located on a Novel Pathogenicity Island", Infect. Immun. 74 (1), 744-749 (2006) PUBMED 16369033.

    -Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (15-JUN-2005) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.


DNA sequence :
ATGTTTTCCAGATTATTTGTAAAAAAACATACTCCAGCTATGGTGGCAGGTAGTCTGATATTTCTGTTCATTATTCTGCT
GACATTTTTCTGGAAAAAAGCAGAGCTGGTGAATGACAGCCAGATCAGAGTTAATTTCGCACTTGGCTATATAGAAAACA
TTCTCAATCAGAACAACAGCATCAGCCAGGAAGCAGAACATTTGCTTCTGAATAACTGTAACGCAGATACTCAGCATGAA
TTAAGCAATCTGCTGCTGAAACGCCCACAACTTCGTGCACTGAGCCTGGCAAGACAAGAGGCTGTATTTTGTAGCACCCA
CCCCGGTTTACCTGTCGGTCCAGTTGCTGAAAAGGAACAGTGGCGGCACGATATGCTGATTCGTTTTCCTGAAGATACAG
GTACTCTGCCCTGGATCCTGTTAAGAACACCTTACAAAAACGGCACGGTCATTACCGCCACGGATTATTATTTTATTCAG
GACATCATCTCTGTGGTTCATGCGGTTCCTGCAATACGATTCCGTTTGGGAAACACGGTCCTGTCAGCCAGCGGAAAAAA
TGTCACTCTGCTCCCGGACGACAGCGGTATACAAAAAGAAAGTCATTCAAAAAAATATCCGTTCTCCCTGATTTATATCA
TCCCGGTGAAAATGCAACTTACCTACGCCTGGAAACAGGCCTGGTATATGATCCCTGTTGCCATCTTCGGTGGAATACTC
ACAGCCTTTCTGCTGTCCCGCCGACGTCCATCTTCCCCGCTCGATATGCTGAAAAATGCACTTGCTCACGGAGAGTTCAG
ACCTTATTTTCAGCCCATCATTTCCGCTAAAAATCACCAGCTAACCGGCTGCGAAGTACTGATTCGCTGGCATCACCCCG
TCAGCGGAATTATCCCACCAGATCAGTTTATCCCGCTGGCAGAGTCCACCGGATTAATTGCCCCGATAACGCAACAGTTG
ATGTCTCAGGTCGAACATAGCCTTAATTCTGTTGCGCATTTTTTACCGAATCAATTTCATATTGGGATCAATATTTCACC
AGCGCATTTTCTGTCTCCGGGACTGGAGGATAGCTGTATGAAGTTTCTATCCACCTTCCCGAAAGGGAAAGTGAAACTGG
TTCTGGAACTGACAGAACGAAACCAGCTATCCGTCACCGCAGAATCAAAAGCGTTATTTGCGAAATTACATCAACAGGGA
GTATTGTTTGCACTGGATGATTTTGGTACTGGCTATGCAACCCACAGCTACCTGCAGTCGTTTCCTGTCGATTACATCAA
ACTCGATAAAGGGTTTGTACAAATGGTCGGCGTGGATGAAATCTCCGGGCATATAGTAGAAAATGTTATTGAGCTGGCTC
GCCGGTTAGGTATTGATATCATTGCCGAAGTCATTGAAACCGAATCACAGGAATTGTTCATGACAGAGAAAGGAAGCTGC
TATCTGCAGGGATACCGCTATGCACCTCCACTACCCGCTGAACAGTTTATTGCTGAGTGGGTATACAACAGCGGAAACGG
CAAAAACCAGAGATAA

Protein sequence :
MFSRLFVKKHTPAMVAGSLIFLFIILLTFFWKKAELVNDSQIRVNFALGYIENILNQNNSISQEAEHLLLNNCNADTQHE
LSNLLLKRPQLRALSLARQEAVFCSTHPGLPVGPVAEKEQWRHDMLIRFPEDTGTLPWILLRTPYKNGTVITATDYYFIQ
DIISVVHAVPAIRFRLGNTVLSASGKNVTLLPDDSGIQKESHSKKYPFSLIYIIPVKMQLTYAWKQAWYMIPVAIFGGIL
TAFLLSRRRPSSPLDMLKNALAHGEFRPYFQPIISAKNHQLTGCEVLIRWHHPVSGIIPPDQFIPLAESTGLIAPITQQL
MSQVEHSLNSVAHFLPNQFHIGINISPAHFLSPGLEDSCMKFLSTFPKGKVKLVLELTERNQLSVTAESKALFAKLHQQG
VLFALDDFGTGYATHSYLQSFPVDYIKLDKGFVQMVGVDEISGHIVENVIELARRLGIDIIAEVIETESQELFMTEKGSC
YLQGYRYAPPLPAEQFIAEWVYNSGNGKNQR