PAI Gene Information


Name : ORF_3 (ORF_3)
Accession : AAZ04414.1
PAI name : PAI I APEC-O1
PAI accession : DQ095216
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : conserved hypothetical protein
Function : -
Note : similar to GenBank Accession Number CAD42019
Homologs in the searched genomes :   21 hits    ( 15 protein-level,   6 DNA-level )  
Publication :
    -Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "The pap Operon of Avian Pathogenic Escherichia coli Strain O1:K1 Is Located on a Novel Pathogenicity Island", Infect. Immun. 74 (1), 744-749 (2006) PUBMED 16369033.

    -Kariyawasam,S., Johnson,T.J. and Nolan,L.K., "Direct Submission", Submitted (15-JUN-2005) Veterinary Microbiology and Preventive Medicine, Iowa State University, 1802 Elwood Drive, VMRI 2, Ames, IA 50011, USA.


DNA sequence :
ATGAATATCAGAAAACTGTTTTGTCCGGGCAATACACCCCGGATTTTATTGTTTTTATTCTTTTTTGTTGTTTCTGTCAT
AACCACAATTGCATGCGGATACACTGAGAAGAATGCCACCGGGAATGTGTTGCTGCTGTTCCTTCTTCTGCTCCTTGCAC
ACAGAAATACCCTCACATCCATTACAGCGCTGTTATTTCTGTTCTGTTGTGCACTGTATGCGCCTGCCGGTATGACGTAC
GGTAAAATCAACAACAGTTTTATTGTCGCGTTGTTGCAGACCACAACTGATGAGGCAGCGGAGTTTACCGGGATGATTCC
TGTTTATCATTTTCTGGTCAGTGCCGCGATTCAGGTATTCATGGTGATTTTCTGGCGAACACACCGCCGTGGTCACCGTA
ACTGGCTGGCACTGCTGCTGTTCGTATTATGCTCAGTAAACAGCTGGCCGTTGCGGATGGTTAAAGGAACTGTTGTGGGG
ACAACTGACACATTGCGTGAAATGCAGCGTTATAAACAACTGAATCAGCACGGGGCTGACAACTGGAAAATCCTGCCGGG
TGTACCGTTGTATGACACGATTGTTATCGTCACTGGTGAGAGTGTGCGCAGGGATTATATGTCGGTATATGGCTATCCCG
TGCCAACCACGCCGTGGCTGAATACAGCTCCCGGTTTATTTATTGACGGCTATACATCGGCAGCAGCCAGTACCGTACCT
TCCCTGAGCCGGACACTGATTTATGACTATGAGCAGAACCCTGACTCCGGCAACAACGTGGTGGCGCTGGCAGCAAAAGC
AGGATACAGCACATGGTGGATATCCAATCAGGGAAAACTGGGGGAGCATGATACACGCATCTCTGTTATTGCTTCTGATG
CGGAGCATACCGTTTTCCTCAAGAAAGGCAGCTTTGCTTCCCGTAAAACGGATGACATGTTGTTGTTACAGGAAACAGAA
CGTGCACTGGCGGATAAATCCTCTCCGAAGGTGATATTCCTGCACATGATGGGGTCTCATCCGAACCCGTGTGACCGGCT
TCACTCCTGGCCGAATCATTACCTGGAGCAGTATCCCCGAAAGGTTGCCTGTTACCTCGCCAGCATCAGTAAACTGGATA
ACTTTCTCGGTCAGCTTGATGGTATCCTTCGCCGGCATTCCCGTCATTTCGCCATGCTTTACTTTTCTGACCATGGTCTG
TCGGTCAGCGACAGCGCTAATCCTGTTCATCATGATGGTCATGTGCAGGGAGGCTACAGCGTTCCCCTGATTATTACCGC
CAGTGACATCACGTCTCATCAGTCCGTCAGCAGAAAAATCAGTGCCCGTCATTTCGCAGGTATTTTTCAGTGGCTGACCG
GCATTCGTACTGAAAATATACCGCCATTCAATCCGCTGACAGACGAAGATAATGAACCCGTTATGGTTTTTAACGGGGAG
AGGAATGTACCGGCAGACAGTCTGAAACCGCAGCCTCTGATTCTTCCGGACAGAAGATAA

Protein sequence :
MNIRKLFCPGNTPRILLFLFFFVVSVITTIACGYTEKNATGNVLLLFLLLLLAHRNTLTSITALLFLFCCALYAPAGMTY
GKINNSFIVALLQTTTDEAAEFTGMIPVYHFLVSAAIQVFMVIFWRTHRRGHRNWLALLLFVLCSVNSWPLRMVKGTVVG
TTDTLREMQRYKQLNQHGADNWKILPGVPLYDTIVIVTGESVRRDYMSVYGYPVPTTPWLNTAPGLFIDGYTSAAASTVP
SLSRTLIYDYEQNPDSGNNVVALAAKAGYSTWWISNQGKLGEHDTRISVIASDAEHTVFLKKGSFASRKTDDMLLLQETE
RALADKSSPKVIFLHMMGSHPNPCDRLHSWPNHYLEQYPRKVACYLASISKLDNFLGQLDGILRRHSRHFAMLYFSDHGL
SVSDSANPVHHDGHVQGGYSVPLIITASDITSHQSVSRKISARHFAGIFQWLTGIRTENIPPFNPLTDEDNEPVMVFNGE
RNVPADSLKPQPLILPDRR