PAI Gene Information


Name : unnamed
Accession : AAP70297.1
PAI name : HPI
PAI accession : AY233333
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : VC0180-like protein
Function : -
Note : ORF23; similar to Virbrio cholerae hypothetical protein VC0180
Homologs in the searched genomes :   5 hits    ( 5 protein-level )  
Publication :
    -Schubert,S., Dufke,S., Sorsa,J. and Heesemann,J., "A novel integrative and conjugative element (ICE) of Escherichia coli: the putative progenitor of the Yersinia high-pathogenicity island", Mol. Microbiol. 51 (3), 837-848 (2004) PUBMED 14731283.

    -Schubert,S., Dufke,S., Sorsa,J. and Heesemann,J., "Direct Submission", Submitted (11-FEB-2003) Department of Bacteriology, Max von Pettenkofer-Institut, Pettenkoferstr. 9a, Munchen 80336, Germany.


DNA sequence :
ATGAAGGACGGGCAGCTCCATCAAGTGATGACGGGATGCGGGTATCGCTATACGCGTGCCCGCAATCTTCCTGAAAAATC
GATTCTGCACAGCAGGGAGCGAGGTGCAGGATATTACACGAAAGAATACGCGACTGATGCCGGTAATTTTAATGTTGCAT
TGGTGATTCATCCCGACCCTTTCACAGAACTGCCGACAGCATTCATTATTGAGCAACCTGAACAGTTCAAAAGTTGCCTT
ATGCCCCATGTCGCCCTGGAAGGGTTCCTTTGCTATGTTGAGCAGATGGAAGCAGACTGGGATTCCAACGATCTGGAAGC
GACATATAAAGAAGTTGATGCACAGATACACCAGACCCTCATTGATTCGGTATCCGCTGCTACGCAAGGGGTAAATGATA
AGAGAGAACTGGAAGGGGAGTTTGCTGCATACTGGCGTCCCAGCGAGACACTCTTTCTTCTGTCAAACGCAAGTCGGGGG
ACCACACTTAAAACCTCTTTGGCAAAATTACTTAAGTCTGATGGAACTACCAGACAGGAGTACATAACGGTTGAGGAATC
ATCCCCGGAGGACTCGGAAGCCGTTATGACCAAGTGGCTCAAACAAAGGTATTTCCCCAGAACTTCGTTAAAAGAAATAC
CCATCAGCACGCATTACATTTCTGTAAATCCCAGCCGTCTAGCCGGGATGAAATGGCCACCGGCTTCATTCCGGGACTTA
CTGGAATGGCTTGAAAAATCAGACCACAATGCCAGAGATAGAGTCATTGAAAACATCAAAGCCGAAGGGAAGAAAAGGTA
CATTTTTCTCTTTGATGTTTTAAACCAGGACATCTTAGCTATCTATGTTGAGTTCAATGCGCAGTCTGTTGATTTCAGAA
GGTACCGGAAGTCAGCCAAAAACTCAACGGTCAAGCTTGCGGCGATGTTGGGTGGCAAAAGTGTTTGCACGGAATATCAA
AGGCTGGGTGTCATTCGTGCTGACATTGCGACGCTCCTTAGCCGGAATACACGACGGAAGGGGGCAGTAAGTCTTTCAAC
AAAGCGTATAGCGTTGATAGGTTGTGGAACAATTGGAGGGTATCTGGCCGAATTATTACTACGCAACGGAGCCGGATGTG
GTAAGGGTTATCTGCATCTTTATGATGATGATATCTACAAACCCAGTAATTTTGGCCGTCATACCCTATCGTCACATGAT
TTCGGGTGGCCTAAATCCCTTTCACTTGCGGCAAAGCTACAGGAGTCTGTGCATTTACAGACAAAGGTTGTAGGATTTAT
GGAACAATTCCGTATCAGCGCAGATGAAATGCAGAAGTACGACATCATTATTGATGCGACAGGACGCCCACCTGTGTCAA
AACGCATCGCTGCGGTGGTTAGACAGATTCCACTGGAGCAAAGACCATATATTATTCACGCGTTCAATGATGGCAACGGC
GGGCATCAAAAGTCTTTATTGATGATGGCCGAAGCTGTTACGGGTGTATGGTTTCTAACCCTGCAAAATACCATAAAGGA
ACCGATTCAAGGTTTAATGATCTTGATATATCAAGCGAAAAAAATAAAAGTTGTGGCAGCACTTACACCCTTTATGATGC
AGCAGTTAGTAGCATAA

Protein sequence :
MKDGQLHQVMTGCGYRYTRARNLPEKSILHSRERGAGYYTKEYATDAGNFNVALVIHPDPFTELPTAFIIEQPEQFKSCL
MPHVALEGFLCYVEQMEADWDSNDLEATYKEVDAQIHQTLIDSVSAATQGVNDKRELEGEFAAYWRPSETLFLLSNASRG
TTLKTSLAKLLKSDGTTRQEYITVEESSPEDSEAVMTKWLKQRYFPRTSLKEIPISTHYISVNPSRLAGMKWPPASFRDL
LEWLEKSDHNARDRVIENIKAEGKKRYIFLFDVLNQDILAIYVEFNAQSVDFRRYRKSAKNSTVKLAAMLGGKSVCTEYQ
RLGVIRADIATLLSRNTRRKGAVSLSTKRIALIGCGTIGGYLAELLLRNGAGCGKGYLHLYDDDIYKPSNFGRHTLSSHD
FGWPKSLSLAAKLQESVHLQTKVVGFMEQFRISADEMQKYDIIIDATGRPPVSKRIAAVVRQIPLEQRPYIIHAFNDGNG
GHQKSLLMMAEAVTGVWFLTLQNTIKEPIQGLMILIYQAKKIKVVAALTPFMMQQLVA