PAI Gene Information


Name : unnamed
Accession : CAC39286.1
PAI name : LPA
PAI accession : AJ278144
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : ORF7
Homologs in the searched genomes :   42 hits    ( 42 protein-level )  
Publication :
    -Schmidt,H., Zhang,W.L., Hemmrich,U., Jelacic,S., Brunder,W., Tarr,P.I., Dobrindt,U., Hacker,J. and Karch,H., "Identification and characterization of a novel genomic island integrated at selC in locus of enterocyte effacement-negative, Shiga toxin-producing Escherichia coli", Infect. Immun. 69 (11), 6863-6873 (2001) PUBMED 11598060.

    -Zhang,W.L., "Direct Submission", Submitted (22-MAY-2000) Zhang W.L., Institut fuer Hygiene und Mikrobiologie, Universitaet Wuerzburg, Josef-Schneider-Str. 2, D-97080, GERMANY.


DNA sequence :
ATGAATAAAATATACGCGCTTAAATATAGCTCCCTTACTGGTGGGCTTATAGCTGTGTCAGAATTAAGTAAGAAGGTCAC
AGGAAAAACCGGCAGAAGATTAATGACGGTTTCCCTGGTATTATCAGTGACTCTTTCTGCTTTACCGGGTAAAGCATCAA
CGGTTAGCGCAGAAATACCATATCAGACTTTTCGTGACTTTGCTGAAAATAAAGGTGTGTTTACCCCCGGAGTGACAGGA
ATTGAAATAAACGACAACAATGGAAATAAAGTTGGGGTTCTTGATGTTCCCATGCTTGATTTTTCCAGTCTTTCTCGTGA
TGGTCATACCACATTGATTCATCCTGGTTATGTTGTATCGGCTAAACATGGTGGTTTACAAAGTGTTTCATCAGCAACTT
TTGGTTATGACCAGATATATAAAATAGTTGATAATAACCTTGCTGGTATAGATTTTTCTGCCCCACGATTAAATAAGCTT
GTTACAGAAGTAATTCCCGCAGATATACAGGGAAAGGATAAATTTAATAATAACCGGTATACGGCTTTTTACCGTGCGGG
CGTTGGCTCTCAATATATCCGTTATGCAAATGGCACAGATAAACTACTGCAGGCTTACACTCCAGAAAAGGCTTATCTGA
CCGGCGGAACAGTGGGGAAACCTTATTATACTCACTATAACGGTATGAAGATGATTTCAGCAAACCCGGGAAATACCTTT
GATAAAAACCAGGGACCTCTTGCCAGTTATGGACAGAGTGGAGACAGTGGTTCGCCGTTATATGCCTGGGATAACATTGA
CAAAAAATGGGTATTAGCTGGAGTTACTCTGCATAATTATGGAGTAAAAGGTGCACGAAATGACTGGCTCCTGATACCTC
ATGACTTTATCAGTCAAAAATTACAGGATGACCTTAAACCAATTATTGTTGCTTCTCCGGAGGAGAATATCTTACGCTGG
GAATTTGATCGTTCGAGAGGTACAGGTACTCTCAGTCAGGGAGAGAAGATTTTTTCCATGACTGGTAGTGTAAACGGAAA
TGCAAATACCGGGAACAATCTTGTGTTCTCAGGTAATGAAGGGAAAATCGAACTGGTATCCAGTGTGGAGCAGGGAGCCG
GATATCTTCAGTTTGATAAAGACTACACTGTACTGACAAATAATAACAGTACATGGACTGGTGCCGGAATTATTGTCGGT
GACGAGGCAAATGTCAAATGGGGAGTTAATGGCATTGCCGGTGATAATCTGCATAAAGTTGGTTCCGGGACATTAACTGT
TAATGGTCATGGTGAGAATAAAGGTGGCCTTAAAGTTGGGGACGGCGTTGTTGTTCTTGAGCAACAGCCAGATGCAAACC
AGAAACAACAGGCGTTTAGCCATATCAATATAGCCAGTGGTCGGGCAACAGTTAAACTTAACGGGGCAAACCAGGTAGAT
GCAGATAATATCAGCTGGGGATATCGGGGTGGGAAACTGGATTTAAATGGGTATGATTTTACCTTTTCCCGTCTTCAGGC
TGCAGATTATGGTGCTGAAATCAGCAACGATAATCAGACAGATAAATCCATAGTCACACTTTCGTTATCTCCTCTGAAAG
CAGAAGAAATAAATGTGGTTGTTAATAATATAAATATAATGGGGGGGACAGGCAAACCAGGTGATCTGTATTATACGACC
TTTGACGGAAATTATTATCTGTTGAAAAGTAACCGATATGGCAGCGCTTTGTTTGGCGCGCTGAATAATCAGAGCGAATG
GCAAAGGCTGGGTAAGGATAAAGAAAAAGCAATTGGGTTATATACTCAGATGAAAATGCAGGAAAGCGCTCCTTTATCAT
ATATATATCATGGAAAAATAACCGGTAATACCAGTGTGGAAATCCCCAAACTGGCAGGCAATGATATTTTAACGCTTGAT
GGCTCTGTCAGTATATCAGGAGATATGTCAAAACAGGACGGTGCTCTTATCTTCCAGGGACACCCGGTTATTCATGCAGG
GCAAACTGTTTCTGCATCGCAGAGTGACTGGGAGAACAGGGAGTTCTCACTCAACAATCTGAATCTTAATAATGCGGACT
TCAGTCTGTCCCGTAATGCATTTATGAACGGGAATATCAGGGCCGTTAACCAGAGCACTGTTATTATCGGCGGAGATACA
GTCTTTACTGATAAAAATGACGGAACAGGTAATGATGTCATCAGTGTTGAAGGGAAATCTGCTGCCGCAGGAACATCCTC
CTATACAGGGCATATCACTCTGGAGCAAAAATCAGCACTGGATATCCGCGATAATTTTCGTGGCGGGGTTACGTCTGAAG
ACAGTCATATCAATGTTTCTTCATCTTCAGTCCTGTTCTCAGATGCATCGTCATTTATAAACAGCTCCCTGAATATTCAT
AAAGGAGGTGCGCTGACCGCTCAGGGAGGGCTGTTTACAAGTGGAAGCATTGATATTGGTGACGCTTCCCTTCTGCTTAC
CGGTACACCAGTGAATTCAGATGATGCTGCTTTTTTACCGACCATCAATATGGCTGATGGCGGATTTAAACTGATGTCTG
ATTCATCAGTACTGAAAGCCAGAGACCAGGCATCTGTTGTTGGTGATATTATTTCTGATAAACAGGCCACAATCAGCTTC
GAAACTGAATCAGGTAAAGAGGGCATGTTGTCTGAGAAGGCATCCCGGGGACTCGCGGTAGGATTACTGAGTGGTTTTAA
TACGGCATACCGCGGTGCAATTCATGCCCCGTCAGCATCTGCCACTATGAACAACACCTGGTGGCAACTGACAGGAGACT
CCTCACTTCGCTCGTTAAAAAATACCGGAAGCATGACATATTTTACAGGAAGTGCAGCGAATAAAGCATTCCATACACTG
ACGGTTGATGAGCTGACGACGAATGGCACTGCGTATGCCATGCGTACGGACCTGAAAAATGCGGATAAGCTGGTAGTAAA
CCAAAAGCTGTCAGGTAAGGACAATATTCTGCTGGTTGATTTTCTGAACAAACCCACCGGAGAAAAACTGGATATTGAAC
TGGTGAGTGCACCGGGGAACAGCAGTAAGGATGTTTTTAAAGGAAGTGAACAGGCAATAGGTTTCAGCAATGTCACACCT
GTTATTACAGCTAAAGACGCCGGAGATAAAACAACATGGAACCTGACCGGGTACAGGATGGCAGAAAATCCCGCCGCAAC
CCAAAGTGCCTCAGGCCTTGCATCTGTGGGGTACAAATCATTTTTGAGTGAGGTCAACAACCTGAATAAACGTATGGGTG
ACCTGCGTGACATCAATGGTGAAGCTGGCGCATGGGCACGTATCATGAGCGGAACCGGCTCTGCCGGTGGTGGTTTCAGT
GACAACCACACACATGTTCAGGTCGGTGTCGACAAAAAACATGAGCTGGACGGACTGGATTTGTTTACCGGCTTCACTGT
CACACACACTGACAGCAGTGCCTCTGCTGATGCTTTCAAAGGTAAAACAAAATCTGTGGGGGCCGGACTCTATGCTTCCG
CCATGTTTGATTCCGGTGCCTATATCGACCTGATTGGTAAGTATGTTCATCATGATAATGAGTACACCGCAACCTTTGCC
GGACTCGGAATCCGTGATTACAGTACGCATTCATGGTATGCCGGTGCTGAAGCAGGCTACCGCTGTCATGTCACTGAGGA
TACCTGGATTGAGCCACAGGCAGAACTGGTTTACGGTGCTGTATCCGGTAAACAGTTTGCATGGAAGGACCAGGGGATGC
ATCTGTCTATGAAGGACAGGGACTACAATCCGCTGATTGGTCGTACCGGTGTGGATGTGGGTAAATCCTTCTCAGGTAAG
GACTGGAAAGTGACAGCCCGTGCCGGTCTGGGCTACCAGTTCGACCTGCTGGCTAACGGCGAAACCGTATTGCGGGACGC
ATCAGGTGAAAAACGTATCAAAGGTGAAAAAGACAGCCGTATGCTGATGTCCGTTGGCCTGAATGCAGAAATCAGGGACA
ACGTCCGCTTTGGACTGGAGTTTGAGAAATCCGCCTTTGGTAAGTACAACGTTGATAATGCTGTCAACGCTAACTTCCGT
TACTCGTTCTGA

Protein sequence :
MNKIYALKYSSLTGGLIAVSELSKKVTGKTGRRLMTVSLVLSVTLSALPGKASTVSAEIPYQTFRDFAENKGVFTPGVTG
IEINDNNGNKVGVLDVPMLDFSSLSRDGHTTLIHPGYVVSAKHGGLQSVSSATFGYDQIYKIVDNNLAGIDFSAPRLNKL
VTEVIPADIQGKDKFNNNRYTAFYRAGVGSQYIRYANGTDKLLQAYTPEKAYLTGGTVGKPYYTHYNGMKMISANPGNTF
DKNQGPLASYGQSGDSGSPLYAWDNIDKKWVLAGVTLHNYGVKGARNDWLLIPHDFISQKLQDDLKPIIVASPEENILRW
EFDRSRGTGTLSQGEKIFSMTGSVNGNANTGNNLVFSGNEGKIELVSSVEQGAGYLQFDKDYTVLTNNNSTWTGAGIIVG
DEANVKWGVNGIAGDNLHKVGSGTLTVNGHGENKGGLKVGDGVVVLEQQPDANQKQQAFSHINIASGRATVKLNGANQVD
ADNISWGYRGGKLDLNGYDFTFSRLQAADYGAEISNDNQTDKSIVTLSLSPLKAEEINVVVNNINIMGGTGKPGDLYYTT
FDGNYYLLKSNRYGSALFGALNNQSEWQRLGKDKEKAIGLYTQMKMQESAPLSYIYHGKITGNTSVEIPKLAGNDILTLD
GSVSISGDMSKQDGALIFQGHPVIHAGQTVSASQSDWENREFSLNNLNLNNADFSLSRNAFMNGNIRAVNQSTVIIGGDT
VFTDKNDGTGNDVISVEGKSAAAGTSSYTGHITLEQKSALDIRDNFRGGVTSEDSHINVSSSSVLFSDASSFINSSLNIH
KGGALTAQGGLFTSGSIDIGDASLLLTGTPVNSDDAAFLPTINMADGGFKLMSDSSVLKARDQASVVGDIISDKQATISF
ETESGKEGMLSEKASRGLAVGLLSGFNTAYRGAIHAPSASATMNNTWWQLTGDSSLRSLKNTGSMTYFTGSAANKAFHTL
TVDELTTNGTAYAMRTDLKNADKLVVNQKLSGKDNILLVDFLNKPTGEKLDIELVSAPGNSSKDVFKGSEQAIGFSNVTP
VITAKDAGDKTTWNLTGYRMAENPAATQSASGLASVGYKSFLSEVNNLNKRMGDLRDINGEAGAWARIMSGTGSAGGGFS
DNHTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADAFKGKTKSVGAGLYASAMFDSGAYIDLIGKYVHHDNEYTATFA
GLGIRDYSTHSWYAGAEAGYRCHVTEDTWIEPQAELVYGAVSGKQFAWKDQGMHLSMKDRDYNPLIGRTGVDVGKSFSGK
DWKVTARAGLGYQFDLLANGETVLRDASGEKRIKGEKDSRMLMSVGLNAEIRDNVRFGLEFEKSAFGKYNVDNAVNANFR
YSF