PAI Gene Information


Name : S4
Accession : AAQ19127.1
PAI name : PAI I CL3
PAI accession : AY275838
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : putative adhesin/hemagglutinin/hemolysin
Function : -
Note : similar to Yersinia pestis CO92 putative hemolysin and putative adhesin encoded by GenBank Accession Number AJ414152 and to Yersinia pestis CO92 putative adhesin encoded by GenBank Accession Number AL590842
Homologs in the searched genomes :   26 hits    ( 26 protein-level )  
Publication :
    -Shen,S., Mascarenhas,M., Rahn,K., Kaper,J. and Karmali,M.A., "Direct Submission", Submitted (14-APR-2003) Laboratory for Foodborne Zoonoses, Health Canada, 110 Stone Road West, Guelph, Ontario N1G 3W4, Canada.

    -Shen,S., Mascarenhas,M., Rahn,K., Kaper,J.B. and Karmali,M.A., "Evidence for a hybrid genomic island in verocytotoxin-producing Escherichia coli CL3 (serotype O113:H21) containing segments of EDL933 (serotype O157:H7) O islands 122 and 48", Infect. Immun. 72 (3), 1496-1503 (2004) PUBMED 14977955 REMARK Erratum:[Infect Immun. 2004 Jul;72(7):4330. Karmal MA [corrected to Karmali MA]].


DNA sequence :
GTGTCTGACGGGCTGTCCGTGACGCTGGATGGTGCGCTGGATAATACGTCCGGCAGACTGCTTTCACAGAAAACGCTGTC
TGTGTCAGGCAGCGAACTGGTCTCTGATGATGGTCTGATTCAGTCCGGCAGTGACATGACGCTGGATGTACAGGACGGGG
TACTCAGTAACCGGAACACGAAAACACGGGGTGGTATCAGCAGCGCCGTACCGTACAGTGCGGGCTGTATGCTGAATAAC
CAGCAGGGATTTATAGTGGGTCAGAAGGATATGACCCTGAACGCCGGAACGCTCGATAACCGTCAGGGCGTGCTGGGGAG
TCAGGCGTCATTGCAGATATCGTCAGGTACGCTGATGAACCAGAAAGGCGCACTGAAGGCCGGGACGGATATGTTGCTGA
GTGGTGGTGATGTCAGTAATCAGGAAGGGACGCTTGCCGCCGGCAGAGACCTGAATGCGCACCTGAATGTGCTGGAAAAT
CAGCAGGGGACGGTGGTCAGCAACGGTAACAGCAGGCTTGATGTCACACGGTTTGATAATCAGGGCGGCAGACTGGTGGC
ACAGCAGTCACTGACCCTTTCCTCGACGGATATCATCAACGATGCCAGTGGCCTGATACAGAGTGGTGCATCGCTGAATC
TCCGTGCTGACACACTGAGTAACCGTAACAGTGGTGACCGGGGCGGGGTTATCAGTCAGGGCCCGATGACGCTGAATGCC
GGCACACTGGACAGCACTGCCGGTGTGCTTTTGTCGGGAGATGCTCTGTCCCTGACTGCAGGCGTGGTTAACAATACATC
CGGTCAGGTGGTGGCAAACGGTCTGCTCGGCTGGAACAGCCAGGCGCTGAACAATCAGTCCGGTCTTATTCAGGGGAGGG
GTATCAGCATTAATACCGCAGGCCAGACGCTGGATAACAGACGCGGTACGCTGAACAGCCTGCAGGAACTGACTGTCAGT
ACCGGAGCAATGGATAACCGGGGCGGGACCGTGGGGGCGAAAACGACAGCTGACCTGAGCACCACCTCGCTGGATAACCG
TGAGGGGGGCCGGCTGGTCAGTGAAGGTGAACTCCGTCTGCATACGGGAGGTCTGCAGAACAGCCACGGACAAATACAGT
CTGTGGGCGACATGTTGCTCAACAGTGTACGGGGCGTTGTGGACAACGTCTCTGGTCTGATTCGCAGTGGTTCCGCCATT
ACGCTGAACGCCCTGCAGTTTATCAATCGCCACACGCAGAATACCGGCCAGGGGCTGGAAGCGCAGACAATACACATCAC
GACGCAGGACCTTGATAACCAGGAGGGCAGTATTCTGGCAGACAGGGCGCTTACGGTTATGGCAGACAGGACGCTCAGTA
ATAATGACGGCGTCCTCTCTTCCGGCGCGACATTGTCTGTCAGTGGCCGTCAGCTGGCGTTTTCGAACCGTGACGGCGTG
GTTAAGGCCGGGCAGTCAGTATCTGTGGATGCCGGTCAGCTCGGTGGTGACGGGAAACTGCTTTCGCTGGGCGATATGAC
GCTGAAAAGTAACACAACGTTCAGCAACAGTGGTCAGACTATTGCGAACGGAAACCTTACGCTTTCCGTTAATGGCGATG
TCTCTAATACCGGCAGCCTGCTGGCAGGCAGCCGGCTTGACCTGAACAGCATCCGCCTTGAAAACACAGAGAAAGGTGAA
ATCAGTGCCGGGCAGACCTGGCTTAACGTGACGGACACACTGCTGAACCGTGGGCTGATTGACGGGAAATATACGCGTCT
TCAGGCGAACACCCTGACGAATTCCGGTACCGGGCGAATCTACGGCGATGCTGTTGGTGTCAGCGCTGCCACATTTAATA
ATCTGGAAGAAAATGGCGTGGCGGCGACGCTGGCCGGGCGTGAGCGTGTGGACTTAGGGGTACAGACCCTGAATAACCGG
ACGCACAGCCTGATTTACAGTGCAGGTGATATGCATACTGGTGGCATGCTGGATGCGAATGGTGCCGCCACGGGGAAAGC
CGGTGTACTGAATAATCACAGTGCCACTATTGAAGCCGCCGGATATCTGGTACTGAGTGCCGGACAAATCAATAACGTCA
ATGACCATTTCACCACGGAGCGCGTGGTGGTTTCCACGGAAAAGGTGACTGAGTATCAGCTCAGTGGCTCAGATAAACGC
TGGAGCGCCGGTGAACCGGGCGTTTATGTGGATAACGATTCCAGTAACTCACTGAAAAAACTGCATACACCGGAAGGCGC
CAGAGATAAATTCACGCAGTACGATTACACCCGTACCGTGGAAGAAACCCGGGTAAAAGAATCCGACCCGGGAAAAATTC
TGTCCGGTGCCGGCATGACGATTGTGGCAGACAAACTGCTTAATGATAAGAGTCAGGTTGTTGCAGGCGGCCTGCTGACC
ATTCCGTCAGGAAGCGTGGAAAATGTCAGTGTCAGTGGCGAGCGGCATGTTACGGACTCAGGAACATCCACGTATTATTA
CCGTATCAGGAAAAAAGGAAAGGATAAGCAGGGGGAGAAAACCTCTCAGTACACTCCGCCCACTGTAATTCAGACCATCA
CGCTGAAGCCGGGAGAGCTCACCAGCCACGGACAGGTTCAGGGCAGCCATGTCACGCTTTCTCCCCTGAAGCCACAGGGA
ACGGATGTACAGACAGGACTGACCGGAAATGTGGATGCGACAGTAGCCGGTACTGACCGGATACCGCTTCGCCCGGTGGT
GTCTGCCGGTGAACCGGTCATTCTGTTGCCGGGGCAGCAGTTTGAAGTCAGCGCACCTCAGGGCAGTATCCATGTTGCCG
GGCCGGATACCCGTCTGCCGGACAGCAGCCTGTTTAAAACAAATCCGGCAGTGAATGTGCCATATCTGGTTGAAACGGAC
CCGCGTTTTACGAACCAGAAGACGTGGCTGGGCAGTGATTACATGCAGAAGGCGTTTTCTCAGAACGGGGATAACATGCT
CAAACGCCTGGGGGATGGTTTTTACGAACAGCGTCTGATTCGTGAGCAGGTTGTCGCTCTGACCGGGCAACGGTACCTGG
ACGGATACAGTAATGATGAAGAGCAGTTTAAGGCGTTAATGGATGCGGGTATTGCATTTGGTAAGCAATACAACCTGACA
CCAGGTGTGGCGCTGACTGCTGAGCAGATGGCGTTACTGACCGGTGATATTGTCTGGCTGGTTAACACCACGGTGACACT
GCCGGACGGCAGCACGCAGACGGTTCAGGTACCTCAGGTGTATGCCCGTGTAAAACCCGGTGATGTGAACAGCGCCGGTG
CGCTGATTGCCGGCAGGGACATGGTGATGAAGCTGGATGGTGACCTGTTTAACAGCGGTAAACTGGCCGGGAAGCAGACC
GTGCAGCTCAGTGCAGAAAACATTCACAACCAGGCCGGTACCATTCAGGGAGCAAATGTCAGCCTGACGGCCCGGACGGA
TATTAACAGTACCGGCGGACTGTTGCAGGCTACGGACAGCCTGCTGGCGATGGCCGGACGGGATATCAGTCTGACCACCA
CCACCCGTACGGCACAGCGCGACGCCGGGCAGAACCATTTTGAACGTACCAGTATCGACAGTGTGGCCGGCGTGTATGTG
CAGAACGACCAGGGAAGACTGGTGCTGCAGGCCGGACGGGATATGAACCTGACGGCTGCAACGGTGGTGAATCAGGGTAA
GGACAGCCTGACGCAACTCAGTGCGGGCAGGGATATGACGCTTTCCACAGTGACCACATCCGCGCAGGACAATATTACCT
GGGACAAAAATAATCGTCTGTCGCAGGGCGTCACACAGAGTACGGGCAGCACGCTTGCCGGTAACGGTGATGTCACGCTG
ACCGCCGGACGGGACATGACGTCACAGGCGGCATCGCTGTCAGCTCAGAAAGGCCTGGCCCTGATGGCGGGTCATGACGT
GACGCTGACCGGTGCACAGAATACCCGCTCGCTGGATGAATACCATAAAGTGACCGGCAGCAGCGGGATGTTATCGAAAA
CCACCACTACCACGCATGATGTCACTGACCGCCGGACCATGACAGGCAGTGAACTGAACGGGGATACGGTCAGTATCAGC
GCCGGACATAATCTGAACGCCACGGGCAGCAGTGTGGCCGGGGATAACCGTGTCTCCCTGGCTGCCGGAAATAACCTGAA
TATCGGTACGCTGACAGAAAGCAACCGTGAAACACACCTGAAACAGGAGAAAAAATCCGGGCTGATGAGCTCCGGAGGTG
TGGGATTCAGCATGGGCAGCCAGAGTCTGAAAGTCACGGACACCGCAACAGACACCACACAGAAAGGCAGCACGGTGGGG
AGTGTTCACGGTGATGTTTCCCTTCAGGCAGGTAACCGGTTGACTGTCAACAGCTCTGACCTGATTGCCGGCAGGGACAT
GGCACTCAGTGGTAAGGAAGTCAGCATTACCGCCGCCACAGACCAGCATGTACAGACACATACGGTGGAGCAGAAAACCT
CCGGTCTGACGCTTGCCCTGTCCGGCACAGCGGGCAGCGCCCTGAATACGACGGTGGAGACCGTACAGGCGGCGAAATCT
GCCGGAAACAGCCGGCTTGAAGCCCTGCAGGGGGTGAAGGCCGCGCTCTCGGGGGCTCAGGCCGTGCAGGCTGGACGACT
GGCAGACGCGCAGGGCGCCGATGCCGGAAATAATAACACGGTGGGCATCAGCCTCTCTTACGGCAGCCAGTCTTCGAAAT
CAGAACAGCAGTCAGAGCAGACGGTGGCGAAGGGCAGTACACTGACCGCAGGTAATAACCTCAGTATTCAGGCCACCGGC
TCCGGCGTGAAGGGCGTGGATGGTGACCTGACGATACAGGGCAGTCAGATAAAGGCCGGGAACAACATTCTGCTGCAGGC
AAACCGTGACGTGAATCTGGTATCAGCAGAAAACACTTCAAAGCTGGAAGGAAAGAACACATCCAGTGGTGGCAGCGTGG
GTGTCGGTGTTGGTGTGGGCTCCGGAGGCTGGGGTATCAGTGTTTCAGCCAGCGCAAACCAGGGTAAGGGCAGTGAAAAA
GGCAACGGTACCACCCATACAGAAACCACGGTGGATGCGGGAAACCGGCTGACCATTATCAGCGGGCGGGACACCACGCT
TACCGGCGCGCAGGCTGGCGGTGAAACGGTGAAAGTGGATGCCGGTCGCCATCTGACACTGACCAGTGAGCAGGACAGTG
ACCGTTATGACAGCAAGCAGCAGAATGCCAGTGCCGGAGGCAGCTTCACTTTCGGGTCCATGAGCGGTTCGGCGAGTGTG
AACCTCAGCCGGGATAAAATGCACAGTAACTACGACAGTGTACAGGAGCAGACGGGGATTTTTGCCGGCAGGGGCGGGTT
TGATGTCACCACCGGACAGCATACGCAACTGAACGGTGCGGTGATTGCGTCCACGGCAACGGCAGATAAAAACAGGCTGG
ATACCGGCACACTGGGCTTCAGTGACATTGAAAACCGGGCCGACTTTAAAACGGAACATCAGAGTGCAGGGCTCAGTACC
GGTGGCAGTGTTGCCGGAAATTTCCTGGGGAACATGGCAAACAATCTGCTGGTCGGTGCAAATCATGAAGGCCATGCGGA
CAGTACCACGCAGTCGGCAGTGTCTGCAGGGAATATCACCATCCGGGATACGCAGAGCCAGAAACAGGATGTGGCAGACC
TGAAGCGTGACGCAGCACATGCAAACCAGACGCTGTCCCCCATCTTTGACAGGGAAAAAGAGCAACAGCGACTGCAGCAG
GCGCAACTCATTGGTGAGATTGGTAACCAGGTGGCAGATATTGCCAGAACGGAAGGACAGATAGCCGGTGAGAAAGCGAA
GCGGGACCCGGCAGCACTGAATCAGGCCCGCGCAGAGCTTGAGGCAGCAGGAAAACCGTTCACGGAACAGGATGTGGCGC
AGCGGGCCTACAATAACGGTATGGCGGCCCCAGGTTTTGGAACGGGAGGCAAATACCAGCAGGCGATACAGGCCGCAACG
GCAGCAGTACAGGGGCTGGCCGGCGGTAATCTGAGTGCAGCCCTGGCAGGTGGTGCAGTACCGTATATTGCTGAAATTAT
CAAACAGACCACCCCGGACGGTGCGGGACGTGTGGCAACTCATGCAGTAGTGAATGCAGCCCTGTCTCTGGCACAGGGTA
AGAATGCTCTGGCTGGGGCCGCAGGCGCAGCCACCGGCGAGGTGGTGGGAATGATAGCCACACAGATGTACGGTAAGCCG
GTGAGTGAACTGAGCGAGACAGAGAAGCAGACGGTCTCGACACTGGCAACGGTTGCTGCCGGCCTGGCCGGTGGTCTGGT
GGGAGACTCGGGGGCTTCAGCGGTAGCCGGCGCACAGTCAGGCAAGACAACGGTTGAGAATAATGCGCTCAGCTTCGGTG
ATGGTTTTGAAAGCAATGCAGCTGCGGCTACATCATGGAACAAGTACGCGGTGGACAATAATCTGACACCGGAACAAACA
CAGGCTGGGCTGGATAAGATAGCGAAAGGTGATATGCCGGACAGTACCAATATCACGAAAGTGATCGTGGATGGATATCA
GGATGGTGTAATGATTGCCGGGGCATGGTATCTGGGGTCTGCTGCGTCAGCAGGTAAAGTTCTTGGTGGCGGGCTTCTTG
GACTGGCGGCCAATAGTGGTTATCAGATCTATGATTTGAACCAGCCGCAGAATGCGAATAAGTCGTGGGACTATCTGGGA
AGCGCGACCTCATTTACAACAGGTATGATGGCTCCCGATCGTGGTGTCCTTGCGAACACAGGTATCGCAATGGGTGGGGC
ATTCTTTACTGATGGTCCAAATACAGCTTCACTGGCCGGGGCTGGTATTGGTGCTGGTCTGGGTGGGGCATTTGGGAAGT
ATGCCCCCACAGCCGTGGGGAAAATCTTAGGGAATGATCCTGTTCCGGGCTTTATTTATGAGCTTGGCGGTGGTGCTGTC
TCTGAGTTTACCAATGGAATTATCAAGGATTTCAATAATCCTCAAGCTCCTGAAAAAAAGGAAAATAAATGA

Protein sequence :
MSDGLSVTLDGALDNTSGRLLSQKTLSVSGSELVSDDGLIQSGSDMTLDVQDGVLSNRNTKTRGGISSAVPYSAGCMLNN
QQGFIVGQKDMTLNAGTLDNRQGVLGSQASLQISSGTLMNQKGALKAGTDMLLSGGDVSNQEGTLAAGRDLNAHLNVLEN
QQGTVVSNGNSRLDVTRFDNQGGRLVAQQSLTLSSTDIINDASGLIQSGASLNLRADTLSNRNSGDRGGVISQGPMTLNA
GTLDSTAGVLLSGDALSLTAGVVNNTSGQVVANGLLGWNSQALNNQSGLIQGRGISINTAGQTLDNRRGTLNSLQELTVS
TGAMDNRGGTVGAKTTADLSTTSLDNREGGRLVSEGELRLHTGGLQNSHGQIQSVGDMLLNSVRGVVDNVSGLIRSGSAI
TLNALQFINRHTQNTGQGLEAQTIHITTQDLDNQEGSILADRALTVMADRTLSNNDGVLSSGATLSVSGRQLAFSNRDGV
VKAGQSVSVDAGQLGGDGKLLSLGDMTLKSNTTFSNSGQTIANGNLTLSVNGDVSNTGSLLAGSRLDLNSIRLENTEKGE
ISAGQTWLNVTDTLLNRGLIDGKYTRLQANTLTNSGTGRIYGDAVGVSAATFNNLEENGVAATLAGRERVDLGVQTLNNR
THSLIYSAGDMHTGGMLDANGAATGKAGVLNNHSATIEAAGYLVLSAGQINNVNDHFTTERVVVSTEKVTEYQLSGSDKR
WSAGEPGVYVDNDSSNSLKKLHTPEGARDKFTQYDYTRTVEETRVKESDPGKILSGAGMTIVADKLLNDKSQVVAGGLLT
IPSGSVENVSVSGERHVTDSGTSTYYYRIRKKGKDKQGEKTSQYTPPTVIQTITLKPGELTSHGQVQGSHVTLSPLKPQG
TDVQTGLTGNVDATVAGTDRIPLRPVVSAGEPVILLPGQQFEVSAPQGSIHVAGPDTRLPDSSLFKTNPAVNVPYLVETD
PRFTNQKTWLGSDYMQKAFSQNGDNMLKRLGDGFYEQRLIREQVVALTGQRYLDGYSNDEEQFKALMDAGIAFGKQYNLT
PGVALTAEQMALLTGDIVWLVNTTVTLPDGSTQTVQVPQVYARVKPGDVNSAGALIAGRDMVMKLDGDLFNSGKLAGKQT
VQLSAENIHNQAGTIQGANVSLTARTDINSTGGLLQATDSLLAMAGRDISLTTTTRTAQRDAGQNHFERTSIDSVAGVYV
QNDQGRLVLQAGRDMNLTAATVVNQGKDSLTQLSAGRDMTLSTVTTSAQDNITWDKNNRLSQGVTQSTGSTLAGNGDVTL
TAGRDMTSQAASLSAQKGLALMAGHDVTLTGAQNTRSLDEYHKVTGSSGMLSKTTTTTHDVTDRRTMTGSELNGDTVSIS
AGHNLNATGSSVAGDNRVSLAAGNNLNIGTLTESNRETHLKQEKKSGLMSSGGVGFSMGSQSLKVTDTATDTTQKGSTVG
SVHGDVSLQAGNRLTVNSSDLIAGRDMALSGKEVSITAATDQHVQTHTVEQKTSGLTLALSGTAGSALNTTVETVQAAKS
AGNSRLEALQGVKAALSGAQAVQAGRLADAQGADAGNNNTVGISLSYGSQSSKSEQQSEQTVAKGSTLTAGNNLSIQATG
SGVKGVDGDLTIQGSQIKAGNNILLQANRDVNLVSAENTSKLEGKNTSSGGSVGVGVGVGSGGWGISVSASANQGKGSEK
GNGTTHTETTVDAGNRLTIISGRDTTLTGAQAGGETVKVDAGRHLTLTSEQDSDRYDSKQQNASAGGSFTFGSMSGSASV
NLSRDKMHSNYDSVQEQTGIFAGRGGFDVTTGQHTQLNGAVIASTATADKNRLDTGTLGFSDIENRADFKTEHQSAGLST
GGSVAGNFLGNMANNLLVGANHEGHADSTTQSAVSAGNITIRDTQSQKQDVADLKRDAAHANQTLSPIFDREKEQQRLQQ
AQLIGEIGNQVADIARTEGQIAGEKAKRDPAALNQARAELEAAGKPFTEQDVAQRAYNNGMAAPGFGTGGKYQQAIQAAT
AAVQGLAGGNLSAALAGGAVPYIAEIIKQTTPDGAGRVATHAVVNAALSLAQGKNALAGAAGAATGEVVGMIATQMYGKP
VSELSETEKQTVSTLATVAAGLAGGLVGDSGASAVAGAQSGKTTVENNALSFGDGFESNAAAATSWNKYAVDNNLTPEQT
QAGLDKIAKGDMPDSTNITKVIVDGYQDGVMIAGAWYLGSAASAGKVLGGGLLGLAANSGYQIYDLNQPQNANKSWDYLG
SATSFTTGMMAPDRGVLANTGIAMGGAFFTDGPNTASLAGAGIGAGLGGAFGKYAPTAVGKILGNDPVPGFIYELGGGAV
SEFTNGIIKDFNNPQAPEKKENK