PAI Gene Information


Name : Z1211 (Z1211)
Accession : NP_286746.1
PAI name : TAI
PAI accession : NC_002655_P1
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : adhesin
Function : -
Note : -
Homologs in the searched genomes :   64 hits    ( 64 protein-level )  
Publication :
    -Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J., Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A., Dimalanta,E., Potamousis,K.,, "Genome sequence of enterohaemorrhagic Escherichia coli O157:H7", Nature 409 (6819), 529-533 (2001) PUBMED 11206551.

    -Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J., Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A., Dimalanta,E., Potamousis,K.,, "Direct Submission", Submitted (28-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Perna,N.T., Plunkett,G. III, Burland,V., Mau,B., Glasner,J.D., Rose,D.J., Mayhew,G.F., Evans,P.S., Gregor,J., Kirkpatrick,H.A., Posfai,G., Hackett,J., Klink,S., Boutin,A., Shao,Y., Miller,L., Grotbeck,E.J., Davis,N.W., Lim,A., Dimalanta,E., Potamousis,K.,, "Direct Submission", Submitted (22-OCT-2000) Laboratory of Genetics, University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.


DNA sequence :
ATGCAGAACGGAATTGCACACAACAGACTGATACTTCTCTGTCTGATGAAGGGGGAGAGCGTTTACGCCCGGGACGTATC
GCTGTGCCCGATAACTCTGTTTTGTACCTGCCGGTATCCACTTTTGTGGGTACCGGCTTTTTTATTCACCCTCTGTAAGG
AAAAGCTGATGAAACGACATCTGAACACCAGCTACAGGCTGGTATGGAATCACATTACGGGCACCCTGGTGGTGGCCTCC
GAACTGGCCCGCTCACGGGGAAAACGCGCCGGTGTGGCGGTTGCGCTGTCTCTTGCTGCTGTCACATCAGTCCCGGCACT
GGCTGCTGACAAGGTTGTACAGGCGGGAGAAACCGTGAACGATGGAACACTGACAAATCATGACAACCAGATTGTCTTCG
GTACGGCCAACGGAATGACCATCAGTACCGGGCTGGAACTGGGGCCGGACAGTGAAGAAAACACCGGTGGGCAATGGATA
CAGAATGGCGGGATAGCCGGAAACACCACTGTCACCACAAATGGTCGTCAGGTCGTGCTGGAGGGGGGAACAGCCAGTGA
TACGGTTATTCGTGACGGCGGGGGACAGAGCCTGAACGGACTGGCGGTGAACACCACACTGAATAACAGAGGCGAGCAGT
GGGTGCATGAGGGCGGGGTTGCCACCGGTACAATTATCAACCGCGACGGTTACCAGAGCGTTAAAAGTGGCGGGCTGGCA
ACAGGAACCATCATCAACACCGGCGCAGAAGGCGGCCCTGATTCTGACAACTCGTATACGGGTCAGAAGGTCCAGGGAAC
AGCAGAATCCACCACCATCAACAAAAATGGACGGCAGATTATCTTATTTTCCGGGCTAGCCCGTGACACTCTCATTTACG
CAGGTGGTGACCAGTCGGTACACGGAAGGGCCCTGAATACCACACTGAATGGCGGTTACCAATATGTGCACAGGGACGGA
CTTGCGCTGAACACGGTAATTAACGAGGGGGGCTGGCAGGTTGTTAAGGCAGGTGGCGCTGCCGGTAACACCACCATAAA
TCAGAACGGTGAACTGAGGGTACATGCCGGCGGGGAAGCCACTGCAGTCACCCAGAACACGGGCGGTGCACTGGTTACCA
GTACTGCTGCAACTGTCATCGGCACAAACCGTCTGGGGAATTTCACGGTGGAAAACGGTAAGGCTGACGGTGTTGTTCTG
GAATCCGGCGGTCGTCTGGATGTACTGGAGAGCCATTCAGCACAGAATACCCTAGTGGATGACGGCGGTACCCTGGCAGT
GTCTGCCGGCGGTAAGGCGACAAGTGTCACCATAACATCCGGTGGTGCCCTGATTGCAGACAGTGGTGCCACTGTTGAGG
GGACCAATGCCAGCGGTAAGTTCAGTATTGATGGCACATCCGGTCAGGCCAGCGGCCTGCTGCTGGAAAATGGCGGCAGC
TTTACGGTTAATGCCGGGGGACAGGCTGGCAACACCACTGTCGGACATCGTGGAACACTGACGCTGGCTGCCGGGGGAAG
TCTGAGTGGCAGAACACAGCTCAGTAAAGGCGCCAGTATGGTACTGAATGGTGATGTGGTCAGTACCGGCGATATTGTTA
ACGCAGGGGAGATTCGCTTTGATAATCAGACGACACCGAATGCCGCGCTGAGCCGTGCTGTTGCAAAAAGTAACTCCCCG
GTAACGTTCCATAAACTGACCACCACGAACCTCACCGGCCAGGGCGGCACCATCAATATGCGTGTTCGCCTTGATGGCAG
CAATGCCTCTGACCAGCTGGTGATTAATGGTGGTCAGGCAACCGGCAAAACCTGGCTTGCGTTTACAAATGTCGGAAACA
GCAACCTCGGGGTGGCAACCACCGGACAGGGTATCCGGGTTGTGGATGCACAGAATGGCGCCACCACAGAAGAAGGTGCG
TTTGCCCTGAGTCGCCCGCTTCAGGCCGGCGCCTTTAACTACACCCTGAACCGTGACAGCGATGAAGACTGGTACCTGCG
CAGTGAAAATGCTTATCGTGCTGAAGTCCCCCTGTATACATCCATGTTGACACAGGCAATGGACTATGACCGGATTCTGG
CAGGCTCCCGCAGCCATCAGACCGGTGTAAACGGTGAAAATAACAGCGTCCGTCTCAGCATTCAGGGCGGTCATCTCGGT
CACGATAACAACGGCGGTATTGCCCGTGGAGCCACGCCGGAAAGCAGCGGCAGCTATGGCTTCGTCCGTCTGGAGGGTGA
CCTGCTCAGAACAGAGGTTGCCGGTATGTCTCTGACGACAGGGGTGTATGGTGCTGCAGGCCATTCTTCCGTTGATGTTA
AGGATGATGACGGTTCCCGCGCCGGCACGGTCCGGGATGATGCCGGCAGTCTGGGCGGATACCTGAATCTGGTACACACA
TCCTCCGGCCTGTGGGCTGACATTGTGGCCCAGGGAACCCGTCACAGCATGAAAGCGTCATCGGACAATAACGACTTCCG
CGCCCGGGGCTGGGGCTGGCTGGGCTCACTGGAAACCGGTCTGCCCTTCAGTATCACTGACAATCTGATGCTGGAGCCAC
AACTGCAGTACACCTGGCAGGGACTCTCCCTGGATGACGGCCAGGATAACGCCGGTTATGTGAAGTTCGGGCATGGCAGT
GCACAACATGTGCGTGCCGGTTTCCGTCTGGGCAGCCACAACGATATGACCTTTGGTGAAGGCACCTCATCCCGTGACAC
CCTGCGCGACAGTGCAAAACACAGTGTGAGTGAACTGCCGGTGAACTGGTGGGTACAGCCTTCTGTTATCCGCACCTTCA
GCTCCCGGGGTGACATGAGCATGGGGACAGCCGCAGCCGGCAGTAACATGACGTTCTCACCGTCCCGGAATGGCACGTCA
CTGGACCTGCAGGCCGGACTGGAAGCCCGTATCCGGGAAAATATCACCCTGGGCGTTCAGGCCGGTTATGCCCACAGCGT
CAGCGGCAGCAGCGCTGAAGGCTATAACGGTCAGGCTACGCTGAATATGACTTTCTGA

Protein sequence :
MQNGIAHNRLILLCLMKGESVYARDVSLCPITLFCTCRYPLLWVPAFLFTLCKEKLMKRHLNTSYRLVWNHITGTLVVAS
ELARSRGKRAGVAVALSLAAVTSVPALAADKVVQAGETVNDGTLTNHDNQIVFGTANGMTISTGLELGPDSEENTGGQWI
QNGGIAGNTTVTTNGRQVVLEGGTASDTVIRDGGGQSLNGLAVNTTLNNRGEQWVHEGGVATGTIINRDGYQSVKSGGLA
TGTIINTGAEGGPDSDNSYTGQKVQGTAESTTINKNGRQIILFSGLARDTLIYAGGDQSVHGRALNTTLNGGYQYVHRDG
LALNTVINEGGWQVVKAGGAAGNTTINQNGELRVHAGGEATAVTQNTGGALVTSTAATVIGTNRLGNFTVENGKADGVVL
ESGGRLDVLESHSAQNTLVDDGGTLAVSAGGKATSVTITSGGALIADSGATVEGTNASGKFSIDGTSGQASGLLLENGGS
FTVNAGGQAGNTTVGHRGTLTLAAGGSLSGRTQLSKGASMVLNGDVVSTGDIVNAGEIRFDNQTTPNAALSRAVAKSNSP
VTFHKLTTTNLTGQGGTINMRVRLDGSNASDQLVINGGQATGKTWLAFTNVGNSNLGVATTGQGIRVVDAQNGATTEEGA
FALSRPLQAGAFNYTLNRDSDEDWYLRSENAYRAEVPLYTSMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLG
HDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLVHT
SSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGS
AQHVRAGFRLGSHNDMTFGEGTSSRDTLRDSAKHSVSELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSRNGTS
LDLQAGLEARIRENITLGVQAGYAHSVSGSSAEGYNGQATLNMTF