PAI Gene Information


Name : S3196 (S3196)
Accession : NP_838480.1
PAI name : SHI-1
PAI accession : NC_004741_P1
Strain : Shigella flexneri 2002017
Virulence or Resistance: Not determined
Product : hypothetical protein
Function : -
Note : residues 1 to 507 of 838 are 85.99 pct identical to residues 3 to 509 of 512 from Escherichia coli K-12 : B2001
Homologs in the searched genomes :   22 hits    ( 21 protein-level,   1 DNA-level )  
Publication :
    -Wei,J., Goldberg,M.B., Burland,V., Venkatesan,M.M., Deng,W., Fournier,G., Mayhew,G.F., Plunkett,G. III, Rose,D.J., Darling,A., Mau,B., Perna,N.T., Payne,S.M., Runyen-Janecky,L.J., Zhou,S., Schwartz,D.C. and Blattner,F.R., "Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T", Infect. Immun. 71 (5), 2775-2786 (2003) PUBMED 12704152 REMARK Erratum:[Infect Immun. 2003 Jul;71(7):4223].

    -Wei,J., Goldberg,M.B., Burland,V., Venkatesan,M.M., Deng,W., Fournier,G., Mayhew,G.F., Plunkett,G. III, Rose,D.J., Darling,A., Mau,B., Perna,N.T., Payne,S.M., Runyen-Janecky,L.J., Zhou,S., Schwartz,D.C. and Blattner,F.R., "Direct Submission", Submitted (23-APR-2003) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Wei,J., Goldberg,M.B., Burland,V., Venkatesan,M.M., Deng,W., Fournier,G., Mayhew,G.F., Plunkett,G. III, Rose,D.J., Darling,A., Mau,B., Perna,N.T., Payne,S.M., Runyen-Janecky,L.J., Zhou,S., Schwartz,D.C. and Blattner,F.R., "Direct Submission", Submitted (13-JUN-2002) Genetics Laboratory, University of Wisconsin - Madison, 445 Henry Mall, Madison, WI 53706, USA.


DNA sequence :
ATGTTACAGATAGTCGGCGCGCTGATTCTGCTGATCGCAGGATTTGCCATTCTTCGCCTTTTGTTCAGAGCATTAACCAG
CACAGCGTCTGCGCTGGCAGGGTTCATATTGCTGTGTCTGTTCGGTCCGGCTTTACTGGCTGGCTATATCACTGAACGCA
TAACCCGGTTATTCCATATTCGCTGGCTGGCAGGCGTATTTCTGACGATTGCCGGAATGATCATCAGCTTCATGTGGGGA
CTTGATGGTAAACATATCGCACTGGAGGCTCATACTTTTGACTCTGTAAAATTTATTCTGACCACCGCTCTCGCCGCTGG
TCTGCTGGCTCTTCCCGTGCAGATAAGAACCATTCAGCAGAACGGGCTCACACCTGAAGATATCAGCAAGGAAATTAACG
GGTATTACTGCTGTTTTTATACTGCTTTTTTCCTTATGGCGTGTTCTGCATACGCACCATTGATCGCATTGCAGTTCGAT
ATTTCACCCTCACTGATGTGGTGGGGCGGGTTGTTGTACTGGCTGGCTGCATTAGTGACGCTGCTATGGGCGGCCAGCCA
GATCCAGGCGCTGAAAAGACTGACCAGTGCCATCAGCCAGACACTGGAAGAACAACCGGTGCTCAACAGTAAATCGTGGC
TGAGCAGTTTGCAAAACGATTACAGCCTTCCTGAAACGCTGACGGAGCGCATCTGGCTGACACTCATTTCACAACGGATT
TCCCGGGGAGAACTGAGGGAATTTGAACTGGCAGACGGAAACTGGTTACTGAACAATGCCTGGTATGAAAGAAACATGGC
AGGGTTTAACGAACAGTTGAAAGAGAACCTGTCATTCACACCTGATGAACTGAAAACGCTCTTCCGGAACCGCCTGAATT
TATCACCGGAAGCGAATGACGATTTTCTCGATCGTTGCCTGGACGGCGGTGACTGGTACCCCTTTTCAGAAGGCCGCCGT
TTTGTATCATTCCACCACGTGGATGAGCTTCGTATCTGTGCCTCCTGCGGGCTGACAGAAGTACATCATGCCCCGGAAAA
TCATAAGCCGGCTCCGGAATGGTACTGCTCCTCTCTTTGTCACGAAACAGAAACACTGTGTCAGGACATTTATGAACGTT
CTTACACCGGTTTTATTTCCGATGCAACGGCGAATGGTCTGATTCTCATGAAACTGCCGGAAACCTGGAGTACAAATGAG
AAAATGTTTGCTTCCGGAGGGCAGGGACATGGGTTTGCCGCTGAACGGGGAAACCATATTGTCGACAGAGTCCGTCTGAA
AAACGCACGGATCCTCGGTGATAATAATGCCAGAAATGGAGCAGACAGACTGGTCAGCGGAACAGAAATCCAGACGAAAT
ATTGTTCAACTGCAGCCCGTAGCGTCGGTGCGGCATTCGACGGACAAAACGGACAGTATCGTTACATGGGAAATCATGGT
CCCATGCAACTGGAAGTCCCCCGTGATCAGTATGCCGGCGCTGTGGAAACCATGAAGAATAAGATCCGCGAAGGTAAAGT
ACCCGGTGTAACCGATCCCGCAGAAGCGTCCCGGCTGATTCGTCGGGGACATCTGACTTATACCCAGGCCCGTAATATCA
CCCGGTTCGGGACCATCGAATCGGTCACTTATGATATTGCCGAGGGGTCGGTTGTCAGTCTGGCGGCCGGAGGGATCAGT
TTTGCCCTGACGGCATCGGTCTTCTGGCTCAGCACCGGCGATCGCGATGCTGCCCTGCAGACAGCTGCTGTCCAGGCAGG
AAAAACCTTCACCCGCACACTGGCTGTCTACGTCACAACCCAGCAACTTCACCGGCTCACTGTTGTTCAGGGTATGCTGA
AGCATATTGATTTTTCGACGGCCAGCCCGACTGTCCGGCAGGCGCTTCAGAAGGGGACCGGTGCAGGAAATATCAGTGCC
CTGAACAAAGTGATGAAGGGTACGCTGGTGACATCTCTGGCACTGGTAGCTGTCACAACCAGCCCTGACATGATCAAAAT
GTTGCGGGGACGGATCTCCGGTGCGCAGTTCATCAGGAATCTTGCCGTGGCATCTTCCGGTGTGGCAGGTGGTGCTGTCG
GGTCAGTGGCGGGCGGGATATTGTTCAGTCCACTGGGACCATTTGGTGCACTGACAGGGCGTGTGGTTGGCGGTGTTCTG
GGGGGAATGATTGCCTCCGCTGTATCAGGAAAAATTGCCGGAGCGCTGGTTGAAGAAGATCGCGTCAAAATTCTGGCAAT
GATTCAGGAGCAGGTGACATGGCTTGCCGGCAGTTTCCTGCTGACCGGACATGAGATTGAAAATCTGAACGCGAATCTGG
CCCGTGTTATCGATCAGAATGCTCTGGAGATCATTTTCGCCGCCGGTATACAACAACGGGCCGCGACCAATATGTTAATC
AAACCACTGGTGGTCAGTATCATCAGGCAACGCCCCGTCATGGAATATGATGCATCCCATCTCGGCAAGATGGTTAACCG
GCTGGAAGAAGCATTCCCCCCGGAGCTCCCGGCATAA

Protein sequence :
MLQIVGALILLIAGFAILRLLFRALTSTASALAGFILLCLFGPALLAGYITERITRLFHIRWLAGVFLTIAGMIISFMWG
LDGKHIALEAHTFDSVKFILTTALAAGLLALPVQIRTIQQNGLTPEDISKEINGYYCCFYTAFFLMACSAYAPLIALQFD
ISPSLMWWGGLLYWLAALVTLLWAASQIQALKRLTSAISQTLEEQPVLNSKSWLSSLQNDYSLPETLTERIWLTLISQRI
SRGELREFELADGNWLLNNAWYERNMAGFNEQLKENLSFTPDELKTLFRNRLNLSPEANDDFLDRCLDGGDWYPFSEGRR
FVSFHHVDELRICASCGLTEVHHAPENHKPAPEWYCSSLCHETETLCQDIYERSYTGFISDATANGLILMKLPETWSTNE
KMFASGGQGHGFAAERGNHIVDRVRLKNARILGDNNARNGADRLVSGTEIQTKYCSTAARSVGAAFDGQNGQYRYMGNHG
PMQLEVPRDQYAGAVETMKNKIREGKVPGVTDPAEASRLIRRGHLTYTQARNITRFGTIESVTYDIAEGSVVSLAAGGIS
FALTASVFWLSTGDRDAALQTAAVQAGKTFTRTLAVYVTTQQLHRLTVVQGMLKHIDFSTASPTVRQALQKGTGAGNISA
LNKVMKGTLVTSLALVAVTTSPDMIKMLRGRISGAQFIRNLAVASSGVAGGAVGSVAGGILFSPLGPFGALTGRVVGGVL
GGMIASAVSGKIAGALVEEDRVKILAMIQEQVTWLAGSFLLTGHEIENLNANLARVIDQNALEIIFAAGIQQRAATNMLI
KPLVVSIIRQRPVMEYDASHLGKMVNRLEEAFPPELPA