Name : lpfC (S4048)
Accession : NP_839216.1
PAI name : SHI-2
PAI accession : NC_004741_P2
Strain : Shigella flexneri 2002017
Virulence or Resistance: Not determined
Product : long polar fimbriae
Function : -
Note : residues 1 to 840 of 840 are 99.40 pct identical to residues 1 to 840 of 840 from GenPept : >gb|AAL18163.1| (AY057066) LpfC [Escherichia coli]
Homologs in the searched genomes : 60 hits ( 60 protein-level )
Publication :
-Wei,J., Goldberg,M.B., Burland,V., Venkatesan,M.M., Deng,W., Fournier,G., Mayhew,G.F., Plunkett,G. III, Rose,D.J., Darling,A., Mau,B., Perna,N.T., Payne,S.M., Runyen-Janecky,L.J., Zhou,S., Schwartz,D.C. and Blattner,F.R., "Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T", Infect. Immun. 71 (5), 2775-2786 (2003) PUBMED 12704152 REMARK Erratum:[Infect Immun. 2003 Jul;71(7):4223].
-Wei,J., Goldberg,M.B., Burland,V., Venkatesan,M.M., Deng,W., Fournier,G., Mayhew,G.F., Plunkett,G. III, Rose,D.J., Darling,A., Mau,B., Perna,N.T., Payne,S.M., Runyen-Janecky,L.J., Zhou,S., Schwartz,D.C. and Blattner,F.R., "Direct Submission", Submitted (23-APR-2003) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
-Wei,J., Goldberg,M.B., Burland,V., Venkatesan,M.M., Deng,W., Fournier,G., Mayhew,G.F., Plunkett,G. III, Rose,D.J., Darling,A., Mau,B., Perna,N.T., Payne,S.M., Runyen-Janecky,L.J., Zhou,S., Schwartz,D.C. and Blattner,F.R., "Direct Submission", Submitted (13-JUN-2002) Genetics Laboratory, University of Wisconsin - Madison, 445 Henry Mall, Madison, WI 53706, USA.
DNA sequence : | |
ATGATGACGACCAGAATAGTGGTTGGCCTCACGGCAGGGACGTGTCTGATTTTCTCGCAAAACCTGATGGCCGAGGTCAG
TGTATTCAATCCGGCGCTTCTGGAAATCAACCATCAATCCGGAGTCGATATTCGCCAGTTTAATCGGGCAAACCTGATGC
CCCCAGGTGTTTATAGCGTTGATATTTTTATCAACGGTAAAATGTTTGAACGTCAGGATGTGACATTTGTTCAGGATAAT
CCAGATGCTGATCTGCACGCTTGCTTTATTGCCATTAAAAAAACACTGTCCTCCTTTGGCATAAAAGTTGATGCGCTCAA
ATCGTTCAATGATGTGGATGAGACGGTTTGCCTCGATCCTGCTCCACGTATTGAAGGCTCATCCTGGCAGTTTGACAGTG
ATAAATTGCAGCTGAATATATCCATTCACCAAATCTACATGGACGCGATGGCTTATGATTACATCAGCCCCACGCGTTGG
GATGAGGGGATTAATGCGCTCACCATCAACTACGATTTTTCTGGTTCACATACACTACGTTCAGATTATGGTTCACAAGA
GACAGATACCAGTTATCTCAATCTGCGCAATGGACTGAATATTGGACCGTGGCGGCTACGTAATTACAGTACTTTAAACA
CCAGCGATGGCCGTGCGGAATACAACTCCATTAGTACCTGGATACAGCGCGATATTGCCGCGTTAAGAAGCCAGATTATG
ATTGGTGATACGTGGACGGCGAGCGATATTTTCGACAGTACGCAAATTCGCGGCGCGCGTTTGTATACTGATAACGATAT
GCTACCCGCCAGCCAGAATGGCTTTGCTCCTGTGGTTCGTGGGATTGCAAAGTCCAACGCCACCGTCATCATTCGGCAGA
ATGGCTACGTGATTTATCAGTCAGCCGTTCCACAAGGTGCTTTTGAGATCACCGATCTCAACACCGCAAGTACAGGTGGC
GATTTGGACGTAACCATCAAAGAAGAAGACGGTAGCGAACAACGATTCACCCAACCTTATGCTTCATTGGCGATTCTTAA
ACGTGAAGGTCTGACAGATGTTGATGTCAGCGTGGGTGAATTGCGCGATGAAGACGGATTTACACCGGACGTCCTTCAGG
CGCAAATACTTCATGGTTTTTCCCACGGGATCACTTTATATGGAGGTATGCAGGCTGCTGAAAATTATGGTTCTGCAGCT
CTGGGTGTCGGTAAAGATCTTGGCGCTTTGGGCGCAATTTCTTTCGATGTGACACATGCTCGTGCGAATTTTAGCCATGA
TGATACAGAAACGGGTCAGTCATATCGCTTTCTCTATTCAAAACTATTTGACGACACAGACACTAGCTTGCGCCTGGTTG
GCTATCGTTACTCCACCGAGGGCTACTATACCCTCAATGAGTGGGCATCGCGGCGCAACAGCCCTGAAGACTTTTGGGAA
ACAGGTAACCGACGTAGTCGCGTGGAGGGAACGCTAACGCAGTCGTTGGGGAGAGATTATGGCAATTTATACCTGACATT
AAGCCGGCAACAATACTGGCATACCGATGATGTCGAACGATTAATGCAATTTGGCTACAGCAGTAGCTGGAAGCGTCTCT
CGTGGAACGTCTCCTGGAGTTATTCCAATACTGCCAGACAGGGGACGGGGAACAACCATGCCAGTGATAACACCAGTGAG
CAGATCTACATGCTCTCTTTATCTGTTCCTTTATCGGGCTGGTGGGGTAATAGTTACGCCACCTATTCTGTTTCGCAAAA
CGATAATTCCGGTAGCTCACATCAACTCGGACTCAGCGGTACGGCGCTGGAAAGAAATAACCTTTCATGGAATTTAATGC
AGTCCTATAACAGTCATGATGATGAGGTTGGCGGTAATATGTCCCTGACCTATGATGGCTCTTATGGCACGGTGAACGGC
AGCTATAACTACAGCCAAAATTCCCAGAGGCTGAATTATGGTATCAGAGGGGGAATTCTGGCACACAGCGAAGGGGTAAC
GTTAAGTCAGGAGTTAGGTGAAACTATTGCTCTTGTTAAAGCACCTGGGGCCGCCGGGTTAGAAATAGATAATATGCGCG
GTGCTGCGACGGACTGGCGCGGCTATACGGTCAAGACACAGCTAAACCCTTATGATGAAAATCGGGTAGCAATCAGCGAT
AACTATTTCTCGAAGTCGAATATAGAACTTGATAATACCGTCGTTACGATGGTTCCCACGCGTGGTGCAGTGGTTAAAGC
GGAGTTTGTGACTCATGTGGGTTATCGCGTTCTCTTCAGAGTGTTAAATGCAAATGGTAAACCGGTACCTTTTGGAGCCA
TTGCTGCGATACAAGATGCAAGTTTGGCAGATTCAGGAATTGTCGGTGACCGTGGCGAACTTTATCTTTCTGGTCTACCA
GAAAAAGGACAGGTTACGTTATCCTGGGGAGAAAACGCCTCAACAAAATGCATCTTCAATTATTCATTTTCGACACCAGA
AAGTGAGAGCGGATTAATTGAACAGGGTGTGACATGTCATTAA
|
Protein sequence : | |
MMTTRIVVGLTAGTCLIFSQNLMAEVSVFNPALLEINHQSGVDIRQFNRANLMPPGVYSVDIFINGKMFERQDVTFVQDN
PDADLHACFIAIKKTLSSFGIKVDALKSFNDVDETVCLDPAPRIEGSSWQFDSDKLQLNISIHQIYMDAMAYDYISPTRW
DEGINALTINYDFSGSHTLRSDYGSQETDTSYLNLRNGLNIGPWRLRNYSTLNTSDGRAEYNSISTWIQRDIAALRSQIM
IGDTWTASDIFDSTQIRGARLYTDNDMLPASQNGFAPVVRGIAKSNATVIIRQNGYVIYQSAVPQGAFEITDLNTASTGG
DLDVTIKEEDGSEQRFTQPYASLAILKREGLTDVDVSVGELRDEDGFTPDVLQAQILHGFSHGITLYGGMQAAENYGSAA
LGVGKDLGALGAISFDVTHARANFSHDDTETGQSYRFLYSKLFDDTDTSLRLVGYRYSTEGYYTLNEWASRRNSPEDFWE
TGNRRSRVEGTLTQSLGRDYGNLYLTLSRQQYWHTDDVERLMQFGYSSSWKRLSWNVSWSYSNTARQGTGNNHASDNTSE
QIYMLSLSVPLSGWWGNSYATYSVSQNDNSGSSHQLGLSGTALERNNLSWNLMQSYNSHDDEVGGNMSLTYDGSYGTVNG
SYNYSQNSQRLNYGIRGGILAHSEGVTLSQELGETIALVKAPGAAGLEIDNMRGAATDWRGYTVKTQLNPYDENRVAISD
NYFSKSNIELDNTVVTMVPTRGAVVKAEFVTHVGYRVLFRVLNANGKPVPFGAIAAIQDASLADSGIVGDRGELYLSGLP
EKGQVTLSWGENASTKCIFNYSFSTPESESGLIEQGVTCH
|
|