PAI Gene Information


Name : invF (STM2899)
Accession : NP_461820.1
PAI name :
PAI accession : NC_003197_P3
Strain :
Virulence or Resistance: Virulence
Product : invasion regulatory protein
Function : -
Note : 'invasion protein InvF (SW:INVF_SALTY); SPI-1 transcription factor; activated by HilA; requires SicA as a co-factor; controls sigD/sopB, sopE and sicAsipBCDA genes'
Homologs in the searched genomes :   42 hits    ( 42 protein-level )  
Publication :
    -McClelland,M., Sanderson,K.E., Spieth,J., Clifton,S.W., Latreille,P., Courtney,L., Porwollik,S., Ali,J., Dante,M., Du,F., Hou,S., Layman,D., Leonard,S., Nguyen,C., Scott,K., Holmes,A., Grewal,N., Mulvaney,E., Ryan,E., Sun,H., Florea,L., Miller,W., Stoneki, "Complete genome sequence of Salmonella enterica serovar Typhimurium LT2", Nature 413 (6858), 852-856 (2001) PUBMED 11677609.

    -McClelland,M., Sanderson,K.E., Spieth,J., Clifton,S.W., Latreille,P., Courtney,L., Porwollik,S., Ali,J., Dante,M., Du,F., Hou,S., Layman,D., Leonard,S., Nguyen,C., Scott,K., Holmes,A., Grewal,N., Mulvaney,E., Ryan,E., Sun,H., Florea,L., Miller,W., Stoneki, "Direct Submission", Submitted (10-SEP-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -McClelland,M., Sanderson,K.E., Spieth,J., Clifton,S.W., Latreille,P., Courtney,L., Porwollik,S., Ali,J., Dante,M., Du,F., Hou,S., Layman,D., Leonard,S., Nguyen,C., Scott,K., Holmes,A., Grewal,N., Mulvaney,E., Ryan,E., Sun,H., Florea,L., Miller,W., Stoneki, "Direct Submission", Submitted (06-NOV-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
ATGTCATTTTCTGAAAGCCGACACAATGAAAATTGCCTGATTCAGGAAGGCGCGCTGCTTTTTTGCGAGCAGGCCGTTGT
CGCACCAGTATCAGGAGACCTGGTTTTTCGACCGTTAAAAATTGAAGTACTCAGCAAATTACTGGCATTTATCGATGGCG
CAGGATTAGTGGACACGACATATGCTGAATCCGATAAATGGGTTTTGCTGAGTCCTGAGTTTCGCGCTATTTGGCAAGAT
CGTAAACGCTGCGAGTACTGGTTTTTGCAGCAAATTATTACGCCTTCTCCGGCCTTCAATAAGGTACTGGCGCTGTTACG
AAAAAGCGAGAGTTACTGGTTGGTTGGCTATTTACTCGCTCAGTCAACCAGCGGCAACACGATGAGAATGCTGGGAGAAG
ACTATGGCGTTTCTTATACCCATTTTCGTCGTTTGTGCAGCAGAGCGTTGGGCGGAAAAGCGAAGAGTGAATTACGAAAC
TGGCGTATGGCGCAATCGCTGCTGAATAGTGTAGAAGGCCACGAGAACATCACCCAATTAGCCGTTAATCATGGTTACTC
ATCGCCTTCACATTTTTCTAGTGAGATCAAAGAGCTGATCGGCGTTTCGCCGCGGAAATTATCAAATATTATTCAATTGG
CAGACAAATGA

Protein sequence :
MSFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQD
RKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRN
WRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK