PAI Gene Information


Name : sat (ECUMN_3367)
Accession : YP_002414040.1
PAI name : Not named
PAI accession : NC_011751_P1
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : Serine protease
Function : -
Note : Evidence 2b : Function of strongly homologous gene; Product type e : enzyme
Homologs in the searched genomes :   29 hits    ( 29 protein-level )  
Publication :
    -Genoscope -,C.E.A., "Direct Submission", Submitted (14-DEC-2008) Genoscope - Centre National de Sequencage : BP 191 91006 EVRY cedex - FRANCE (E-mail : seqref@genoscope.cns.fr - Web : www.genoscope.cns.fr).

    -Touchon,M., Hoede,C., Tenaillon,O., Barbe,V., Baeriswyl,S., Bidet,P., Bingen,E., Bonacorsi,S., Bouchier,C., Bouvet,O., Calteau,A., Chiapello,H., Clermont,O., Cruveiller,S., Danchin,A., Diard,M., Dossat,C., Karoui,M.E., Frapy,E., Garry,L., Ghigo,J.M., Gill, "Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths", PLoS Genet. 5 (1), E1000344 (2009) PUBMED 19165319.

    -Touchon,M., Hoede,C., Tenaillon,O., Barbe,V., Baeriswyl,S., Bidet,P., Bingen,E., Bonacorsi,S., Bouchier,C., Bouvet,O., Calteau,A., Chiapello,H., Clermont,O., Cruveiller,S., Danchin,A., Diard,M., Dossat,C., Karoui,M.E., Frapy,E., Garry,L., Ghigo,J.M., Gill, "Direct Submission", Submitted (18-DEC-2008) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
TTGAGAGAATATATGAATAAAATATACTCCCTTAAATATAGTGCTGCCACTGGCGGACTCATTGCTGTTTCTGAATTAGC
GAAAAGAGTTTCTGGTAAAACAAACCGAAAACTTGTAGCAACAATGTTGTCTCTGGCTGTTGCCGGTACAGTAAATGCAG
CAAATATTGATATATCAAATGTATGGGCGAGAGACTATCTTGATCTTGCACAAAATAAAGGTATTTTCCAGCCCGGAGCA
ACAGACGTAACAATCACTTTAAAAAACGGAGATAAATTCTCTTTCCATAATCTCTCAATTCCGGATTTTTCTGGTGCAGC
AGCGAGTGGCGCAGCTACCGCAATAGGAGGTTCTTATAGTGTTACTGTTGCACATAACAAAAAGAACCCTCAGGCCGCAG
AAACCCAGGTTTACGCTCAGTCTTCTTACAGGGTTGTTGACAGAAGAAATTCCAATGATTTTGAGATTCAGAGGTTAAAT
AAATTTGTTGTGGAAACAGTAGGTGCCACCCCGGCAGAGACCAACCCTACAACATATTCTGATGCATTAGAACGCTACGG
TATAGTCACTTCTGACGGTTCAAAAAAAATCATAGGTTTTCGTGCTGGCTCTGGAGGAACATCATTTATTAATGGTGAAT
CCAAAATCTCAACAAATTCAGCATATAGCCATGATCTGTTAAGTGCTAGTCTATTTGAGGTCACCCAATGGGACTCATAC
GGCATGATGATTTATAAAAATGATAAAACATTTCGTAATCTTGAAATATTCGGAGACAGCGGCTCTGGAGCATACTTATA
TGATAACAAACTAGAAAAATGGGTATTAGTCGGAACAACCCATGGTATTGCCAGCGTTAATGGTGACCAACTGACATGGA
TAACAAAATACAATGATAAACTGGTTAGTGAGTTAAAAGATACCTATAGTCATAAAATAAATCTGAATGGCAATAATGTA
ACCATTAAAAACACAGATATAACATTACACCAAAACAATGCAGATACCACTGGTACTCAAGAAAAAATAACTAAAGACAA
AGATATTGTGTTCACAAATGGGGGAGATGTCCTGTTTAAGGATAATTTGGATTTTGGTAGCGGTGGTATTATCTTTGACG
AAGGCCATGAATATAACATAAACGGTCAGGGATTTACATTTAAAGGAGCAGGAATTGATATCGGAAAAGAAAGCATTGTA
AACTGGAATGCATTGTATTCCAGTGATGATGTTTTACACAAAATAGGCCCTGGTACTCTGAATGTTCAAAAAAAACAGGG
GGCAAATATAAAGATAGGTGAAGGAAATGTTATTCTTAATGAAGAAGGAACATTTAACAATATATACCTTGCAAGCGGAA
ATGGTAAGGTAATACTAAATAAAGATAATTCCCTTGGCAATGATCAATATGCGGGGATATTTTTTACTAAACGTGGTGGT
ACGCTAGATTTAAATGGACACAATCAGACTTTTACTAGAATTGCCGCCACTGACGATGGAACAACAATAACTAACTCAGA
TACAACGAAAGAAGCCGTTCTGGCAATCAATAACGAAGACTCCTACATATATCATGGGAACATAAATGGCAATATAAAAC
TAACACACAATATTAATTCTCAGGATAAGAAAACTAATGCAAAATTAATTCTGGATGGTAGTGTCAACACAAAAAATGAT
GTTGAAGTCAGTAATGCCAGTCTTACCATGCAAGGCCATGCAACAGAGCATGCAATATTCAGAAGCTCAGCGAATCATTG
CTCCCTGGTATTTCTTTGTGGAACGGACTGGGTCACCGTTTTGAAAGAAACAGAGAGTTCATATAATAAAAAGTTCAATT
CTGATTACAAAAGTAATAATCAGCAGACCTCATTTGATCAGCCTGACTGGAAAACCGGGGTGTTTAAATTTGATACATTA
CACCTGAACAATGCTGACTTTTCAATATCACGCAATGCCAATGTTGAAGGAAATATATCAGCAAATAAATCAGCTATCAC
AATCGGCGATAAAAATGTTTACATTGATAATCTTGCAGGGAAAAATATTACTAATAATGGTTTTGACTTCAAACAAACTA
TCAGTACTAATCTATCCATAGGAGAAACTAAATTTACAGGTGGCATCACTGCACATAACAGCCAAATAGCCATAGGTGAT
CAAGCTGTAGTTACACTTAATGGTGCAACCTTTCTGGATAATACTCCTATAAGTATAGATAAAGGAGCAAAAGTTATAGC
ACAAAATTCCATGTTCACAACAAAAGGTATTGATATCTCCGGTGAACTGACTATGATGGGAATCCCTGAACAGAATAGTA
AAACTGTAACGCCGGGTCTCCACTACGCTGCTGATGGATTCAGGCTGAGTGGTGGAAATGCAAATTTCATTGCCAGAAAT
ATGGCATCTGTCACCGGAAATATTTATGCTGATGATGCAGCAACCATTACTCTGGGACAGCCTGAAACTGAAACACCGAC
TATATCGTCTGCTTATCAGGCATGGGCAGAGACTCTTTTGTATGGCTTTGATACCGCTTATCGAGGCGCAATAACAGCCC
CCAAAGCTACAGTTAGCATGAATAATGCGATCTGGCATCTAAATAGCCAGTCATCAATTAATCGTCTAGAAACAAAAGAC
AGTATGGTGCGTTTTACTGGTGATAATGGGAAGTTTACAACCCTTACAGTGAACAACCTTACTATAGATGACAGTGCATT
TGTGCTGCGTGCAAATCTGGCCCAAGCAGATCAGCTTGTTGTCAATAAATCGTTGTCTGGTAAAAACAACCTTCTGTTAG
TCGACTTCATTGAGAAAAATGGAAACAGCAACGGACTGAATATCGATCTGGTCAGCGCACCAAAAGGAACTGCAGTAGAT
GTCTTTAAAGCTACGACTCGGAGTATTGGCTTCAGTGATGTAACACCGGTTATCGAGCAAAAGAACGATACAGACAAAGC
AACATGGACTCTGATCGGCTATAAATCTGTGGCCAACGCCGATGCGGCTAAAAAGGCAACATTACTGATGTCAGGCGGCT
ATAAAGCCTTCCTTGCTGAGGTCAACAACCTTAACAAACGTATGGGTGATCTGCGTGACATTAACGGTGAGTCCGGTGCA
TGGGCCCGAATCATTAGCGGAACCGGGTCTGCCGGCGGTGGATTCAGTGACAACTACACCCACGTTCAGGTCGGTGCGGA
TAACAAACATGAACTCGATGGCCTTGACCTCTTCACCGGGGTGACCATGACCTATACCGACAGCCATGCAGGCAGTGATG
CCTTCAGTGGTGAAACGAAGTCTGTGGGTGCCGGTCTCTATGCCTCTGCCATGTTTGAGTCCGGAGCATATATCGACCTC
ATCGGTAAGTACGTTCACCATGACAACGAGTATACCGCAACTTTCGCCGGCCTTGGCACCAGAGACTACAGCTCCCACTC
CTGGTATGCCGGTGCGGAAGTCGGTTACCGTTACCATGTAACTGACTCTGCATGGATTGAGCCGCAGGCGGAACTTGTTT
ACGGTGCTGTATCCGGGAAACAGTTCTCCTGGAAGGACCAGGGAATGAACCTCACCATGAAGGATAAGGACTTTAATCCG
CTGATTGGGCGTACCGGTGTTGATGTGGGTAAATCCTTCTCCGGTAAGGACTGGAAAGTCACAGCCCGCGCCGGCCTTGG
CTACCAGTTTGACCTGTTTGCCAACGGTGAAACCGTACTGCGTGATGCGTCCGGTGAGAAACGTATCAAAGGTGAAAAAG
ACGGTCGTATGCTCATGAATGTTGGTCTCAACGCCGAAATTCGCGATAATCTTCGCTTCGGTCTTGAGTTTGAGAAATCG
GCATTTGGTAAATACAACGTGGATAACGCGATCAACGCCAACTTCCGTTACTCTTTCTGA

Protein sequence :
MREYMNKIYSLKYSAATGGLIAVSELAKRVSGKTNRKLVATMLSLAVAGTVNAANIDISNVWARDYLDLAQNKGIFQPGA
TDVTITLKNGDKFSFHNLSIPDFSGAAASGAATAIGGSYSVTVAHNKKNPQAAETQVYAQSSYRVVDRRNSNDFEIQRLN
KFVVETVGATPAETNPTTYSDALERYGIVTSDGSKKIIGFRAGSGGTSFINGESKISTNSAYSHDLLSASLFEVTQWDSY
GMMIYKNDKTFRNLEIFGDSGSGAYLYDNKLEKWVLVGTTHGIASVNGDQLTWITKYNDKLVSELKDTYSHKINLNGNNV
TIKNTDITLHQNNADTTGTQEKITKDKDIVFTNGGDVLFKDNLDFGSGGIIFDEGHEYNINGQGFTFKGAGIDIGKESIV
NWNALYSSDDVLHKIGPGTLNVQKKQGANIKIGEGNVILNEEGTFNNIYLASGNGKVILNKDNSLGNDQYAGIFFTKRGG
TLDLNGHNQTFTRIAATDDGTTITNSDTTKEAVLAINNEDSYIYHGNINGNIKLTHNINSQDKKTNAKLILDGSVNTKND
VEVSNASLTMQGHATEHAIFRSSANHCSLVFLCGTDWVTVLKETESSYNKKFNSDYKSNNQQTSFDQPDWKTGVFKFDTL
HLNNADFSISRNANVEGNISANKSAITIGDKNVYIDNLAGKNITNNGFDFKQTISTNLSIGETKFTGGITAHNSQIAIGD
QAVVTLNGATFLDNTPISIDKGAKVIAQNSMFTTKGIDISGELTMMGIPEQNSKTVTPGLHYAADGFRLSGGNANFIARN
MASVTGNIYADDAATITLGQPETETPTISSAYQAWAETLLYGFDTAYRGAITAPKATVSMNNAIWHLNSQSSINRLETKD
SMVRFTGDNGKFTTLTVNNLTIDDSAFVLRANLAQADQLVVNKSLSGKNNLLLVDFIEKNGNSNGLNIDLVSAPKGTAVD
VFKATTRSIGFSDVTPVIEQKNDTDKATWTLIGYKSVANADAAKKATLLMSGGYKAFLAEVNNLNKRMGDLRDINGESGA
WARIISGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDL
IGKYVHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNP
LIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNLRFGLEFEKS
AFGKYNVDNAINANFRYSF