Gene Information

Name : sat (c3619)
Accession : NP_755494.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : secreted auto transporter toxin
Function : -
COG functional category : S : Function unknown
COG ID : COG4625
EC number : -
Position : 3456362 - 3460261 bp
Length : 3900 bp
Strand : -
Note : Residues 5 to 1299 of 1299 are 100.00 pct identical to residues 1 to 1295 of 1295 from GenPept.129 : >gb|AAG30168.1|AF289092_1 (AF289092) secreted autotransporter toxin [Escherichia coli]

DNA sequence :
TTGAGAGAATATATGAATAAAATATACTCCCTTAAATATAGTGCTGCCACTGGCGGACTCATTGCTGTTTCTGAATTAGC
GAAAAGAGTTTCTGGTAAAACAAACCGAAAACTTGTAGCAACAATGTTGTCTCTGGCTGTTGCCGGTACAGTAAATGCAG
CAAATATTGATATATCAAATGTATGGGCGAGAGACTATCTTGATCTTGCACAAAATAAAGGTATTTTCCAGCCCGGAGCA
ACAGACGTAACAATCACTTTAAAAAACGGAGATAAATTCTCTTTCCATAATCTCTCAATTCCGGATTTTTCTGGTGCAGC
AGCGAGTGGCGCAGCTACCGCAATAGGAGGTTCTTATAGTGTTACTGTTGCACATAACAAAAAGAACCCTCAGGCCGCAG
AAACCCAGGTTTACGCTCAGTCTTCTTACAGGGTTGTTGACAGAAGAAATTCCAATGATTTTGAGATTCAGAGGTTAAAT
AAATTTGTTGTGGAAACAGTAGGTGCCACCCCGGCAGAGACCAACCCTACAACATATTCTGATGCATTAGAACGCTACGG
TATAGTCACTTCTGACGGTTCAAAAAAAATCATAGGTTTTCGTGCTGGCTCTGGAGGAACATCATTTATTAATGGTGAAT
CCAAAATCTCAACAAATTCAGCATATAGCCATGATCTGTTAAGTGCTAGTCTATTTGAGGTCACCCAATGGGACTCATAC
GGCATGATGATTTATAAAAATGATAAAACATTTCGTAATCTTGAAATATTCGGAGACAGCGGCTCTGGAGCATACTTATA
TGATAACAAACTAGAAAAATGGGTATTAGTCGGAACAACCCATGGTATTGCCAGCGTTAATGGTGACCAACTGACATGGA
TAACAAAATACAATGATAAACTGGTTAGTGAGTTAAAAGATACCTATAGTCATAAAATAAATCTGAATGGCAATAATGTA
ACCATTAAAAACACAGATATAACATTACACCAAAACAATGCAGATACCACTGGTACTCAAGAAAAAATAACTAAAGACAA
AGATATTGTGTTCACAAATGGGGGAGATGTCCTGTTTAAGGATAATTTGGATTTTGGTAGCGGTGGTATTATCTTTGACG
AAGGCCATGAATATAACATAAACGGTCAGGGATTTACATTTAAAGGAGCAGGAATTGATATCGGAAAAGAAAGCATTGTA
AACTGGAATGCATTGTATTCCAGTGATGATGTTTTACACAAAATAGGCCCCGGTACTCTGAATGTTCAAAAAAAACAGGG
GGCAAATATAAAGATAGGTGAAGGAAATGTTATTCTTAATGAAGAAGGAACATTTAACAATATATACCTTGCAAGCGGAA
ATGGTAAGGTAATACTAAATAAAGATAATTCCCTTGGCAATGATCAATATGCGGGGATATTTTTTACTAAACGTGGTGGT
ACGCTAGATTTAAATGGACACAATCAGACTTTTACTAGAATTGCCGCCACTGACGATGGAACAACAATAACTAACTCAGA
TACAACGAAAGAAGCCGTTCTGGCAATCAATAACGAAGACTCCTACATATATCATGGGAACATAAATGGCAATATAAAAC
TAACGCACAATATTAATTCTCAGGATAAGAAAACTAATGCAAAATTAATTCTGGATGGTAGTGTCAACACAAAAAATGAT
GTTGAAGTCAGTAATGCCAGTCTTACCATGCAAGGCCATGCAACAGAGCATGCAATATTCAGAAGCTCAGCGAATCATTG
CTCCCTGGTATTTCTTTGTGGAACGGACTGGGTCACCGTTTTGAAAGAAACAGAGAGTTCATATAATAAAAAATTCAATT
CTGATTACAAAAGTAATAATCAGCAGACCTCATTTGATCAGCCTGACTGGAAAACCGGGGTGTTTAAATTTGATACATTA
CACCTGAACAATGCTGACTTTTCAATATCACGCAATGCCAATGTTGAAGGAAATATATCAGCAAATAAATCAGCTATCAC
AATCGGCGATAAAAATGTTTACATTGATAATCTTGCAGGGAAAAATATTACTAATAATGGTTTTGACTTCAAACAAACTA
TCAGTACTAATCTATCCATAGGAGAAACTAAATTTACAGGTGGCATCACTGCACATAACAGCCAAATAGCCATAGGTGAT
CAAGCTGTAGTTACACTTAATGGTGCAACCTTTCTGGATAATACTCCTATAAGTATAGATAAAGGAGCAAAAGTTATAGC
ACAAAATTCCATGTTCACAACAAAAGGTATTGATATCTCCGGTGAACTGACTATGATGGGAATCCCTGAACAGAATAGTA
AAACTGTAACGCCGGGTCTCCACTACGCTGCTGATGGATTCAGGCTGAGTGGTGGAAATGCAAATTTCATTGCCAGAAAT
ATGGCATCTGTCACCGGAAATATTTATGCTGATGATGCAGCAACCATTACTCTGGGACAGCCTGAAACTGAAACACCGAC
TATATCGTCTGCTTATCAGGCATGGGCAGAGACTCTTTTGTATGGCTTTGATACCGCTTATCGAGGCGCAATAACAGCCC
CCAAAGCTACAGTTAGCATGAATAATGCGATCTGGCATCTAAATAGCCAGTCATCAATTAATCGTCTAGAAACAAAAGAC
AGTATGGTGCGTTTTACTGGTGATAATGGGAAGTTTACAACCCTTACAGTGAACAACCTTACTATAGATGACAGTGCATT
TGTGCTGCGTGCAAATCTGGCCCAAGCAGATCAGCTTGTTGTCAATAAATCGTTGTCTGGTAAAAACAACCTTCTGTTAG
TCGACTTCATTGAGAAAAATGGAAACAGCAACGGACTGAATATCGATCTGGTCAGCGCACCAAAAGGAACTGCAGTAGAT
GTCTTTAAAGCTACGACTCGGAGTATTGGCTTCAGTGATGTAACACCGGTTATCGAGCAAAAGAACGATACAGACAAAGC
AACATGGACTCTGATCGGCTATAAATCTGTGGCCAACGCCGATGCGGCTAAAAAGGCAACATTACTGATGTCAGGCGGCT
ATAAAGCCTTCCTTGCTGAGGTCAACAACCTTAACAAACGTATGGGTGATCTGCGTGACATTAACGGTGAGTCCGGTGCA
TGGGCCCGAATCATTAGCGGAACCGGGTCTGCCGGCGGTGGATTCAGTGACAACTACACCCACGTTCAGGTCGGTGCGGA
TAACAAACATGAACTCGATGGCCTTGACCTCTTCACCGGGGTGACCATGACCTATACCGACAGCCATGCAGGCAGTGATG
CCTTCAGTGGTGAAACGAAGTCTGTGGGTGCCGGTCTCTATGCCTCTGCCATGTTTGAGTCCGGAGCATATATCGACCTC
ATCGGTAAGTACGTTCACCATGACAACGAGTATACCGCAACTTTCGCCGGCCTTGGCACCAGAGACTACAGCTCCCACTC
CTGGTATGCCGGTGCGGAAGTCGGTTACCGTTACCATGTAACTGACTCTGCATGGATTGAGCCGCAGGCGGAACTTGTTT
ACGGTGCTGTATCCGGGAAACAGTTCTCCTGGAAGGACCAGGGAATGAACCTCACCATGAAGGATAAGGACTTTAATCCG
CTGATTGGGCGTACCGGTGTTGATGTGGGTAAATCCTTCTCCGGTAAGGACTGGAAAGTCACAGCCCGCGCCGGCCTTGG
CTACCAGTTTGACCTGTTTGCCAACGGTGAAACCGTACTGCGTGATGCGTCCGGTGAGAAACGTATCAAAGGTGAAAAAG
ACGGTCGTATGCTCATGAATGTTGGTCTCAACGCCGAAATTCGCGATAATCTTCGCTTCGGTCTTGAGTTTGAGAAATCG
GCATTTGGTAAATACAACGTGGATAACGCGATCAACGCCAACTTCCGTTACTCTTTCTGA

Protein sequence :
MREYMNKIYSLKYSAATGGLIAVSELAKRVSGKTNRKLVATMLSLAVAGTVNAANIDISNVWARDYLDLAQNKGIFQPGA
TDVTITLKNGDKFSFHNLSIPDFSGAAASGAATAIGGSYSVTVAHNKKNPQAAETQVYAQSSYRVVDRRNSNDFEIQRLN
KFVVETVGATPAETNPTTYSDALERYGIVTSDGSKKIIGFRAGSGGTSFINGESKISTNSAYSHDLLSASLFEVTQWDSY
GMMIYKNDKTFRNLEIFGDSGSGAYLYDNKLEKWVLVGTTHGIASVNGDQLTWITKYNDKLVSELKDTYSHKINLNGNNV
TIKNTDITLHQNNADTTGTQEKITKDKDIVFTNGGDVLFKDNLDFGSGGIIFDEGHEYNINGQGFTFKGAGIDIGKESIV
NWNALYSSDDVLHKIGPGTLNVQKKQGANIKIGEGNVILNEEGTFNNIYLASGNGKVILNKDNSLGNDQYAGIFFTKRGG
TLDLNGHNQTFTRIAATDDGTTITNSDTTKEAVLAINNEDSYIYHGNINGNIKLTHNINSQDKKTNAKLILDGSVNTKND
VEVSNASLTMQGHATEHAIFRSSANHCSLVFLCGTDWVTVLKETESSYNKKFNSDYKSNNQQTSFDQPDWKTGVFKFDTL
HLNNADFSISRNANVEGNISANKSAITIGDKNVYIDNLAGKNITNNGFDFKQTISTNLSIGETKFTGGITAHNSQIAIGD
QAVVTLNGATFLDNTPISIDKGAKVIAQNSMFTTKGIDISGELTMMGIPEQNSKTVTPGLHYAADGFRLSGGNANFIARN
MASVTGNIYADDAATITLGQPETETPTISSAYQAWAETLLYGFDTAYRGAITAPKATVSMNNAIWHLNSQSSINRLETKD
SMVRFTGDNGKFTTLTVNNLTIDDSAFVLRANLAQADQLVVNKSLSGKNNLLLVDFIEKNGNSNGLNIDLVSAPKGTAVD
VFKATTRSIGFSDVTPVIEQKNDTDKATWTLIGYKSVANADAAKKATLLMSGGYKAFLAEVNNLNKRMGDLRDINGESGA
WARIISGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDL
IGKYVHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNP
LIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNLRFGLEFEKS
AFGKYNVDNAINANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 100
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 55
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 55
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 55
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 52

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
sat NP_755494.1 secreted auto transporter toxin VFG0902 Protein 0.0 100
sat NP_755494.1 secreted auto transporter toxin VFG0862 Protein 0.0 64
sat NP_755494.1 secreted auto transporter toxin VFG0844 Protein 0.0 56
sat NP_755494.1 secreted auto transporter toxin VFG0630 Protein 0.0 55
sat NP_755494.1 secreted auto transporter toxin VFG0772 Protein 0.0 52