Gene Information

Name : sat (i14_3313)
Accession : YP_006155394.1
Strain : Escherichia coli clone D i14
Genome accession: NC_017652
Putative virulence/resistance : Virulence
Product : secreted auto transpoter toxin
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3319336 - 3323235 bp
Length : 3900 bp
Strand : -
Note : -

DNA sequence :
TTGAGAGAATATATGAATAAAATATACTCCCTTAAATATAGTGCTGCCACTGGCGGACTCATTGCTGTTTCTGAATTAGC
GAAAAGAGTTTCTGGTAAAACAAACCGAAAACTTGTAGCAACAATGTTGTCTCTGGCTGTTGCCGGTACAGTAAATGCAG
CAAATATTGATATATCAAATGTATGGGCGAGAGACTATCTTGATCTTGCACAAAATAAAGGTATTTTCCAGCCCGGAGCA
ACAGACGTAACAATCACTTTAAAAAACGGAGATAAATTCTCTTTCCATAATCTCTCAATTCCGGATTTTTCTGGTGCAGC
AGCGAGTGGCGCAGCTACCGCAATAGGAGGTTCTTATAGTGTTACTGTTGCACATAACAAAAAGAACCCTCAGGCCGCAG
AAACTCAGGTTTACGCTCAGTCTTCTTACAAGGTTGTTGACAGAAGAAATTCCAATGATTTTGAGATTCAGAGGTTAAAT
AAATTTGTTGTGGAAACAGTAGGTGCCACCCCGGCAGAGACCAACCCTACAACATATTCTGATGCATTAGAACGCTACGG
TATAGTCACTTCTGACGGTTCAAAAAAAATCATAGGTTTTCGTGCTGGCTCTGGAGGAACATCATTTATTAATGGTGAAT
CCAAAATCTCAACAAATTCAGCATATAGCCATGATCTGTTAAGTGCTAGTCTATTTGAGGTCACCCAATGGGACTCATAC
GGCATGATGATTTATAAAAATGATAAAACATTTCGTAATCTTGAAATATTCGGAGACAGCGGCTCTGGAGCATACTTATA
TGATAACAAACTAGAAAAATGGGTATTAGTCGGAACAACCCATGGTATTGCCAGCGTTAATGGTGACCAACTGACATGGA
TAACAAAATACAATGATAAACTGGTTAGTGAGTTAAAAGATACCTATAGTCATAAAATAAATCTGAATGGCAATAATGTA
ACCATAAAAAACACAGATATAACATTACACCAAAACAATGCAGATACCACTGGTACTCAAGAAAAAATAACTAAAGACAA
AGATATTGTGTTCACAAATGGGGGAAATGTCCTGTTTAAGGATAATTTGGATTTTGGTAGCGGTGGTATTATCTTTGACG
AAGGCCATGAATATAACATAAACGGTCAGGGATTTACATTTAAAGGAGCAGGAATTGATATCGGAAAAGAAAGCATTGTA
AACTGGAATGCATTGTATTCCAGTGATGATGTTTTACACAAAATAGGCCCCGGTACTCTGAATGTTCAAAAAAAACAGGG
GGCAAATATAAAGATAGGTGAAGGAAATGTTATTCTTAATGAAGAAGGAACATTTAACAATATATACCTTGCAAGCGGAA
ATGGTAAGGTAATACTAAATAAAGATAATTCCCTTGGCAATGATCAATATGCGGGGATATTTTTTACTAAACGTGGTGGC
ACGCTAGATTTAAATGGACACAATCAGACTTTTACTAGAATTGCCGCCACTGACGATGGAACAACAATAACTAACTCAGA
TACAACGAAAGAAGCCGTTCTGGCAATCAATAACGAAGACTCCTACATATATCATGGGAACATAAATGGCAATATAAAAC
TAACGCACAATATTAATTCTCAGGATAAGAAAACTAATGCAAAATTAATTCTGGATGGTAGTGTCAACACAAAAAATGAT
GTTGAAGTCAGTAATGCCAGTCTTACCATGCAAGGCCATGCAACAGAGCATGCAATATTCAGAAGCACAGCGAATCATTG
CTCCCTGGTATTTCTTTGTGGAACGGACTGGGTCACCGTTTTGAAAGAAACAGAGAGTTCATATAATAAAAAGTTCAATT
CTGATCACAAAAGTAATAATCAGCAGACCTCATTTGATCAGCCAGACTGGAAAACCGGGGTGTTTAAATTTGATACATTA
CACCTGAACAATGCTGACTTTTCAATATCACGCAATGCCAATGTTGAAGGAAATATATCAGCAAATAAATCAGCTATCAC
AATCGGCGATAAAAATGCTTACATTGATAATCTTGCAGGGAAAAATATTACTAATAATGGTTTTGACTTCAAACAAACTA
TCAGTACTAATCTATCCATAGGAGAAACTAAATTTACAGGTGGCATCACTGCACATAACAGCCAAATAGCCATAGGTGAT
CAAGCTGTAGTTACACTTAATGGTGCAACCTTTCTGAATAATACTCCTATAAGTATAGATAAAGGAGCAAAAGTTATAGC
ACAAAATTCCATGTTCACAACAAAAGGTATTGATATCTCCGGTGAGCTGACTATGATGGGAATCCCTGAACAGAATAGTA
AAACTGTAACGCCGGGTCTCCACTACGCTGCTGATGGATTCAGGCTGAGTGGTGGAAATGCAAATTTCATTGCCAGAAAT
ATGGCATCTGTCACCGGAAATATTTATGCTGATGATGCAGCAACCATTACTCTGGGACAGCCTGAAACTGAAACACCGAC
TATATCGTCTGCTTATCAGGCATGGGCAGAGACTCTTTTGTATGGCTTTGATACCGCCTATCGAGGCGCAATAACAGCCC
CCAAAGCTACAGTTAGCATGAATAATGCGATCTGGCATCTAAATAGCCAGTCATCAATTAATCGTCTAGAAACAAAAGAC
AGTATGGTGCGTTTTACTGGTGATAATGGGAAGTTTACAACCCTTACAGTGGACAACCTTACTATAGATGACAGTGCATT
TGTGCTGCGTGCAAATCTGGCCCAAGCAGATCAGCTTGTTGTCAATAAATCGTTGTCTGGTAAAAACAACCTTCTGTTAG
TCGACTTCATTGAGAAAAATGGAAACAGCAACGGACTGAATATCGATCTGGTCAGCGCACCAAAAGGAACTGCAGTAGAT
GTCTTTAAAGCTACGACTCGGAGTATTGGCTTCAGTGATGTAACACCGGTTATCGAGCAAAAGAACGATACAGACAAAGC
AACATGGACTCTGATCGGCTATAAATCTGTGGCCAACGCCGATGCGGCTAAAAAGGCAACATTACTGATGTCAGGCGGCT
ATAAAGCCTTCCTTGCTGAGGTCAACAACCTTAACAAACGTATGGGTGATCTGCGTGACATTAACGGTGAGTCCGGTGCA
TGGGCCCGAATCATGAGCGGAACCGGGTCTGCCGGCGGTGGATTCAGTGACAACTACACCCACGTTCAGGTCGGTGCGGA
TAACAAACATGAACTCGATGGCCTTGACCTCTTCACCGGGGTGACCATGACCTATACCGACAGCCATGCAGGCAGTGATG
CCTTCAGTGGTGAAACGAAGTCTGTGGGTGCCGGTCTCTATGCCTCTGCCATGTTTGAGTCCGGAGCATATATCGACCTC
ATCGGTAAGTACGTTCACCATGACAACGAGTATACCGCAACTTTCGCCGGCCTTGGCACCAGAGACTACAGCTCCCACTC
CTGGTATGCCGGTGCGGAAGTCGGTTACCGTTACCATGTAACTGACTCTGCATGGATTGAGCCGCAGGCGGAACTTGTTT
ACGGTGCTGTATCCGGGAAACAGTTCTCCTGGAAGGACCAGGGAATGAACCTCACCATGAAGGATAAGGACTTTAATCCG
CTGATTGGGCGTACCGGTGTTGATGTGGGTAAATCCTTCTCGGGTAAGGACTGGAAAGTCACAGCCCGCGCCGGCCTTGG
CTACCAGTTTGACCTGTTTGCCAACGGTGAAACCGTACTGCGTGATGCGTCCGGTGAGAAACGTATCAAAGGTGAAAAAG
ACGGTCGTATGCTCATGAATGTTGGTCTCAACGCTGAAATTCGCGATAATCTTCGCTTCGGTCTTGAGTTTGAGAAATCG
GCATTTGGTAAATACAACGTGGATAACGCGATCAACGCCAACTTCCGTTACTCTTTCTGA

Protein sequence :
MREYMNKIYSLKYSAATGGLIAVSELAKRVSGKTNRKLVATMLSLAVAGTVNAANIDISNVWARDYLDLAQNKGIFQPGA
TDVTITLKNGDKFSFHNLSIPDFSGAAASGAATAIGGSYSVTVAHNKKNPQAAETQVYAQSSYKVVDRRNSNDFEIQRLN
KFVVETVGATPAETNPTTYSDALERYGIVTSDGSKKIIGFRAGSGGTSFINGESKISTNSAYSHDLLSASLFEVTQWDSY
GMMIYKNDKTFRNLEIFGDSGSGAYLYDNKLEKWVLVGTTHGIASVNGDQLTWITKYNDKLVSELKDTYSHKINLNGNNV
TIKNTDITLHQNNADTTGTQEKITKDKDIVFTNGGNVLFKDNLDFGSGGIIFDEGHEYNINGQGFTFKGAGIDIGKESIV
NWNALYSSDDVLHKIGPGTLNVQKKQGANIKIGEGNVILNEEGTFNNIYLASGNGKVILNKDNSLGNDQYAGIFFTKRGG
TLDLNGHNQTFTRIAATDDGTTITNSDTTKEAVLAINNEDSYIYHGNINGNIKLTHNINSQDKKTNAKLILDGSVNTKND
VEVSNASLTMQGHATEHAIFRSTANHCSLVFLCGTDWVTVLKETESSYNKKFNSDHKSNNQQTSFDQPDWKTGVFKFDTL
HLNNADFSISRNANVEGNISANKSAITIGDKNAYIDNLAGKNITNNGFDFKQTISTNLSIGETKFTGGITAHNSQIAIGD
QAVVTLNGATFLNNTPISIDKGAKVIAQNSMFTTKGIDISGELTMMGIPEQNSKTVTPGLHYAADGFRLSGGNANFIARN
MASVTGNIYADDAATITLGQPETETPTISSAYQAWAETLLYGFDTAYRGAITAPKATVSMNNAIWHLNSQSSINRLETKD
SMVRFTGDNGKFTTLTVDNLTIDDSAFVLRANLAQADQLVVNKSLSGKNNLLLVDFIEKNGNSNGLNIDLVSAPKGTAVD
VFKATTRSIGFSDVTPVIEQKNDTDKATWTLIGYKSVANADAAKKATLLMSGGYKAFLAEVNNLNKRMGDLRDINGESGA
WARIMSGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDL
IGKYVHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNP
LIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNLRFGLEFEKS
AFGKYNVDNAINANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 99
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 55
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 55
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 55
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 52

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
sat YP_006155394.1 secreted auto transpoter toxin VFG0902 Protein 0.0 99
sat YP_006155394.1 secreted auto transpoter toxin VFG0862 Protein 0.0 64
sat YP_006155394.1 secreted auto transpoter toxin VFG0844 Protein 0.0 56
sat YP_006155394.1 secreted auto transpoter toxin VFG0630 Protein 0.0 55
sat YP_006155394.1 secreted auto transpoter toxin VFG0772 Protein 0.0 52