Gene Information

Name : ECABU_c32840 (ECABU_c32840)
Accession : YP_006107283.1
Strain : Escherichia coli ABU 83972
Genome accession: NC_017631
Putative virulence/resistance : Virulence
Product : putative secreted autotransporter toxin sat
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3351987 - 3355874 bp
Length : 3888 bp
Strand : +
Note : -

DNA sequence :
ATGAATAAAATATACTCCCTTAAATATAGTGCTGCCACTGGCGGACTCATTGCTGTTTCTGAATTAGCGAAAAGAGTTTC
TGGTAAAACAAACCGAAAACTTGTAGCAACAATGTTGTCTCTGGCTGTTGCCGGTACAGTAAATGCAGCAAATATTGATA
TATCAAATGTATGGGCGAGAGACTATCTTGATCTTGCACAAAATAAAGGTATTTTCCAGCCCGGAGCAACAGACGTAACA
ATCACTTTAAAAAACGGAGATAAATTCTCTTTCCATAATCTCTCAATTCCGGATTTTTCTGGTGCAGCAGCGAGTGGCGC
AGCTACCGCAATAGGAGGTTCTTATAGTGTTACTGTTGCACATAACAAAAAGAACCCTCAGGCCGCAGAAACTCAGGTTT
ACGCTCAGTCTTCTTACAAGGTTGTTGACAGAAGAAATTCCAATGATTTTGAGATTCAGAGGTTAAATAAATTTGTTGTG
GAAACAGTAGGTGCCACCCCGGCAGAGACCAACCCTACAACATATTCTGATGCATTAGAACGCTACGGTATAGTCACTTC
TGACGGTTCAAAAAAAATCATAGGTTTTCGTGCTGGCTCTGGAGGAACATCATTTATTAATGGTGAATCCAAAATCTCAA
CAAATTCAGCATATAGCCATGATCTGTTAAGTGCTAGTCTATTTGAGGTCACCCAATGGGACTCATACGGCATGATGATT
TATAAAAATGATAAAACATTTCGTAATCTTGAAATATTCGGAGACAGCGGCTCTGGAGCATACTTATATGATAACAAACT
AGAAAAATGGGTATTAGTCGGAACAACCCATGGTATTGCCAGCGTTAATGGTGACCAACTGACATGGATAACAAAATACA
ATGATAAACTGGTTAGTGAGTTAAAAGATACCTATAGTCATAAAATAAATCTGAATGGCAATAATGTAACCATAAAAAAC
ACAGATATAACATTACACCAAAACAATGCAGATACCACTGGTACTCAAGAAAAAATAACTAAAGACAAAGATATTGTGTT
CACAAATGGGGGAAATGTCCTGTTTAAGGATAATTTGGATTTTGGTAGCGGTGGTATTATCTTTGACGAAGGCCATGAAT
ATAACATAAACGGTCAGGGATTTACATTTAAAGGAGCAGGAATTGATATCGGAAAAGAAAGCATTGTAAACTGGAATGCA
TTGTATTCCAGTGATGATGTTTTACACAAAATAGGCCCCGGTACTCTGAATGTTCAAAAAAAACAGGGGGCAAATATAAA
GATAGGTGAAGGAAATGTTATTCTTAATGAAGAAGGAACATTTAACAATATATACCTTGCAAGCGGAAATGGTAAGGTAA
TACTAAATAAAGATAATTCCCTTGGCAATGATCAATATGCGGGGATATTTTTTACTAAACGTGGTGGCACGCTAGATTTA
AATGGACACAATCAGACTTTTACTAGAATTGCCGCCACTGACGATGGAACAACAATAACTAACTCAGATACAACGAAAGA
AGCCGTTCTGGCAATCAATAACGAAGACTCCTACATATATCATGGGAACATAAATGGCAATATAAAACTAACGCACAATA
TTAATTCTCAGGATAAGAAAACTAATGCAAAATTAATTCTGGATGGTAGTGTCAACACAAAAAATGATGTTGAAGTCAGT
AATGCCAGTCTTACCATGCAAGGCCATGCAACAGAGCATGCAATATTCAGAAGCACAGCGAATCATTGCTCCCTGGTATT
TCTTTGTGGAACGGACTGGGTCACCGTTTTGAAAGAAACAGAGAGTTCATATAATAAAAAGTTCAATTCTGATCACAAAA
GTAATAATCAGCAGACCTCATTTGATCAGCCAGACTGGAAAACCGGGGTGTTTAAATTTGATACATTACACCTGAACAAT
GCTGACTTTTCAATATCACGCAATGCCAATGTTGAAGGAAATATATCAGCAAATAAATCAGCTATCACAATCGGCGATAA
AAATGCTTACATTGATAATCTTGCAGGGAAAAATATTACTAATAATGGTTTTGACTTCAAACAAACTATCAGTACTAATC
TATCCATAGGAGAAACTAAATTTACAGGTGGCATCACTGCACATAACAGCCAAATAGCCATAGGTGATCAAGCTGTAGTT
ACACTTAATGGTGCAACCTTTCTGAATAATACTCCTATAAGTATAGATAAAGGAGCAAAAGTTATAGCACAAAATTCCAT
GTTCACAACAAAAGGTATTGATATCTCCGGTGAGCTGACTATGATGGGAATCCCTGAACAGAATAGTAAAACTGTAACGC
CGGGTCTCCACTACGCTGCTGATGGATTCAGGCTGAGTGGTGGAAATGCAAATTTCATTGCCAGAAATATGGCATCTGTC
ACCGGAAATATTTATGCTGATGATGCAGCAACCATTACTCTGGGACAGCCTGAAACTGAAACACCGACTATATCGTCTGC
TTATCAGGCATGGGCAGAGACTCTTTTGTATGGCTTTGATACCGCCTATCGAGGCGCAATAACAGCCCCCAAAGCTACAG
TTAGCATGAATAATGCGATCTGGCATCTAAATAGCCAGTCATCAATTAATCGTCTAGAAACAAAAGACAGTATGGTGCGT
TTTACTGGTGATAATGGGAAGTTTACAACCCTTACAGTGGACAACCTTACTATAGATGACAGTGCATTTGTGCTGCGTGC
AAATCTGGCCCAAGCAGATCAGCTTGTTGTCAATAAATCGTTGTCTGGTAAAAACAACCTTCTGTTAGTCGACTTCATTG
AGAAAAATGGAAACAGCAACGGACTGAATATCGATCTGGTCAGCGCACCAAAAGGAACTGCAGTAGATGTCTTTAAAGCT
ACGACTCGGAGTATTGGCTTCAGTGATGTAACACCGGTTATCGAGCAAAAGAACGATACAGACAAAGCAACATGGACTCT
GATCGGCTATAAATCTGTGGCCAACGCCGATGCGGCTAAAAAGGCAACATTACTGATGTCAGGCGGCTATAAAGCCTTCC
TTGCTGAGGTCAACAACCTTAACAAACGTATGGGTGATCTGCGTGACATTAACGGTGAGTCCGGTGCATGGGCCCGAATC
ATGAGCGGAACCGGGTCTGCCGGCGGTGGATTCAGTGACAACTACACCCACGTTCAGGTCGGTGCGGATAACAAACATGA
ACTCGATGGCCTTGACCTCTTCACCGGGGTGACCATGACCTATACCGACAGCCATGCAGGCAGTGATGCCTTCAGTGGTG
AAACGAAGTCTGTGGGTGCCGGTCTCTATGCCTCTGCCATGTTTGAGTCCGGAGCATATATCGACCTCATCGGTAAGTAC
GTTCACCATGACAACGAGTATACCGCAACTTTCGCCGGCCTTGGCACCAGAGACTACAGCTCCCACTCCTGGTATGCCGG
TGCGGAAGTCGGTTACCGTTACCATGTAACTGACTCTGCATGGATTGAGCCGCAGGCGGAACTTGTTTACGGTGCTGTAT
CCGGGAAACAGTTCTCCTGGAAGGACCAGGGAATGAACCTCACCATGAAGGATAAGGACTTTAATCCGCTGATTGGGCGT
ACCGGTGTTGATGTGGGTAAATCCTTCTCGGGTAAGGACTGGAAAGTCACAGCCCGCGCCGGCCTTGGCTACCAGTTTGA
CCTGTTTGCCAACGGTGAAACCGTACTGCGTGATGCGTCCGGTGAGAAACGTATCAAAGGTGAAAAAGACGGTCGTATGC
TCATGAATGTTGGTCTCAACGCTGAAATTCGCGATAATCTTCGCTTCGGTCTTGAGTTTGAGAAATCGGCATTTGGTAAA
TACAACGTGGATAACGCGATCAACGCCAACTTCCGTTACTCTTTCTGA

Protein sequence :
MNKIYSLKYSAATGGLIAVSELAKRVSGKTNRKLVATMLSLAVAGTVNAANIDISNVWARDYLDLAQNKGIFQPGATDVT
ITLKNGDKFSFHNLSIPDFSGAAASGAATAIGGSYSVTVAHNKKNPQAAETQVYAQSSYKVVDRRNSNDFEIQRLNKFVV
ETVGATPAETNPTTYSDALERYGIVTSDGSKKIIGFRAGSGGTSFINGESKISTNSAYSHDLLSASLFEVTQWDSYGMMI
YKNDKTFRNLEIFGDSGSGAYLYDNKLEKWVLVGTTHGIASVNGDQLTWITKYNDKLVSELKDTYSHKINLNGNNVTIKN
TDITLHQNNADTTGTQEKITKDKDIVFTNGGNVLFKDNLDFGSGGIIFDEGHEYNINGQGFTFKGAGIDIGKESIVNWNA
LYSSDDVLHKIGPGTLNVQKKQGANIKIGEGNVILNEEGTFNNIYLASGNGKVILNKDNSLGNDQYAGIFFTKRGGTLDL
NGHNQTFTRIAATDDGTTITNSDTTKEAVLAINNEDSYIYHGNINGNIKLTHNINSQDKKTNAKLILDGSVNTKNDVEVS
NASLTMQGHATEHAIFRSTANHCSLVFLCGTDWVTVLKETESSYNKKFNSDHKSNNQQTSFDQPDWKTGVFKFDTLHLNN
ADFSISRNANVEGNISANKSAITIGDKNAYIDNLAGKNITNNGFDFKQTISTNLSIGETKFTGGITAHNSQIAIGDQAVV
TLNGATFLNNTPISIDKGAKVIAQNSMFTTKGIDISGELTMMGIPEQNSKTVTPGLHYAADGFRLSGGNANFIARNMASV
TGNIYADDAATITLGQPETETPTISSAYQAWAETLLYGFDTAYRGAITAPKATVSMNNAIWHLNSQSSINRLETKDSMVR
FTGDNGKFTTLTVDNLTIDDSAFVLRANLAQADQLVVNKSLSGKNNLLLVDFIEKNGNSNGLNIDLVSAPKGTAVDVFKA
TTRSIGFSDVTPVIEQKNDTDKATWTLIGYKSVANADAAKKATLLMSGGYKAFLAEVNNLNKRMGDLRDINGESGAWARI
MSGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDLIGKY
VHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNPLIGR
TGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNLRFGLEFEKSAFGK
YNVDNAINANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 99
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 55
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 55
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 55
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 52

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECABU_c32840 YP_006107283.1 putative secreted autotransporter toxin sat VFG0902 Protein 0.0 99
ECABU_c32840 YP_006107283.1 putative secreted autotransporter toxin sat VFG0862 Protein 0.0 64
ECABU_c32840 YP_006107283.1 putative secreted autotransporter toxin sat VFG0844 Protein 0.0 56
ECABU_c32840 YP_006107283.1 putative secreted autotransporter toxin sat VFG0630 Protein 0.0 55
ECABU_c32840 YP_006107283.1 putative secreted autotransporter toxin sat VFG0772 Protein 0.0 52