Gene Information

Name : espC (E2348C_2915)
Accession : YP_002330403.1
Strain : Escherichia coli E2348/69
Genome accession: NC_011601
Putative virulence/resistance : Virulence
Product : serine protease
Function : -
COG functional category : S : Function unknown
COG ID : COG4625
EC number : -
Position : 3005494 - 3009411 bp
Length : 3918 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAAATATACGCATTAAAATATTGTCACGCGACAGGGGGGCTGATTGCTGTATCCGAACTGGCCTCCAGAGTTAT
GAAGAAAGCCGCTCGCGGCAGCCTTTTAGCATTATTTAATCTATCATTGTATGGTGCTTTTTTAAGCGCATCTCAGGCTG
CTCAACTAAATATTGATAATGTATGGGCTAGAGATTATTTAGACCTCGCACAAAATAAGGGGGTGTTTAAAGCTGGTGCG
ACCAATGTTTCAATTCAACTCAAGAATGGCCAGACGTTTAATTTTCCAAATGTTCCAATTCCTGATTTCTCGCCGGCCTC
AAATAAAGGCGCTACTACATCTATAGGTGGAGCTTATAGTGTCACAGCAACCCATAACGGAACAACTCATCATGCAATAA
GCACCCAAAACTGGGGACAAAGCTCATATAAATATATAGACCGGATGACGAATGGAGATTTTGCTGTAACACGACTTGAT
AAGTTTGTTGTTGAAACAACAGGGGTAAAAAATTCAGTAGATTTTTCTCTCAATAGTCATGATGCTCTTGAACGTTATGG
TGTGGAGATCAATGGTGAGAAAAAAATCATTGGTTTCAGGGTTGGGGCTGGGACGACTTATACCGTTCAAAATGGTAATA
CATATAGTACAGGACAGGTATACAATCCTCTTTTGTTAAGCGCTTCAATGTTTCAGTTAAACTGGGATAACAAAAGACCA
TATAATAACACGACACCTTTTTATAATGAAACTACCGGTGGAGACAGTGGTTCCGGTTTCTATCTGTATGATAACGTAAA
AAAAGAATGGGTTATGCTTGGTACTTTATTTGGAATAGCATCCAGTGGTGCAGATGTTTGGTCTATTCTGAATCAGTATG
ATGAAAATACAGTTAATGGTTTAAAAAACAAATTTACTCAAAAAGTCCAGTTAAACAATAATACAATGTCGCTTAATAGT
GACAGTTTTACGTTAGCTGGTAATAATACAGCAGTGGAAAAAAATAATAATAACTATAAAGATCTAAGTTTTAGTGGTGG
TGGAAGTATTAATTTCGACAATGACGTAAACATTGGCTCTGGTGGTCTCATTTTTGATGCAGGGCATCATTATACTGTCA
CTGGTAATAATAAAACATTCAAGGGTGCCGGGCTGGATATTGGTGACAATACTACAGTCGACTGGAATGTGAAAGGGGTT
GTCGGTGATAACCTGCATAAAATTGGTGCAGGTACATTGAATGTTAATGTTTCTCAAGGTAATAATCTTAAAACGGGGGA
TGGTCTTGTCGTATTAAATAGCGCTAATGCATTTGATAATATTTATATGGCCAGTGGTCATGGTGTTGTAAAAATTAATC
ATAGTGCAGCGCTTAACCAGAACAATGACTATAGAGGTATTTTCTTTACTGAAAATGGTGGTACTCTGGATTTAAATGGT
TATGACCAGAGTTTTAATAAAATTGCAGCGACAGATATAGGAGCACTCATAACAAATAGTGCAGTGCAGAAAGCAGTTCT
TTCTGTTAATAATCAGTCAAACTATATGTATCATGGTTCTGTTTCAGGTAATACAGAGATAAACCACCAGTTTGATACCC
AAAAAAATAATAGTCGCCTGATTCTGGACGGTAATGTCGATATTACAAATGACATTAACATTAAGAATAGCCAGCTCACC
ATGCAGGGACATGCTACATCTCATGCTGTTTTTAGAGAGGGTGGGGTTACCTGCATGCTGCCAGGAGTTATTTGTGAAAA
GGATTATGTTTCAGGCATACAGCAACAGGAAAACTCAGCCAATAAAAATAATAATACAGATTATAAGACCAATAATCAGG
TATCATCATTTGAGCAACCTGACTGGGAAAATCGTCTGTTTAAGTTTAAGACATTGAATCTGATAAATTCAGATTTTATC
GTTGGCCGTAATGCTATTGTTGTTGGTGATATTTCTGCCAATAATTCCACTCTGTCTTTAAGTGGAAAAGATACAAAAGT
ACATATTGATATGTATGACGGCAAAAACATCACGGGAGATGGCTTCGGTTTTCGGCAGGATATTAAAGATGGTGTATCTG
TTTCTCCTGAGAGCAGCAGTTATTTTGGAAATGTTACGCTGAATAATCACTCATTACTGGATATTGGTAATAAATTTACC
GGTGGTATCGAGGCTTATGACAGCTCCGTGAGTGTGACCTCACAGAATGCTGTTTTTGATCGTGTTGGCAGCTTTGTCAA
CAGCAGCCTGACCCTCGAAAAAGGAGCAAAACTAACGGCTCAGGGCGGTATTTTCAGCACCGGGGCTGTGGACGTAAAAG
AAAATGCCTCCCTGATCCTGACGGGGACACCTTCTGCACAGAAACAGGAGTATTACTCCCCTGTGATTTCTACAACGGAA
GGGATTAACCTCGGAGATAAGGCCAGCCTTTCTGTTAAAAACATGGGCTATCTGAGTTCGGATATTCATGCAGGAACCAC
GGCGGCAACCATTAATCTGGGAGACGGTGATGCTGAGACGGATTCTCCGTTATTCAGCTCCCTGATGAAGGGATATAACG
CGGTTCTGAGTGGCAACATTACGGGTGAGCAGAGTACGGTAAATATGAACAATGCTCTGTGGTACTCTGACGGAAACTCA
ACGATCGGAACGCTGAAGAGTACGGGGGGACGAGTTGAACTGGGGGGCGGGAAAGACTTTGCCACCCTGCGGGTAAAAGA
GCTTAACGCAAATAACGCCACATTCCTGATGCATACCAACAACAGTCAGGCTGACCAGCTGAATGTCACGAATAAACTGT
TGGGCAGTAATAATACCGTCCTGGTCGACTTTTTAAACAAGCCAGCCAGTGAAATGAACGTGACGTTAATTACCGCACCG
AAAGGGAGTGACGAGAAAACGTTCACTGCAGGAACGCAGCAGATTGGTTTCAGTAATGTCACGCCGGTAATCAGCACAGA
AAAAACGGATGATGCCACAAAATGGATGCTGACAGGGTATCAGACCGTCTCTGATGCCGGTGCCTCGAAAACCGCAACGG
ACTTTATGGCGTCAGGTTATAAATCCTTCCTGACAGAGGTCAATAATCTGAACAAGCGTATGGGTGACCTGCGGGATACT
CAGGGGGATGCCGGCGTCTGGGCGCGCATCATGAACGGTACCGGTTCGGCAGATGGTGGTTACAGCGATAACTACACTCA
CGTTCAGATTGGTGCCGACAGAAAGCATGAGCTGGACGGTGTGGATTTGTTCACGGGTGCATTACTGACCTATACAGACA
GCAATGCAAGCAGCCACGCCTTCAGTGGTAAAACCAAATCCGTGGGGGGAGGGTTGTACGCTTCAGCACTCTTTGATTCC
GGGGCTTATTTTGACCTGATTGGTAAATATCTCCATCACGACAATCAGTACACGGCGAGTTTTGCGTCTCTTGGTACAAA
AGACTACAGCTCTCATTCCTGGTATGCCGGTGCAGAGGTCGGGTATCGTTACCACCTGTCGGAAGAGTCCTGGGTGGAGC
CACAGATGGAGCTGGTTTACGGTTCTGTGTCAGGAAAATCTTTTAGCTGGGAAGACCGGGGAATGGCCCTGAGCATGAAA
GACAAGGATTATAACCCACTGATTGGCCGTACCGGTGTTGACGTGGGAAGAACCTTCTCCGGAGACGACTGGAAAATTAC
CGCGCGAGCCGGGCTGGGTTACCAGTTCGACCTGCTGGCGAACGGAGAAACGGTTCTGCGGGATGCATCCGGAGAGAAAC
GTTTTGAAGGTGAAAAGGACAGCAGAATGCTGATGAATGTGGGGATGAATGCGGAAATTAAGGATAATATGCGTTTTGGC
TTGGAGCTGGAAAAATCGGCGTTCGGGAAATATAACGTGGACAATGCGATAAACGCTAACTTCCGTTATTCTTTCTGA

Protein sequence :
MNKIYALKYCHATGGLIAVSELASRVMKKAARGSLLALFNLSLYGAFLSASQAAQLNIDNVWARDYLDLAQNKGVFKAGA
TNVSIQLKNGQTFNFPNVPIPDFSPASNKGATTSIGGAYSVTATHNGTTHHAISTQNWGQSSYKYIDRMTNGDFAVTRLD
KFVVETTGVKNSVDFSLNSHDALERYGVEINGEKKIIGFRVGAGTTYTVQNGNTYSTGQVYNPLLLSASMFQLNWDNKRP
YNNTTPFYNETTGGDSGSGFYLYDNVKKEWVMLGTLFGIASSGADVWSILNQYDENTVNGLKNKFTQKVQLNNNTMSLNS
DSFTLAGNNTAVEKNNNNYKDLSFSGGGSINFDNDVNIGSGGLIFDAGHHYTVTGNNKTFKGAGLDIGDNTTVDWNVKGV
VGDNLHKIGAGTLNVNVSQGNNLKTGDGLVVLNSANAFDNIYMASGHGVVKINHSAALNQNNDYRGIFFTENGGTLDLNG
YDQSFNKIAATDIGALITNSAVQKAVLSVNNQSNYMYHGSVSGNTEINHQFDTQKNNSRLILDGNVDITNDINIKNSQLT
MQGHATSHAVFREGGVTCMLPGVICEKDYVSGIQQQENSANKNNNTDYKTNNQVSSFEQPDWENRLFKFKTLNLINSDFI
VGRNAIVVGDISANNSTLSLSGKDTKVHIDMYDGKNITGDGFGFRQDIKDGVSVSPESSSYFGNVTLNNHSLLDIGNKFT
GGIEAYDSSVSVTSQNAVFDRVGSFVNSSLTLEKGAKLTAQGGIFSTGAVDVKENASLILTGTPSAQKQEYYSPVISTTE
GINLGDKASLSVKNMGYLSSDIHAGTTAATINLGDGDAETDSPLFSSLMKGYNAVLSGNITGEQSTVNMNNALWYSDGNS
TIGTLKSTGGRVELGGGKDFATLRVKELNANNATFLMHTNNSQADQLNVTNKLLGSNNTVLVDFLNKPASEMNVTLITAP
KGSDEKTFTAGTQQIGFSNVTPVISTEKTDDATKWMLTGYQTVSDAGASKTATDFMASGYKSFLTEVNNLNKRMGDLRDT
QGDAGVWARIMNGTGSADGGYSDNYTHVQIGADRKHELDGVDLFTGALLTYTDSNASSHAFSGKTKSVGGGLYASALFDS
GAYFDLIGKYLHHDNQYTASFASLGTKDYSSHSWYAGAEVGYRYHLSEESWVEPQMELVYGSVSGKSFSWEDRGMALSMK
DKDYNPLIGRTGVDVGRTFSGDDWKITARAGLGYQFDLLANGETVLRDASGEKRFEGEKDSRMLMNVGMNAEIKDNMRFG
LELEKSAFGKYNVDNAINANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 99
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 54
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 54
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 54
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 52

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
espC YP_002330403.1 serine protease VFG0772 Protein 0.0 99
espC YP_002330403.1 serine protease VFG0630 Protein 0.0 54
espC YP_002330403.1 serine protease VFG0844 Protein 0.0 53
espC YP_002330403.1 serine protease VFG0862 Protein 0.0 53
espC YP_002330403.1 serine protease VFG0902 Protein 0.0 52