Gene Information

Name : espC (ECNA114_4310)
Accession : YP_006141430.1
Strain : Escherichia coli NA114
Genome accession: NC_017644
Putative virulence/resistance : Virulence
Product : Per-activated serine protease autotransporter enterotoxin EspC
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4454260 - 4458024 bp
Length : 3765 bp
Strand : +
Note : -

DNA sequence :
ATGGACACCCGAAATTTCTGGATCCGTGATTATCTTGATCTGGCACAGAATAAAGGTGCATTTCAGCCCGGAGCATACGG
GGTAAAAACGCCATTAAAAAATGGGGGAGAATTCAGTTTTCCTGAAGTAACAATCCCTGATTTTTCTCCTGTATCCGCTA
AAGGTGCAACAACTGCTATTGGTAATGCCTACAGTGTTACAGCAAGTCATAATGGCACTATTCACCATGCCATTAAAACT
CAGACATGGGGACAGTCAGATTATCATTATGTTGATCGTGTGACCAAAGGTGACTTTGCGGTCCAGCGTCTGGATAAGTT
TGTTGTTGAAACAGCGGGTGCAACAGAGCATGCTGATTTCAACTTATCAGCAGCAGAAGCACTAGAGCGTTATGGTATTG
AATTCAATGGAAAGAAACAAATAATCGGTTTCCGCGTTGGAGCTGGTGCAACCGGTGTTACATCTTATGGGGTGGGACAG
ACATACAATCCATTATTACGTAGTGCCTCCATGTTTCAGTTAAACTGGAACAATATGTATGCAAGTAATAATACCGGTGG
GTTTTATAATGAAGTGACAGGAGGTGACAGCGGTTCCGGATTTTACTTGTATGATAATCAGAAAAAAAAATGGGTCATTT
TGGGGACACTGACAGGTAAAGTATTCTCCAGTAAGGATACCTGGGCCTTTTTTGCTCGATATGATCAGAACACTGTCGAT
ATCCTGAAAAATACTTTTACCCAAGAGGTGAACCTCAATGGTCAGAAAATGACAGTTAATAATAAAAATATTGCCATTAA
CGATAAAATAACTGCTATTGAACTGACCAAGAGTAATAAAAATAAAGATTTGAAATTCCATGGTGGCGGGAGCATTGAGC
TCACCGATAACCTGAACTCAGGAACCGGAGGATTGATTTTTGATGAGGGACAACATTATTCGGTTATTGGGAAAGATAAA
GCCTATAAAGGGGCGGGTGTTGAGATCGGAAAAGATACGGTTGTCGACTGGTCGGTAAAAGGGGAGGCAAACGATAACCT
GCACAAAACAGGGGCCGGGACACTGAATGTCAATGTGGCCCAGGGGAATAACCTGAAAACAGGTGACGGTACCGTTTTTC
TTAATGCAGAAAAGGCTTTCAATGCTATCTATGTTGCCAGTGGCCGTGGAACGGTCAAACTGGGGCAGGCCGATGCGCTG
GATAAAAATAGTGATTACAGAGGTATTTATTTTACTAGTCGTGGGGGGACTCTGGATTTAAACGGGTTCAGCCAGTCGTT
CAAGAAGATCGCGGCAACTGATGTTGGTACCATTATCACCAATACTTCTGATAAAACAGCGACCCTTTCCCTACAAAACC
CCTCCCGCTATGTCTATCACGGTAGTATCACGGGAAATACGAATATCGAACACACTGGGACACAGAAAAGTGCTGACAGC
AGTCTGATTATTGATGGAAACATTAATACACGCAATGACATTACTGTGCGGAATTCCCAGCTCAGACTTCAGGGGCATGC
CACATCACATGCAATATTCCGCGAGGGGCCTCGGCACTGCTATGTCCCCGGAGTTCTTTGTGACAAAGATTATGTTACTG
ATTTTGCCAGACTGGAAAGTGAGGCAAATAAGAAAAATAACAGTGCCTATAAAACAAATAATCAGGTGGCTTCTTTTGAC
CAACCTGACTGGGAAACCCGGCATTTCCGATTTAAGACTCTGAATCTGGAAAACTCAGAATTCACAACTGCACGTAACTC
AGTTGTTGAGGGTGATATTGTCGCATCGAATTCAACGCTGAAACTGGGGGGCGACGTTCCGGTGTTCATTGATATGTATG
ATGGCATCAATATCACCGGTAATGGTTTTGGCTTCCGCCAGGACGTTCGTGAAGGACGCTCAGCAGATGATGGCAGTAGC
AGCTATACAGGCAAAATTACACTGCAAAAAGGCTCCACGCTGGACATCAACAACCGGTTCATTGGCGGTATTGAAGCCCA
TGACAGTAAGGTAAACGTCACCTCACCGGATGCCCTGCTGCAGAACAGTGGTGTTTTCGTGAATTCCACCCTTTCTGTCC
GTGACGGCGGTCATCTGACGGCACAAAAAGGGCTCTACAGTAACGGCCGGGTTCAGATTGGAAAGAAAGGTACGCTTTCC
CTGAGCGGCACGCCGGAAAATGGCGCTGATAATACCTGGATGCCCGTACTGACATACATGACAGAAGGCTATGATTTAAC
CGGTGATAACGCTACGCTGAACATCAGCCAGCAGGCGCATGTTTCCGGGGATGTTCATGCAACCAGTTCATCCAGCATTC
GTATTGGCTCCGAAAACCCTGGCTCAGTTTCCTCTTCTGTCTCCCCTGTTCTGGCTGCCGGGTTGTTCAGCGGATATAAC
GCGGCGTACTACGGTGCCATCACCGGCGGTAAGGGGAACGTCAGTATGAATAATGGCCTGTGGCAGCTGACCGGAGATTC
CGACATCAACAGTCTGACGACCCGTAACAGCCGGGTACAGTCTGAAGAAAACGGTGCCTTCCGTACCCTGACGGTTAAGA
CACTTGATGCCACGGGCAGTGATTTTGTCCTGCGCACTGATCTGAAGGACGCTGATAAAATCAGTATTACGGAGAAAGCC
AGCGGTTCAGACAACACCCTGAATGTCAGCTTTATGAAGAACCCGTCTCCGGGACAGTCCCTGAATATCCCGCTGGTCAG
TGCACCGGCCGGAACATCAGGGGATATCTTTAAGGCCGGCACCCGGGTGACAGGCTTCAGTCGTGTGACGCCGACGCTGC
GTGTTGACACCACTGGCGGCAGTACGAAGTGGATTCTGGATGGTTTCAGGACGGAAGCTGATAAAGCGGCTGCAGCGAAG
GCGAACAGTTTCATGAATGCCGGCTACAAAAGCTTTATGACGGAAGTAAACAATCTGAACAAACGTATGGGGGAGCTGCG
TGACACGAACGGTGATGCCGGTGCCTGGGCCCGTATTATGAACGGCGCCGGTTCAGCCGATGGCGGATACAGTGATAACT
ACACCCACGTTCAGGTCGGTTTTGACAAAAAACATGCGCTGGACGGTGTTGACCTGTTCACCGGTGTCACGATGACCTAT
ACCGACAGCAGTGCAGACAGTGATGCGTTCAGCGGGAAGACAAAATCCGTGGGGGGAGGTCTGTATGCTTCAGCACTGTT
TAACTCCGGTGCCTACATTGATTTGATTGGTAAATATATTCACCATGACAATGACTACACAGGTAACTTTGCCGGTCTGG
GGACGAAGCACTACGGAACCCACTCCTGGTATGCCGGAGCGGAAACGGGTTACCGTTATCACCTGACGGAAGACACATTT
ATTGAGCCTCAGGCCGAACTGGTTTACGGCGCAGTGTCCGGGAAAACATTCCGCTGGAAAGACGGTGAGATGGACCTGAG
TATGAAGAACAAGGATTTCAGCCCGTTGATTGGCAGAACAGGGATTGAGCTGGGCAAAACCTTCAGGGGTAAGGACTGGA
GTGTGACAGCCCGTGCCGGAACCAGCTGGCAGTTTGACCTGCTGAATAATGGCGAGACAGTTCTTCGTGATGCATCCGGA
GAGAAACGGATCAAAGGGGAGAAAGACAGCAGGATGTTGTTCAATGTTGGCATGAACGCACAGATAAAGGACAACATGCG
CTTTGGTCTGGAGTTTGAGAAATCCGCCTTTGGTAAATACAACGTGGATAACGCAATAAACGCGAATTTCCGGTATATGT
TCTGA

Protein sequence :
MDTRNFWIRDYLDLAQNKGAFQPGAYGVKTPLKNGGEFSFPEVTIPDFSPVSAKGATTAIGNAYSVTASHNGTIHHAIKT
QTWGQSDYHYVDRVTKGDFAVQRLDKFVVETAGATEHADFNLSAAEALERYGIEFNGKKQIIGFRVGAGATGVTSYGVGQ
TYNPLLRSASMFQLNWNNMYASNNTGGFYNEVTGGDSGSGFYLYDNQKKKWVILGTLTGKVFSSKDTWAFFARYDQNTVD
ILKNTFTQEVNLNGQKMTVNNKNIAINDKITAIELTKSNKNKDLKFHGGGSIELTDNLNSGTGGLIFDEGQHYSVIGKDK
AYKGAGVEIGKDTVVDWSVKGEANDNLHKTGAGTLNVNVAQGNNLKTGDGTVFLNAEKAFNAIYVASGRGTVKLGQADAL
DKNSDYRGIYFTSRGGTLDLNGFSQSFKKIAATDVGTIITNTSDKTATLSLQNPSRYVYHGSITGNTNIEHTGTQKSADS
SLIIDGNINTRNDITVRNSQLRLQGHATSHAIFREGPRHCYVPGVLCDKDYVTDFARLESEANKKNNSAYKTNNQVASFD
QPDWETRHFRFKTLNLENSEFTTARNSVVEGDIVASNSTLKLGGDVPVFIDMYDGINITGNGFGFRQDVREGRSADDGSS
SYTGKITLQKGSTLDINNRFIGGIEAHDSKVNVTSPDALLQNSGVFVNSTLSVRDGGHLTAQKGLYSNGRVQIGKKGTLS
LSGTPENGADNTWMPVLTYMTEGYDLTGDNATLNISQQAHVSGDVHATSSSSIRIGSENPGSVSSSVSPVLAAGLFSGYN
AAYYGAITGGKGNVSMNNGLWQLTGDSDINSLTTRNSRVQSEENGAFRTLTVKTLDATGSDFVLRTDLKDADKISITEKA
SGSDNTLNVSFMKNPSPGQSLNIPLVSAPAGTSGDIFKAGTRVTGFSRVTPTLRVDTTGGSTKWILDGFRTEADKAAAAK
ANSFMNAGYKSFMTEVNNLNKRMGELRDTNGDAGAWARIMNGAGSADGGYSDNYTHVQVGFDKKHALDGVDLFTGVTMTY
TDSSADSDAFSGKTKSVGGGLYASALFNSGAYIDLIGKYIHHDNDYTGNFAGLGTKHYGTHSWYAGAETGYRYHLTEDTF
IEPQAELVYGAVSGKTFRWKDGEMDLSMKNKDFSPLIGRTGIELGKTFRGKDWSVTARAGTSWQFDLLNNGETVLRDASG
EKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNVDNAINANFRYMF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 60
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 52
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 52
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 52
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 51

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
espC YP_006141430.1 Per-activated serine protease autotransporter enterotoxin EspC VFG0772 Protein 0.0 60
espC YP_006141430.1 Per-activated serine protease autotransporter enterotoxin EspC VFG0630 Protein 0.0 52
espC YP_006141430.1 Per-activated serine protease autotransporter enterotoxin EspC VFG0902 Protein 0.0 51
espC YP_006141430.1 Per-activated serine protease autotransporter enterotoxin EspC VFG0844 Protein 0.0 51
espC YP_006141430.1 Per-activated serine protease autotransporter enterotoxin EspC VFG0862 Protein 0.0 51