Gene Information

Name : espP (pO157p78)
Accession : NP_052685.1
Strain :
Genome accession: NC_002128
Putative virulence/resistance : Virulence
Product : EspP
Function : -
COG functional category : S : Function unknown
COG ID : COG4625
EC number : -
Position : 80757 - 84659 bp
Length : 3903 bp
Strand : +
Note : extracellular serine protease

DNA sequence :
ATGAATAAAATATACTCTCTTAAATACAGCCATATTACAGGAGGGTTAATCGCTGTTTCTGAATTATCCGGCAGAGTATC
ATCAAGAGCAACTGGTAAGAAAAAACACAAACGCATACTTGCATTATGTTTTTTAGGCTTATTACAATCCTCATATTCTT
TTGCGTCACAGATGGATATTTCAAATTTCTACATCCGTGACTATATGGATTTTGCACAGAACAAGGGCATATTTCAGGCT
GGCGCAACAAATATTGAAATAGTGAAGAAAGATGGCTCCACCCTGAAACTACCGGAAGTACCATTTCCTGACTTCTCACC
GGTTGCAAACAAAGGGTCAACCACATCTATTGGTGGTGCATACAGTATCACAGCCACACACAATACGAAAAACCACCACT
CAGTTGCGACGCAAAACTGGGGGAACAGCACGTACAAACAAACTGACTGGAATACTTCACATCCTGATTTTGCAGTATCC
CGACTTGACAAGTTTGTTGTTGAGACCCGAGGTGCGACTGAAGGCGCAGATATTTCGTTATCAAAACAGCAGGCACTTGA
ACGTTACGGGGTTAATTATAAAGGAGAAAAGAAACTTATCGCATTCAGAGCCGGCTCTGGTGTGGTATCCGTTAAAAAAA
ATGGACGCATAACTCCATTTAATGAGGTTTCTTATAAGCCAGAAATGTTAAATGGCTCTTTCGTTCACATTGATGACTGG
AGTGGATGGCTGATATTAACCAACAACCAGTTTGATGAGTTTAATAACATTGCCTCTCAGGGTGACAGCGGTTCAGCACT
GTTCGTCTATGATAACCAAAAGAAAAAGTGGGTTGTCGCTGGAACTGTCTGGGGGATTTATAATTACGCCAATGGCAAAA
ACCACGCAGCATACAGTAAATGGAACCAGACAACCATTGACAACCTGAAGAACAAGTATTCTTACAACGTGGATATGTCA
GGGGCTCAGGTTGCAACCATTGAAAATGGAAAACTGACAGGCACTGGCTCAGACACCACCGATATAAAAAATAAGGACTT
AATATTTACTGGCGGTGGAGATATCCTCCTGAAATCCTCTTTTGATAATGGTGCTGGCGGTCTTGTCTTTAATGATAAAA
AGACCTATCGAGTAAACGGGGATGATTTCACCTTTAAAGGTGCCGGTGTTGATACAAGAAACGGCAGCACCGTTGAGTGG
AATATCCGGTATGATAATAAAGACAACCTTCACAAAATTGGTGATGGCACATTAGATGTCCGAAAAACCCAGAACACCAA
CCTGAAAACAGGTGAGGGTCTTGTCATTCTTGGAGCTGAAAAAACATTCAATAATATCTACATAACCAGTGGTGATGGAA
CTGTCCGACTGAATGCAGAAAATGCACTGTCTGGCGGTGAATACAACGGTATTTTCTTTGCGAAAAATGGCGGAACTCTT
GACCTGAACGGATATAATCAGTCTTTCAATAAAATTGCTGCAACTGATTCAGGTGCTGTAATAACCAATACGTCAACCAA
AAAATCCATTTTATCCCTGAATAATACTGCTGACTATATCTATCACGGTAACATAAACGGGAATCTGGACGTACTTCAGC
ATCATGAGACGAAAAAAGAGAACCGTCGTCTTATTCTTGATGGGGGCGTGGACACAACAAATGATATAAGCCTGCGTAAT
ACACAACTGTCCATGCAGGGACATGCCACTGAACATGCCATTTATCGGGATGGAGCTTTCTCTTGTTCACTACCAGCTCC
TATGCGCTTTTTGTGTGGCAGTGATTATGTTGCAGGAATGCAAAATACAGAAGCTGATGCTGTAAAACAAAACGGAAATG
CCTATAAAACCAACAATGCTGTCTCTGATTTATCGCAGCCAGACTGGGAAACCGGAACATTCAGATTTGGAACGCTACAT
CTTGAAAATTCCGATTTTTCTGTTGGTCGTAATGCAAATGTAATCGGGGACATTCAGGCCAGTAAATCAAACATTACTAT
TGGTGACACTACAGCATATATTGATTTGCATGCTGGTAAAAATATTACCGGTGATGGTTTTGGCTTCCGCCAGAATATTG
TGCGTGGAAACTCACAAGGAGAAACGCTGTTTACAGGAGGGATCACAGCAGAAGACAGCACTATCGTTATTAAAGATAAA
GCAAAAGCATTATTTTCAAATTATGTATACCTGCTGAACACAAAAGCAACCATAGAGAACGGTGCTGATGTGACAACTCA
AAGTGGTATGTTCTCCACGAGCGATATCAGCATCTCTGGTAATCTGTCCATGACAGGCAATCCCGACAAAGACAATAAAT
TCGAGCCCTCAATATATCTGAATGATGCTTCTTATCTACTGACTGACGACTCCGCCAGACTCGTTGCCAAAAATAAAGCA
TCTGTGGTGGGAGATATACACTCCACTAAAAGTGCATCCATCATGTTTGGTCATGATGAAAGCGACCTCTCGCAGTTGTC
TGACAGAACCTCAAAAGGGCTTGCACTTGGTCTTTTAGGTGGCTTTGATGTCTCATATCGCGGTTCAGTCAATGCCCCGT
CAGCATCTGCCACTATGAACAACACCTGGTGGCAACTAACCGGAGATTCTGCGCTGAAAACACTGAAAAGTACAAACAGC
ATGGTCTATTTCACTGACAGCGCAAACAATAAGAAATTCCATACGCTGACGGTCGATGAGCTGGCAACCAGCAACAGCGC
CTATGCGATGCGTACAAACCTTTCTGAATCAGACAAACTGGAGGTCAAAAAACACTTGTCTGGTGAGAACAATATTTTAC
TCGTTGATTTCCTTCAGAAACCAACGCCTGAAAAACAACTGAATATTGAACTGGTAAGCGCGCCAAAAGACACCAATGAA
AATGTCTTTAAAGCCAGTAAACAAACCATTGGTTTCAGTGATGTAACGCCGGTCATTACAACCAGGGAAACCGATGACAA
AATAACATGGTCACTGACAGGCTATAACACGGTAGCAAACAAGGAAGCAACCCGGAATGCCGCCGCCCTGTTCTCTGTTG
ACTATAAAGCGTTTCTGAACGAGGTCAACAACCTGAACAAACGTATGGGTGACCTGCGTGATATCAACGGCGAAGCCGGT
GCATGGGCACGCATCATGAGCGGTACCGGCTCTGCCAGTGGTGGTTTCAGTGACAACTACACGCACGTTCAGGTCGGGGT
CGACAAAAAACACGAGCTGGACGGACTGGATTTGTTTACCGGTTTCACTGTCACACACACTGACAGCAGTGCCTCCGCCG
ATGTTTTCAGTGGTAAAACGAAGTCTGTGGGGGCTGGCCTGTATGCTTCCGCCATGTTTGATTCCGGTGCCTATATCGAC
CTGATTGGCAAGTATGTTCACCATGATAATGAGTACACTGCAACCTTTGCCGGACTCGGAACCCGTGATTACAGCACGCA
TTCATGGTATGCCGGTGCAGAAGCGGGCTACCGCTATCATGTCACTGAGGATGCCTGGATTGAGCCACAGGCTGAGCTGG
TTTACGGTTCTGTATCCGGTAAACAGTTTGCATGGAAGGACCAGGGAATGCATCTGTCCATGAAGGACAAGGACTACAAT
CCGCTGATTGGCCGAACGGGTGTGGATGTGGGTAAATCCTTCTCTGGTAAGGACTGGAAAGTGACAGCCCGTGCCGGTCT
GGGCTACCAGTTCGACCTGCTGGCTAACGGCGAAACCGTATTGCGGGATGCATCTGGTGAAAAACGCATCAAAGGTGAAA
AGGACAGCCGTATGCTGATGTCCGTTGGCCTGAATGCAGAAATCAGGGATAACGTCCGCTTTGGACTGGAGTTTGAGAAA
TCCGCCTTTGGTAAGTACAACGTTGATAATGCTGTCAACGCTAATTTCCGTTACTCGTTCTGA

Protein sequence :
MNKIYSLKYSHITGGLIAVSELSGRVSSRATGKKKHKRILALCFLGLLQSSYSFASQMDISNFYIRDYMDFAQNKGIFQA
GATNIEIVKKDGSTLKLPEVPFPDFSPVANKGSTTSIGGAYSITATHNTKNHHSVATQNWGNSTYKQTDWNTSHPDFAVS
RLDKFVVETRGATEGADISLSKQQALERYGVNYKGEKKLIAFRAGSGVVSVKKNGRITPFNEVSYKPEMLNGSFVHIDDW
SGWLILTNNQFDEFNNIASQGDSGSALFVYDNQKKKWVVAGTVWGIYNYANGKNHAAYSKWNQTTIDNLKNKYSYNVDMS
GAQVATIENGKLTGTGSDTTDIKNKDLIFTGGGDILLKSSFDNGAGGLVFNDKKTYRVNGDDFTFKGAGVDTRNGSTVEW
NIRYDNKDNLHKIGDGTLDVRKTQNTNLKTGEGLVILGAEKTFNNIYITSGDGTVRLNAENALSGGEYNGIFFAKNGGTL
DLNGYNQSFNKIAATDSGAVITNTSTKKSILSLNNTADYIYHGNINGNLDVLQHHETKKENRRLILDGGVDTTNDISLRN
TQLSMQGHATEHAIYRDGAFSCSLPAPMRFLCGSDYVAGMQNTEADAVKQNGNAYKTNNAVSDLSQPDWETGTFRFGTLH
LENSDFSVGRNANVIGDIQASKSNITIGDTTAYIDLHAGKNITGDGFGFRQNIVRGNSQGETLFTGGITAEDSTIVIKDK
AKALFSNYVYLLNTKATIENGADVTTQSGMFSTSDISISGNLSMTGNPDKDNKFEPSIYLNDASYLLTDDSARLVAKNKA
SVVGDIHSTKSASIMFGHDESDLSQLSDRTSKGLALGLLGGFDVSYRGSVNAPSASATMNNTWWQLTGDSALKTLKSTNS
MVYFTDSANNKKFHTLTVDELATSNSAYAMRTNLSESDKLEVKKHLSGENNILLVDFLQKPTPEKQLNIELVSAPKDTNE
NVFKASKQTIGFSDVTPVITTRETDDKITWSLTGYNTVANKEATRNAAALFSVDYKAFLNEVNNLNKRMGDLRDINGEAG
AWARIMSGTGSASGGFSDNYTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADVFSGKTKSVGAGLYASAMFDSGAYID
LIGKYVHHDNEYTATFAGLGTRDYSTHSWYAGAEAGYRYHVTEDAWIEPQAELVYGSVSGKQFAWKDQGMHLSMKDKDYN
PLIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLLANGETVLRDASGEKRIKGEKDSRMLMSVGLNAEIRDNVRFGLEFEK
SAFGKYNVDNAVNANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 56
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 56
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 56
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 56
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 53
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 48
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
espP NP_052685.1 EspP VFG0844 Protein 0.0 100
espP NP_052685.1 EspP VFG0862 Protein 0.0 58
espP NP_052685.1 EspP VFG0630 Protein 0.0 56
espP NP_052685.1 EspP VFG0902 Protein 0.0 56
espP NP_052685.1 EspP VFG0772 Protein 0.0 53