Gene Information

Name : espP (L7020)
Accession : YP_325580.1
Strain :
Genome accession: NC_007414
Putative virulence/resistance : Virulence
Product : putative exoprotein-precursor
Function : -
COG functional category : S : Function unknown
COG ID : COG4625
EC number : -
Position : 11242 - 15144 bp
Length : 3903 bp
Strand : +
Note : 100 pct identical and equal length to a a putative exoprotein-precursor ECDNAPLAS accession X97542

DNA sequence :
ATGAATAAAATATACTCTCTTAAATACAGCCATATTACAGGAGGGTTAATCGCTGTTTCTGAATTATCCGGCAGAGTATC
ATCAAGAGCAACTGGTAAGAAAAAACACAAACGCATACTTGCATTATGTTTTTTAGGCTTATTACAATCCTCATATTCTT
TTGCGTCACAGATGGATATTTCAAATTTCTACATCCGTGACTATATGGATTTTGCACAGAACAAGGGCATATTTCAGGCT
GGCGCAACAAATATTGAAATAGTGAAGAAAGATGGCTCCACCCTGAAACTACCGGAAGTACCATTTCCTGACTTCTCACC
GGTTGCAAACAAAGGGTCAACCACATCTATTGGTGGTGCATACAGTATCACAGCCACACACAATACGAAAAACCACCACT
CAGTTGCGACGCAAAACTGGGGGAACAGCACGTACAAACAAACTGACTGGAATACTTCACATCCTGATTTTGCAGTATCC
CGACTTGACAAGTTTGTTGTTGAGACCCGAGGTGCGACTGAAGGCGCAGATATTTCGTTATCAAAACAGCAGGCACTTGA
ACGTTACGGGGTTAATTATAAAGGAGAAAAGAAACTTATCGCATTCAGAGCCGGCTCTGGTGTGGTATCCGTTAAAAAAA
ATGGACGCATAACTCCATTTAATGAGGTTTCTTATAAGCCAGAAATGTTAAATGGCTCTTTCGTTCACATTGATGACTGG
AGTGGATGGCTGATATTAACCAACAACCAGTTTGATGAGTTTAATAACATTGCCTCTCAGGGTGACAGCGGTTCAGCACT
GTTCGTCTATGATAACCAAAAGAAAAAGTGGGTTGTCGCTGGAACTGTCTGGGGGATTTATAATTACGCCAATGGCAAAA
ACCACGCAGCATACAGTAAATGGAACCAGACAACCATTGACAACCTGAAGAACAAGTATTCTTACAACGTGGATATGTCA
GGGGCTCAGGTTGCAACCATTGAAAATGGAAAACTGACAGGCACTGGCTCAGACACCACCGATATAAAAAATAAGGACTT
AATATTTACTGGCGGTGGAGATATCCTCCTGAAATCCTCTTTTGATAATGGTGCTGGCGGTCTTGTCTTTAATGATAAAA
AGACCTATCGAGTAAACGGGGATGATTTCACCTTTAAAGGTGCCGGTGTTGATACAAGAAACGGCAGCACCGTTGAGTGG
AATATCCGGTATGATAATAAAGACAACCTTCACAAAATTGGTGATGGCACATTAGATGTCCGAAAAACCCAGAACACCAA
CCTGAAAACAGGTGAGGGTCTTGTCATTCTTGGAGCTGAAAAAACATTCAATAATATCTACATAACCAGTGGTGATGGAA
CTGTCCGACTGAATGCAGAAAATGCACTGTCTGGCGGTGAATACAACGGTATTTTCTTTGCGAAAAATGGCGGAACTCTT
GACCTGAACGGATATAATCAGTCTTTCAATAAAATTGCTGCAACTGATTCAGGTGCTGTAATAACCAATACGTCAACCAA
AAAATCCATTTTATCCCTGAATAATACTGCTGACTATATCTATCACGGTAACATAAACGGGAATCTGGACGTACTTCAGC
ATCATGAGACGAAAAAAGAGAACCGTCGTCTTATTCTTGATGGGGGCGTGGACACAACAAATGATATAAGCCTGCGTAAT
ACACAACTGTCCATGCAGGGACATGCCACTGAACATGCCATTTATCGGGATGGAGCTTTCTCTTGTTCACTACCAGCTCC
TATGCGCTTTTTGTGTGGCAGTGATTATGTTGCAGGAATGCAAAATACAGAAGCTGATGCTGTAAAACAAAACGGAAATG
CCTATAAAACCAACAATGCTGTCTCTGATTTATCGCAGCCAGACTGGGAAACCGGAACATTCAGATTTGGAACGCTACAT
CTTGAAAATTCCGATTTTTCTGTTGGTCGTAATGCAAATGTAATCGGGGACATTCAGGCCAGTAAATCAAACATTACTAT
TGGTGACACTACAGCATATATTGATTTGCATGCTGGTAAAAATATTACCGGTGATGGTTTTGGCTTCCGCCAGAATATTG
TGCGTGGAAACTCACAAGGAGAAACGCTGTTTACAGGAGGGATCACAGCAGAAGACAGCACTATCGTTATTAAAGATAAA
GCAAAAGCATTATTTTCAAATTATGTATACCTGCTGAACACAAAAGCAACCATAGAGAACGGTGCTGATGTGACAACTCA
AAGTGGTATGTTCTCCACGAGCGATATCAGCATCTCTGGTAATCTGTCCATGACAGGCAATCCCGACAAAGACAATAAAT
TCGAGCCCTCAATATATCTGAATGATGCTTCTTATCTACTGACTGACGACTCCGCCAGACTCGTTGCCAAAAATAAAGCA
TCTGTGGTGGGAGATATACACTCCACTAAAAGTGCATCCATCATGTTTGGTCATGATGAAAGCGACCTCTCGCAGTTGTC
TGACAGAACCTCAAAAGGGCTTGCACTTGGTCTTTTAGGTGGCTTTGATGTCTCATATCGCGGTTCAGTCAATGCCCCGT
CAGCATCTGCCACTATGAACAACACCTGGTGGCAACTAACCGGAGATTCTGCGCTGAAAACACTGAAAAGTACAAACAGC
ATGGTCTATTTCACTGACAGCGCAAACAATAAGAAATTCCATACGCTGACGGTCGATGAGCTGGCAACCAGCAACAGCGC
CTATGCGATGCGTACAAACCTTTCTGAATCAGACAAACTGGAGGTCAAAAAACACTTGTCTGGTGAGAACAATATTTTAC
TCGTTGATTTCCTTCAGAAACCAACGCCTGAAAAACAACTGAATATTGAACTGGTAAGCGCGCCAAAAGACACCAATGAA
AATGTCTTTAAAGCCAGTAAACAAACCATTGGTTTCAGTGATGTAACGCCGGTCATTACAACCAGGGAAACCGATGACAA
AATAACATGGTCACTGACAGGCTATAACACGGTAGCAAACAAGGAAGCAACCCGGAATGCCGCCGCCCTGTTCTCTGTTG
ACTATAAAGCGTTTCTGAACGAGGTCAACAACCTGAACAAACGTATGGGTGACCTGCGTGATATCAACGGCGAAGCCGGT
GCATGGGCACGCATCATGAGCGGTACCGGCTCTGCCAGTGGTGGTTTCAGTGACAACTACACGCACGTTCAGGTCGGGGT
CGACAAAAAACACGAGCTGGACGGACTGGATTTGTTTACCGGTTTCACTGTCACACACACTGACAGCAGTGCCTCCGCCG
ATGTTTTCAGTGGTAAAACGAAGTCTGTGGGGGCTGGCCTGTATGCTTCCGCCATGTTTGATTCCGGTGCCTATATCGAC
CTGATTGGCAAGTATGTTCACCATGATAATGAGTACACTGCAACCTTTGCCGGACTCGGAACCCGTGATTACAGCACGCA
TTCATGGTATGCCGGTGCAGAAGCGGGCTACCGCTATCATGTCACTGAGGATGCCTGGATTGAGCCACAGGCTGAGCTGG
TTTACGGTTCTGTATCCGGTAAACAGTTTGCATGGAAGGACCAGGGAATGCATCTGTCCATGAAGGACAAGGACTACAAT
CCGCTGATTGGCCGAACGGGTGTGGATGTGGGTAAATCCTTCTCTGGTAAGGACTGGAAAGTGACAGCCCGTGCCGGTCT
GGGCTACCAGTTCGACCTGCTGGCTAACGGCGAAACCGTATTGCGGGATGCATCTGGTGAAAAACGCATCAAAGGTGAAA
AGGACAGCCGTATGCTGATGTCCGTTGGCCTGAATGCAGAAATCAGGGATAACGTCCGCTTTGGACTGGAGTTTGAGAAA
TCCGCCTTTGGTAAGTACAACGTTGATAATGCTGTCAACGCTAATTTCCGTTACTCGTTCTGA

Protein sequence :
MNKIYSLKYSHITGGLIAVSELSGRVSSRATGKKKHKRILALCFLGLLQSSYSFASQMDISNFYIRDYMDFAQNKGIFQA
GATNIEIVKKDGSTLKLPEVPFPDFSPVANKGSTTSIGGAYSITATHNTKNHHSVATQNWGNSTYKQTDWNTSHPDFAVS
RLDKFVVETRGATEGADISLSKQQALERYGVNYKGEKKLIAFRAGSGVVSVKKNGRITPFNEVSYKPEMLNGSFVHIDDW
SGWLILTNNQFDEFNNIASQGDSGSALFVYDNQKKKWVVAGTVWGIYNYANGKNHAAYSKWNQTTIDNLKNKYSYNVDMS
GAQVATIENGKLTGTGSDTTDIKNKDLIFTGGGDILLKSSFDNGAGGLVFNDKKTYRVNGDDFTFKGAGVDTRNGSTVEW
NIRYDNKDNLHKIGDGTLDVRKTQNTNLKTGEGLVILGAEKTFNNIYITSGDGTVRLNAENALSGGEYNGIFFAKNGGTL
DLNGYNQSFNKIAATDSGAVITNTSTKKSILSLNNTADYIYHGNINGNLDVLQHHETKKENRRLILDGGVDTTNDISLRN
TQLSMQGHATEHAIYRDGAFSCSLPAPMRFLCGSDYVAGMQNTEADAVKQNGNAYKTNNAVSDLSQPDWETGTFRFGTLH
LENSDFSVGRNANVIGDIQASKSNITIGDTTAYIDLHAGKNITGDGFGFRQNIVRGNSQGETLFTGGITAEDSTIVIKDK
AKALFSNYVYLLNTKATIENGADVTTQSGMFSTSDISISGNLSMTGNPDKDNKFEPSIYLNDASYLLTDDSARLVAKNKA
SVVGDIHSTKSASIMFGHDESDLSQLSDRTSKGLALGLLGGFDVSYRGSVNAPSASATMNNTWWQLTGDSALKTLKSTNS
MVYFTDSANNKKFHTLTVDELATSNSAYAMRTNLSESDKLEVKKHLSGENNILLVDFLQKPTPEKQLNIELVSAPKDTNE
NVFKASKQTIGFSDVTPVITTRETDDKITWSLTGYNTVANKEATRNAAALFSVDYKAFLNEVNNLNKRMGDLRDINGEAG
AWARIMSGTGSASGGFSDNYTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADVFSGKTKSVGAGLYASAMFDSGAYID
LIGKYVHHDNEYTATFAGLGTRDYSTHSWYAGAEAGYRYHVTEDAWIEPQAELVYGSVSGKQFAWKDQGMHLSMKDKDYN
PLIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLLANGETVLRDASGEKRIKGEKDSRMLMSVGLNAEIRDNVRFGLEFEK
SAFGKYNVDNAVNANFRYSF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sigA NP_838462.1 serine protease Virulence SHI-1 Protein 0.0 56
sigA NP_708742.1 serine protease Virulence SHI-1 Protein 0.0 56
sigA AAF67320.1 exported serine protease SigA Virulence SHI-1 Protein 0.0 56
sat YP_002414040.1 Serine protease Not tested Not named Protein 0.0 56
espC AAG37043.1 enterotoxin EspC Virulence espC PAI Protein 0.0 53
unnamed CAC39286.1 hypothetical protein Not tested LPA Protein 0.0 48
pic NP_838464.1 serine protease precurser Virulence SHI-1 Protein 0.0 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
espP YP_325580.1 putative exoprotein-precursor VFG0844 Protein 0.0 100
espP YP_325580.1 putative exoprotein-precursor VFG0862 Protein 0.0 58
espP YP_325580.1 putative exoprotein-precursor VFG0630 Protein 0.0 56
espP YP_325580.1 putative exoprotein-precursor VFG0902 Protein 0.0 56
espP YP_325580.1 putative exoprotein-precursor VFG0772 Protein 0.0 53