Gene Information

Name : gspE (ECOK1_3744)
Accession : YP_006102849.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : general secretory pathway protein E
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3829217 - 3830698 bp
Length : 1482 bp
Strand : +
Note : identified by match to protein family HMM PF00437; match to protein family HMM TIGR02533

DNA sequence :
ATGAGAATTCACTCACCGTACCCCGCCAGTTGGGCGCTGGCACAACGAATTGGTTATCTCTATTCAGAGGGCGAGATTAT
TTATCTCGCCGATACGCCATTCGAACGGTTACTCGATATTCAACGTCAGGTTGGCCAGTGCCAGACCATGACCAGCTTGT
CACAGGCTGATTTTGAAGCCCGGCTGGAAGCGGTGTTCCATCAGAATACCGGTGAGTCGCAACAGATTGCGCAAGATATC
GATCAATCCGTCGATCTTCTCTCGCTTTCGGAAGAGATGCCCGCAAATGAAGATCTCCTGAATGAAGATTCAGCGGCACC
GGTTATCCGCTTGATCAATGCTATTTTGAGTGAGGCCATCAAAGAAACCGCCTCCGATATCCACATTGAAACCTATGAAA
AAACAATGTCGATCCGTTTTCGCATCGACGGCGTTTTGCGGACAATTTTACAGCCAAACAAAAAACTGGCGGCACTGCTT
ATCTCCCGAATTAAGGTCATGGCTCGTCTTGATATCGCCGAAAAACGTATTCCACAGGATGGACGTATTAGTTTGCGTAT
CGGGCGACGTAACATAGATGTCCGCGTATCCACACTGCCGTCCATCTATGGTGAACGCGCCGTGCTCCGTCTGCTGGATA
AAAACAGCCTCCAGCTTTCATTGAACAACCTGGGGATGACGGCAGCGGATAAACAGGATTTAGAAAATCTCATCCAGCTT
CCACACGGTATTATCCTGGTGACCGGACCGACAGGCTCCGGTAAAAGTACCACGCTCTACGCCATTCTTTCGGCGCTGAA
TACTCCCGGCCGCAATATTCTGACGGTAGAAGATCCCGTGGAATATGAACTGGAAGGCATAGGACAAACGCAGGTGAATA
CCCGTGTGGATATGTCTTTCGCTCGCGGCCTGCGCGCCATACTTCGCCAGGACCCGGATGTCGTCATGGTGGGGGAAATT
CGTGATACAGAAACCGCGCAAATTGCGGTTCAGGCCTCGCTCACCGGCCATCTGGTACTCTCAACACTCCACACTAACAG
TGCATCAGGCGCAGTGACCCGGCTCCGCGACATGGGCGTCGAATCATTCCTGCTTTCGTCTTCACTGGCAGGGATTATCG
CGCAACGTCTGGTTCGTCGTCTGTGTCCGCAATGCCGACAATTCACGCCTGTATCGCCGCAACAAGCGCAGATGTTTAAA
CATCATCAGCTCGCGGTGACGACAATTGGCACTCCCGTAGGCTGCCCTCATTGCCATCAATCCGGCTACCAGGGGCGCAT
GGCGATCCACGAAATGATGGTGGTGACGCCGGAATTGCGGGCCGCTATTCATGAAAATGTGGATGAACAAGCACTGGAGC
GACTGGTCCGGCAACAACACAATGCCTTAATCAAAAATGGCCTGCAAAAAGTGATACGCGGTGACACCTCCTGGGATGAG
GTTATGCGCGTCGCCAGTGCCACGCTGGAGAACGAAGCATGA

Protein sequence :
MRIHSPYPASWALAQRIGYLYSEGEIIYLADTPFERLLDIQRQVGQCQTMTSLSQADFEARLEAVFHQNTGESQQIAQDI
DQSVDLLSLSEEMPANEDLLNEDSAAPVIRLINAILSEAIKETASDIHIETYEKTMSIRFRIDGVLRTILQPNKKLAALL
ISRIKVMARLDIAEKRIPQDGRISLRIGRRNIDVRVSTLPSIYGERAVLRLLDKNSLQLSLNNLGMTAADKQDLENLIQL
PHGIILVTGPTGSGKSTTLYAILSALNTPGRNILTVEDPVEYELEGIGQTQVNTRVDMSFARGLRAILRQDPDVVMVGEI
RDTETAQIAVQASLTGHLVLSTLHTNSASGAVTRLRDMGVESFLLSSSLAGIIAQRLVRRLCPQCRQFTPVSPQQAQMFK
HHQLAVTTIGTPVGCPHCHQSGYQGRMAIHEMMVVTPELRAAIHENVDEQALERLVRQQHNALIKNGLQKVIRGDTSWDE
VMRVASATLENEA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspE CAE85233.1 GspE, hypothetical type II secretion protein Not tested PAI V 536 Protein 1e-134 60
gspE YP_854406.1 type II secretion protein GspE Not tested PAI I APEC-O1 Protein 2e-134 60

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspE YP_006102849.1 general secretory pathway protein E VFG2048 Protein 1e-134 60
gspE YP_006102849.1 general secretory pathway protein E VFG0182 Protein 6e-128 58
gspE YP_006102849.1 general secretory pathway protein E VFG1876 Protein 3e-122 53
gspE YP_006102849.1 general secretory pathway protein E VFG0112 Protein 2e-85 47
gspE YP_006102849.1 general secretory pathway protein E VFG2426 Protein 7e-77 46
gspE YP_006102849.1 general secretory pathway protein E VFG1880 Protein 9e-81 41