Gene Information

Name : gspE1 (EcHS_A3520)
Accession : YP_001460118.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Virulence
Product : general secretory pathway protein E
Function : -
COG functional category : N : Cell motility
COG ID : COG2804
EC number : -
Position : 3502682 - 3504163 bp
Length : 1482 bp
Strand : +
Note : identified by similarity to SP:P45759; match to protein family HMM PF00437; match to protein family HMM TIGR02533

DNA sequence :
ATGAGAATTCACTCACCGTACCCCGCCAGTTGGGCGCTGGCACAACGAATTGGTTATCTCTATTCAGAGGGCGAGATTAT
TTATCTCGCCGATACGCCATTCGAGCGGTTACTCGATATTCAACGTCAGGTTGGTCAGTGCCAGACCATGACCAGTTTGT
CACAGGCTGATTTCGAAGCCCGGCTGGAAGCGGTGTTCCATCAGAATACCGGTGAGTCGCAACAGATTGCGCAGGATATC
GATCAATCCGTCGATCTTCTCTCGCTTTCGGAAGAGATGCCCGCAAATGAAGATCTCCTGAATGAAGATTCAGCGGCACC
GGTTATCCGCTTGATCAATGCGATTTTGAGTGAGGCCATCAAAGAAACCGCCTCTGATATCCACATTGAAACCTATGAAA
AAACAATGTCGATCCGTTTTCGCATCGACGGCGTTTTGCGGACAATTTTACAGCCAAACAAAAAACTGGCGGCACTGCTT
ATCTCCCGAATTAAGGTCATGGCTCGTCTTGATATCGCCGAAAAACGTATTCCACAGGATGGAAGAATTAGTTTGCGTAT
CGGGCGACGTAACATAGATGTCCGCGTATCCACACTGCCGTCCATCTATGGTGAACGCGCCGTACTCCGCCTGCTGGATA
AAAACAGCCTCCAGCTTTCATTGAACAACCTGGGGATGACGGCAGCGGATAAGCAGGATTTAGAAAATCTCATTCAGCTT
CCGCACGGTATTATCCTGGTGACAGGGCCGACAGGCTCCGGTAAAAGCACCACGCTCTACGCCATCCTTTCGGCGCTGAA
TACTCCCGGCCGCAATATTCTGACGGTAGAAGATCCCGTGGAATATGAGCTGGAAGGCATTGGGCAAACGCAGGTGAATA
CCCGTGTGGATATGTCTTTCGCTCGCGGCCTGCGCGCCATACTTCGCCAGGACCCGGATGTCGTCATGGTGGGGGAAATT
CGTGATACAGAAACCGCGCAGATTGCGGTTCAGGCCTCGCTCACCGGCCATCTGGTACTCTCAACACTCCACACTAACAG
TGCATCAGGCGCAGTGACCCGGCTCCGCGACATGGGCGTCGAATCATTCCTGCTTTCGTCTTCCCTGGCAGGGATTATCG
CGCAACGTCTGGTTCGTCGCCTGTGTCCGCAATGCCGACAATTCACGCCCGTATCACCCCAACAAGCGCAGATGTTTAAA
TATCATCAGCTCGCGGTGACAACAATTGGCACTCCCGTAGGCTGCCCTCATTGCCATCAATCCGGCTATCAGGGGCGCAT
GGCGATCCACGAAATGATGGTGGTGACGCCGGAATTACGGGCCGCTATTCATGAAAATGTGGATGAACAAGCACTGGAGC
GACTAGTCCGGCAACAACACAAGGCCTTAATCAAAAATGGCCTGCAAAAAGTGATAAGCGGTGACACCTCCTGGGATGAG
GTTATGCGCGTCGCCAGTGCCACGCTGGAGAGCGAAGCATGA

Protein sequence :
MRIHSPYPASWALAQRIGYLYSEGEIIYLADTPFERLLDIQRQVGQCQTMTSLSQADFEARLEAVFHQNTGESQQIAQDI
DQSVDLLSLSEEMPANEDLLNEDSAAPVIRLINAILSEAIKETASDIHIETYEKTMSIRFRIDGVLRTILQPNKKLAALL
ISRIKVMARLDIAEKRIPQDGRISLRIGRRNIDVRVSTLPSIYGERAVLRLLDKNSLQLSLNNLGMTAADKQDLENLIQL
PHGIILVTGPTGSGKSTTLYAILSALNTPGRNILTVEDPVEYELEGIGQTQVNTRVDMSFARGLRAILRQDPDVVMVGEI
RDTETAQIAVQASLTGHLVLSTLHTNSASGAVTRLRDMGVESFLLSSSLAGIIAQRLVRRLCPQCRQFTPVSPQQAQMFK
YHQLAVTTIGTPVGCPHCHQSGYQGRMAIHEMMVVTPELRAAIHENVDEQALERLVRQQHKALIKNGLQKVISGDTSWDE
VMRVASATLESEA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspE YP_854406.1 type II secretion protein GspE Not tested PAI I APEC-O1 Protein 6e-135 60
gspE CAE85233.1 GspE, hypothetical type II secretion protein Not tested PAI V 536 Protein 4e-135 60

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspE1 YP_001460118.1 general secretory pathway protein E VFG2048 Protein 3e-135 60
gspE1 YP_001460118.1 general secretory pathway protein E VFG0182 Protein 6e-128 58
gspE1 YP_001460118.1 general secretory pathway protein E VFG1876 Protein 3e-122 53
gspE1 YP_001460118.1 general secretory pathway protein E VFG0112 Protein 2e-84 47
gspE1 YP_001460118.1 general secretory pathway protein E VFG2426 Protein 9e-78 46
gspE1 YP_001460118.1 general secretory pathway protein E VFG1880 Protein 2e-80 41