Gene Information

Name : gspE (ECOK1_3380)
Accession : YP_006102495.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : general secretory pathway protein E
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3465225 - 3466718 bp
Length : 1494 bp
Strand : -
Note : identified by match to protein family HMM PF00437; match to protein family HMM TIGR02533

DNA sequence :
ATGGTGCCTGTAGCACAGGAAACCACCGCTAACACCGTGCGTCTGCCCTACAGTTTCAGCCGTCGGTTTAGCCTGGTGGC
ATGGTGCGAAGCGTCGCTGGAGATCCTCCATGTTCATCCGTTGTCGCTCTCTGTTTTGCAGGAGCTACAACGGGGGCTGA
ACGCGCCCTTTACGCTGCGGCAAATCGACGAGGCCGAATTTGAACAGCGGCTGAATGCGGTCTGGCAGCGGGACTCTTCC
GAAGCTCGCCAGCTGATGGAAGATCTCGGTTCCGCCGAGGACTTTTTTACCCTCGCTGAAGAACTGCCGGAAACGGAAGA
TCTGCTGGAAAGTGACGACGATGCGCCGATCATCAAACTGATCAACGCCATGCTGGCAGAGGCAATCAAAGAAGGCGCTT
CGGATATCCACATTGAGACGTTTGAAAAGAGTCTGGTGATCCGTTTTCGTGTTGACGGCACATTACATGAAATGTTGCGC
CCCGGTCGTAAACTGGCCTCGCTGCTGGTCTCGCGTATCAAGGTGATGGCGCGACTGGATATTGCCGAAAAGCGCGTACC
GCAGGATGGCCGTATTGCGTTGCTGCTGGGCGGTCGGGCGATTGACGTCCGTGTATCTACCATGCCTTCCGCCTGGGGAG
AACGGGTGGTGCTGCGACTGCTGGACAAAAACCAGGCTCGCCTGACGCTGGAGCGTCTGGGTTTAAGTCTCGAACTGACT
GCGCAGTTGCGCCAGCTGTTACACAAACCGCACGGCATTTTTCTGGTGACGGGGCCGACCGGTTCCGGCAAAAGCACCAC
GCTGTACGCTGGATTGCAGGAGCTGAACAACCACTCGCGTAACATTCTCACGGTTGAAGACCCTATCGAATACATGATTG
AAGGGATCGGTCAGACGCAGGTTAACACCCGCGTCGGCATGACATTCGCCCGTGGCCTGCGCGCAATTTTGCGTCAGGAC
CCGGATGTGGTGATGGTCGGTGAAATCCGCGATACCGAAACCGCAGAAATCGCTGTTCAGGCTTCACTGACCGGACACCT
GGTACTTTCCACGCTGCATACCAACACAGCGGTGGGGGCGATCACACGTTTGCAGGATATGGGCGTGGAGCCTTTCCTGC
TCTCTTCCAGTCTGACGGGCGTGATGGCGCAGCGACTGGTCCGCACGCTGTGCTCCGACTGCCGTCAGGCCGCGCCTGCC
ACCGACGAAGAAAAACGTCTTATGGGGATCTCCGATACGCATGCCGTCACGCTGTACCATCCGCAGGGCTGCCCCGCCTG
TAATCACAAAGGTTTTCGCGGACGTACTGCCATCCATGAACTGATTGTGGTGGACGCCACATTGCGTGATTTGATCCACC
GTCAGGCCGGGGAACTGGAGCTGGAACGTTATGTCCGGCAACACTCTGCGGGTATCCGCAGTAACGGCATTGAGAAAGTG
CTCGCCGGAGAAACCTCTCTCGATGAAGTTCTGCGGGTAACCATGGAGGCGTAA

Protein sequence :
MVPVAQETTANTVRLPYSFSRRFSLVAWCEASLEILHVHPLSLSVLQELQRGLNAPFTLRQIDEAEFEQRLNAVWQRDSS
EARQLMEDLGSAEDFFTLAEELPETEDLLESDDDAPIIKLINAMLAEAIKEGASDIHIETFEKSLVIRFRVDGTLHEMLR
PGRKLASLLVSRIKVMARLDIAEKRVPQDGRIALLLGGRAIDVRVSTMPSAWGERVVLRLLDKNQARLTLERLGLSLELT
AQLRQLLHKPHGIFLVTGPTGSGKSTTLYAGLQELNNHSRNILTVEDPIEYMIEGIGQTQVNTRVGMTFARGLRAILRQD
PDVVMVGEIRDTETAEIAVQASLTGHLVLSTLHTNTAVGAITRLQDMGVEPFLLSSSLTGVMAQRLVRTLCSDCRQAAPA
TDEEKRLMGISDTHAVTLYHPQGCPACNHKGFRGRTAIHELIVVDATLRDLIHRQAGELELERYVRQHSAGIRSNGIEKV
LAGETSLDEVLRVTMEA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspE YP_854406.1 type II secretion protein GspE Not tested PAI I APEC-O1 Protein 0.0 100
gspE CAE85233.1 GspE, hypothetical type II secretion protein Not tested PAI V 536 Protein 0.0 100

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspE YP_006102495.1 general secretory pathway protein E VFG2048 Protein 0.0 97
gspE YP_006102495.1 general secretory pathway protein E VFG0182 Protein 5e-135 62
gspE YP_006102495.1 general secretory pathway protein E VFG1876 Protein 3e-130 56
gspE YP_006102495.1 general secretory pathway protein E VFG2426 Protein 1e-74 49
gspE YP_006102495.1 general secretory pathway protein E VFG1880 Protein 4e-77 44