Gene Information

Name : gspF (ECOK1_3379)
Accession : YP_006102494.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : general secretion pathway protein F
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3464002 - 3465225 bp
Length : 1224 bp
Strand : -
Note : identified by match to protein family HMM PF00482; match to protein family HMM TIGR02120

DNA sequence :
ATGGCACTGTTTTACTATCAGGCGCTGGAGCGTAATGGTCGCAAAACCAAAGGCATGATTGAGGCGGATTCCGCGCGTCA
TGCCCGTCAATTGTTACGCGGTAAAGACCTCATTCCCGTGCACATTGAAGCCCGGATGAATGCATCGGCAGGGGGATTGT
TGCAGCGTCGGCGGCACGCACATCGTCGCGTGGCGACGGCAGATCTGGCGCTGTTCACTCGTCAACTGGCAACGCTGGTG
CAGGCAGCAATGCCGCTGGAAACCTGCTTACAGGCGGTCAGTGAGCAAAGCGAAAAACTGCATGTAAAAAGCCTCGGAAT
GGCGCTGCGCAGCCGGATTCAGGAAGGTTATACCCTGTCGGACAGCCTGCGCGAACATCCCCGCGTCTTTGACTCCCTGT
TTTGTTCGATGGTCGCTGCCGGAGAAAAATCCGGACATCTCGACGTGGTGCTCAATCGCCTGGCGGATTACACCGAACAG
CGGCAGCGTCTGAAATCACGCCTGCTGCAGGCCATGCTCTATCCGCTGGTTCTGCTGGTGGTGGCAACGGGCGTAGTCAC
TATTTTGCTGACGGCAGTGGTGCCGAAAATTATCGAACAGTTTGATCATCTCGGACACGCGCTACCCGCCTCCACCCGAA
TGCTCATCGCTATGAGCGACGCGTTACAGGCCAGCGGCGTGTACTGGCTGGCGGGTTTGCTGGGGCTTCTGGTGCTGGGG
CAACGGTTACTCAAAAATCCTGCGATGCGCCTGCGCTGGGATAAAACCTTGCTGCGCCTGCCCGTGACGGGGCGTGTTGC
GCGCGGACTGAATACGGCGCGTTTTTCCCGCACGTTAAGCATCCTCACCGCCAGCAGTGTTCCGCTGTTGGAAGGCATTC
AGACCGCCGCCGCCGTGTCGGCAAATCGTTATGTCGAGCAACAACTGCTGCTGGCGGCAGATCGCGTCCGCGAAGGAAGC
AGCCTGCGCGCCGCGCTGGCGGATCTGCGCCTGTTTCCACCGATGATGCTGTACATGATCGCCTCCGGCGAACAGAGCGG
CGAGCTGGAAACCATGCTTGAACAGGCCGCGGTCAACCAGGAACGGGAATTTGATACACAGGTGGGGCTGGCGTTAGGGC
TGTTTGAGCCGGCGCTGGTGGTGATGATGGCGGGTGTGGTGCTGTTTATCGTCATCGCCATCCTCGAACCGATGCTGCAA
CTGAACAATATGGTTGGAATGTAA

Protein sequence :
MALFYYQALERNGRKTKGMIEADSARHARQLLRGKDLIPVHIEARMNASAGGLLQRRRHAHRRVATADLALFTRQLATLV
QAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTLSDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQ
RQRLKSRLLQAMLYPLVLLVVATGVVTILLTAVVPKIIEQFDHLGHALPASTRMLIAMSDALQASGVYWLAGLLGLLVLG
QRLLKNPAMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAVSANRYVEQQLLLAADRVREGS
SLRAALADLRLFPPMMLYMIASGEQSGELETMLEQAAVNQEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQ
LNNMVGM

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspF YP_854405.1 type II secretion protein GspF Not tested PAI I APEC-O1 Protein 2e-158 100
gspF CAE85232.1 GspF, hypothetical type II secretion protein Not tested PAI V 536 Protein 2e-156 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspF YP_006102494.1 general secretion pathway protein F VFG2049 Protein 4e-150 97
gspF YP_006102494.1 general secretion pathway protein F VFG0181 Protein 1e-70 47