Gene Information

Name : gspF1 (EcHS_A3521)
Accession : YP_001460119.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Virulence
Product : general secretion pathway protein F
Function : -
COG functional category : N : Cell motility
COG ID : COG1459
EC number : -
Position : 3504160 - 3505356 bp
Length : 1197 bp
Strand : +
Note : identified by match to protein family HMM PF00482; match to protein family HMM TIGR02120

DNA sequence :
ATGAATTATCGCTATCGCGCCATGACCCAGGATGGTCAAAAATTGCAAGGGATCATTGATGCTAACGATGAACGTCAGGC
ACGACTGCGGCTGCGTGAAGAAGGGCTTTTCCTGCTGGATATTCGCCCCCAAAAAAGTTCGGGAGTAAAAACACGTCGCC
CGAGGATCAGCCATAGTGAACTGACGCTTTTCACCCGGCAGTTGGCAACCTTAAGCGCAGCGGCATTACCCCTGGAAGAG
AGCCTTGCCGTAATCGGTCAACAAAGCAGTAATAAACGACTGGGTGACGTGTTAAATCAGGTACGCAGCGCCATCCTTGA
AGGGCATCCCCTTTCCGATGCATTACAGCATTTTCCCACGCTTTTCGATTCGCTCTATCGTACCCTGGTAAAAGCGGGCG
AAAAGAGCGGGCTGCTGGCCCCGGTGTTGGAAAAGCTGGCTGATTACAATGAAAACCGGCAGAAAATCCGCAGCAAGCTC
ATTCAGTCACTGATCTACCCCTGTATGCTCACTACGGTGGCGATTGGGGTCGTGATTATTCTCCTCACTGCTGTCGTGCC
CAAAATTACCGAACAGTTCGTGCATATGAAGCAGCAACTGCCGCTGAGTACACGCATTCTTTTAGGTCTGAGCGACACGT
TGCAACGTACCGGCCCGACATTATTAGCGACAGTGTTTATTGTCGCTGTAGGTTTCTGGCTCTGGTTAAAACGCGGCAAT
AACCGCCACCGTTTTCATGCCATGTTGCTGCGCGTTGCGCTCATCGGCCCGCTGATTTGCGCCATTAACAGCGCACGCTA
TCTCCGCACTTTAAGTATTTTGCAATCCAGCGGCGTCCCTCTGCTGGATGGGATGAATTTGTCCACCGAAAGCCTCAACA
ACCTCGAAATTCGCCAGCGTCTGGCAAATGCGGCAGAGAACGTTCGCCAGGGTAACAGCATTCATCTTTCGCTGGAACAA
ACCGCAATTTTCCCGCCGATGATGCTCTACATGGTGGCCTCTGGCGAAAAAAGCGGGCAGCTCGGCACATTAATGGTCAG
AGCCGCAGATAACCAGGAGACACTCCAACAAAATCGGATCGCCTTAACGCTCTCCATCTTCGAGCCAGCACTCATTATTA
CGATGGCACTGATCGTCCTGTTTATTGTCGTGTCGGTACTCCAACCTCTTCTTCAACTTAACTCAATGATTAATTAA

Protein sequence :
MNYRYRAMTQDGQKLQGIIDANDERQARLRLREEGLFLLDIRPQKSSGVKTRRPRISHSELTLFTRQLATLSAAALPLEE
SLAVIGQQSSNKRLGDVLNQVRSAILEGHPLSDALQHFPTLFDSLYRTLVKAGEKSGLLAPVLEKLADYNENRQKIRSKL
IQSLIYPCMLTTVAIGVVIILLTAVVPKITEQFVHMKQQLPLSTRILLGLSDTLQRTGPTLLATVFIVAVGFWLWLKRGN
NRHRFHAMLLRVALIGPLICAINSARYLRTLSILQSSGVPLLDGMNLSTESLNNLEIRQRLANAAENVRQGNSIHLSLEQ
TAIFPPMMLYMVASGEKSGQLGTLMVRAADNQETLQQNRIALTLSIFEPALIITMALIVLFIVVSVLQPLLQLNSMIN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspF CAE85232.1 GspF, hypothetical type II secretion protein Not tested PAI V 536 Protein 4e-63 45
gspF YP_854405.1 type II secretion protein GspF Not tested PAI I APEC-O1 Protein 6e-64 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspF1 YP_001460119.1 general secretion pathway protein F VFG2049 Protein 2e-59 42
gspF1 YP_001460119.1 general secretion pathway protein F VFG0181 Protein 4e-66 41