Gene Information

Name : gspD (ECOK1_3743)
Accession : YP_006102848.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : general secretion pathway protein D
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3827243 - 3829207 bp
Length : 1965 bp
Strand : +
Note : identified by match to protein family HMM PF00263; match to protein family HMM PF03958; match to protein family HMM TIGR02517

DNA sequence :
ATGGACAGCGCAATGAAGGGACTCAATAAAATCACCTGCTGCTTGCTGGCAGCACTACTCATGCCTTGTGCAGGACACGC
TGAGAACGAACAATACGGCGCTAACTTCAATAACGCCGATATCCGCCAGTTCGTGGAAATAGTGGGTCAGCATCTTGGCA
AAACGATCCTGATCGACCCTTCGGTACAGGGAACCATTTCCGTACGCAGTAATGATACGTTTAGCCAACAGGAGTACTAC
CAGTTCTTTTTAAGTATTCTTGATCTTTACGGTTATTCCGTGATCACACTGGACAATGGTTTTCTGAAAGTGGTGCGCTC
AGCTAATGTAAAAACATCGCCAGGGATGATTGCTGACAGTTCTCGTCCAGGCGTAGGTGATGAGTTGGTCACCCGAATCG
TACCGCTTGAGAACGTTCCTGCTCGTGACCTGGCTCCCCTGCTCCGCCAGATGATGGATGCGGGTAGCGTCGGTAATGTT
GTGCATTATGAACCCTCCAACGTTCTTATTCTGACCGGTCGTGCCTCCACCATTAATAAACTGATTGAAGTCATAAAGCG
CGTTGATGTCATCGGCACAGAGAAGCAGCAAATTATTCATCTGGAATATGCATCAGCGGAAGATCTCGCCGAGATTCTTA
ATCAATTAATCAGCGAAAGCCACGGTAAAAGCCAGATGCCTGCCCTCCTTTCCGCGAAGATTGTGGCGGATAAGCGAACC
AACTCTCTTATCATCAGCGGGCCGGAAAAAGCACGCCAGCGCATAACTTCATTACTGAAAAGCCTTGATGTAGAAGAGAG
CGAGGAAGGAAATACCCGGGTCTATTACCTGAAATATGCTAAAGCCACGAATCTGGTGGAAGTGCTAACCGGTGTTTCCG
AAAAGCTGAAAGATGAAAAAGGGAATTCGCGTAAGCCTTCTTCTACTTCTGCGATGGATAATGTGGCCATTACCGCCGAT
GAACAGACCAACTCGTTGGTCATTACCGCTGACCAGTCCGTCCAGGAGAAACTCGCCACGGTAATTGCGCGTCTGGATAT
TCGCCGTGCACAGGTGCTGGTTGAGGCAATCATCGTCGAAGTTCAGGATGGAAATGGACTCAACCTCGGCGTGCAATGGG
CGAATAAAAACGTTGGCGCACAGCAATTTACCAATACGGGATTACCGGTTTTTAACGCTGCGCAAGGTGTAGCTGATTAT
AAAAAGAATGGCGGGATCACCAGCGCGAATCCTGCCTGGGATATGTTTAGCGCCTACAATGGCATGGCTGCAGGCTTCTT
CAATGGCGACTGGGGCGTATTGTTGACTGCGCTGGCCAGTAACAATAAAAATGACATCCTCGCCACCCCGAGCATCGTAA
CGCTGGATAATAAACTCGCGTCCTTCAACGTTGGTCAGGATGTGCCGGTGCTATCCGGCTCTCAGACCACTTCTGGGGAT
AACGTCTTTAATACTGTCGAACGCAAAACGGTGGGCACAAAACTCAAAGTCACTCCGCAGGTCAATGAAGGCGACGCGGT
ACTGCTCGAAATAGAGCAGGAGGTTTCCAGCGTTGACTCTTCGTCTAATTCGACGCTCGGCCCGACATTTAACACCCGTA
CTATTCAAAACGCCGTGCTGGTTAAAACCGGTGAAACGGTGGTCCTGGGCGGATTGCTGGATGATTTTTCTAAAGAGCAA
GTGTCAAAGGTTCCTCTGCTTGGCGATATTCCTTTAGTGGGACAACTCTTCCGCTATACCTCGACCGAGCGCGCTAAACG
CAACCTGATGGTATTTATCCGTCCGACGATTATCCGTGACGATGATGTTTATCGCTCACTGTCAAAAGAGAAATACACCC
GCTACCGCCAGGAGCAACAGCTGCGAATCGACGGGAAATCAAAAGCGCTGATTGGCTCGGAAGATTTGCCGGTGCTGGAT
GAGAACACGTTCAACAGTCACGCTCCTGCGCCATCGTCACGGTGA

Protein sequence :
MDSAMKGLNKITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRSNDTFSQQEYY
QFFLSILDLYGYSVITLDNGFLKVVRSANVKTSPGMIADSSRPGVGDELVTRIVPLENVPARDLAPLLRQMMDAGSVGNV
VHYEPSNVLILTGRASTINKLIEVIKRVDVIGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSAKIVADKRT
NSLIISGPEKARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNSRKPSSTSAMDNVAITAD
EQTNSLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNLGVQWANKNVGAQQFTNTGLPVFNAAQGVADY
KKNGGITSANPAWDMFSAYNGMAAGFFNGDWGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGD
NVFNTVERKTVGTKLKVTPQVNEGDAVLLEIEQEVSSVDSSSNSTLGPTFNTRTIQNAVLVKTGETVVLGGLLDDFSKEQ
VSKVPLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRSLSKEKYTRYRQEQQLRIDGKSKALIGSEDLPVLD
ENTFNSHAPAPSSR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspD CAE85234.1 GspD, hypothetical type II secretion protein Not tested PAI V 536 Protein 6e-124 45
gspD YP_854407.1 type II secretion protein GspD Not tested PAI I APEC-O1 Protein 4e-131 44

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspD YP_006102848.1 general secretion pathway protein D VFG2047 Protein 9e-122 45