Gene Information

Name : gspD1 (EcHS_A3519)
Accession : YP_001460117.1
Strain : Escherichia coli HS
Genome accession: NC_009800
Putative virulence/resistance : Virulence
Product : general secretion pathway protein D
Function : -
COG functional category : N : Cell motility
COG ID : COG1450
EC number : -
Position : 3500708 - 3502672 bp
Length : 1965 bp
Strand : +
Note : identified by match to protein family HMM PF00263; match to protein family HMM PF03958; match to protein family HMM TIGR02517

DNA sequence :
ATGGACTGCGTCATGAAAGGACTCAATAAAATCACCTGCTGCTTGCTGGCAGCACTACTCATGCCTTGTGCAGGACACGC
TGAGAACGAACAATACGGCGCGAACTTCAATAACGCCGATATCCGCCAGTTCGTGGAAATAGTGGGTCAGCATCTTGGCA
AAACGATCCTGATCGACCCTTCGGTACAGGGAACCATTTCCGTACGCAGTAATGATACGTTTAGCCAACAGGAGTACTAC
CAGTTCTTTTTAAGTATTCTTGATCTTTACGGTTATTCCGTGATCACGCTGGACAATGGTTTTCTGAGAGTGGTTCGCTC
AGCTAATGTAAAAACATCGCCAGGGATGATTGCTGACAGTTCTCGTCCAGGCGTAGGTGATGAGTTGGTCACCCGAATCG
TACCGCTTGAGAACGTTCCTGCTCGTGACCTGGCCCCCCTGCTCCGCCAGATGATGGATGCGGGTAGCGTCGGTAATGTT
GTGCATTATGAACCCTCCAACGTTCTTATTCTGACCGGTCGTGCCTCCACCATTAATAAACTGATTGAAGTCATAAAGCG
CGTTGATGTCATCGGCACAGAGAAGCAGCAAATTATTCATCTGGAATATGCGTCAGCGGAAGATCTCGCCGAGATTCTTA
ATCAATTAATCAGCGAAAGCCACGGTAAAAGCCAGATGCCAGCCCTCCTCTCCGCGAGGATTGTGGCGGATAAGCGAACC
AACTCTCTTATCATCAGTGGACCGGAAAAAGCACGCCAGCGCATCACTTCATTACTGAAAAGCCTTGATGTCGAAGAGAG
CGAGGAAGGAAATACCCGGGTTTATTACCTGAAATATGCTAAAGCCACGAATCTGGTGGAAGTGCTAACCGGTGTTTCCG
AAAAGCTGAAAGATGAAAAAGGGAATGCGCGTAAGCCCTCCTCTTCTGGCGCGATGGATAACGTCGCCATTACCGCCGAT
GAACAGACTAACTCTCTGGTCATTACCGCTGACCAGTCCGTCCAGGAAAAACTCGCCACGGTAATTGCGCGTCTGGACAT
TCGCCGTGCACAGGTGCTGGTTGAGGCAATCATCGTTGAAGTTCAGGATGGAAATGGACTAAACCTCGGCGTGCAATGGG
CGAATAAAAACGTTGGCGCACAGCAATTTACCAATACCGGATTACCGATTTTTAACGCTGCGCAAGGTGTGGCTGATTAT
AAAAAGAATGGTGGGATCACCAGCGCGAATCCTGCCTGGGATATGTTTAGCGCCTACAATGGCATGGCCGCAGGCTTCTT
CAATGGCGACTGGGGAGTACTGCTTACCGCGCTGGCCAGTAACAATAAAAATGACATCCTCGCCACCCCAAGCATCGTAA
CGCTGGATAATAAACTCGCGTCCTTCAACGTGGGGCAGGATGTGCCGGTGCTATCCGGGTCACAGACCACTTCAGGGGAT
AACGTCTTTAATACCGTCGAACGCAAAACGGTGGGGACAAAACTCAAAGTTACTCCGCAGGTCAATGAAGGCGACGCGGT
GTTGCTCGAAATAGAGCAGGAAGTCTCCAGCGTTGACTCTTCCTCTAACTCGACGCTCGGCCCGACGTTTAATACCCGTA
CTATTCAAAACGCCGTGCTGGTCAAAACCGGTGAAACGGTGGTCCTGGGCGGATTGCTGGATGATTTTTCTAAAGAGCAA
GTGTCAAAGGTTCCTCTGCTTGGCGATATTCCTTTAGTGGGGCAACTCTTCCGCTATACCTCCACCGAGCGCGCTAAACG
CAACCTGATGGTATTTATCCGTCCGACGATTATCCGTGACGATGATGTTTATCGCTCACTGTCAAAAGAGAAATACACCC
GTTACCTTCAGGAGCAACAACAGCGGATCGACGGGAAATCAAAAGCGCTGGTTGGCTCGGAAGATTTGCCGGTGCTGGAT
GAAAACACGTTCAACAGTCACGCCCCTGCGCCATCGTCACGGTGA

Protein sequence :
MDCVMKGLNKITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRSNDTFSQQEYY
QFFLSILDLYGYSVITLDNGFLRVVRSANVKTSPGMIADSSRPGVGDELVTRIVPLENVPARDLAPLLRQMMDAGSVGNV
VHYEPSNVLILTGRASTINKLIEVIKRVDVIGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSARIVADKRT
NSLIISGPEKARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNARKPSSSGAMDNVAITAD
EQTNSLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNLGVQWANKNVGAQQFTNTGLPIFNAAQGVADY
KKNGGITSANPAWDMFSAYNGMAAGFFNGDWGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGD
NVFNTVERKTVGTKLKVTPQVNEGDAVLLEIEQEVSSVDSSSNSTLGPTFNTRTIQNAVLVKTGETVVLGGLLDDFSKEQ
VSKVPLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRSLSKEKYTRYLQEQQQRIDGKSKALVGSEDLPVLD
ENTFNSHAPAPSSR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspD CAE85234.1 GspD, hypothetical type II secretion protein Not tested PAI V 536 Protein 5e-124 45
gspD YP_854407.1 type II secretion protein GspD Not tested PAI I APEC-O1 Protein 5e-131 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspD1 YP_001460117.1 general secretion pathway protein D VFG2047 Protein 1e-121 46