Gene Information

Name : gspC (ECOK1_3382)
Accession : YP_006102497.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Virulence
Product : general secretion pathway protein C
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3468808 - 3469767 bp
Length : 960 bp
Strand : -
Note : identified by similarity to SP:P31699; match to protein family HMM TIGR01713

DNA sequence :
TTGGCGCGGGTTGTTTTTCGTGACGCACGAATTTATCTCATTCAATGGCTGACTAAAATTCGTCACATTCTTAGCCAGAT
ACAATCTCTTAATACAGACAAAGAGCATCTGCGAAAAATGGTGCGCGGGATGTTCTGGCTCATGCTGCTTATTATTTCTG
CAAAAATGGCGTATTCACTCTGGCGCTATTTCTCCTTTTCTGCGGAATATACGGCGGTTTCCTCATCGGTGAATAAACCG
CTCCGTGCGGATGCAAAACCGTTCGATAAAAATGACGTGCAATTAGTCAGCCAACAAAACTGGTTTGGCAAATATCAACC
CGTCGCCGCACCGGTAAAACAACCTGAATCTGCGCCTGTGGCAGAAACGCGTCTTAATGTGGTGCTGCGTGGGATCGCCT
TTGGTGCCAGACCCGGCGTGGTGATTGAAGAAGGCGGCAAACAGCAGGTCTATTTGCAGGGTGAACGGCTTGGCTCTCAC
AACGCAGTGATTGAGGAAATCAACCGCGACCATGTGATGCTGCGCTATCAGGGGAAAATGGAACGTCTGAGTCTGGCAGA
GGAGGAACGCCCCCCTGTGGCGGTGACCAGCAAAAAAGCCGCCAGCGACGAAGCAAAGCAAGCTGTTGCTGAACCTGTCG
TCAGTGCGCCAGTGGAGATCCCGGCTGCCGTGCGTCAGGCACTGGCGAAAGATCCGCAGAAAATTTTTAACTATATCCAG
CTTACGCCTGTGCGTAAGGAGGGAATTGTCGGTTATGCAGTGAAGCCGGGGGCAGATCGTTCTCTGTTCGATGCCAGCGG
TTTCAGGGAAGGCGATATCGCCATTGCGCTAAATCAGCAGGATTTCACTGATCCACGAGCAATGATTGCTCTGATGCGGC
AGTTACCTTCAATGGATTCCATTCAACTTACGGTTTTACGCAAGGGTGCGCGCTACGACATTTCCATCGCACTGCGCTAA

Protein sequence :
MARVVFRDARIYLIQWLTKIRHILSQIQSLNTDKEHLRKMVRGMFWLMLLIISAKMAYSLWRYFSFSAEYTAVSSSVNKP
LRADAKPFDKNDVQLVSQQNWFGKYQPVAAPVKQPESAPVAETRLNVVLRGIAFGARPGVVIEEGGKQQVYLQGERLGSH
NAVIEEINRDHVMLRYQGKMERLSLAEEERPPVAVTSKKAASDEAKQAVAEPVVSAPVEIPAAVRQALAKDPQKIFNYIQ
LTPVRKEGIVGYAVKPGADRSLFDASGFREGDIAIALNQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARYDISIALR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspC CAE85235.1 GspC, hypothetical type II secretion protein Not tested PAI V 536 Protein 4e-113 94

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
gspC YP_006102497.1 general secretion pathway protein C VFG2046 Protein 4e-110 92