Gene Information

Name : yfcU (ECED1_2801)
Accession : YP_002398709.1
Strain : Escherichia coli ED1a
Genome accession: NC_011745
Putative virulence/resistance : Virulence
Product : putative export usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 2738962 - 2741616 bp
Length : 2655 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pf : putative factor

DNA sequence :
ATGTGTATGCCTAATCACTCAAATTTTCGGCTGCGGGGAATCGCCTGCTATATTGCGCTGGCAATCTCTGGTGGGTCAGT
CAATGCATGGGCTGATGATTCAATTCAATTTGACCCCCGTTTCCTTGAGTTAAAGGGCGATACGAAAATTGATCTCGGTA
AGTTTTCAAAAAAAGGGTATGTCGACGCGGGTAAATATAATTTACGTGTATTTATAAATAAACACCCCCTTTCTGATGAA
TACGACATTAACTGGTACGTCTCTGAAAACGATCCAACAAAAAACTATGCCTGCCTGACACCTGAGTTAGTGGCGGCGCT
GGGGCTGAAAGAAGGGATAGCAAAAAGCCTGCAGTGGACGCACAACGATGAATGCCTTAAACCGGGGCAATTAGACGGGA
TGGAAGTCGAGAATGATTTAAGCCAGTCGGCGTTGCTGCTGACAGTGCCACAGGCTTACCTCGAATATACCAGCAGCGAC
TGGGACCCTCCCTCACGTTGGGACGACGGTATTCCTGGCCTGATTGCCGACTACAGCCTCAATGCGCAAACTCGCCACCA
GGAGCAGGGGGGCGAGGACTCACATGATATCAGCGGCAACGGTACCGTTGGGGCGAACCTGGGGGCATGGCGTTTCCGTG
CAGACTGGCAAAGTGATTATCAGCACACCCGCAGCAACGATGACGACGATGACAGCAGTAACAGCACAACGAGTAAACAC
TGGGACTGGAGTCGTTATTACGCCTGGCGAGCCTTGCCCTCTTTAAAAGCGAAGCTGTCGCTGGGGGAAGATTATCTCAA
TTCCGATATTTTCGACGGCTTTAACTATATCGGCAGCAGCGTCAGCACCGATGACCAGATGCTGCCGCCCAACCTGCGCG
GCTATGCACCGGATGTTTCCGGCGTGGCGCACAGCAGTGCAAAAGTCACCATTAGCCAGATGGGCCGGGTACTTTACGAA
ACCCAGGTTCCAGCCGGGCCATTCCGCATTCAGGATATCGGCGACTCCGTCTCCGGCACACTGCACGTCCGCGTTGAAGA
ACAGAATGGTCAGGTGCAGGAATATGACGTTACGACCGCATCTATGCCATTCCTCACGCGCCAGGGGCAGGTGCGTTACA
AAGTGATGATGGGGCGTCCGGAAGACTGGAACCACAAGACCGAAGGCGGCTTTTTCTCCGGCGGAGAAGCGTCATGGGGG
GTGGCAGATGGCTGGTCGCTCTACGGTGGTGCGCTGGCAGATAAACACTATCAGTCGGCGGCGATGGGGGTGGGGCGCGA
CCTCGCACAGTTTGGCGCGCTGGCGTTCGATGTGACTCACTCGCACGTCAACCTGGATCATGACAGCGCATATGGCAAAG
GAAAACTGGACGGCAACTCCTTTCGCGTGAGCTATGCCAAAGACTTTGACGAACTCAACAGCCGCGTCACCTTTGCAGGC
TACCGTTTTTCTGAAAAGAACTTCATGACCATGAGCGAGTATCTGGACGCGAACCAGTCAGACATGGCGCGGACCGGTAA
CGACAAAGAGATGTATACGATCACCTATAACCAGAACTTTGCCGCTGCGGGTGTCTCGATCTATCTCAACTACTCCCATC
GTACTTACTGGGATCGCCCGGAACAGACAAACTATAACCTGATGTTTTCCCACTATTTTAATATGGGGAGCATTCGCAAC
ATGAGCATCTCGGTGACCGGCTATCGCTACGAATATGACGATAATGCGGATAAGGGGATGTACCTCAGTATGAGCATTCC
GTGGAGCGACAGCAGCACCGTGACCTACAACGGTTCCTACGGCAGCGGGTCGGACAGCAGCCAGGTCGGTTACTTTAAGC
GCGTCGATGACGCAACGCACTACCAGGTTAACGTTGGCACCAGCGAGCAGCACGGCAGCGCGGATGGTTATCTGAGCCAC
GACGGTTCGCTGGCGAAGGTTGATCTCAGCGCCAACTACCATGAAGGGGAATACCGTTCGGCGGGGATCGCCTTACAGGG
CGGGGCAACGCTGACCGCGCATGGTGGGGCGCTGCACCGTACCCAGAACATGGGCGGCACGCGCCTGCTGATTGACGCCG
ACGGGATTGCCAACGTGCCGGTCGAAAGCAACGGCGCGCCGGTGTACACCAATATGTTTGGCAAGGCGGTGGTCGCCGAT
ATTAACAACTACTATCGCAATCAGGCGTATATCGATTTAAACAACCTGCCGGAAGATGCGGAAGCCACCCAGTCGGTGGT
GCAGGCGACTCTGACGGAAGGCGCAATCGGCTATCGTAAATTCAAAGTGATCAGCGGGCAAAAAGCGATGGCGGTACTGC
GGTTACGTGACGGTAGCTACCCGCCGTTTGGCGCGGAAGTGAAAAACGACGAGCAGCAGCAGGTTGGCATTGTGGATGAT
GAAGGCAATGTCTATCTGGCGGGGGTCAATGCGGACGAGCATATGATGGTGTTCTGGGAAGGCAGCGCACAATGCGAGAT
CGTATTGCCGAAGCCGCTACCTGCCGATTTGTTCAGCGGCCTGTTGTTGCCGTGTGAACAAAAGGGAACGGCAGCCCCTG
ATTCTTCAGCGCCAGAAATTAAGCCTGTTATTCAGGGCCAGACGCGGCAAGTCACACCAACGGAAGCGCCGAGGTCAATT
TCAGCAACTCAATAA

Protein sequence :
MCMPNHSNFRLRGIACYIALAISGGSVNAWADDSIQFDPRFLELKGDTKIDLGKFSKKGYVDAGKYNLRVFINKHPLSDE
YDINWYVSENDPTKNYACLTPELVAALGLKEGIAKSLQWTHNDECLKPGQLDGMEVENDLSQSALLLTVPQAYLEYTSSD
WDPPSRWDDGIPGLIADYSLNAQTRHQEQGGEDSHDISGNGTVGANLGAWRFRADWQSDYQHTRSNDDDDDSSNSTTSKH
WDWSRYYAWRALPSLKAKLSLGEDYLNSDIFDGFNYIGSSVSTDDQMLPPNLRGYAPDVSGVAHSSAKVTISQMGRVLYE
TQVPAGPFRIQDIGDSVSGTLHVRVEEQNGQVQEYDVTTASMPFLTRQGQVRYKVMMGRPEDWNHKTEGGFFSGGEASWG
VADGWSLYGGALADKHYQSAAMGVGRDLAQFGALAFDVTHSHVNLDHDSAYGKGKLDGNSFRVSYAKDFDELNSRVTFAG
YRFSEKNFMTMSEYLDANQSDMARTGNDKEMYTITYNQNFAAAGVSIYLNYSHRTYWDRPEQTNYNLMFSHYFNMGSIRN
MSISVTGYRYEYDDNADKGMYLSMSIPWSDSSTVTYNGSYGSGSDSSQVGYFKRVDDATHYQVNVGTSEQHGSADGYLSH
DGSLAKVDLSANYHEGEYRSAGIALQGGATLTAHGGALHRTQNMGGTRLLIDADGIANVPVESNGAPVYTNMFGKAVVAD
INNYYRNQAYIDLNNLPEDAEATQSVVQATLTEGAIGYRKFKVISGQKAMAVLRLRDGSYPPFGAEVKNDEQQQVGIVDD
EGNVYLAGVNADEHMMVFWEGSAQCEIVLPKPLPADLFSGLLLPCEQKGTAAPDSSAPEIKPVIQGQTRQVTPTEAPRSI
SATQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 1e-145 43
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 5e-139 42
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 6e-138 42
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 7e-138 42
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 6e-139 42
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 5e-139 42
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 5e-139 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yfcU YP_002398709.1 putative export usher protein VFG1548 Protein 3e-139 42
yfcU YP_002398709.1 putative export usher protein VFG0884 Protein 3e-139 42
yfcU YP_002398709.1 putative export usher protein VFG0895 Protein 2e-139 42