Gene Information

Name : yfcU (ECS88_2485)
Accession : YP_002392163.1
Strain : Escherichia coli S88
Genome accession: NC_011742
Putative virulence/resistance : Virulence
Product : export usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 2480848 - 2483502 bp
Length : 2655 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pf : factor

DNA sequence :
ATGTGTATGCCTAATCACTCAAATTTTCGGCTGCGGGGAATCGCCTGCTATATTGCGCTGGCAATCTCTGGTGGATCAGT
CAATGCATGGGCTGATGATTCAATTCAATTTGACCCCCGTTTCCTTGAGTTAAAGGGCGATACGAAAATTGATCTCGGTA
AGTTTTCAAAAAAAGGGTATGTCGACGCGGGTAAATATAATTTACGTGTATTTATAAATAAACAACCCCTTTCTGATGAA
TACGACATTAACTGGTACGTCTCTGAAAACGATCCAACAAAAAACTATGCCTGCCTGACACCTGAGTTAGTGGCGGCGCT
GGGGCTGAAAGAAGGGATAGCAAAAAGCCTGCAGTGGACGCACAACGATGAATGCCTTAAACCAGGTCAATTAGACGGGA
TGGAAGTCGAGAATGATTTAAGCCAGTCGGCGTTGCTGCTGACAGTGCCACAGGCTTACCTCGAATATACCAGCAGCGAC
TGGGACCCTCCCTCACGTTGGGACGACGGTATTCCTGGCCTGATTGCCGACTACAGCCTCAATGCGCAAACTCGCCACCA
GGAGCAGGGGGGCGAGGACTCACATGATATCAGCGGCAACGGTACCGTTGGGGCGAACCTGGGGGCATGGCGTTTCCGTG
CAGACTGGCAAAGTGATTATCAGCACACCCGCAGCAACGATGACGACGATGACAGCAGTAACAGTACAACGAGCAAAAAC
TGGGACTGGAGCCGTTATTACGCCTGGCGGGCCTTACCCTCTTTAAAAGCGAAGCTGTCGCTGGGGGAAGATTATCTCAA
TTCCGATATTTTCGACGGCTTTAACTATATCGGCAGCAGCGTCAGCACCGATGACCAGATGCTGCCGCCCAACCTGCGCG
GCTATGCGCCGGATGTCTCCGGCGTGGCGCACAGCAGTGCAAAAGTCACCATTAGCCAGATGGGCCGGGTACTTTACGAA
ACCCAGGTTCCCGCCGGGCCATTCCGCATTCAGGATATCGGCGACTCCGTCTCCGGCACACTGCACGTCCGCGTTGAAGA
ACAGAATGGTCAGGTGCAGGAATATGACGTTACCACCGCATCTATGCCATTCCTCACGCGCCAGGGGCAGGTGCGTTACA
AAGTGATGATGGGGCGTCCGGAAGACTGGAACCACAAGACCGAAGGCGGCTTTTTCTCCGGCGGAGAAGCGTCATGGGGG
GTGGCAGATGGCTGGTCGCTCTACGGTGGTGCGCTGGCAGATAAACACTATCAGTCGGCGGCGATGGGGGTGGGGCGCGA
CCTCGCACAGTTTGGCGCGCTGGCGTTCGATGTGACTCACTCGCACGTCAACCTGGATCATGACAGCGCATACGGCAAAG
GAAAACTGGACGGCAACTCCTTTCGCGTGAGCTATGCCAAAGACTTTGACGAACTCAACAGCCGCGTCACCTTTGCAGGC
TACCGTTTTTCTGAAAAGAACTTCATGACCATGAGCGAGTATCTGGACGCGAACCAGTCGGACATGGCGCGGACCGGTAA
CGACAAAGAGATGTATACGATCACCTATAACCAGAACTTTGCCGCTGCGGGTGTTTCGATCTATCTCAACTACTCCCATC
GTACTTACTGGGATCGCCCGGAACAGACAAACTATAACCTGATGTTTTCCCACTATTTTAATATGGGGAGCATTCGCAAC
ATGAGCATCTCGGTGACCGGCTATCGCTACGAATATGACGATAATGCGGATAAGGGGATGTACCTCAGTATGAGCATTCC
GTGGAGCGACAGCAGCACCGTGACCTACAACGGTTCCTACGGCAGCGGATCGGACAGCAGCCAGGTCGGTTACTTTAAGC
GCGTCGATGACGCAACGCACTACCAGGTTAACGTTGGCACCAGCGAGCAGCACGGCAGCGTGGATGGTTATCTGAGCCAC
GACGGTTCGCTGGCGAAGGTTGATCTCAGCGCCAACTACCATGAAGGGGAATACCGTTCGGCGGGGATCGCCTTACAGGG
CGGGGCAACGCTGACCGCGCATGGTGGGGCCCTGCACCGTACCCAGAACATGGGCGGCACGCGCCTGCTGATTGACGCCG
ACGGGATTGCCAACGTGCCGGTCGAAAGCAACGGCGCGCCGGTGTACACCAATATGTTTGGCAAGGCGGTGGTCGCCGAT
ATTAACAACTACTATCGCAATCAGGCGTATATCGATTTAAACAACTTGCCGGAAGATGCGGAAGCCACCCAGTCGGTGGT
GCAGGCAACCCTGACGGAAGGCGCAATCGGCTATCGCAAATTCAAAGTGATCAGCGGGCAAAAAGCGATGGCGGTACTGA
GGTTGCGTGACGGTAGCTACCCGCCGTTTGGCGCGGAAGTGAAAAATGACGAGCAGCAGCAGGTTGGCATTGTGGATGAT
GAAGGCAATGTCTATCTGGCGGGGGTCAATGCGGGCGAGCATATGATGGTGTTCTGGGAAGGCAGCGCACAATGCGAGAT
CGTATTGCCGAAGCCGCTACCTGCCGATTTGTTCAGCGGCCTGTTGTTGCCGTGTGAACAAAAGGGAACGGCAGCCCCTG
ATTCTTCAGCGCCAGAAATTAAGCCTGTTATTCAGGACCAGACGCGGCAAGTCACACCAACGGAAGCGCCGACGTCAATT
TCAGCAACTCAATAA

Protein sequence :
MCMPNHSNFRLRGIACYIALAISGGSVNAWADDSIQFDPRFLELKGDTKIDLGKFSKKGYVDAGKYNLRVFINKQPLSDE
YDINWYVSENDPTKNYACLTPELVAALGLKEGIAKSLQWTHNDECLKPGQLDGMEVENDLSQSALLLTVPQAYLEYTSSD
WDPPSRWDDGIPGLIADYSLNAQTRHQEQGGEDSHDISGNGTVGANLGAWRFRADWQSDYQHTRSNDDDDDSSNSTTSKN
WDWSRYYAWRALPSLKAKLSLGEDYLNSDIFDGFNYIGSSVSTDDQMLPPNLRGYAPDVSGVAHSSAKVTISQMGRVLYE
TQVPAGPFRIQDIGDSVSGTLHVRVEEQNGQVQEYDVTTASMPFLTRQGQVRYKVMMGRPEDWNHKTEGGFFSGGEASWG
VADGWSLYGGALADKHYQSAAMGVGRDLAQFGALAFDVTHSHVNLDHDSAYGKGKLDGNSFRVSYAKDFDELNSRVTFAG
YRFSEKNFMTMSEYLDANQSDMARTGNDKEMYTITYNQNFAAAGVSIYLNYSHRTYWDRPEQTNYNLMFSHYFNMGSIRN
MSISVTGYRYEYDDNADKGMYLSMSIPWSDSSTVTYNGSYGSGSDSSQVGYFKRVDDATHYQVNVGTSEQHGSVDGYLSH
DGSLAKVDLSANYHEGEYRSAGIALQGGATLTAHGGALHRTQNMGGTRLLIDADGIANVPVESNGAPVYTNMFGKAVVAD
INNYYRNQAYIDLNNLPEDAEATQSVVQATLTEGAIGYRKFKVISGQKAMAVLRLRDGSYPPFGAEVKNDEQQQVGIVDD
EGNVYLAGVNAGEHMMVFWEGSAQCEIVLPKPLPADLFSGLLLPCEQKGTAAPDSSAPEIKPVIQDQTRQVTPTEAPTSI
SATQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 2e-144 43
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 4e-138 42
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 4e-137 42
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 6e-137 42
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 5e-138 42
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 6e-138 42
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 5e-138 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yfcU YP_002392163.1 export usher protein VFG1548 Protein 3e-138 42
yfcU YP_002392163.1 export usher protein VFG0884 Protein 2e-138 42
yfcU YP_002392163.1 export usher protein VFG0895 Protein 2e-138 42