Gene Information

Name : yfcU (ECIAI39_2490)
Accession : YP_002408445.1
Strain : Escherichia coli IAI39
Genome accession: NC_011750
Putative virulence/resistance : Virulence
Product : putative export usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 2574511 - 2577168 bp
Length : 2658 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pf : putative factor

DNA sequence :
ATGTGTATGCCTAATCACTCAAATTTTCGGCTGCGGGGAATCGCCTGCTATATTGCGCTGGCAATCTCTGGTGGATCAGT
CAATGCATGGGCTGATGATTCCATTCAATTTGACCCCCGTTTCCTTGAGTTAAAGGGTGACACGAAAATTGATCTCGGTA
AGTTTTCAAAAAAAGGGTATGTCGACGCGGGTAAATATAATTTACGTGTATTTATAAATAAACAACCCCTTTCTGATGAA
TACGACATTAACTGGTACGTCTCTGAAAATGATCCAACAAAAACTTATGCCTGCCTGACACCTGAGTTAGTGGCGGCGCT
GGGGCTGAAAGAAGGGATAGCAAAAAGCCTGCAGTGGACGCACAACGATGAATGCCTTAAACCGGGGCAATTAGATGGGA
TGGAAGTCGAGAATGATTTAAGCCAGTCGGCGTTGCTGCTGACAGTGCCACAGGCTTATCTCGAATATACCAGCAGCGAC
TGGGACCCACCCTCACGCTGGGACGACGGTATTCCTGGCCTGATTGCCGACTACAGCCTCAATGCGCAAACCCGTCATCA
GGAGCAGGGTGGCGAGGACTCACATGATATCAGCGGCAACGGTACCGTTGGGGCGAATCTGGGGGCATGGCGTTTCCGCG
CAGACTGGCAAAGTGATTATCAGCACACCCGCAGCAACGACGACGAAGACGATGACAGCAGTAACAGTACAACGAGCAAA
AACTGGGACTGGAGCCGTTATTACGCCTGGCGGGCCTTACCCTCTTTAAAAGCCAAGCTGTCGCTGGGTGAAGATTATCT
CAACTCCGATATTTTCGACGGCTTTAACTATATTGGCAGCAGCGTCAGCACCGATGACCAGATGCTGCCGCCCAACCTGC
GCGGCTATGCGCCGGATGTCTCCGGCGTGGCGCACAGCAGTGCAAAAGTCACTATTAGCCAGATGGGCCGGGTACTTTAC
GAAACCCAGGTTCCCGCCGGGCCATTCCGCATTCAGGATATCGGCGACTCCGTCTCCGGCACACTGCACGTCCGCGTTGA
AGAACAGAATGGTCAGGTGCAGGAATATGACGTTACGACCGCATCTATGCCATTCCTCACGCGCCAGGGGCAGGTGCGTT
ACAAAGTGATGATGGGGCGTCCGGAAGACTGGAACCACAAGACCGAAGGCGGCTTTTTCTCCGGCGGAGAAGCGTCATGG
GGGGTGGCAGATGGCTGGTCGCTCTACGGTGGCGCGCTGGCAGATGAACACTACCAGTCGGCGGCGATGGGGGTGGGGCG
CGACCTCGCACAGTTTGGCGCGCTGGCGTTCGATGTGACTCACTCGCACGTCAACCTGGATCATGACAGCGCATACGGCA
AAGGAAAACTGGACGGCAACTCCTTTCGCGTGAGCTATGCCAAAGACTTTGACGAACTTAACAGCCGCGTCACCTTTGCA
GGCTACCGTTTTTCTGAAAAGAACTTCATGACCATGAGCGAGTATCTGGACGCGAACCAGTCGGACATGGCGCGGACCGG
TAACGACAAAGAGATGTATACGATCACCTATAACCAGAACTTTGCCGCTGCGGGTGTCTCGATCTATCTCAACTACTCCC
ATCGTACTTACTGGGATCGCCCGGAACAGACAAACTATAACCTGATGTTTTCCCACTATTTTAATATGGGGAGTATTCGC
AACATGAGCATCTCGGTGACCGGCTATCGCTACGAATATGACGATAACGCGGATAAGGGGATGTACCTCAGCATGAGCAT
TCCGTGGAGCGACAGCAGCACCGTGACCTACAACGGCTCCTACGGCAGTGGGTCGGACAGCAGCCAGGTCGGTTACTTTA
AGCGCGTTGATGACGCAACGCACTACCAGGTTAACGTCGGTACCAGCGAACAGCACGGCAGCGTGGATGGTTATCTGAGT
CACGACGGCTCGCTGGCGAAGGTTGATCTCAGCGCCAACTACCATGAGGGGGAATACCGCTCGGCGGGGATCGCCTTACA
GGGCGGGGCAACGCTGACCGCGCATGGTGGGGCGCTGCATCGCACTCAAAGCATGGGCGGTACGCGCTTGCTGATTGACG
CCGACGGGATTGCCAATGTGCCGGTCGAAAGCAACGGCGCGCCGGTGTACACCAATATGTTTGGCAAGGCGGTGGTCGCC
GATATTAACAACTACTATCGCAATCAGGCGTATATCGATTTAAACAACCTGCCAGAAGATGCGGAAGCCACCCAGTCGGT
GGTACAGGCAACCCTGACGGAAGGTGCAATTGGCTATCGCAAATTCAAAGTGATCAGCGGGCAAAAAGCGATGGCGGTAC
TGCGGTTGCGTGACGGCAGCTACCCGCCGTTTGGCGCGGAAGTGAAAAACGACGAGCAGCAGCAGGTTGGCATTGTGGAT
GATGAAGGCAATGTCTATCTGGCGGGAGTCAATGCGGGTGAGCATATGACGGTGTTCTGGGAAGGCAGCGCACAATGCGA
GATCGTATTGCCGAAGCCGCTACCTGCCGATCTGTTCAGCGGCCTGTTGTTGCCGTGTGAACAAAAGGGAACGGCAGCCC
CTGATTCTTCAGCGCCAGAAATTAAGCCTGTTATTCAGGACCAGACGCGGCAAGTCACACCAACGGAAGCGCCGACGTCA
ATTTCAGCTACTCAATAA

Protein sequence :
MCMPNHSNFRLRGIACYIALAISGGSVNAWADDSIQFDPRFLELKGDTKIDLGKFSKKGYVDAGKYNLRVFINKQPLSDE
YDINWYVSENDPTKTYACLTPELVAALGLKEGIAKSLQWTHNDECLKPGQLDGMEVENDLSQSALLLTVPQAYLEYTSSD
WDPPSRWDDGIPGLIADYSLNAQTRHQEQGGEDSHDISGNGTVGANLGAWRFRADWQSDYQHTRSNDDEDDDSSNSTTSK
NWDWSRYYAWRALPSLKAKLSLGEDYLNSDIFDGFNYIGSSVSTDDQMLPPNLRGYAPDVSGVAHSSAKVTISQMGRVLY
ETQVPAGPFRIQDIGDSVSGTLHVRVEEQNGQVQEYDVTTASMPFLTRQGQVRYKVMMGRPEDWNHKTEGGFFSGGEASW
GVADGWSLYGGALADEHYQSAAMGVGRDLAQFGALAFDVTHSHVNLDHDSAYGKGKLDGNSFRVSYAKDFDELNSRVTFA
GYRFSEKNFMTMSEYLDANQSDMARTGNDKEMYTITYNQNFAAAGVSIYLNYSHRTYWDRPEQTNYNLMFSHYFNMGSIR
NMSISVTGYRYEYDDNADKGMYLSMSIPWSDSSTVTYNGSYGSGSDSSQVGYFKRVDDATHYQVNVGTSEQHGSVDGYLS
HDGSLAKVDLSANYHEGEYRSAGIALQGGATLTAHGGALHRTQSMGGTRLLIDADGIANVPVESNGAPVYTNMFGKAVVA
DINNYYRNQAYIDLNNLPEDAEATQSVVQATLTEGAIGYRKFKVISGQKAMAVLRLRDGSYPPFGAEVKNDEQQQVGIVD
DEGNVYLAGVNAGEHMTVFWEGSAQCEIVLPKPLPADLFSGLLLPCEQKGTAAPDSSAPEIKPVIQDQTRQVTPTEAPTS
ISATQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 5e-145 43
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 2e-138 42
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 2e-137 42
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 2e-137 42
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 3e-138 42
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 3e-138 42
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 3e-138 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yfcU YP_002408445.1 putative export usher protein VFG1548 Protein 1e-138 42
yfcU YP_002408445.1 putative export usher protein VFG0884 Protein 1e-138 42
yfcU YP_002408445.1 putative export usher protein VFG0895 Protein 1e-138 42