Gene Information

Name : EC55989_2582 (EC55989_2582)
Accession : YP_002403604.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : outer membrane usher protein yfcU precursor
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 2651339 - 2653981 bp
Length : 2643 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pt : transporter

DNA sequence :
ATGCCTGACCATTCTCTTTTTCGATTACGGATTCTTCCGTGGTGCATTGCGCTGGCAATGTCAGGGAGTTATAGCAGTGT
CTGGGCTGAAGACGACATTCAGTTTGATTCCCGTTTTCTGGAATTAAAAGGCGACACAAAAATTGATCTGAAGCGTTTTT
CCAGCCAGGGATATGTTGAGCCCGGAAAATACAATTTACAGGTTCAACTAAATAAACAGCCATTGGCGGAAGAGTACGAT
ATTTACTGGTATGCTGGTGAAGATGACGCGAGCAAAAGCTATGCTTGTCTGACACCGGAACTGGTAGCGCAGTTTGGTTT
AAAAGAAGACGTGGCGAACAATCTGCAATGGAGCCACGATGCTAAATGCCTGAAATCCGGTCAACTGGAAGGCATGGAAA
TTAAGGCTGATTTAAGCCAGTCCGCATTAGTCATTTCACTGCCACAGGCTTACCTCGAATATACTTATCCCGACTGGGAT
CCGCCTTCACGTTGGGATGACGGCATCTCCGGGATCGTCGCGGACTACAGCATCAACGCACAAACCCGGCACGAAGAAAA
TGGCGGTGATGATAGTAACGAGATCAGCGGCAACGGGACGGTCGGGGTTAACCTGGGGCCGTGGCGTATGCGTGCTGACT
GGCAGACTAACTATCAACATACTCGCAGTAATGATGACGATGAATTCAGCGGCGATGAAACTCAAAAAAAATGGGAGTGG
AGTCGCTACTATGCCTGGCGGGCGTTACCATCATTAAAAGCCAAACTGGCGCTGGGCGAGGATTACCTCAGATCCGATAT
TTTTGATGGTTTTAACTATGTTGGTGGCAGTGTCAGTACTGACGATCAAATGTTGCCTCCCAATCTGCGCGGCTACGCGC
CAGACATTTCCGGCGTGGCACACACCACAGCAAAAGTGACCGTCAGCCAGATGGGGCGTGTGATTTACGAAACGCAGGTT
CCGGCTGGACCGTTTCGTATTCAGGATCTTGGTGATTCTGTCTCCGGTACGTTGCATATTCGCATTGAAGAACAGAACGG
CCAGGTGCAGGAATATGACATCAGCACCGCCTCGATGCCATACCTCACTCGCCCAGGTCAGGTTCGCTATAAGATCATGA
TGGGCCGTCCGCAAGAGTGGGGACACCATGTCGAGGGTGAATTTTTTTCTGGTGCTGAAGCTTCCTGGGGGATCGCTAAC
GGCTGGTCGTTATATGGCGGCGCACTGGGAGATGAAAACTATCAGTCTGCGGCGCTTGGCGTCGGTCGCGATTTGTCTAC
ATTCGGCGCGGTCGCGTTTGATGTTACTCACTCGCACACCAAACTGGATAAAGACACCGCTTATGGCAAAGGTTCGCTGG
ACGGTAACTCCTTCCGTGTGAGTTATTCCAAAGACTTTGACCAGCTCAACAGCCGCGTTACTTTTGCTGGATATCGCTTC
TCGGAAGAGAACTTTATGACCATGAGTGAGTATCTGGATGCCAGTGACAGCGGAATGGTACGCACGGGCAACGACAAAGA
GATGTACACCGCCACTTATAACCAGAACTTCCGCGATGCGGGTGTTTCGGTTTATCTCAACTATACCCGCCATACCTACT
GGGATCGCGAGGAGCAGACAAACTACAACATCATGCTCTCGCACTATTTCAATATGGGCAGTATTCGTAATGTCAGCATC
TCGATGACTGGCTACCGTTACGAGTATGACAACCAGGCCGACAAAGGCATGTACATTTCGCTCAGTATGCCGTGGGGCGA
CAACAGTACCGTTAGCTATAACGGTAACTATGGCAGTGGGACGGACAGCAGTCAGGTCGGTTATTTCAGCCGTGTCGATG
ACGCGACTCACTATCAGTTGAACGTCGGCACCAGTGACAAACACACCAGCGTTGACGGCTATTACAGCCATGATGGTTCG
CTGGCGCAGGTTGACCTCAGTGCGAACTACCATGAAGGGCAATACACCTCTGCGGGCTTGTCGTTACAGGGCGGCGCAAC
GCTTACTGCCCACGGTGGCGCACTTCACCGTACCCAGAATATGGGCGGGACACGCTTGTTGATTGATGCCGATGGCGTTG
CCGATGTTCCGGTGGAAGGTAACGGGGCTGCTGTTTATACCAATATGTTTGGTAAAGCCGTCGTTTCTGACGTCAATAAC
TATTACCGCAATCAGGCGTATATCGACCTCAACAGATTGCCTGAAAACGCTGAAGCAACCCAGTCGGTGGTGCAAGCCAC
GCTAACTGAAGGTGCCATTGGCTACCGCAAATTTGCCGTCATCAGTGGTCAAAAAGCGATGGCGGTGCTGCGTTTACAAG
ACGGCAGCCATCCTCCGTTTGGCGCAGAAGTAAAAAATGATAACGAGCAGACAGTGGGCCTTGTCGATGATGACGGCAAT
GTTTATCTGGCTGGGGTGAAACCTGGCGAACACATGAGTGTGTTCTGGAGTGGTGTTGCGCATTGCGATATCAACCTGCC
GGACCCGCTGCCTGCCGATCTGTTTAACGGCTTGTTACTGCCATGCCAGCATAAAGGCAACGTCGCGCCGGTTGTTCCTG
ATGACATAAAGCCTGTCATTCAGGAGCAGACGCAACAGGTGACACCCACCAATCCCCCTGTTTCTGTTTCAGCTAACCAA
TAA

Protein sequence :
MPDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVEPGKYNLQVQLNKQPLAEEYD
IYWYAGEDDASKSYACLTPELVAQFGLKEDVANNLQWSHDAKCLKSGQLEGMEIKADLSQSALVISLPQAYLEYTYPDWD
PPSRWDDGISGIVADYSINAQTRHEENGGDDSNEISGNGTVGVNLGPWRMRADWQTNYQHTRSNDDDEFSGDETQKKWEW
SRYYAWRALPSLKAKLALGEDYLRSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVAHTTAKVTVSQMGRVIYETQV
PAGPFRIQDLGDSVSGTLHIRIEEQNGQVQEYDISTASMPYLTRPGQVRYKIMMGRPQEWGHHVEGEFFSGAEASWGIAN
GWSLYGGALGDENYQSAALGVGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYRF
SEENFMTMSEYLDASDSGMVRTGNDKEMYTATYNQNFRDAGVSVYLNYTRHTYWDREEQTNYNIMLSHYFNMGSIRNVSI
SMTGYRYEYDNQADKGMYISLSMPWGDNSTVSYNGNYGSGTDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDGS
LAQVDLSANYHEGQYTSAGLSLQGGATLTAHGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVNN
YYRNQAYIDLNRLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLQDGSHPPFGAEVKNDNEQTVGLVDDDGN
VYLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLLPCQHKGNVAPVVPDDIKPVIQEQTQQVTPTNPPVSVSANQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 9e-152 43
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 1e-151 43
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 1e-151 43
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 1e-151 43
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 1e-150 42
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 1e-150 42
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 8e-157 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EC55989_2582 YP_002403604.1 outer membrane usher protein yfcU precursor VFG1548 Protein 6e-152 43
EC55989_2582 YP_002403604.1 outer membrane usher protein yfcU precursor VFG0884 Protein 5e-152 43
EC55989_2582 YP_002403604.1 outer membrane usher protein yfcU precursor VFG0895 Protein 4e-152 43