Gene Information

Name : EcSMS35_2496 (EcSMS35_2496)
Accession : YP_001744540.1
Strain : Escherichia coli SMS-3-5
Genome accession: NC_010498
Putative virulence/resistance : Virulence
Product : fimbrial usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 2541162 - 2543810 bp
Length : 2649 bp
Strand : -
Note : yfcU; identified by match to protein family HMM PF00577

DNA sequence :
ATGCCTAATCACTCAAATTTTCGGCTGCGGGGAATCGCCTGCTATATTGCGCTGGCAATCTCAGGTGGATCAGTCAATGC
ATGGGCTGATGATTCCATTCAATTTGACCCCCGTTTCCTTGAGTTAAAGGGCGACACGAAAATTGATCTCGGTAAGTTTT
CAAAAAAAGGGTATGTCGACGCGGGAAAATATAATTTACGTGTATTTATAAATAAACAACCCCTTTCTGATGAATACGAC
ATTAACTGGTATGTTTCTGAAAACGATCCAACAAAAACGTATGCCTGCCTGACACCTGAGTTAGTGGCGGCGCTGGGGCT
GAAAGAAGGGATAGCAAAAAGCCTGCAGTGGACGCACAACGATGAATGCCTTAAACCGGGGCAATTAGATGGGATGGAAG
TCGAGAATGATTTAAGCCAGTCGGCGTTGCTGCTGACAGTGCCACAGGCTTATCTCGAATATACCAGCAGCGACTGGGAC
CCACCCTCACGCTGGGACGACGGTATTCCTGGCCTGATTGCCGACTACAGCCTCAATGCGCAAACCCGCCACCAGGAGCA
GGGTGGCGAGGACTCACATGATATCAGCGGCAACGGTACCGTTGGGGCGAACCTGGGGGCATGGCGTTTCCGCGCAGACT
GGCAAAGTGATTATCAGCACACCCGCAGCAACGATGACGACGATGACAGCAGTAACAGTACAACGAGCAAAAACTGGGAC
TGGAGCCGTTATTACGCCTGGCGGGCCTTACCCTCTTTAAAAGCGAAGCTGTCGCTGGGGGAAGATTATCTCAATTCCGA
TATTTTCGACGGCTTTAACTATATTGGCAGCAGCGTCAGCACCGATGACCAGATGCTGCCGCCCAACCTGCGCGGCTATG
CGCCGGATGTCTCCGGCGTGGCGCACAGCAGTGCAAAAGTCACCATTAGCCAGATGGGCCGGGTACTTTACGAAACCCAG
GTTCCCGCCGGGCCATTCCGCATTCAGGATATCGGCGACTCCGTCTCCGGCACACTGCACGTCCGCGTTGAAGAACAGAA
TGGTCAGGTGCAGGAATATGACGTTACGACCGCATCTATGCCATTCCTCACGCGCCAGGGGCAGGTGCGTTACAAAGTGA
TGATGGGGCGTCCGGAAGACTGGAACCACAAGACCGAAGGCGGCTTTTTCTCCGGCGGAGAAGCGTCATGGGGGGTGGCA
GATGGCTGGTCGCTCTACGGTGGCGCGCTGGCAGATGAACACTACCAGTCGGCGGCGATGGGGGTGGGGCGCGACCTCGC
ACAGTTTGGCGCGCTGGCGTTCGATGTGACTCACTCGCACGTCAACCTGGATCATGACAGCGCATACGGCAAAGGAAAAC
TGGACGGCAACTCCTTTCGCGTGAGCTATGCCAAAGACTTTGACGAACTCAACAGCCGCGTCACCTTTGCAGGCTACCGT
TTTTCTGAAAAGAACTTCATGACCATGAGCGAGTATCTGGACGCGAACCAGTCGGACATGGCGCGGACCGGTAACGACAA
AGAGATGTATACGATCACCTATAACCAGAACTTTGCCGCTGCGGGTGTCTCGATCTATCTCAACTACTCCCATCGTACTT
ACTGGGATCGCCCGGAACAGACAAACTATAACCTGATGTTTTCCCACTATTTTAATATGGGGAGCATTCGCAACGTGAGC
ATCTCGGTGACCGGCTATCGCTACGAATATGACGATAACGCGGATAAGGGGATGTACCTCAGCATGAGCATTCCGTGGAG
CGACAGCAGCACCGTGACCTACAACGGTTCCTACGGCAGCGGGTCGGACAGCAGCCAGGTCGGTTACTTTAAGCGCGTTG
ATGACGCAACGCACTACCAGGTTAACGTCGGTACCAGCGAACAGCACGGCAGCGTGGATGGTTATCTGAGTCACGACGGC
TCGCTGGCGAAGGTTGATCTCAGCGCCAACTACCATGAGGGGGAATACCGCTCGGCGGGGATCGCCTTACAGGGCGGGGC
AACGCTGACCGCGCATGGTGGGGCGCTGCATCGCACTCAAAGCATGGGCGGTACGCGCCTGCTGATTGACGCCGACGGGA
TTGCCAATGTGCCGGTCGAAAGCAACGGCGCGCCGGTGTACACCAATATGTTTGGCAAGGCGGTGGTCGCCGATATTAAC
AACTACTATCGCAATCAGGCGTATATCGATTTAAACAACCTGCCAGAAGATGCGGAAGCCACCCAGTCGGTGGTACAGGC
AACCCTGACGGAAGGTGCAATTGGCTATCGCAAATTCAAAGTGATCAGCGGGCAAAAAGCGATGGCGGTACTGCGGTTGC
GTGACGGCAGCTACCCGCCGTTTGGCGCGGAAGTGAAAAACGACGAGCAGCAGCAGGTTGGCATTGTGGATGATGAAGGC
AATGTCTATCTGGCGGGAGTCAATGCGGGTGAGCATATGACGGTGTTCTGGGAAGGCAGCGCACAATGCGAGATCGTATT
GCCGAAGCCGCTACCTGCCGATCTGTTCAGCGGCCTGTTGTTGCCGTGTGAACAAAAGGGAACGGCAGCCCCTGATTCTT
CAGCGCCAGAAATTAAGCCTGTTATTCAGGACCAGACGCGGCAAGTCACACCAACGGAAGCGCCGACGTCAATTTCAGCT
ACTCAATAA

Protein sequence :
MPNHSNFRLRGIACYIALAISGGSVNAWADDSIQFDPRFLELKGDTKIDLGKFSKKGYVDAGKYNLRVFINKQPLSDEYD
INWYVSENDPTKTYACLTPELVAALGLKEGIAKSLQWTHNDECLKPGQLDGMEVENDLSQSALLLTVPQAYLEYTSSDWD
PPSRWDDGIPGLIADYSLNAQTRHQEQGGEDSHDISGNGTVGANLGAWRFRADWQSDYQHTRSNDDDDDSSNSTTSKNWD
WSRYYAWRALPSLKAKLSLGEDYLNSDIFDGFNYIGSSVSTDDQMLPPNLRGYAPDVSGVAHSSAKVTISQMGRVLYETQ
VPAGPFRIQDIGDSVSGTLHVRVEEQNGQVQEYDVTTASMPFLTRQGQVRYKVMMGRPEDWNHKTEGGFFSGGEASWGVA
DGWSLYGGALADEHYQSAAMGVGRDLAQFGALAFDVTHSHVNLDHDSAYGKGKLDGNSFRVSYAKDFDELNSRVTFAGYR
FSEKNFMTMSEYLDANQSDMARTGNDKEMYTITYNQNFAAAGVSIYLNYSHRTYWDRPEQTNYNLMFSHYFNMGSIRNVS
ISVTGYRYEYDDNADKGMYLSMSIPWSDSSTVTYNGSYGSGSDSSQVGYFKRVDDATHYQVNVGTSEQHGSVDGYLSHDG
SLAKVDLSANYHEGEYRSAGIALQGGATLTAHGGALHRTQSMGGTRLLIDADGIANVPVESNGAPVYTNMFGKAVVADIN
NYYRNQAYIDLNNLPEDAEATQSVVQATLTEGAIGYRKFKVISGQKAMAVLRLRDGSYPPFGAEVKNDEQQQVGIVDDEG
NVYLAGVNAGEHMTVFWEGSAQCEIVLPKPLPADLFSGLLLPCEQKGTAAPDSSAPEIKPVIQDQTRQVTPTEAPTSISA
TQ

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 2e-145 43
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 6e-139 42
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 9e-138 42
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 7e-138 42
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 8e-139 42
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 1e-138 42
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 1e-138 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcSMS35_2496 YP_001744540.1 fimbrial usher protein VFG1548 Protein 4e-139 42
EcSMS35_2496 YP_001744540.1 fimbrial usher protein VFG0884 Protein 3e-139 42
EcSMS35_2496 YP_001744540.1 fimbrial usher protein VFG0895 Protein 5e-139 42