Gene Information

Name : SN31241_11900 (SN31241_11900)
Accession : YP_008359108.1
Strain : Salmonella enterica USMARC-S3124.1
Genome accession: NC_021902
Putative virulence/resistance : Virulence
Product : outer membrane usher protein yfcU
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1220072 - 1222729 bp
Length : 2658 bp
Strand : +
Note : fimbrial outer membrane usher protein StfC PRK15284; outer membrane usher protein yfcU of Gammaproteobacteria UniRef RepID=YFCU_ECOLI

DNA sequence :
ATGGCTCACTATAAAAAATTTCGTCTGAGCACGCTTGCGGCCGTGGTGGGTATTGTTCTGGCTGTCGGTCCGGAAAATAG
CTATGCGGAAGCGCCCATTCAATTTAATACCCGATTTCTTGATGTTAAAGATGATGCCAGCCTGGATCTCTCCCGTTTTT
CCCGTAAAGGCTACATTATGCCGGGGAGCTATCATCTCCAGGTGCTGGTCAATCAGAGTCAAATTGCCCAGGATAATGTT
ATTACGTATTCCGTTGATAATAACGATCCTGATAACACCTATCCCTGTTTATCGCCTGAACTGGTATCGCTGCTGGGGTT
AAAACCTGAAATAGCAGATAAAATGATCTGGATAAATGCCGGTCAGTGTCTGCAACCAGATCAACTGGAAGGGATGGAAA
CCCAAACGGATTTAAGCCAGTCAACGCTGACGGTGATTATTCCGCAGGCCTATCTGGAATACAGCGATGAAGAGTGGGAT
CCACCTTCCCGCTGGGATGAAGGGATTCCCGGCGTATTATTTGACTACAACGTTAACAGCCAGTGGCGACATGCTGAACA
TGATGACGGCGATGAGTATGACATCAGCGGCAACGGCACGGTGGGTGCCAACCTCGGCGCGTGGCGTTTGCGCGCGGACT
GGCAGGCTAACTATCGTCACGAAAATGACAGCGAAGATAAAGACAACTTTGGCTCCAGTTCCGAACAGAACTGGGACTGG
AACCGCTATTACGCCTGGCGGGCGATCCCGCAGCTCCGGGCGCAGCTAACGCTGGGCGAAGGATCGCTGGAATCCGATAT
TTTCGACGGCTTTAACTATGTTGGCGGCAGCCTTATCACCGACGATCAGATGTTACCGCCTAATCTGCGCGGCTACGCCC
CGGATATATCAGGCGTGGCGCGCACCAACGCCAAAGTCACCGTGACCCAGCGCGGGCGGGTGATTTATGAGTCGCAGGTC
CCGGCTGGGCCGTTTCGCATTCAGGATATTAATGAAACGGTATCCGGCGATCTACACGTCAAAATTGAAGAACAAAGCGG
TCAGGTGCAGGAATATGACGTCAGCACCGCCTCCATTCCGTTTCTGACCCGTCCCGGCCAGGTGCGCTACAAGTTGGCAG
CCGGGCGACCGCAGGACTGGGATCACAATATGGAAGGCGGCTTTTTCACCTCAGCCGAAGCCTCCTGGGGGATCGCTAAC
GGCTGGTCGCTGTACGGCGGTGCCATCGGTGAGCAGGATTATCAGGCGCTGGCGTTAGGACTGGGGCGCGATCTGGCGCT
GCTGGGCGCGTTTTCCGTCGATGTCACCCATTCCCGTGCGACGCTGCCAGAGGGTAGCGCCTACGGCGACGGCACCATTC
AGGGTAACTCGTTCCGTGCCAGCTACGCTAAAGATTTTGATGATATAGACAGCCGTCTGACGTTTGCTGGTTATCGCTTT
TCCGAAGAAAACTACATGACGATGGACGAGTTTATCGATACGCATAATGACGATAACGATCGTCAGCGTACCGGCCACGA
TAAAGAGATGTATACCCTGACGTACAGCCAGAACTTTTCGGCAATAAACGTCAACGCCTATATCAACTACACCCATCGCA
CCTACTGGAATCAACCTAACCAGGACAGCTATAACCTGACGCTGTCGCACTATTTCGATGTCGGTGAGGTGCGCGGGATC
AGTCTGTCGGTGAACGGTTTTCGCAACGAATATGACAATGAGCGTGATGACGGCGTGTACGTCTCGCTCAGTATTCCGTG
GGGCAACAACCGCACGCTGAGCTACAACGGCTCCTTTAGCGATGACAACAACAGCAATCAGGTCGGCTATTACGAGCGCA
TTGACGATCGCAATAACTACCAGATCAACGCTGGTCGCGCGGATAACGGTGCGACTCTCGACGGCTACTACCGTCATCAA
GCGAGCTATGCCGACATTGACGTCAGTGCGAACTATCAGGAAGGCGACTATACCTCCGGCGGGCTGAACATCCAGGGCGG
CGCGACGCTGACTGCTAAAGGCGGGGCGCTGCACCGCACCAGCGTCAACGGCGGCTCGCGGCTGCTGGTGGATGTCGGCG
ATGAAGCGAACGTACCCATCTCCGGCTACAGCACGCCGGTATATACCAACGCGTTTGGTAAAGCCGTCATTGTCGACGTC
AACGACTACTACCGCAACCTGGTGAAAATCGACATTACCCAGTTGCCGGAAGACGCGGAAGCAACTCTCTCCATCGCTCA
GGCGACCCTGACGGAAGGGGCGATCGGTTATCGCCGCATGGAGGTGCTCAGCGGTAAAAAAGCCATGGCCAGTATCCGCC
TGCGCGATGGCGGCACGCCGCCCTTCGGCGCAGAGGTTTACAACAGCCGCCAGCAACAGTTAGGGATCGTAGGTGAAGAC
GGCAGCGTTTATCTGATCGGCATTAATCCCGGCGAGCGGTTGCAGGTGACATGGGAAGGTAAAACGCAGTGTGAAGCGGC
GTTGCCCGATCCGCTGCCGGGCGATCTGTTTAGCGGCCTGTTGCTGCCGTGCATCGGCGACGCTTCATCACCTGAGGCAA
CGCAGCCCGAAGAGAAACCATTACTCCAGCTTCATACGCAGCGGCGGACGTCTTCGACGCAACCCGAAGCGCTCTCTTCT
CGTTATCCAACACATTGA

Protein sequence :
MAHYKKFRLSTLAAVVGIVLAVGPENSYAEAPIQFNTRFLDVKDDASLDLSRFSRKGYIMPGSYHLQVLVNQSQIAQDNV
ITYSVDNNDPDNTYPCLSPELVSLLGLKPEIADKMIWINAGQCLQPDQLEGMETQTDLSQSTLTVIIPQAYLEYSDEEWD
PPSRWDEGIPGVLFDYNVNSQWRHAEHDDGDEYDISGNGTVGANLGAWRLRADWQANYRHENDSEDKDNFGSSSEQNWDW
NRYYAWRAIPQLRAQLTLGEGSLESDIFDGFNYVGGSLITDDQMLPPNLRGYAPDISGVARTNAKVTVTQRGRVIYESQV
PAGPFRIQDINETVSGDLHVKIEEQSGQVQEYDVSTASIPFLTRPGQVRYKLAAGRPQDWDHNMEGGFFTSAEASWGIAN
GWSLYGGAIGEQDYQALALGLGRDLALLGAFSVDVTHSRATLPEGSAYGDGTIQGNSFRASYAKDFDDIDSRLTFAGYRF
SEENYMTMDEFIDTHNDDNDRQRTGHDKEMYTLTYSQNFSAINVNAYINYTHRTYWNQPNQDSYNLTLSHYFDVGEVRGI
SLSVNGFRNEYDNERDDGVYVSLSIPWGNNRTLSYNGSFSDDNNSNQVGYYERIDDRNNYQINAGRADNGATLDGYYRHQ
ASYADIDVSANYQEGDYTSGGLNIQGGATLTAKGGALHRTSVNGGSRLLVDVGDEANVPISGYSTPVYTNAFGKAVIVDV
NDYYRNLVKIDITQLPEDAEATLSIAQATLTEGAIGYRRMEVLSGKKAMASIRLRDGGTPPFGAEVYNSRQQQLGIVGED
GSVYLIGINPGERLQVTWEGKTQCEAALPDPLPGDLFSGLLLPCIGDASSPEATQPEEKPLLQLHTQRRTSSTQPEALSS
RYPTH

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 3e-159 46
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 6e-154 43
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 1e-153 43
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 2e-153 43
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 5e-154 43
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 7e-154 43
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 6e-154 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SN31241_11900 YP_008359108.1 outer membrane usher protein yfcU VFG1548 Protein 4e-154 43
SN31241_11900 YP_008359108.1 outer membrane usher protein yfcU VFG0884 Protein 2e-154 43
SN31241_11900 YP_008359108.1 outer membrane usher protein yfcU VFG0895 Protein 3e-154 43