Gene Information

Name : SN31241_40600 (SN31241_40600)
Accession : YP_008361975.1
Strain : Salmonella enterica USMARC-S3124.1
Genome accession: NC_021902
Putative virulence/resistance : Virulence
Product : outer membrane usher protein yfcU
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4106419 - 4109064 bp
Length : 2646 bp
Strand : +
Note : fimbrial outer membrane usher protein SteB PRK15273; outer membrane usher protein yfcU of Gammaproteobacteria UniRef RepID=YFCU_ECOLI

DNA sequence :
ATGCTTCTAAGCGTCTCCCCTTATAGCGCGTCAGGCAAAGACATCGAATTTAATACCGATTTCCTCGATGTAAAAAATCG
CGATAACGTTAACATTGCACAGTTTTCTCGTAAGGGTTTTATTCTGCCAGGCGTCTACCTTTTACAAATTAAAATTAACG
GACAGACTCTGCCGCAGGAATTTCCTGTTAACTGGGTTATTCCAGAACATGATCCACAAGGAAGTGAGGTTTGCGCAGAA
CCAGAATTAGTTACGCAATTGGGTATAAAGCCGGAACTCGCGGAAAAACTCGTCTGGATAACGCACGGCGAACGACAATG
TCTGGCGCCAGATTCACTGAAAGGCATGGATTTTCAGGCCGACCTGGGGCACTCCACGCTGTTGGTGAATTTACCCCAGG
CGTATATGGAATACAGCGATGTCGACTGGGACCCACCCGCCCGCTGGGATAATGGTATTCCCGGCATCATTCTGGATTAC
AACATTAATAATCAGCTCCGCCACGATCAAGAAAGCGGCAGCGAAGAGCAAAGCATCAGCGGCAACGGGACGTTAGGCGC
GAACCTGGGCGCATGGCGACTGCGGGCCGACTGGCAGGCCAGCTACGACCATCGTGACGATGACGAGAACACTTCCACTC
TCCACGATCAGAGCTGGAGCCGCTACTACGCCTATCGCGCACTACCGACGCTCGGGGCCAAACTTACGCTGGGCGAAAGC
TATCTCCAGTCCGATGTTTTCGACAGCTTTAACTATATCGGTGCCAGCGTCGTTTCTGACGATCAGATGCTGCCGCCGAA
ACTGCGCGGCTATGCGCCGGAGATCGTGGGTATTGCGCGCTCTAATGCAAAAGTCAAAGTCTCCTGGCAGGGGCGCGTAC
TGTATGAAACGCAGGTGCCCGCAGGACCGTTCCGTATTCAGGATCTCAACCAGTCCGTTTCCGGTACGTTGCACGTCACC
GTGGAAGAGCAGAACGGTCAGACCCAGGAGTTTGACGTTAACACCGCATCGGTTCCCTTCCTGACGCGCCCCGGCATGGT
GCGCTACAAGATGGCGCTGGGCCGCCCGCAGGACTGGGATCATCACCCTATTACCGGCACATTCGCCTCGGCGGAAGCTT
CGTGGGGGGTCACCAACGGCTGGTCGCTATATGGCGGCGCAATTGGAGAAAGCAGCTATCAGGCCGTGGCGTTGGGAAGC
GGTAAGGATCTTGGCGTGGTGGGCGCGGTGGCGGTTGACATTACGCACTCCATCGCCCACATGCCGCAAGACGACGGGTT
TGACGGCGAAACGCTGCAGGGTAACTCATATCGCATCAGCTACTCCCGTGACTTTGATGAAATCGACAGCCGACTAACCT
TTGCCGGATACCGCTTCTCAGAAAAGAACTTTATGAGCATGAGCGACTATCTGGATGCGAAAACCTATCATCATCTCAAT
GCCGGTCACGAAAAAGAACGCTATACGGTCACCTATAACCAGAACTTCCGTGAACAGGGCATGAGCGCCTATTTCAGCTA
CTCACGCAGTACCTTCTGGGACAGCCCGGATCAGAGTAACTATAACCTGTCTCTTTCCTGGTACTTCGACTTAGGGTCGA
TAAAAAATCTCAGTGCGTCGCTGAACGGCTATCGCAGCGAATATAACGGTGATAAAGATGATGGCGTCTATATCTCGCTG
TCTGTTCCCTGGGGCAATGATTCCATCAGCTACAACGGTACGTTTAACGGTAGTCAACACCGTAATCAGCTCGGCTATTC
CGGCCACAGCCAGAACGGCGATAACTGGCAGCTTCACGTCGGGCAGGATGAACAAGGCGCACAGGCAGACGGTTATTACA
GCCATCAGGGCGCGCTGACGGACATCGATCTGAGCGCGGATTATGAAGAAGGATCGTACCGTTCGCTGGGCATGTCGCTG
CGCGGCGGCATGACGCTGACCACCCAGGGCGGCGCGCTACACCGGGGAAGTTTAGCGGGCAGCACACGTTTGCTGGTTGA
TACCGACGGCATTGCGGACGTCCCCGTTAGCGGTAACGGCTCGCCAACCTCAACCAACATTTTCGGCAAGGCCGTGATTG
CGGATGTCGGAAGCTATTCGCGCAGCCTGGCGCGTATCGATCTGAACAAATTGCCGGAGAAGGCGGAAGCTACTAAGTCG
GTTGTGCAGATCACGCTCACCGAAGGCGCCATCGGCTACCGTCACTTTGACGTGGTCAGCGGCGAGAAAATGATGGCGGT
TTTCCGGCTGGCAGACGGCGACTTCCCACCGTTCGGCGCCGAAGTGAAAAACGAGCGCCAGCAGCAGTTGGGCCTGGTGG
CCGATGACGGCAACGCGTGGCTGGCGGGCGTAAAAGCCGGGGAAACATTGAAAGTATTCTGGGACGGCGCGGCGCAGTGT
GAAGCATCACTCCCGCCCACGTTTACACCGGAGCTATTGGCTAACGCGCTATTGCTGCCGTGCAAAATTCTGGAAGGTCA
GCCCCCCACCGCACCGCAGAAAAGTTCTCCGCTGCCTGCGCAACCGCTAATCCAGGAACATACGCAAACCGATGGCCAAC
CGGCCGCGCCGGTGGCGACAACCACTCAAACCCCGCCCATACCGCTGGCTGACAACCATGCGGTGAATCGCAAGGATATG
GAATAA

Protein sequence :
MLLSVSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAE
PELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDY
NINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGES
YLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVT
VEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESSYQAVALGS
GKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLN
AGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISL
SVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSL
RGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKS
VVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQC
EASLPPTFTPELLANALLLPCKILEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDM
E

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
pixC CAE85159.1 PixC protein Not tested PAI V 536 Protein 2e-172 46
prfC CAD42029.1 PrfC protein Virulence PAI II 536 Protein 2e-168 45
papC YP_854254.1 outer membrane usher protein PapC Virulence PAI I APEC-O1 Protein 1e-166 45
papC AAZ04424.1 outer membrane usher protein Virulence PAI I APEC-O1 Protein 7e-167 45
papC NP_755465.1 PapC protein Virulence PAI I CFT073 Protein 4e-168 45
papC_2 NP_757034.1 PapC protein Virulence PAI II CFT073 Protein 2e-168 45
papC YP_002414015.1 Outer membrane usher protein PapC Not tested Not named Protein 2e-168 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SN31241_40600 YP_008361975.1 outer membrane usher protein yfcU VFG1548 Protein 1e-168 45
SN31241_40600 YP_008361975.1 outer membrane usher protein yfcU VFG0884 Protein 2e-168 45
SN31241_40600 YP_008361975.1 outer membrane usher protein yfcU VFG0895 Protein 1e-168 45