Gene Information

Name : sfmD (ECUMN_0572)
Accession : YP_002411335.1
Strain : Escherichia coli UMN026
Genome accession: NC_011751
Putative virulence/resistance : Virulence
Product : putative outer membrane export usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 639818 - 642427 bp
Length : 2610 bp
Strand : +
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; PubMedId : 1970114; Product type pf : putative factor

DNA sequence :
ATGAAAATACCCACTACTACGGATATTCCGCAGAGGTATACCTGGTGTCTGGCCGGAATTTGTTATTCATCTCTTGCCAT
TTTACCCTCCTTTTTAAGCTATGCGGAAAGTTATTTCAACCCGGCATTTTTATTAGAGAATGGCACATCCGTTGCTGATT
TATCGCGCTTTGAGAGAGGTAATCATCAACCTGCGGGCGTGTATCGGGTGGATCTCTGGCGTAATGATGAGTTCATTGGT
TCACAGGATATCGTATTTGAATCGACGACAGTAAATACAGGTGATAAATCAGGTGGGTTAATGCCTTGTTTTAATCAGGC
ACTCCTTGAACGAATTGGCCTTAATAGCAGTGCATTCCCCGAGTTAGCCCAGCAGCAAAATAATAAATGCATTAATTTAC
TGAAAGCTGTACCTGATGCCACAATTAACTTTGATTTTGCAGCGATGCGCCTGAACATCACTATTCCTCAGATAGCGTTG
TTGAGTAGCGCTCACGGTTACATTCCGCCTGAAGAGTGGGATGAAGGTATTCCTGCTTTACTCCTGAATTATAATTTCAC
CGGTAACAGAGGTAATGGTAACGATAGCTATTTTTTTAGTGAGCTCAGCGGGATTAATATTGGCCCGTGGCGTTTACGCA
ACAATGGTTCCTGGAACTATTTTCGCGGAAATGGATATCATTCAGAACAGTGGAATAATATTGGCACCTGGGTACAGCGC
GCCATTATTCCGCTGAAAAGTGAACTGGTAATGGGAGACGGCAATACAGGAAGTGATATTTTCGATGGCGTTGGATTTCG
TGGTGTACGGCTTTATTCTTCCGATAATATGTATCCTGATAGCCAGCAAGGGTTTGCCCCAACGGTACGTGGGATTGCCC
GTACGGCGGCCCAGCTAACGATTCGGCAAAATGGTTTTATTATCTATCAAAGCTATGTTTCCCCCGGCGCTTTTGAAATT
ACAGATTTGCACCCGACATCTTCAAATGGCGATCTGGATGTCACCATCGACGAGCGCGATGGCAATCAGCAGAATTATAC
AATTCCGTATTCAACAGTGCCGATTTTACAACGCGAAGGGCGTTTCAAATTTGACCTGACGGCGGGCGATTTTCGTAGCG
GTAATAGTCAGCAATCATCGCCTTTCTTTTTTCAGGGTACGGCACTCGGCGGTTTACCACAGGAATTTACTGCCTACGGC
GGGACGCAATTATCTGCAAATTACACCGCCTTTTTGTTAGGGTTGGGGCGCAACCTCGGGAACTGGGGCGCAGTGTCGCT
GGATGTAACGCATGCGCGCAGTCAGTTAGCCGACGACAGTCGTCATGAGGGGGATTCCATTCGCTTCCTCTATGCGAAAT
CAATGAACACTTTCGGCACCAATTTTCAGTTAATGGGTTACCGCTATTCGACACAAGGTTTTTATACCCTTGATGATGTT
GCGTATCGTCGAATGGAGGGGTACGAATATGATTACGATTATGACGGTGAGCATCGCGATGAACCGATAATCGTGAATTA
CCACAATTTACGCTTTAGCCGTAAAGACCGTTTGCAGTTAAATATTTCACAATCACTTAATGACTTTGGCTCGCTTTATA
TTTCTGGTACCCATCAAAAATACTGGAATACTTCGGATTCAGATACGTGGTATCAGGTGGGGTATACCAGCAGCTGGGTT
GGCATCAGTTATTCGCTCTCATTTTCGTGGAATGAATCTGTAGGGATTCCCGATAACGAACGTATTGTCGGACTTAATGT
TTCAGTGCCTTTCAATGTTTTGACCAAACGTCGCTACACCCGGGAAAATGCGCTCGACCGCGCTTATGCCTCCTTTAACG
CCAACCGTAACAGCAACGGGCAAAATAGCTGGCTGGCAGGTGTAGGTGGGACCTTACTGGAAGGCCACAACCTGAGTTAT
CACGTAAGCCAGGGCGATACCTCGAATAATGGGTATACGGGTAGCGCTACGGCAAACTGGCAGGCCACTTACGGTACGCT
GGGGGTCGGGTATAACTACGACCGCGATCAACATGACGTTAACTGGCAGCTGTCTGGCGGTGTGGTCGGGCATGAAAATG
GTATAACGCTGAGCCAGCCTTTAGGGGATACCAATGTTTTGATTAAAGCGCCTGGCGCAGGCGGTGTACGCATTGAAAAT
CAAACTGGCATTTTAACCGACTGGCGCGGCTATGCGGTGATGCCGTATGCCACGGTTTATCGGTATAACCGTATCGCGCT
TGATACCAATACGATGGGGAATTCCATCGATGTTGAAAAAAATATTAGCAGCGTTGTGCCGACGCAAGGCGCGTTGGTTC
GTGCCAATTTTGATACCCGCATAGGCGTGCGGGCGCTCATTACCGTTACCCAGGGCGGAAAACCGGTGCCGTTTGGATCA
CTGGTACGGGAAAACAGTACCGGAATAACCAGTATGGTGGGTGATGACGGGCAAGTTTATTTAAGCGGTGCGCCATTGTC
TGGTGAATTACTGGTTCAGTGGGGAGACGGTGCAAACTCACGCTGCATTGCGCACTATGTATTGCCGAAGCAAAGCTTAC
AGCAAGCCGTCACTGTTATTTCGGCAGTTTGCACACATCCTGGCTCATAA

Protein sequence :
MKIPTTTDIPQRYTWCLAGICYSSLAILPSFLSYAESYFNPAFLLENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFIG
SQDIVFESTTVNTGDKSGGLMPCFNQALLERIGLNSSAFPELAQQQNNKCINLLKAVPDATINFDFAAMRLNITIPQIAL
LSSAHGYIPPEEWDEGIPALLLNYNFTGNRGNGNDSYFFSELSGINIGPWRLRNNGSWNYFRGNGYHSEQWNNIGTWVQR
AIIPLKSELVMGDGNTGSDIFDGVGFRGVRLYSSDNMYPDSQQGFAPTVRGIARTAAQLTIRQNGFIIYQSYVSPGAFEI
TDLHPTSSNGDLDVTIDERDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFRSGNSQQSSPFFFQGTALGGLPQEFTAYG
GTQLSANYTAFLLGLGRNLGNWGAVSLDVTHARSQLADDSRHEGDSIRFLYAKSMNTFGTNFQLMGYRYSTQGFYTLDDV
AYRRMEGYEYDYDYDGEHRDEPIIVNYHNLRFSRKDRLQLNISQSLNDFGSLYISGTHQKYWNTSDSDTWYQVGYTSSWV
GISYSLSFSWNESVGIPDNERIVGLNVSVPFNVLTKRRYTRENALDRAYASFNANRNSNGQNSWLAGVGGTLLEGHNLSY
HVSQGDTSNNGYTGSATANWQATYGTLGVGYNYDRDQHDVNWQLSGGVVGHENGITLSQPLGDTNVLIKAPGAGGVRIEN
QTGILTDWRGYAVMPYATVYRYNRIALDTNTMGNSIDVEKNISSVVPTQGALVRANFDTRIGVRALITVTQGGKPVPFGS
LVRENSTGITSMVGDDGQVYLSGAPLSGELLVQWGDGANSRCIAHYVLPKQSLQQAVTVISAVCTHPGS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
fim2D AFH78429.1 putative fimbrial usher protein Virulence KpGI-5 Protein 1e-177 47
sfaF CAC16953.2 SfaF protein Virulence PAI III 536 Protein 5e-174 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
sfmD YP_002411335.1 putative outer membrane export usher protein VFG0446 Protein 0.0 72
sfmD YP_002411335.1 putative outer membrane export usher protein VFG0876 Protein 0.0 48
sfmD YP_002411335.1 putative outer membrane export usher protein VFG1645 Protein 3e-174 46
sfmD YP_002411335.1 putative outer membrane export usher protein VFG0912 Protein 4e-174 46