Gene Information

Name : sfmD (EC55989_0546)
Accession : YP_002401662.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : outer membrane export usher protein
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 587502 - 590111 bp
Length : 2610 bp
Strand : +
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; PubMedId : 1970114; Product type pf : factor

DNA sequence :
ATGAAAATACCCACTACTACGGATATTCCGCAGAGGTATACCTGGTGTCTGGCCGGAATTTGTTATTCATCTCTTGCCAT
TTTACCCTCCTTTTTAAGCTATGCGGAAAGTTATTTCAACCCGGCATTTTTATTAGAGAATGGCACATCCGTTGCTGATT
TATCGCGCTTTGAGAGAGGTAATCATCAACCTGCGGGCGTGTATCGGGTGGATCTCTGGCGTAATGATGAGTTCATTGGT
TCACAGGATATCGTATTTGAATCGACAACAGAAAATACAGGTGATAAATCAGGTGGGTTAATGCCCTGTTTTAACCAGGT
ACTCCTTGAACGAATTGGCCTTAATAGCAGTGCATTTCCCGAGTTAGCCCAGCAGCAAAACAATAAATGCATCAATTTAC
TGAAAGCTGTACCTGATGCCACAATTAACTTTGATTTTGCAGCGATGCGCCTGAACATCACTATTCCTCAGATAGCGTTG
TTGAGTAGCGCTCACGGTTACATTCCGCCTGAAGAGTGGGATGAAGGTATTCCTGCTTTACTCCTGAATTATAATTTCAC
CGGTAACAGAGGTAATGGTAACGATAGCTATTTTTTTAGTGAGCTCAGCGGGATTAATATTGGCCCGTGGCGTTTACGCA
ACAATGGTTCCTGGAACTATTTTCGCGGAAATGGATATCATTCAGAGCAGTGGAATAATATTGGCACCTGGGTACAGCGC
GCCATTATTCCGCTGAAAAGTGAACTGGTAATGGGAGACGGCAATACAGGAAGTGATATTTTCGATGGTGTTGGATTTCG
TGGTATACGGCTTTACTCTTCCGATAATATGTATCCTGATAGCCAGCAAGGGTTTGCCCCAACGGTACGTGGGATTGCCC
GTACGGCGGCCCAGCTAACGATTCGGCAAAATGGTTTGATTATCTATCAAAGCTATGTTTCCCCCGGCGCTTTTGAAATT
ACAGATTTGCACCCGACATCTTCAAATGGCGATCTGGATGTCACCATCGACGAGCGCGATGGCAATCAGCAGAATTACAC
AATTCCGTATTCAACAGTGCCGATTTTACAACGCGAAGGGCGTTTCAAATTTGACCTGACGGCGGGCGATTTTCGTAGCG
GTAATAGTCAGCAATCATCACCTTTCTTTTTTCAGGGCACGGCACTCGGCGGTTTACCACAGGAATTTACTGCCTACGGC
GGGACGCAATTATCTGCCAATTACACCGCCTTTTTATTAGGACTGGGGCGCAATCTCGGAAACTGGGGGGCAGTGTCGCT
GGATGTGACCCATGCGCGCAGTCAGTTAGCCGACGACAGTCGTCATGAGGGGGATTCCATTCGCTTCCTCTATGCGAAAT
CGTTGAACACCTTCGGCACCAATTTTCAGTTAATGGGTTACCGCTATTCGACACAAGGTTTTTATACCCTTGATGATGTT
GCGTATCGTCGAATGGAGGGGTACGAATATGATTACGATTACGACGGTGAGCATCGGGATGAACCGATAATCGTGAATTA
CCACAATTTACGCTTTAGCCGTAAAGACCGTTTGCAGTTAAATATTTCACAATCACTTAATGACTTTGGCTCGCTTTATA
TCTCTGGTACCCATCAAAAATACTGGAATACTTCGGATTCAGATACCTGGTATCAGGTGGGGTATACCAGCAGCTGGGTT
GGTATCAGTTATTCACTCTCATTTTCGTGGAATGAATCTGTAGGGATCCCCGATAACGAACGTATTGTCGGACTTAATGT
TTCAGTGCCTTTCAATGTTCTGACCAAACGTCGCTACACCCGGGAAAATGCGCTCGACCGCGCTTATGCCTCCTTTAACG
CCAACCGTAACAGCAACGGGCAAAATAGCTGGCTGGCAGGTGTAGGTGGGACCTTACTGGAAGGCCACAACCTGAGTTAT
CACGTAAGCCAGGGTGATACCTCGAATAATGGGTATACGGGCAGCGCCACGGCAAACTGGCAGGCCACTTACGGTACGCT
GGGGGTCGGGTATAACTACGACCGCGATCAACATGACGTTAACTGGCAGCTGTCTGGCGGTGTGGTCGGGCACGAAAATG
GCATAACGCTGAGCCAGCCTTTAGGGGATACCAATGTTTTGATTAAAGCGCCTGGCGCAGGCGGTGTACGCATTGAAAAT
CAAACTGGCATTTTAACCGACTGGCGCGGCTATGCGGTGATGCCGTATGCCACGGTTTATCGGTATAACCGTATCGCGCT
TGATACCAATACGATGGGGAATTCCATCGATGTTGAAAAAAATATTAGCAGCGTTGTGCCGACGCAAGGCGCGTTGGTTC
GTGCCAATTTTGATACCCGCATAGGCGTGCGGGCGCTCATTACCGTTACCCAGGGCGGAAAACCGGTGCCGTTTGGATCA
CTGGTACGGGAAAACAGTACCGGAATAACCAGTATGGTGGGTGATGACGGGCAAGTTTATTTAAGTGGTGCGCCATTGTC
TGGTGAATTACTGGTTCAGTGGGGAGACGGCGCGAACTCACGCTGCATTGCGCACTATGTATTGCCGAAGCAAAGCTTAC
AGCAAGCCGTCACTGTTATTTCGGCAGTTTGCACACATCCTGGCTCATAA

Protein sequence :
MKIPTTTDIPQRYTWCLAGICYSSLAILPSFLSYAESYFNPAFLLENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFIG
SQDIVFESTTENTGDKSGGLMPCFNQVLLERIGLNSSAFPELAQQQNNKCINLLKAVPDATINFDFAAMRLNITIPQIAL
LSSAHGYIPPEEWDEGIPALLLNYNFTGNRGNGNDSYFFSELSGINIGPWRLRNNGSWNYFRGNGYHSEQWNNIGTWVQR
AIIPLKSELVMGDGNTGSDIFDGVGFRGIRLYSSDNMYPDSQQGFAPTVRGIARTAAQLTIRQNGLIIYQSYVSPGAFEI
TDLHPTSSNGDLDVTIDERDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFRSGNSQQSSPFFFQGTALGGLPQEFTAYG
GTQLSANYTAFLLGLGRNLGNWGAVSLDVTHARSQLADDSRHEGDSIRFLYAKSLNTFGTNFQLMGYRYSTQGFYTLDDV
AYRRMEGYEYDYDYDGEHRDEPIIVNYHNLRFSRKDRLQLNISQSLNDFGSLYISGTHQKYWNTSDSDTWYQVGYTSSWV
GISYSLSFSWNESVGIPDNERIVGLNVSVPFNVLTKRRYTRENALDRAYASFNANRNSNGQNSWLAGVGGTLLEGHNLSY
HVSQGDTSNNGYTGSATANWQATYGTLGVGYNYDRDQHDVNWQLSGGVVGHENGITLSQPLGDTNVLIKAPGAGGVRIEN
QTGILTDWRGYAVMPYATVYRYNRIALDTNTMGNSIDVEKNISSVVPTQGALVRANFDTRIGVRALITVTQGGKPVPFGS
LVRENSTGITSMVGDDGQVYLSGAPLSGELLVQWGDGANSRCIAHYVLPKQSLQQAVTVISAVCTHPGS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
fim2D AFH78429.1 putative fimbrial usher protein Virulence KpGI-5 Protein 1e-176 47
sfaF CAC16953.2 SfaF protein Virulence PAI III 536 Protein 1e-173 46

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
sfmD YP_002401662.1 outer membrane export usher protein VFG0446 Protein 0.0 72
sfmD YP_002401662.1 outer membrane export usher protein VFG0876 Protein 0.0 48
sfmD YP_002401662.1 outer membrane export usher protein VFG1645 Protein 8e-174 46
sfmD YP_002401662.1 outer membrane export usher protein VFG0912 Protein 1e-173 46