Gene Information

Name : focD (c1242)
Accession : NP_753156.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : F1C fimbrial usher
Function : -
COG functional category : N : Cell motility
COG ID : COG3188
EC number : -
Position : 1190123 - 1192801 bp
Length : 2679 bp
Strand : +
Note : Escherichia coli K-12 ortholog: b4317; Escherichia coli O157:H7 ortholog: z5915

DNA sequence :
TTGTGTTTGGCTGCTGATTTTATTTTTTCGCTCAGTGTGGGGGCTGACATGTTTTTCGGAGATGGCGGACAGCTCCTGTC
TGATAAATCACTGACCGGCTCTGCTGGCGGGGGTAATAACAGAATGAAATTTAATATACTGCCGCTGGCATTTTTTATCG
GGATAATTGTTTCTCCGGCCCGGGCTGAGCTCTATTTTAATCCGCGTTTTCTGTCTGATGACCCGGATGCGGTTGCGGAC
CTGTCTGCATTTACACAGGGGCAGGAGCTTCCGCCAGGCGTGTACCGGGTGGATATTTATCTGAATGATACGTATATCTC
AACCCGTGACGTGCAGTTTCAGATGAGTCAGGACGGGAAGCAACTTGCCCCCTGCCTGAGTCCGGAGCATATGAGTGCGA
TGGGGGTAAACCGTTATGCCGTGCCGGGTATGGAGAGACTGCCGGCTGACACCTGCACATCACTGAATTCCATGATTCAG
GGTGCCACATTCCGGTTTGATGTCGGGCAGCAGCGATTGTACCTGACCGTTCCTCAGTTATATATGAGTAATCAGGCCCG
TGGTTATATTGCGCCGGAATACTGGGATAACGGAATTACAGCGGCATTGCTGAATTATGACTTCAGTGGTAATCGGGTGC
GCGACAGCTATGGTGGCACCAGCGATTATGCCTATCTGAATCTGAAAACAGGACTGAATATCGGGAGCTGGCGTCTTCGG
GACAACACCAGCTGGAGTTACAGTGCCGGAAAGGGGTACAGCCAGAATAACTGGCAGCACATTAACACCTGGCTGGAGCG
GGATATAGTTTCCCTGCGCTCCCGTCTGACGATGGGGGACAGTTATACCCGGGGGGATATTTTTGACGGTGTAAATTTCA
GGGGGATTCAACTGGCCTCTGATGACAATATGGTTCCGGACAGCCAGAGAGGGTATGCACCGACAATTCACGGTATTTCC
CGGGGAACATCCCGGATTAGTATCCGCCAGAACGGGTATGAAATTTACCAGAGTACGCTACCTCCAGGCCCGTTCGAGAT
AAACGATATTTATCCGGCCGGGAGTGGAGGTGACTTACAGGTAACCCTGCAGGAAGCTGATGGCAGTGTACAGCGATTTA
ATGTGCCATGGTCTTCGGTGCCTGTACTGCAACGTGAAGGGCATCTGAAATATGCTCTCAGTGCCGGAGAGTTTCGCAGT
GGCGGACATCAGCAGGATAATCCCCGTTTTGCTGAAGGAACGCTGAAGTATGGTCTGCCGGCAGGCTGGACGGTGTATGG
AGGAGCCTGGATAGCTGAACGCTACCGTGCCTTCAATCTGGGGGTGGGTAAAAACATGGGCTGGCTGGGCGCTGTTTCAC
TGGATGCCACCCGTGCGAATGCCCGGTTACCGGATGAAAGCCGGCATGACGGTCAGTCATATCGTTTTCTGTATAACAAG
TCACTGACGGAAACAGGGACAAATATCCAGTTAATCGGATACCGCTACTCCACACGGGGCTATTTCAGCTTTGCTGATAC
CGCCTGGAAGAAAATGAGTGGTTACAGTGTTCTTACCCAGGACGGAGTGATACAGATACAGCCGAAGTATACGGATTACT
ACAATCTGGCTTACAACAAGCGGGGAAGGGTGCAGGTGAGTATCAGCCAGCAGACGGGGGAATCGTCAACGTTGTATCTG
AGTGGCAGCCACCAGAGTTACTGGGGAACGGACAGGACGGACCGTCAGCTTAATGCTGGCTTTAACTCATCTGTGAATGA
CATCAGCTGGTCCCTGAACTACAGCCTGTCCCGGAATGCATGGCAACATGAAACTGACCGGATACTGTCATTTGATGTCA
GCATTCCGTTCAGCCACTGGATGCGTTCGGACAGCACATCTGCATGGAGAAACGCCAGTGCCCGTTACAGTCAGACCCTG
GAGGCTCACGGACAGGCTGCCAGTACAGCAGGGCTGTATGGCACATTGCTGGGGGACAATAATCTGGGATACAGCATCCA
GAGTGGTTATACCCGGGGAGGGTATGAGGGGAGCAGTAAAACAGGATATGCCTCTCTGAATTACAGAGGGGGATACGGTA
ATGCCAGTGCAGGATACAGTCACAGTGGGGGCTACCGGCAACTGTATTATGGGCTGAGTGGCGGTATTCTGGCACATGCC
AATGGTCTGACACTGAGCCAGCCTCTGGGAGATACGCTGATTCTTGTCCGTGCACCAGGTGCATCTGATACGCGCATTGA
AAACCAGACGGGGGTGTCTACAGACTGGAGGGGGTATGCTGTACTGCCTTATGCAACAGACTATCGTGAAAACCGGGTGG
CACTGGATACCAACACCCTTGCAGATAATGTTGATATAGAGAATACCGTGGTCAGCGTCGTGCCCACGCATGGTGCCGTT
GTGCGGGCTGACTATAAAACCCGTGTGGGTGTGAAAGTGCTGATGACGCTGATGAGAAACGGGAAAGCAGTTCCTTTTGG
CTCTGTCGTTACGGCCAGAAACGGGGGGAGCAGTATCGCCGGAGAGAATGGTCAGGTTTATCTGAGCGGAATGCCTCTGT
CAGGACAGGTTAGTGTGAAATGGGGGAGTCAGACCACGGACCAGTGTACTGCAGATTATAAATTACCGAAGGAAAGCGCC
GGACAGATATTAAGTCATGTCACGGCTAGTTGCAGGTAA

Protein sequence :
MCLAADFIFSLSVGADMFFGDGGQLLSDKSLTGSAGGGNNRMKFNILPLAFFIGIIVSPARAELYFNPRFLSDDPDAVAD
LSAFTQGQELPPGVYRVDIYLNDTYISTRDVQFQMSQDGKQLAPCLSPEHMSAMGVNRYAVPGMERLPADTCTSLNSMIQ
GATFRFDVGQQRLYLTVPQLYMSNQARGYIAPEYWDNGITAALLNYDFSGNRVRDSYGGTSDYAYLNLKTGLNIGSWRLR
DNTSWSYSAGKGYSQNNWQHINTWLERDIVSLRSRLTMGDSYTRGDIFDGVNFRGIQLASDDNMVPDSQRGYAPTIHGIS
RGTSRISIRQNGYEIYQSTLPPGPFEINDIYPAGSGGDLQVTLQEADGSVQRFNVPWSSVPVLQREGHLKYALSAGEFRS
GGHQQDNPRFAEGTLKYGLPAGWTVYGGAWIAERYRAFNLGVGKNMGWLGAVSLDATRANARLPDESRHDGQSYRFLYNK
SLTETGTNIQLIGYRYSTRGYFSFADTAWKKMSGYSVLTQDGVIQIQPKYTDYYNLAYNKRGRVQVSISQQTGESSTLYL
SGSHQSYWGTDRTDRQLNAGFNSSVNDISWSLNYSLSRNAWQHETDRILSFDVSIPFSHWMRSDSTSAWRNASARYSQTL
EAHGQAASTAGLYGTLLGDNNLGYSIQSGYTRGGYEGSSKTGYASLNYRGGYGNASAGYSHSGGYRQLYYGLSGGILAHA
NGLTLSQPLGDTLILVRAPGASDTRIENQTGVSTDWRGYAVLPYATDYRENRVALDTNTLADNVDIENTVVSVVPTHGAV
VRADYKTRVGVKVLMTLMRNGKAVPFGSVVTARNGGSSIAGENGQVYLSGMPLSGQVSVKWGSQTTDQCTADYKLPKESA
GQILSHVTASCR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
sfaF CAC16953.2 SfaF protein Virulence PAI III 536 Protein 0.0 99
fim2D AFH78429.1 putative fimbrial usher protein Virulence KpGI-5 Protein 0.0 64

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
focD NP_753156.1 F1C fimbrial usher VFG0912 Protein 0.0 100
focD NP_753156.1 F1C fimbrial usher VFG1645 Protein 0.0 99
focD NP_753156.1 F1C fimbrial usher VFG0876 Protein 0.0 65
focD NP_753156.1 F1C fimbrial usher VFG0446 Protein 5e-180 47
focD NP_753156.1 F1C fimbrial usher VFG0454 Protein 9e-170 41