Gene Information

Name : UMNK88_3721 (UMNK88_3721)
Accession : YP_006135420.1
Strain : Escherichia coli UMNK88
Genome accession: NC_017641
Putative virulence/resistance : Virulence
Product : accessory colonization factor protein AcfD
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3594925 - 3599493 bp
Length : 4569 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGCGCAACCCTGTTAGCCGGTTGTGATGGCGG
TGGTTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAGTGAAACCTGATCCAACAC
CAAACCCGGAGCCGACGCCTGAGCCAACGCCGGAGCCTGAACCTACGCCGGAACCGACACCTGATCCTGAGCCAACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCTGCGGGTAACTGGTGATATCAC
CTGTAATGATGAATCCAGCGATGGCTTTACCTTTACACCAGGCGACAAAGTCACCTGTGTGGCAGGGAACAACACGACAA
TTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCG
CAAGAACTGGCGGGTTCCGACAACAAGAAAAGCAATGCGCTCTCGCTGGTCACCTCCATGAACAGTTGCCCGGCGAATAC
AGAACAGGTGTGCCTGGAGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGACTCGCTGTATAAGCAAATCGATCTGGCAC
CGGAAGAGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGACTGATAAAGCGCCATCCACTCATACTTCA
CCGGTAGTCCCCGTCACGACTCCGGGAACAAAACCGGATCTGAACGCCTCCTTCGTGTCGGCTAACGCGGAAGAGTTTTA
TCAGTATCAACCTTCGGAAATCATTCTCTCAGAAGGTCGCCTGGTGGATAGCCAGGGATATGGTGTTGCTGGCGTCAACT
ACTACACCAATTCAGGCCGTGGCGTGACAGGGGAAAATGGTGAATTTTCTTTTAGCTGGGGCGAAACCATCTCCTTTGGT
ATCGATACCTTTGAACTGGGTTCAGTGCGCGGCAATAAGTCGACCATTGCGCTGACTGAACTGGGTGATGAAGTTCGCGG
GGCGAATATCGATCAGCTTATTCATCGCTATTCGACGACCGGGCAAAATAATACCCGTGTAGTTCCGGATGAAGTACGCA
AGGTTTTTGCTGAATATCCCAACGTGATTAACGAGATTATCAATCTCTCGTTATCCAATGGTGCGACGCTGGGGGAAGGT
GAGCAAGTCGTTAATCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGAGATCGATACCGCGATTTG
TGCGAAAACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACGACGCGCAATGTTAATGACGGCCAGATTCAGGGCG
TTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCAGTAAGTTCCATGTATTCCATGACTCCACCAACTTC
TATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACGCGGCCTTCCCGATTCTGATGGCGCGTAA
TGATAAAAACTACTGGCTGGCCTTCGGCGAAAAACGCGCCTGGGATAAAAACGAGCTGGCGTACATTACGGAAGCGCCTT
CTCTTGTTGAGCCGGAAAACGTTACGCGCGATACCGCCACCTTTAACCTGCCGTTTATTTCGCTGGGGCAAGTCGGTGAG
GGCAAACTGATGGTTATCGGTAACCCACACTACAACAGCATTTTGCGTTGCCCGAACGGTTACAGCTGGAACGGGGGCGT
TAATAAAGATGGGCAGTGTACGCTCAACAGCGACCCGGATGACATGAAGAACTTCATGGAGAACGTGCTGCGCTATCTGT
CCGACGATAAATGGAAGCCGGACGCGAAAGCCAGCATGATCGTAGGCACCAACCTGGATACTGTCTATTTTAAACGTCAT
GGTCAGGTTACAGGAAACAGCGCTGCGTTCGACTTCCATCCGGATTTTGCGGGCATCTCTGTTGAGCATTTAAGTAGCTA
TGGCGATCTCGATCCGCAGGAAATGCCGCTGCTGATCCTTAACGGCTTTGAATATGTGACTCAGGTGGGTAACGATCCTT
ATGAAATCCCGCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCCTATCTGAAC
AAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGTTTTGTGCGTCTGTT
GGATGCCGCAGGTCTGTCGATGGCACTGAACAAGTCGGTAGTAAATAACGATCCGCAAGGGTATCCGAACCGCGTTCGTC
AGCAGCGCGCAACGGGCATTTGGGTCTATGAACGTTATCCTGCCGTAGATGGTGCGCTGCCGTACACCATCGATAGTAAG
ACAGGGGAAGTTAAGTGGAAATATCAGGTAGAAAACAAACCTGATGACAAACCGAAGCTGGAAGTTGCCAGCTGGCTGGA
AGATGTAGATGGCAAACAGGAAACGCGTTATGCCTTTATTGATGAGGCCGATCATAAAACAGAGGATTCTCTGAAGGCTG
CGAAGGAGAAAATTTTCGCCGCGTTCCCGGGGCTGAAAGAGTGTACTAATCCGGCATATCACTATGAGGTCAACTGCCTG
GAATATCGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCACAGTATACGCAACTAAGCCTTAACGCCGA
CACTGCAAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACTTCCGGA
CCAATGGTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAAC
GATACGAGCTATCGTTATGAAGAAGGCAAAAATGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGC
CAATGATGCCTATGCAGGCGGCACCAAGTGTTCTGCAGATCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGTG
ACGGTAGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTCAACTATATGGAAAAACCGCTGACACGCCTGATGCTG
GGCCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTCGAGAAGTATCCGGGAGCAGTATCGGAAGAGGGACAGAACGT
TACTGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCAGGTAACATGCAGTCAACTGGCCTGTGGGCACCGG
CTCAGAAAGAGGTCACCATTAAGTCCAATGCGAACGTTCCTGTGACCGTCACCGTGGCGCTGGCTGACGACCTGACCGGA
CGTGAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTACTCTCTGGACGCTAGCGGTACGGTGAA
GTTCAAGGTGCCTTACGGTGGCCTGATTTATATCAAGGGCAATAGCTCTACCAATGAATCTGCCAGCTTCACCTTTACTG
GCGTGGTAAAAGCACCGTTCTATAAAGACGGCGCATGGAAAAACGATCTGAACTCACCGGCTCCGCTGGGTGAGCTGGAA
TCAGACGCTTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGCAATTACACTGGCGGACTGGAGCAATTCGCTAA
CGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGCCGTGATAGCGAAGACGGTAAGCACCGGATGTTTACCT
ATAAAAACTTGCCGGGCCACAAACATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCTGGTTATCCG
GTGATGAACAGCAGCTTCTCGCCGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATCTGGCATGAAGT
CGGTCATAACGCCGCAGAAACGCCGTTGACTGTACCGGGTGCAACTGAAGTCGCTAACAACGTGCTGGCGCTGTACATGC
AGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACCGGAATATCTGGAGGAGAGCAACAAC
CAGGCATGGGCACGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGCTGAAGGAATGGGCAGAGAAAAACTTTGA
TATCAAGAAATGGTATCCAGATGGCACTCCTCTGCCAGAGTTTTACAGCGAGCGTGAAGGGATGAAAGGCTGGAACCTGT
TCCAGTTGATGCATCGTAAAGCACGCGGCGATGAGGTCAGCAATGACAAGTTTGGCGGCAAGAATTACTGTGCTGAATCC
AACGGTAACGCAGCGGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATCTTTCGGAGTTCTTTAAGAAATG
GAATCCGGGCGCGAATGCTTACCAGCTGCCGGGGGCGAGCGAGATGAGCTTCGAGGGCGGTGTGAGCCAGTCGGCTTACA
ACACGCTCGCGTCACTCGATCTGCCGAAACCGGAACAGGGACCGGAAACCATTAATCAGGTTACCGAGCATAAGATGTCT
GCCGAGTAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPEPEPTPEPTPDPEPTP
EPEPEPVPTKTGYLTLGGSLRVTGDITCNDESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDA
QELAGSDNKKSNALSLVTSMNSCPANTEQVCLEFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTS
PVVPVTTPGTKPDLNASFVSANAEEFYQYQPSEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGENGEFSFSWGETISFG
IDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDEVRKVFAEYPNVINEIINLSLSNGATLGEG
EQVVNLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNF
YGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRDTATFNLPFISLGQVGE
GKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKNFMENVLRYLSDDKWKPDAKASMIVGTNLDTVYFKRH
GQVTGNSAAFDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYEIPLRADTSKPKLTQQDVTDLIAYLN
KGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSK
TGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCL
EYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWN
DTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML
GRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTG
REKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELE
SDAFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFTNDVQISIGDAHSGYP
VMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNN
QAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAES
NGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPEQGPETINQVTEHKMS
AE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 92
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 50
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
UMNK88_3721 YP_006135420.1 accessory colonization factor protein AcfD VFG0106 Protein 0.0 49