Gene Information

Name : ECP_3050 (ECP_3050)
Accession : YP_670934.1
Strain : Escherichia coli 536
Genome accession: NC_008253
Putative virulence/resistance : Virulence
Product : lipoprotein AcfD
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG0810
EC number : -
Position : 3209277 - 3213842 bp
Length : 4566 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGTGCAACCCTGTTAGCCGGTTGTGATGGCGG
TGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAGTGAAACCTGATCCAACAC
CAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTACGCCGGAACCGACACCTGATCCTGAGCCAACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGATAACTGGTGCTACTTG
TAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCGACAAAGTCACCTGTGTGGCAGGGAACAACACGACAATTG
CTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCGCAA
GAACTGGCAGCTTCCGATGACAAGAAAAGCAATGCGGTTTCGCTGGTAACGTCCAGTAACAGCTGTCCGGCGGATACAGA
ACAGGTTTGCCTGACGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGACTCGCTGTATAAGCAAATCGATCTGGCACCGG
AAGAGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACCGATAAAGCGCCATCCACTCATACCTCACCG
GTCGTGCCTGTCACCACGCCGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCA
GTATCAACCCACTGAAATCATTCTTTCCGAAGGCCGACTGGTGGATAGCATGGGCAATGGTGTGGTTGGCGTAAATTACT
ACACCAGCTCAGGCCGTGGCGTAACTGGCGAAAACGGCAAATTCAACTTCAGCTGGGGCGAAACCATCTCCTTTGGTATC
GACACCTTTGAACTGGGCTCAGTGCGCGGCAATAAGTCGACCATTGCGTTGACTGAACTGGGTGACGAAGTTCGCGGCGC
GAATATTGATCAGCTTATTCATCGTTACTCCCAGGCCGGAAAAAATGATGAGCGTGAAGTGCCGGACGTAGTGCGCAAGG
TCTTTGCCGAGTATCCCAACGTAATCAACGAGATTATCAATCTCTCGTTATCCAATGGCGAGGCGTTGAGCGAAGGCGAT
CAAACCTTTGAGCGGACAAACGAATTTCTTGAGCAGTTTGAATCCGGGCAGGCTAAAGAGATTGATACGGCGATTTGTGA
CTCCCTTGGGGGCTGCAACTCTCAGCGTTGGTTCTCGTTGACAGCACGCAATGTTAATGACGGCCAGATTCAGGGCGTTA
TTAACAAGCTGTGGGGGGTGGATACGAACTACAAATCTGTCAGCAAGTTCCATGTATTCCATGACTCTACCAACTTCTAT
GGCAGCACCGGTAATGCGCGCGGTCAGGCAGTGGTGAATATCTCCAACGCGGCATTCCCGATTCTGATGGCGCGTAATGA
TAAAAACTACTGGCTGGCCTTCGGCGAAAAACGCGCCTGGGATAAAAACGAGCTGGCGTACATTACGGAAGCGCCTTCTC
TCGTTGAGCCGGAAAACGTTACGCGCGATACCGCCACCTTTAACCTGCCGTTTATTTCGCTGGGGCAAGTCGGTGAGGGC
AAACTGATGGTTATCGGTAACCCGCACTACAACAGCATTTTGCGTTGTCCGAACGGTTACAGCTGGGAAGGCGGTGTTGA
TAAAAACGGTCAGTGTACGCGTAACAGTGATTCTAATGATATGAAGCACTTTATGCAGAACGTGTTGCGCTATCTGTCCG
ACGATAAATGGACGCCGGACGCGAAAGCCAGCATGACCGTAGGTACCAACCTGGATACTGTCTATTTCAAACGTCATGGT
CAGGTTACAGGAAACAGCGCTGAGTTCGGCTTTCATCCGGATTTTGCGGGTATCTCTGTTGAGCATTTAAGTAGCTATGG
CGATCTCGACCCGCAGGAAATGCCGCTGCTGATCCTTAACGGCTTTGAATATGTGACTCAGGTTGGTAACGATCCTTATG
CAATCCCGCTGCGAGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCCTATCTGAACAAA
GGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGCTTTGTACGTCTGCTTGA
TGCCGCAGGTCTGTCGATGGCACTGAACAAGTCGGTAGTAAATAACGATCCGCAAGGGTATCCGAACCGCGTTCGTCAGC
AGCGCGCAACGGGCATTTGGGTCTATGAACGTTATCCTGCCGTAGATGGTGCGCTGCCGTACACCATCGATAGTAAGACA
GGGGAAGTTAAGTGGAAATATCAGGTAGAAAACAAACCTGATGACAAACCGAAGCTGGAAGTTGCCAGCTGGCTGGAAGA
TGTAGATGGCAAACAGGAAACGCGTTATGCCTTTATTGATGAGGCCGATCATAAAACAGAGGATTCTCTGAAGGCTGCGA
AGGCAAAAATCTTTGAGAAGTTTCCTGGATTAAAGGAGTGTAAGGACCCAACTTACCACTACGAGGTCAACTGTCTGGAA
TATCGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCACAGTATACGCAACTAAGCCTTAACGCCGACAC
GGCAAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACTTCCGGACCA
ATGGTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAATAAA
ATTGAATATCGTTATGAAAACGACAAGGATGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGCCAA
CGATGCCTATACTGGCGGCACGCAGTGTTCTGATGAGCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGCGAGA
AGAGCGTTAATAAAGCGGGCATGATGAACCCGAGCTATCCGCTCAACTATATGGAAAAACCGCTGACGCGCCTGATGCTG
GGTCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTCGAGAAGTATCCGGGAGCGGTATCGGCAGAAGGTGAGAAGGT
TACTGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCTACTGGCCTGTGGGCTCCGG
CTCAGAAAGAGGTCACCATTGAGTCTACTGCATCAGTTGCTGTGACTGTCACCGTGGCGCTGGCCGACGACCTTACCGGA
CGTGAGAAGCATGAAGTCGCTCTGAACCGTCCGCCAAAAGTGACGAAAACCTATGAGCTGAAAGCCAATGGTGAGGTGAA
GTTTACGGTTCCTTACGGTGGTCTGATTTATATCAAGGGCAACAGCCCACAGAATGAGTCAGCCGAATTCACCTTTACTG
GTGTGGTGAAAGCGCCGTTCTATAAAGATGGCGCATGGAAAAACGCTCTGAACTCCCCTGCGCCGTTGGGCGAGCTGGAG
TCAGACGCTTTCGTCTACACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTACAAGGGCGGTCAGGAACAATTCGCTGA
GGAACTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGATGTTTACCT
ATAAAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCTGGTTATCCG
GTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATCTGGCACGAAGC
TGGTCACAACGCCGCCGAAACGCCGTTGACTGTACCGGGCGCGACCGAAGTAGCGAACAACGTGCTGGCGCTGTACATGC
AGGATCGCTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACCGGAATATCTGGAGGAGAGCAACAAC
CAGGCATGGGCACGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCGCAGCTGAAGGAATGGGCAGAGAAAAACTTTGA
TATCACGAAGTGGTATCCAGAAGGTAACCTGCCTAAGTTCTACAGCGAGCGTGAAGGGATGAAAGGCTGGAACCTGTTCC
AGTTGATGCACCGTAAAGCGCGCGGCGATGAGGTTGGCAAAACCAAGTTTGGAGAAAGAAATTACTGTGCCGAATCCAAC
GGTAACGCTGCCGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATCTTTCGGCGTTCTTTAAGAAATGGAA
TCCGGGCGCGAATGCTTACCAGTTGCCGGGAGCGAGCGAGATGAACTTCGAGGGCGGTGTGAGCCAGTCGGCTTACGAGA
CGCTGGCGGCGCTTAATCTGCCGAAACCGCAGCAAGGGCCGGAAACCATTAATAAAGTTACCGAGTATTCGATGCCTGCT
GAATAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTP
EPEPEPVPTKTGYLTLGGSQRITGATCNGESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDAQ
ELAASDDKKSNAVSLVTSSNSCPADTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSP
VVPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSMGNGVVGVNYYTSSGRGVTGENGKFNFSWGETISFGI
DTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSQAGKNDEREVPDVVRKVFAEYPNVINEIINLSLSNGEALSEGD
QTFERTNEFLEQFESGQAKEIDTAICDSLGGCNSQRWFSLTARNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFY
GSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRDTATFNLPFISLGQVGEG
KLMVIGNPHYNSILRCPNGYSWEGGVDKNGQCTRNSDSNDMKHFMQNVLRYLSDDKWTPDAKASMTVGTNLDTVYFKRHG
QVTGNSAEFGFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLNK
GGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSKT
GEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKAKIFEKFPGLKECKDPTYHYEVNCLE
YRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNK
IEYRYENDKDDELGFKTFTEFLNCYANDAYTGGTQCSDELKKSLVDNNMIYGEKSVNKAGMMNPSYPLNYMEKPLTRLML
GRSWWDLNIKVDVEKYPGAVSAEGEKVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIESTASVAVTVTVALADDLTG
REKHEVALNRPPKVTKTYELKANGEVKFTVPYGGLIYIKGNSPQNESAEFTFTGVVKAPFYKDGAWKNALNSPAPLGELE
SDAFVYTTPKKNLEASNYKGGQEQFAEELDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDAHSGYP
VMNSSFSTNSTTLPTTPLNDWLIWHEAGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNN
QAWARGGAGDRLLMYAQLKEWAEKNFDITKWYPEGNLPKFYSEREGMKGWNLFQLMHRKARGDEVGKTKFGERNYCAESN
GNAADTLMLCASWVAQTDLSAFFKKWNPGANAYQLPGASEMNFEGGVSQSAYETLAALNLPKPQQGPETINKVTEYSMPA
E

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 100
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 48
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECP_3050 YP_670934.1 lipoprotein AcfD VFG0106 Protein 0.0 48