Gene Information

Name : ECSE_3249 (ECSE_3249)
Accession : YP_002294524.1
Strain : Escherichia coli SE11
Genome accession: NC_011415
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3373686 - 3378254 bp
Length : 4569 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGTGCAACCCTGTTAGCCGGTTGTGATGGCGG
TGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAGTGAAACCTGATCCAACAC
CAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTACACCGGAACCGACACCTGATCCTGAGCCAACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCTGCGGGTAACTGGTGATATCAC
CTGTAATGATGAATCCAGCGATGGCTTTACCTTTACACCAGGCGACAAAGTCACCTGTGTGGCAGGGAACAACACGACAA
TTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCG
CAAGAACTGGCGGGTTCCGACAACAAGAAAAGCAATGCGCTCTCGCTGGTCACCTCCATGAACAGTTGCCCGGCGAATAC
AGAACAGGTGTGCCTGGAGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGACTCGCTGTATAAGCAAATCGATCTGGCAC
CGGAAGAGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGACTGATAAAGCGCCATCCACTCATACTTCA
CCGGTAGTCCCCGTCACGACTCCGGGAACAAAACCGGATCTAAACGCCTCCTTCGTGTCGGCTAACGCGGAACAGTTTTA
TCAGTATCAACCTTCGGAAATCATTCTCTCAGAAGGTCGCCTGGTGGATAGCCAGGGATATGGTGTTGCTGGCGTCAACT
ACTACACCAATTCAGGCCGTGGCGTGACAGGGGAAAATGGTGAATTTTCTTTTAGCTGGGGCGAAACCATCTCCTTTGGT
ATCGATACCTTTGAACTGGGTTCAGTGCGCGGCAATAAGTCGACCATTGCGCTGACTGAACTGGGTGATGAAGTTCGCGG
GGCGAATATCGATCAGCTTATTCATCGCTATTCGACGACCGGGCAAAATAATACCCGTGTTGTTCCGGATGATGTACGCA
AGGTCTTTGCCGAATATCCCAACGTGATTAACGAGATTATCAATCTCTCGTTATCCAACGGTGCGACGCTGGATGAAGGT
GAGCAAGTTGTTAATCTGCCAAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGAGATCGATACCGCGATTTG
TGCGAAAACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACGACGCGCAATGTTAATGACGGCCAGATTCAGGGCG
TTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCAGCAAGTTCCATGTATTCCATGACTCTACTAACTTC
TATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACGCGGCCTTCCCGATTCTGATGGCGCGTAA
TGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAACGAGCTGGCGTACATTACGGAAGCGCCTT
CTCTTGTTGAGCCGGAAAACGTTACGCGCGATACCGCCACTTTCAACCTGCCGTTTATTTCGCTGGGGCAAGTCGGTGAG
GGCAAACTGATGGTTATCGGAAACCCACACTACAACAGCATTTTGCGTTGCCCGAACGGTTACAGCTGGAACGGGGGCGT
TAATAAAGATGGGCAGTGTACGCTCAACAGCGACCCGGATGACATGAAGAACTTCATGGAGAACGTGCTGCGCTATCTGT
CCGACGATAAATGGACGCCGGACGCGAAAGCCAGCATGACCGTAGGCACCAACCTAGATACTGTCTATTTTAAACGTCAT
GGTCAGGTTACAGGAAACAGCGCTGCGTTCGACTTCCATCCGGATTTTGCGGGCATCTCTGTTGAGCATTTAAGTAGCTA
TGGCGATCTCGATCCGCAGGAAATGCCGCTACTGATCCTTAACGGCTTTGAATATGTGACTCAGGTGGGTAACGATCCTT
ATGCAATCCCGCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCCTATCTGAAC
AAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGTTTTGTGCGTCTGTT
GGATGCCGCAGGTCTGTCGATGGCACTGAACAAGTCGGTAGTAAATAACGATCCGCAAGGGTATCCGAACCGCGTTCGTC
AGCAGCGCGCAACGGGCATTTGGGTCTATGAACGTTATCCTGCCGTAGATGGTGCGCTGCCGTACACCATCGATAGTAAG
ACAGGGGAAGTTAAGTGGAAATATCAGGTAGAAAACAAACCTGATGACAAACCGAAGCTGGAAGTTGCCAGCTGGCTGGA
AGATGTAGATGGCAAACAGGAAACGCGTTATGCCTTTATTGATGAGGCCGATCATAAAACAGAGGATTCTCTGAAGGCTG
CGAAGGAGAAAATTTTCGCCGCGTTCCCGGGGCTGAAAGAGTGTACTAATCCGGCATATCACTATGAGGTCAACTGCCTG
GAATATCGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCACAGTATACGCAACTAAGCCTTAACGCCGA
CACTGCAAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACTTCCGGA
CCAATGGTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAAC
GATACGAGCTATCGTTATGAAGAAGGCAAAAATGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGC
CAATGATGCCTATGCAGGCGGCACCAAGTGTTCTGCAGATCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGTG
ACGGTAGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTCAACTATATGGAAAAACCGCTGACACGCCTGATGCTG
GGCCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTCGAGAAGTATCCGGGAGCAGTATCGGAAGAGGGACAGAACGT
TACTGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCAGGTAACATGCAGTCAACTGGCCTGTGGGCACCGG
CTCAGAAAGAGGTCACCATTAAGTCCAATGCGAACGTTCCTGTGACCGTCACCGTGGCGCTGGCTGACGACCTGACCGGA
CGTGAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTACTCTCTGGACGCTAGCGGTACGGTGAA
GTTCAAGGTGCCTTACGGTGGCCTGATTTATATCAAGGGCAATAGCTCTACCAATGAATCTGCCAGCTTCACCTTTACTG
GCGTGGTAAAAGCACCGTTCTATAAAGACGGCGCATGGAAAAACGATCTGAACTCACCGGCTCCGCTGGGTGAGCTGGAA
TCAGACGCTTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGCAATTACACTGGCGGACTGAAGCAATTCGCTAA
CGATCTGGACACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTGATGAAACGAGCGGTAAGCACCGGATGTTTACCT
ATAAAGCATTGACGGGACATAAACATCGTTTCACAAACGATGTGCAGATCTCCATCGGTGATGCGCACTCGGGTTATCCG
GTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATCTGGCACGAAGT
AGGGCACAACGCCGCAGAAACGCCGCTGACTGTTCCAGGCGCGACCGAAGTGGCGAACAACGTGCTGGCGCTGTACATGC
AGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACCGGAATATCTGGAGGAGAGCAACGGT
CAGGCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGCTGAAGGAATGGGCAGAGAAAAACTTTGA
TATCAAGAAATGGTATCCAGATGGCACTCCTCTGCCAGAGTTTTACAGCGAGCGTGAAGGGATGAAAGGCTGGAACCTGT
TCCAGTTGATGCATCGTAAAGCACGCGGCGATGAGGTCAGCAATGACAAGTTTGGCGGCAAGAATTACTGTGCTGAATCC
AACGGTAACGCAGCGGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATCTTTCGGAGTTCTTTAAGAAATG
GAATCCGGGCGCGAATGCTTACCAGTTGCCGGGAGCGACGGAGATGAGCTTCGAAGGCGGTGTGAGCCAGTCGGCGTACA
ACACGCTGGCGTCACTCAATCTGCCGAAACCGAAGCAAGGGCCGGAAACCATTAACAAGGTTACCGAGTATTCGATGCCT
GCTGAATAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTP
EPEPEPVPTKTGYLTLGGSLRVTGDITCNDESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDA
QELAGSDNKKSNALSLVTSMNSCPANTEQVCLEFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTS
PVVPVTTPGTKPDLNASFVSANAEQFYQYQPSEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGENGEFSFSWGETISFG
IDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLDEG
EQVVNLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNF
YGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRDTATFNLPFISLGQVGE
GKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKNFMENVLRYLSDDKWTPDAKASMTVGTNLDTVYFKRH
GQVTGNSAAFDFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAYLN
KGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTIDSK
TGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKEKIFAAFPGLKECTNPAYHYEVNCL
EYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWN
DTSYRYEEGKNDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLML
GRSWWDLNIKVDVEKYPGAVSEEGQNVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTG
REKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELE
SDAFVYTTPKKNLNASNYTGGLKQFANDLDTFASSMNDFYGRDETSGKHRMFTYKALTGHKHRFTNDVQISIGDAHSGYP
VMNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNG
QAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAES
NGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGATEMSFEGGVSQSAYNTLASLNLPKPKQGPETINKVTEYSMP
AE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 92
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 50
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 50
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 50
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECSE_3249 YP_002294524.1 hypothetical protein VFG0106 Protein 0.0 50