Gene Information

Name : O3K_04095 (O3K_04095)
Accession : YP_006777549.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 840099 - 844532 bp
Length : 4434 bp
Strand : +
Note : -

DNA sequence :
TTGCCGGAAGTGAAACCTGATCCAACACCAAACCCGGAGCCGACACCTGAGCCAACGCCGGACCCAGAACCTACGCCGGA
ACCGACACCTGATCCTGAGCCAACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACTCTGGGCG
GAAGCCTGCGGGTAACTGGTGATATCACCTGTAATGATGAATCCAGCGATGGCTTTACCTTTACACCAGGCGACAAAGTC
ACCTGTGTGGCAGGGAACAACACGACAATTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGA
AAAAGTGTCGTTTAGTCTTGAGGACGCGCAAGAACTGGCGGGTTCCGACAACAAGAAAAGCAATGCGCTCTCGCTGGTCA
CCTCCATGAACAGTTGCCCGGCGAATACAGAACAGGTGTGCCTGGAGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGAC
TCGCTGTATAAGCAAATCGATCTGGCACCGGAAGAATTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGAC
CGATAAAGCGCCATCCACTCATACTTCACCGGTCGTGCCCGCCACCACTCCGGGAACAAAACCGGATCTAAACGCTTCCT
TCGTGTCGGCTAACGCGGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTCTCTGAAGGTCGACTGGTCGATAGT
CAGGGGGATGGTGTTGTTGGTGTCAACTATTACACCAATTCCGGCCGTGGTGTAACCGGAGAAAACGGGGAATTTTCCTT
TAGTTGGGGGGAAACCATCTCCTTTGGCATCGACACTTTTGAGCTTGGTTCTGTGCGTGGTAACAAGTCGACTATTGCAT
TGACTGAACTGGGTGATGAAGTTCGCGGGGCAAATATCGATCAGTTGATTCACCGCTATTCGAAGGCTGGACAAAATCAC
ACGCGTGTAGTTCCGGATGAAGTGCGCAAGGTTTTTGCTGAATATCCCAACGTGATTAACGAGATTATCAATCTCTCGTT
ATCCAATGGTGCGACGCTGGGGGAAGGTGAGCAAGTCGTTAATCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTC
AGGCCAAAGAGATCGATACCGCGATTTGTGCGAAAACCGATGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGC
AATGTTAATGACGGCCAGATTCAGGGCGTTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCAGTAAGTT
CCATGTATTCCATGACTCTACCAACTTCTATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACG
CGGCCTTCCCGATTCTGATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAAT
GAGCTGGCGTACATTACTGAAGCGCCTTCCATTGTGCGACCAGAGAACGTGACACGCGAAACCGCCACCTTCAACCTGCC
GTTTATCTCGCTGGGGCAAGTGGGCGATGGCAAGCTGATGGTTATCGGTAACCCACACTACAACAGCATCCTGCGTTGCC
CGAACGGCTACAGCTGGAACGGGGGCGTTAATAAAGACGGACAGTGTACGCTCAACAGCGACCCGGATGACATGAAGAAC
TTCATGGAGAACGTGCTGCGCTATCTGTCAAATGATCGCTGGTTGCCGGATGCAAAATCCAGTATGACCGTGGGTACTAA
CCTGGACACGGTATATTTCAAAAAACATGGTCAGGTGCTGGGAAATAGCGCACCGTTTGCGTTCCACAAGGATTTCACTG
GCATCACGGTCAAACCAATGACCAGCTATGGCAATCTGAATCCAGATGAAGTTCCTCTGTTGATCCTCAATGGCTTTGAA
TACGTCACACAATGGGGTAGCGATCCTTACTCCATTCCTCTGCGCGCAGATACCAGCAAACCGAAGCTGACCCAGCAGGA
TGTGACCGATTTGATCGCCTATATGAACAAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAG
AGAGCGCATCTGGCTTTGTACGTCTGCTTGATGCCGCAGGTTTGTCGATGGCGCTTAACAAGTCGGTAGTAAATAACGAT
CCGCAAGGCTACCCTGACCGTGTTCGTCAACGACGTTCAACGCCAATTTGGGTCTATGAGCGTTATCCGGCTGTCGATGG
TAAACCACCGTATACCATTGATGACACCACGAAAGAAGTTATCTGGAAATATCAGCAAGAAAACAAACCTGATGACAAAC
CGAAGCTGGAAGTTGCCAGCTGGCAGGAAGAAGTTGAGGGTAAACAGGTAACCCAATTCGCCTTTATTGATGAAGCCGAC
CACAAAACGCCTGAGTCACTGGCTGCGGCAAAACAGAGAATTCTGGACGCGTTCCCAGGGCTGGAAGTGTGTAAGGATTC
TGACTATCACTATGAGGTCAACTGTCTGGAATACCGCCCAGGCACGGGTGTGCCGGTAACCGGTGGCATGTATGTTCCGC
AGTATACGCAGCTGGATCTTGGAGCTGACACTGCGAAAGCGATGCTGCAGGCTGCGGATTTAGGCACCAATATTCAGCGC
CTGTATCAGCATGAGCTTTATTTCCGTACCAATGGCCTCCAGGGTGAGCGTCTCAACAGCGTTGATCTGGAACGTTTATA
CCAAAACATGTCCGTCTGGCTGTGGAACGAGACGAAATATCGTTATGAAGAGGGTAAAGAAGACGAGCTGGGCTTTAAAA
CGTTCACTGAGTTTCTGAACTGCTACACCAACAATGCATACGTTGGCACGCAGTGTTCTGCTGAGCTGAAAAAATCGCTG
ATCGATAACAAGATGATTTATGGTGAAGAAAGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTCAACTATATGGA
AAAACCGCTGACACGCCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTTGAGAAGTATCCGGGAG
TGGTGAATACAAACGGCGAAACCGTCACACAAAACATTAACTTGTACTCAGCTCCAACCAAATGGTTTGCAGGTAACATG
CAGTCAACTGGCCTGTGGGCACCTGCCCAGCAGGAAGTCAGCATTGAGTCAAAGGCGACAGTTCCTGTGACCGTGACTGT
TGCGCTGGCCGACGACCTGACAGGACGAGAGAAGCATGAAGTTAGCCTGAATCGTCCACCCAGAGTGACAAAAACCTATG
ACCTGAAAGCCAATGATAAGGTGACGTTCAAAGTCCCTTACGGTGGTCTGATTTACATCAAGGGCGACAGCAAAGAGGTG
CAATCAGCTGACTTCACCTTTACCGGTGTAGTAAAAGCGCCGTTCTATAAAGACGGTAAGTGGCAACACGATCTGAACTC
CCCTGCCCCGCTGGGCGAACTGGAGTCTGCCTCGTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGCAATTACA
CTGGCGGACTGGAGCAATTCGCTAACGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGCCGTGATAGCGAA
GACGGTAAGCACCGGATGTTTACCTATAAAAACTTGCCGGGCCACAAACATCGTTTCGCCAACGATGTGCAGATCTCCAT
CGGTGATGCGCATTCGGGTTATCCGGTAATGAACAGCAGCTTCTCGCCGAACAGCACCACGCTGCCGACGACGCCGCTGA
ACGACTGGCTGATCTGGCATGAAGTCGGTCATAACGCCGCAGAAACGCCGTTGACTGTACCGGGTGCAACTGAAGTCGCT
AACAACGTGCTGGCGCTGTACATGCAGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACC
GGAATATCTGGAGGAAAGCAACGGTCAGGCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGCTGA
AGGAATGGGCAGAGAAAAACTTTGATATCAAGAAATGGTATCCAGATGGCACTCCTCTGCCAGAGTTTTACAGCGAGCGT
GAAGGGATGAAAGGCTGGAACCTGTTCCAGTTGATGCATCGTAAAGCACGCGGCGATGAGGTCAGCAATGACAAGTTTGG
CGGCAAGAATTACTGTGCTGAATCCAACGGTAACGCAGCGGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGG
ATCTTTCGGAGTTCTTTAAGAAATGGAATCCGGGCGCGAATGCTTACCAGTTGCCGGGAGCGACGGAGATGAGCTTCGAA
GGCGGTGTGAGCCAGTCGGCGTACAACACGCTGGCGTCACTCAATCTGCCGAAACCGAAGCAAGGGCCGGAAACCATTAA
CAAGGTTACCGAGTATTCGATGCCTGCTGAATAA

Protein sequence :
MPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKTGYLTLGGSLRVTGDITCNDESSDGFTFTPGDKV
TCVAGNNTTIATFDTQSEAARSLRAVEKVSFSLEDAQELAGSDNKKSNALSLVTSMNSCPANTEQVCLEFSSVIESKRFD
SLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPVVPATTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDS
QGDGVVGVNYYTNSGRGVTGENGEFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSKAGQNH
TRVVPDEVRKVFAEYPNVINEIINLSLSNGATLGEGEQVVNLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTR
NVNDGQIQGVINKLWGVDTNYKSVSKFHVFHDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKN
ELAYITEAPSIVRPENVTRETATFNLPFISLGQVGDGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKN
FMENVLRYLSNDRWLPDAKSSMTVGTNLDTVYFKKHGQVLGNSAPFAFHKDFTGITVKPMTSYGNLNPDEVPLLILNGFE
YVTQWGSDPYSIPLRADTSKPKLTQQDVTDLIAYMNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNND
PQGYPDRVRQRRSTPIWVYERYPAVDGKPPYTIDDTTKEVIWKYQQENKPDDKPKLEVASWQEEVEGKQVTQFAFIDEAD
HKTPESLAAAKQRILDAFPGLEVCKDSDYHYEVNCLEYRPGTGVPVTGGMYVPQYTQLDLGADTAKAMLQAADLGTNIQR
LYQHELYFRTNGLQGERLNSVDLERLYQNMSVWLWNETKYRYEEGKEDELGFKTFTEFLNCYTNNAYVGTQCSAELKKSL
IDNKMIYGEESSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGVVNTNGETVTQNINLYSAPTKWFAGNM
QSTGLWAPAQQEVSIESKATVPVTVTVALADDLTGREKHEVSLNRPPRVTKTYDLKANDKVTFKVPYGGLIYIKGDSKEV
QSADFTFTGVVKAPFYKDGKWQHDLNSPAPLGELESASFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSE
DGKHRMFTYKNLPGHKHRFANDVQISIGDAHSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVA
NNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSER
EGMKGWNLFQLMHRKARGDEVSNDKFGGKNYCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGATEMSFE
GGVSQSAYNTLASLNLPKPKQGPETINKVTEYSMPAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 87
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 50
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_04095 YP_006777549.1 inner membrane lipoprotein VFG0106 Protein 0.0 49