Gene Information

Name : yghJ (ECS88_3348)
Accession : YP_002392955.1
Strain : Escherichia coli S88
Genome accession: NC_011742
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG0810
EC number : -
Position : 3305869 - 3310449 bp
Length : 4581 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1644747; Product type lp : lipoprotein

DNA sequence :
TTGTCACTTGCGTTATTAATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTAAGCGCAACCCTGTT
AGCCGGTTGTGATGGTGGTGGTTCAGGATCGTCCTCCGATACGCCGTCTGTAGATTCTGGATCAGGGACTTTGCCGGAAG
TGAAACCCGATCCAACACCAACCCCGGAGCCGACACCTGAGCCGACGCCGGACCCAGAACCTACGCCGGATCCAACACCT
GATCCTGAGCCGACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCG
GGTAACTGGTGCTACCTGTAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCAATACCGTGAGTTGTGTGGTGG
GCAGTACGACCATTGCAACATTCAACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGACAAAGTGTCGTTTAGC
CTGGAGGACGCGCAGGAGCTGGCGAATTCTGAAAATAAGAAAACCAACGCCATCTCTCTGGTGACGTCCAGCGACAGTTG
CCCCGCAGATGCAGAACAGCTTTGTCTTACTTTCTCGTCAGTGGTTGATCGCGCGCGATTTGAAAAACTGTATAAGCAAA
TTGATCTGGCAACAGACAATTTCAGCAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACTGATAAAGCGCCGTCC
ACCCATACCTCAACGGTAGTGCCAGTCACGACAGAGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGTCGGCTAACGC
GGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTTTCCGAAGGCCAACTGGTGGATAGCCTGGGGAACGGTGTTG
CTGGCGTTGACTACTACACCAATTCAGGCCGTGGCGTAACTGACGAAAACGGTAAATTTTCCTTTAGCTGGGGCGAAACC
ATCTCCTTTGGTATCGATACCTTTGAACTGGGCTCAGTACGTGGCAATAAGTCGACCATTGCGCTGACTGAATTGGGTGA
TGAAGTTCGCGGGGCAAATATCGATCAGCTCATTCATCGTTATTCGACGACTGGTCAAAATAATACTCGTGTTGTTCCGG
ACGATGTACGCAAGGTCTTTGCCGAATATCCCAACGTGATCAACGAGATAATCAATCTTTCGTTATCCAACGGTGCGACG
CTGGATGAAGGCGATCAAAACGTTGTGCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGAGATCGA
TACCGCGATTTGTGCGAAAACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGCAATGTTAATGACGGCC
AGATTCAGGGCGTTATTAACAAGCTGTGGGGCGTGGATACGAACTATCAGTCTGTCAGCAAGTTCCACGTCTTCCATGAC
TCTACCAACTTCTATGGCAGTACCGGTAACGCGCGCGGTCAGGCGGTGGTAAATATCTCCAACTCGGCATTCCCGATTCT
GATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGTACATTA
CGGAAGCGCCTTCCATTGTGCAGCCAGAGAATGTTACGCGCGATACTGCGACTTTCAACCTGCCGTTTATTTCGCTGGGG
CAAGTCGGTGAAGGCAAACTGATGGTTATCGGTAACCCGCACTACAACAGCATCCTGCGTTGCCCGAACGGTTACAGTTG
GGGCGGTGGTGTTAATAGTAAAGGTGAGTGTACGCTCAGCGGTGATTCTGATGACATGAAGCACTTTATGCAGAACGTAC
TGCGCTACTTGTCAAATGACATCTGGCAGCCAAATACCAAGAGCATCATGACTGTCGGCACCAACCTGGAGAACGTTTAT
TTCAAAAAAGCGGGCCAGGTATTGGGAAATAGTGCACCATTTGCTTTCCATGAGGATTTCACTGGTATCACGGTTAAACA
GTTGACCAGCTATGGCGATCTGAATCCGGAAGAGATTCCGTTGCTGATCCTCAACGGCTTTGAATATGTGACTCAGTGGT
CTGGCGATCCCTATGCTGTGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATC
GCTTATCTGAACAAAGGTGGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCCAGTTT
TGTGCGTCTGCTGGATGCCGCGGGTCTGTCAATGGCTCTGAACAAATCGGTGGTGAACAACGATCCGCAAGGGTATCCGG
ATCGCGTTCGTCAGCGTCGCGCGACTGGCATTTGGGTTTATGAACGTTATCCTGCTGCAGACGGCGCGCAACCGCCGTAC
ACCATCGACCCAAATACAGGGGAAGTGACCTGGAAATACCAGCAAGACAACAAGCCTGATGACAAGCCGAAACTGGAAGT
TGCGAGCTGGCAGGAGGAAGTTGAGGGCAAACAGGTAACGCGTTATGCCTTTATTGATGAAGCGGAATACACAACAGAAG
AATCTCTGGAAGCGGCAAAGGCAAAAATCTTTGAGAAGTTTCCTGGGTTACAGGAGTGTAAGGACTCGACTTACCATTAC
GAGATTAACTGTTTGGAGCGCCGCCCAGGCACGGATGTTCCGGTAACAGGTGGCATGTATGTTCCGCGCTATACGCAACT
GAATCTTGACGCCGACACCGCGAAAGCGATGGTGCAGGCGGCGGATTTAGGCACCAACATTCAGCGCCTGTATCAGCATG
AGCTTTATTTCCGTACCAAAGGCAGTAAAGGTGAGCGTCTGAACAGTGTTGATCTGGAACGTCTGTACCAGAACATGTCG
GTCTGGCTGTGGAACGATACGAAATATCGTTACGAAGAGGGCAAGGAAGATGAGCTGGGCTTTAAAACGTTCACCGAGTT
CCTGAACTGCTACGCCAATGATGCCTATGCAGGCGGCACCAAGTGCTCCGCAGATCTGAAAAAATCGCTGGTCGATAACA
ACATGATCTACGGTGACGGTAGCAGCAAAGCGGGCATGATGAACCCAAGCTATCCGCTCAACTATATGGAAAAACCGCTG
ACGCGTCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATTAAGGTTGATGTGGAGAAGTACCCAGGATCCGTATCGGC
AAAGGGTGAGAGCGTTACGGAAAACATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCAACCG
GCCTGTGGGCACCGGCCCAGCAGGACGTCACCATTAAGTCTTCGGCGTCAGTCCCAGTGACTGTTACCGTGGCGCTGGCT
GACGACCTGACTGGACGTGAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTATACTCTGGAGGC
TAACGGTGAAGTGACCTTCAAGGTGCCTTATGGTGGTCTGATTTATATCAAGGGCGACAGTAAGGATGATGTTTCTGCTA
ACTTCACCTTTACCGGTGTAGTAAAAGCGCCGTTCTATAAAGACGGCGAATGGAAAAACGATCTGGACTCACCGGCGCCG
CTGGGCGAGCTGGAGTCTGCGTCGTTCGTCTATACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTTCACTGGTGGTGT
AGCAGAATTCGCTAAAGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGC
ACCGGATGTTTACCTATAAAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCG
CACTCGGGTTATCCGGTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCT
GATTTGGCACGAAGTCGGTCATAACGCTGCAGAAACACCGCTGAACGTACCGGGTGCAACTGAAGTGGCGAACAACGTGC
TGGCGCTGTACATGCAGGATCGCTATCTCGGTAAGATGAACCGTGTCGCTGACGACATTACCGTCGCGCCGGAATATCTG
GACGAGAGCAACGGTCAGGCCTGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGTTGAAGGAGTGGGC
AGAGGAAAACTTTGATATCAAACAGTGGTATCCAGATGGTGAGCTGCCTAAGTTCTACAGCGATCGTAAAGGGATGAAGG
GCTGGAACCTGTTCCAGTTGATGCACCGTAAAGCGCGCGGCGATGATGTTGGTAACAGCACCTTTGGTGGCAAGAATTAC
TGTGCTGAATCCAATGGTAACGCTGCCGACACGCTGATGCTGTGTGCATCCTGGGTCGCTCAGGCGGATCTTTCGGAATT
CTTTAAGAAATGGAATCCGGGTGCAAGTGCTTACCAGTTGCCGGGAGCAACGGAGATGAGTTTCCAGGGCGGTGTGAGCT
CTTCGGCTTACAGCACGCTGGCGTCACTCAAGCTGCCGAAACCGGAAAAAGGGCCGGAAACCATTAACAAGGTTACCGAG
CATAAGATGTCTGCCGAGTAA

Protein sequence :
MSLALLMNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTP
DPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGNTVSCVVGSTTIATFNTQSEAARSLRAVDKVSFS
LEDAQELANSENKKTNAISLVTSSDSCPADAEQLCLTFSSVVDRARFEKLYKQIDLATDNFSKLVNEEVENNAATDKAPS
THTSTVVPVTTEGTKPDLNASFVSANAEQFYQYQPTEIILSEGQLVDSLGNGVAGVDYYTNSGRGVTDENGKFSFSWGET
ISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGAT
LDEGDQNVVLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYQSVSKFHVFHD
STNFYGSTGNARGQAVVNISNSAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVQPENVTRDTATFNLPFISLG
QVGEGKLMVIGNPHYNSILRCPNGYSWGGGVNSKGECTLSGDSDDMKHFMQNVLRYLSNDIWQPNTKSIMTVGTNLENVY
FKKAGQVLGNSAPFAFHEDFTGITVKQLTSYGDLNPEEIPLLILNGFEYVTQWSGDPYAVPLRADTSKPKLTQQDVTDLI
AYLNKGGSVLIMENVMSNLKEESASSFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRATGIWVYERYPAADGAQPPY
TIDPNTGEVTWKYQQDNKPDDKPKLEVASWQEEVEGKQVTRYAFIDEAEYTTEESLEAAKAKIFEKFPGLQECKDSTYHY
EINCLERRPGTDVPVTGGMYVPRYTQLNLDADTAKAMVQAADLGTNIQRLYQHELYFRTKGSKGERLNSVDLERLYQNMS
VWLWNDTKYRYEEGKEDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPL
TRLMLGRSWWDLNIKVDVEKYPGSVSAKGESVTENISLYSNPTKWFAGNMQSTGLWAPAQQDVTIKSSASVPVTVTVALA
DDLTGREKHEVALNRPPRVTKTYTLEANGEVTFKVPYGGLIYIKGDSKDDVSANFTFTGVVKAPFYKDGEWKNDLDSPAP
LGELESASFVYTTPKKNLEASNFTGGVAEFAKDLDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDA
HSGYPVMNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLNVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYL
DESNGQAWARGGAGDRLLMYAQLKEWAEENFDIKQWYPDGELPKFYSDRKGMKGWNLFQLMHRKARGDDVGNSTFGGKNY
CAESNGNAADTLMLCASWVAQADLSEFFKKWNPGASAYQLPGATEMSFQGGVSSSAYSTLASLKLPKPEKGPETINKVTE
HKMSAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 86
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_002392955.1 inner membrane lipoprotein VFG0106 Protein 0.0 49