Gene Information

Name : yghJ (ECED1_3616)
Accession : YP_002399471.1
Strain : Escherichia coli ED1a
Genome accession: NC_011745
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3538679 - 3543259 bp
Length : 4581 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1644747; Product type lp : lipoprotein

DNA sequence :
TTGTCACTTGCGTTATTAATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTAAGCGCAACCCTGTT
AGCCGGTTGTGATGGTGGTGGTTCAGGATCGTCCTCCGATACGCCGTCTGTAGATTCTGGATCAGGGACTTTGCCGGAAG
TGAAACCCGATCCAACACCAACCCCGGAGCCGACACCTGAGCCGACGCCGGACCCAGAACCTACGCCGGATCCAACACCT
GATCCTGAGCCGACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGTGGAAGCCAGCG
GGTAACTGGTGCTACCTGTAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCAATACCGTGAGTTGTGTGGTGG
GCAGTACGACCATTGCAACATTCAACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGACAAAGTGTCGTTTAGC
CTGGAGGACGCACAGGAGCTGGCGAATTCTGAAAATAAGAAAACCAACGCCATCTCTCTGGTGACGTCCAGCGACAGTTG
CCCCGCAGATGCAGAACAGCTTTGTCTTACTTTCTCGTCAGTGGTTGATCGCGCGCGATTTGAAAAACTGTATAAGCAAA
TTGATCTGGCAACAGACAATTTCAGCAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACTGATAAAGCGCCGTCC
ACCCATACCTCAACGGTAGTGCCAGTCACGACAGAGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGTCGGCTAACGC
GGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTTTCCGAAGGCCAACTGGTGGATAGCCTGGGGAACGGTGTTG
CTGGCGTTGACTACTACACCAATTCTGGCCGTGGCGTAACTGACGAAAACGGTAAATTCTCCTTTAGCTGGGGCGAAACC
ATCTCCTTTGGTATCGATACCTTTGAACTGGGCTCAGTACGTGGCAATAAGTCGACCATTGCGCTGACTGAATTGGGTGA
TGAAGTTCGCGGGGCAAATATCGATCAGCTCATTCATCGTTATTCGACGACTGGTCAAAATAATACTCGTGTTGTTCCGG
ACGATGTACGCAAGGTCTTTGCCGAATATCCCAACGTGATCAACGAGATAATCAATCTTTCGTTATCCAACGGTGCGACG
CTGGATGAAGGCGATCAAAACGTTGTGCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGAGATCGA
TACCGCGATTTGTGCGAAAACCGACGGTTGCAACGAGGCTCGCTGGTTCTCGCTGACAACGCGCAATGTTAATGACGGCC
AGATTCAGGGCGTTATTAACAAGCTGTGGGGCGTGGATACGAACTATCAGTCTGTCAGCAAGTTCCACGTCTTCCATGAC
TCTACCAACTTCTATGGCAGCACCGGTAACGCGCGCGGTCAGGCGGTGGTAAATATCTCCAACGCGGCATTCCCGATTCT
GATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGTACATTA
CGGAAGCGCCTTCCATTGTGCAGCCAGAGAACGTTACGCGCGATACCGCGACCTTCAACCTGCCGTTTATTTCGCTGGGG
CAAGTCGGTGAAGGCAAACTGATGGTTATCGGTAACCCGCACTACAACAGCATCCTGCGTTGCCCGAACGGTTACAGTTG
GGGCGGTGGTGTTAATAGTAAAGGTGAGTGTACGCTCAGCGGTGATTCTGATGACATGAAGCACTTTATGCAGAACGTCC
TGCGCTACTTGTCAAATGACATCTGGCAGCCAAATACCAAGAGCATCATGACTGTCGGCACCAACCTGGAGAACGTTTAT
TTCAAAAAAGCGGGCCAGGTATTGGGAAATAGTGCACCATTTGCTTTCCATGAGGATTTCACTGGTATCACGGTTAAACA
GTTGACCAGCTATGGCGATCTGAATCCGGAAGAGATTCCGTTGCTGATCCTCAACGGCTTTGAATATGTGACTCAGTGGT
CTGGCGATCCCTATGCTGTGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATC
GCTTATCTGAACAAAGGTGGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCCAGTTT
TGTGCGTCTGCTGGATGCCGCGGGTCTGTCAATGGCTCTGAACAAATCGGTGGTGAACAACGATCCGCAAGGGTATCCGG
ATCGCGTTCGTCAGCGTCGCGCGACTGGCATTTGGGTTTATGAACGTTATCCTGCTGCAGACGGCGCGCAACCGCCGTAC
ACCATCGACCCAAATACAGGGGAAGTGACCTGGAAATACCAGCAAGACAACAAGCCTGATGACAAGCCGAAACTGGAAGT
TGCGAGCTGGCAGGAGGAAGTTGAGGGCAAACAGGTAACGCGTTATGCCTTTATTGATGAAGCGGAATACACAACAGAAG
AATCTCTGGAAGCGGCAAAGGCAAAAATCTTTGAGAAGTTTCCTGGGTTACAGGAGTGTAAGGACTCGACTTACCATTAC
GAGATTAACTGTTTGGAGCGCCGCCCAGGCACGGATGTTCCGGTAACAGGTGGCATGTATGTTCCGCGCTATACGCAACT
GAATCTTGACGCCGACACCGCGAAAGCGATGGTGCAGGCGGCGGATTTAGGCACCAACATTCAGCGCCTGTATCAGCATG
AGCTTTATTTCCGTACCAAAGGCAGTAAAGGTGAGCGTCTGAACAGTGTTGATCTGGAACGTCTGTACCAGAACATGTCG
GTCTGGCTGTGGAACGATACGAAATATCGTTACGAAGAGGGCAAGGAAGATGAGCTGGGCTTTAAAACGTTCACCGAGTT
CCTGAACTGCTACGCCAATGATGCCTATGCAGGCGGCACCAAGTGCTCCGCAGATCTGAAAAAATCGCTGGTCGATAACA
ACATGATCTACGGTGACGGTAGCAGCAAAGCGGGCATGATGAACCCAAGCTATCCGCTCAACTATATGGAAAAACCGCTG
ACGCGTCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATTAAGGTTGATGTGGAGAAGTACCCAGGATCCGTATCGGC
AAAGGGTGAGAGCGTTACGGAAAACATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCAACCG
GCCTGTGGGCACCGGCCCTGCAGGACGTCACCATTAAGTCTTCGGCGTCAGTCCCAGTGACTGTTACCGTGGCGCTGGCT
GACGACCTGACTGGACGTGAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTATACTCTGGAGGC
TAACGGTGAAGTGACCTTCAAGGTGCCTTATGGTGGTCTGATTTATATCAAGGGCGACAGTAAGGATGATGTTTCTGCTA
ACTTCACCTTTACCGGTGTAGTAAAAGCGCCGTTCTATAAAGACGGCGAATGGAAAAACGATCTGGACTCACCGGCGCCG
CTGGGCGAGCTGGAGTCTGCGTCGTTCGTCTATACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTTCACTGGTGGTGT
AGCAGAATTCGCTAAAGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGCAATGAAACGAGCGGTAACC
ACCGGATGTTTACCTATGAAGCATTGACGGGGCATAAACACCGTTTCGCCAACGATGTGCAGATCTCCATCGGTGATGCG
CACTCGGGTTATCCGGTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACAACGCCGCTGAACGACTGGCT
GATTTGGCACGAAGTCGGTCATAACGCTGCAGAAACACCGCTGAACGTACCGGGTGCAACTGAAGTGGCGAACAACGTGC
TGGCGCTGTACATGCAGGATCGCTATCTCGGCAAGATGAACCGTGTCGCTGACGACATTACCGTCGCGCCGGAATATCTG
GACGAGAGCAACGGTCAGGCCTGGGCGCGCGGCGGTGCGGGGGACCGTCTGCTGATGTACGCACAGCTGAAAGAGTGGGC
AGAGAAAAACTTTGATATCACGAAGTGGTATCCAGATGGCAAGCTGCCAGCGTTCTACAGCGAGCGTGAAGGGATGAAAG
GCTGGAACCTGTTCCAGTTGATGCATCGTAAAGCACGCGGCGATGATGTTGGTAACAGCACCTTTGGTGGCAAGAATTAC
TGTGCTGAATCCAATGGTAACGCTGCCGACACGCTGATGCTGTGTGCCTCCTGGGTCGCTCAGACGGATCTTTCGGAGTT
CTTTAAGAAATGGAATCCGGGTGCGAATGCTTACCAGCTACCGGGAGCGGCAGAGATGAGCTTCGAGGGCGGTGTGAGCT
CTTCGGCTTACAGCACGCTCGCGTCGCTTAATCTGCCGAAACCGGAAAAAGGGCCGGAAACCATTAACAAGGTTACCGAG
CATAAGATGTCTGCCGAGTAA

Protein sequence :
MSLALLMNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTP
DPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGNTVSCVVGSTTIATFNTQSEAARSLRAVDKVSFS
LEDAQELANSENKKTNAISLVTSSDSCPADAEQLCLTFSSVVDRARFEKLYKQIDLATDNFSKLVNEEVENNAATDKAPS
THTSTVVPVTTEGTKPDLNASFVSANAEQFYQYQPTEIILSEGQLVDSLGNGVAGVDYYTNSGRGVTDENGKFSFSWGET
ISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGAT
LDEGDQNVVLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYQSVSKFHVFHD
STNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVQPENVTRDTATFNLPFISLG
QVGEGKLMVIGNPHYNSILRCPNGYSWGGGVNSKGECTLSGDSDDMKHFMQNVLRYLSNDIWQPNTKSIMTVGTNLENVY
FKKAGQVLGNSAPFAFHEDFTGITVKQLTSYGDLNPEEIPLLILNGFEYVTQWSGDPYAVPLRADTSKPKLTQQDVTDLI
AYLNKGGSVLIMENVMSNLKEESASSFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRATGIWVYERYPAADGAQPPY
TIDPNTGEVTWKYQQDNKPDDKPKLEVASWQEEVEGKQVTRYAFIDEAEYTTEESLEAAKAKIFEKFPGLQECKDSTYHY
EINCLERRPGTDVPVTGGMYVPRYTQLNLDADTAKAMVQAADLGTNIQRLYQHELYFRTKGSKGERLNSVDLERLYQNMS
VWLWNDTKYRYEEGKEDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPL
TRLMLGRSWWDLNIKVDVEKYPGSVSAKGESVTENISLYSNPTKWFAGNMQSTGLWAPALQDVTIKSSASVPVTVTVALA
DDLTGREKHEVALNRPPRVTKTYTLEANGEVTFKVPYGGLIYIKGDSKDDVSANFTFTGVVKAPFYKDGEWKNDLDSPAP
LGELESASFVYTTPKKNLEASNFTGGVAEFAKDLDTFASSMNDFYGRNETSGNHRMFTYEALTGHKHRFANDVQISIGDA
HSGYPVMNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLNVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYL
DESNGQAWARGGAGDRLLMYAQLKEWAEKNFDITKWYPDGKLPAFYSEREGMKGWNLFQLMHRKARGDDVGNSTFGGKNY
CAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGAAEMSFEGGVSSSAYSTLASLNLPKPEKGPETINKVTE
HKMSAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 86
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_002399471.1 inner membrane lipoprotein VFG0106 Protein 0.0 49