Gene Information

Name : yghJ (ECUMN_3448)
Accession : YP_002414113.1
Strain : Escherichia coli UMN026
Genome accession: NC_011751
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3564272 - 3568834 bp
Length : 4563 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1644747; Product type lp : lipoprotein

DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGCGCAACCCTGTTAGCCGGTTGTGATGGCGG
TGGTTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAGTGAAACCTGATCCAACAC
CAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTACGCCGGAACCGATACCTGATCCTGAGCCAACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGGTAACTGGTGCTACCTG
TAATGGTGAATCCAGCGATGGCTTTACATTTACACCTGGTGAGGACGTTACTTGCGTGGCGGGTAACACGACAATTGCCA
CCTTCAACACTCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCGCAAGAA
CTGGCGGCTTCCGACAATAAGAAAAGCAATGCGGTTTCGCTGGTAACGTCCAGTAACAGCTGTCCGGCGGATACAGAACA
GGTTTGCCTGACGTTCTCCTCTGTGATCGAGAGTAAACGTTTCGACTCGCTGTATAAGCAAATCGATCTGGCACCGGAAG
AGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGACCGATAAAGCGCCATCCACTCATACTTCACCGGTC
GTGCCTGTCACCACGCCGGGAACAAAACCGGATCTGAACGCCTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCAGTA
TCAACCTACGGAAATCATTCTCTCCGAAGGCCGACTGGTAGATAGCCAGGGATATGGTGTTGCTGGCGTCAACTACTACA
CCAATTCAGGCCGTGGCGTGACAGGGGAAAATGGTGAATTTTCCTTTAGCTGGGGCGAAACCATCTCCTTTGGTATCGAT
ACCTTTGAACTGGGTTCAGTGCGCGGCAATAAGTCGACCATTGCGCTGACTGAACTGGGTGATGAAGTTCGCGGGGCGAA
TATTGATCAGCTTATTCATCGCTATTCGACGACCGGGCAAAATAATACCCGTGTTGTTCCGGACGATGTACGCAAGGTCT
TTGCCGAATATCCCAACGTGATCAACGAGATTATCAATCTCTCGTTATCCAACGGTGCGACGCTGGATGAAGGTGAGCAA
GTTGTTAATCTGCCAAACGAATTTATTGAGCAGTTTAAGACGGGTCAAGCCAAAGAGATCGATACCGCGATTTGTGCGAA
AACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACGACGCGCAATGTTAATGACGGCCAGATTCAGGGCGTTATTA
ACAAGCTGTGGGGTGTGGATACAAGTTACAAATCTGTCAGCAAGTTCCATGTATTCCATGACTCTACCAACTTCTATGGC
AGCACCGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACGCGGCATTCCCGATTCTGATGGCGCGTAATGATAA
AAACTACTGGCTGGCCTTCGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGTACATTACGGAAGCGCCTTCCATTG
TGCGCCCGGAAAACGTGACTCGCGAAACTGCCACCTTCAACCTGCCGTTTATCTCGCTGGGGCAAGTCGGCAAAGGTAAG
CTGATGGTTATCGGTAACCCGCACTACAACAGCATTCTGCGCTGTCCGAACGGTTACAGCTGGGAAGGCGGTGTTGATAA
AAACGGTCAGTGTACGCGTAACAGTGATTCTAATGATATGAAGCACTTTATGCAGAACGTGTTGCGCTATCTGTCCAACG
ATAAATGGACGCCGGACGCGAAAGCCAGCATGACCGTAGGCACCAACCTGGATACTGTCTATTTCAAACGTCATGGTCAG
GTTACAGGAAACAGTGCTGAGTTCGGCTTCCATCCGGATTTTGCGGGTATCTCCGTTGAGCATTTAAGTAGCTATGGCGA
TCTCGATCCGCAGAAAATGCCGCTGCTGATCCTTAACGGCTTTGAGTATGTGACTCAGGTGGGGGGCGATCCCTATGCAG
TGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACCCAGCAGGATGTGACCGATTTGATCGCCTATATGAACAAAGGT
GGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGCTTTGTGCGTCTGCTGGATGC
CGCAGGCCTGTCAATGGCACTGAACAAGTCGGTGGTGAATAACGATCCGCAGGGTTATCCAGACCGCGTGCGCCAGCGTC
GAGCGACTGGCATTTGGGTTTATGAACGTTATCCGGTTGTGGAGGGTGAGCTGCCGTACACCATTGATTCCAAAACAGGA
AAAGTTACCTGGAAATATCAGATTGATAACAAGCCTGATAAGAAACCGAAACTGGAAGTAGCCAGCTGGCAAGAAGAGGT
TGATGGTAAACAGGTAACTCAATTCGCCTTTATCGATGAAGCCGACCACAAAACGACTGAATCTCTGGATGCTGCGAAGA
AGAAAATTCTGGAAAAATTCAAGGGATTAGAGGAGTGTAAGGACTCGACATATCACTACGAAATCAACTGCCTGGAATAC
CGTCCTGGCACGAACGTTCCGGTTACTGGTGGCATGTATGTTCCGCGCTATACGCAACTGAATCTCAGCGCCGACACTGC
AAAAGCGATGGTGCAGGCGGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACTTCCGGACCAATG
GTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAATGAAATT
GAATACAGCTATGACAGCAGCAAGGAAGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGCCAACGA
TGCCTATACTGGCGGCACGCAGTGTTCTGATGAGCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGCGAGAAGA
GCGTTAATAAAGCGGGCATGATGAACCCCAGCTATCCGCTCAACTATATGGAAAAACCGCTGACGCGCCTGATGCTGGGT
CGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTCGAGAAGTATCCGGGAGCGGTATCGGCAGAAGGTGAGAAGGTTAC
TGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCTACTGGCCTATGGGCTCCGGCTC
AGAAAGAGGTCACCATTGAGTCTTCTGCATCAGTTCCTGTGACTGTCACCGTGGCGCTGGCCGACGATCTGACCGGACGC
GAGAAGCATGAAGTCGCGCTGAACCGTCCGCCAAAAGTGACGAAAACCTATGATCTGAAAGCCAATGATAAGGTGACCTT
CAAGGTGCCTTACGGTGGTCTGATTTATATCAAGGGCAACAGCCCAAAGAATGAGTCAGCCGAATTCACCTTTACTGGTG
TGGTAAAAGCGCCGTTCTATAAAGACGGCGAATGGAAAAACGCTCTGAACTCCCCTGCGCCGCTGGGCGAGCTGGAATCA
GACGCTTTCGTTTATACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTTCACTGGTGGTGTAGCAGAATTCGCTAAAGA
TCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGATGTTTACCTATA
AAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCTGGTTATCCGGTA
ATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATCTGGCACGAAGTTGG
TCACAACGCCGCCGAAACGCCGCTGACTGTTCCTGGCGCAACTGAAGTAGCGAACAACGTGCTGGCGCTGTACATGCAGG
ATCGCTATCTCGGCAAGATGAACCGTGTCGCTGACGACATTACCGTCGCGCCGGAATATCTGGAGGAGAGCAACGGTCAG
GCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGCTGAAAGAGTGGGCAGAGAAAAACTTTGATAT
CAAGACGTGGTATCCAGATGGCAATCTGCCAGCGTTCTACAGCGAGCGTGAAGGGATGAAAGGCTGGAACCTGTTCCAGT
TGATGCATCGTAAAGCACGCGGCGATGAGGTTGGCAAAACCAAGTTTGGCGAAAGAAATTACTGTGCCGAATCCAACGGT
AACGCTGCCGACACGCTGATGCTGTGTGCATCCTGGGTCGCCCAGACGGATCTTTCGGAGTTCTTTAAGAAATGGAATCC
GGGCGCGAATGCTTACCAGTTGCCGGGAGCGAGCGAGATGAACTTCGAGGGCGGTGTGAGCCAGTCGGCTTACGAGACGC
TGGCGGCGCTTAATCTGCCGAAACCGCAGCAAGGGCCGGAAACCATTAATCAGGTTACCGAGCATAAGATGTCAGCTGAG
TAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPIPDPEPTP
EPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGEDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQE
LAASDNKKSNAVSLVTSSNSCPADTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV
VPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGYGVAGVNYYTNSGRGVTGENGEFSFSWGETISFGID
TFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLDEGEQ
VVNLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTSYKSVSKFHVFHDSTNFYG
STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVRPENVTRETATFNLPFISLGQVGKGK
LMVIGNPHYNSILRCPNGYSWEGGVDKNGQCTRNSDSNDMKHFMQNVLRYLSNDKWTPDAKASMTVGTNLDTVYFKRHGQ
VTGNSAEFGFHPDFAGISVEHLSSYGDLDPQKMPLLILNGFEYVTQVGGDPYAVPLRADTSKPKLTQQDVTDLIAYMNKG
GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRATGIWVYERYPVVEGELPYTIDSKTG
KVTWKYQIDNKPDKKPKLEVASWQEEVDGKQVTQFAFIDEADHKTTESLDAAKKKILEKFKGLEECKDSTYHYEINCLEY
RPGTNVPVTGGMYVPRYTQLNLSADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNEI
EYSYDSSKEDELGFKTFTEFLNCYANDAYTGGTQCSDELKKSLVDNNMIYGEKSVNKAGMMNPSYPLNYMEKPLTRLMLG
RSWWDLNIKVDVEKYPGAVSAEGEKVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIESSASVPVTVTVALADDLTGR
EKHEVALNRPPKVTKTYDLKANDKVTFKVPYGGLIYIKGNSPKNESAEFTFTGVVKAPFYKDGEWKNALNSPAPLGELES
DAFVYTTPKKNLEASNFTGGVAEFAKDLDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDAHSGYPV
MNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQ
AWARGGAGDRLLMYAQLKEWAEKNFDIKTWYPDGNLPAFYSEREGMKGWNLFQLMHRKARGDEVGKTKFGERNYCAESNG
NAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMNFEGGVSQSAYETLAALNLPKPQQGPETINQVTEHKMSAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 93
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_002414113.1 inner membrane lipoprotein VFG0106 Protein 0.0 49