Gene Information

Name : yghJ (UTI89_C3389)
Accession : YP_542367.1
Strain : Escherichia coli UTI89
Genome accession: NC_007946
Putative virulence/resistance : Virulence
Product : lipoprotein AcfD-like
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG0810
EC number : -
Position : 3318170 - 3322732 bp
Length : 4563 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTAAGCGCAACCCTGTTAGCCGGTTGTGATGGTGG
TGGTTCAGGATCGTCCTCCGATACGCCGTCTGTAGATTCTGGATCAGGGACTTTGCCGGAAGTGAAACCCGATCCAACAC
CAACCCCGGAGCCGACACCTGAGCCGACGCCGGACCCAGAACCTACGCCGGATCCAACACCTGATCCTGAGCCGACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGGTAACTGGTGCTACCTG
TAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCAATACCGTGAGTTGTGTGGTGGGCAGTACGACCATTGCAA
CATTCAACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGACAAAGTGTCGTTTAGCCTGGAGGACGCGCAGGAG
CTGGCGAATTCTGAAAATAAGAAAACCAACGCCATCTCTCTGGTGACGTCCAGCGACAGTTGCCCCGCAGATGCAGAACA
GCTTTGTCTTACTTTCTCGTCAGTGGTTGATCGCGCGCGATTTGAAAAACTGTATAAGCAAATTGATCTGGCAACAGACA
ATTTCAGCAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACTGATAAAGCGCCGTCCACCCATACCTCAACGGTA
GTGCCAGTCACGACAGAGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCAGTA
TCAACCCACTGAAATCATTCTTTCCGAAGGCCAACTGGTGGATAGCCTGGGGAACGGTGTTGCTGGCGTTGACTACTACA
CCAATTCAGGCCGTGGCGTAACTGACGAAAACGGTAAATTTTCCTTTAGCTGGGGCGAAACCATCTCCTTTGGTATCGAT
ACCTTTGAACTGGGCTCAGTACGTGGCAATAAGTCGACCATTGCGCTGACTGAATTGGGTGATGAAGTTCGCGGGGCAAA
TATCGATCAGCTCATTCATCGTTATTCGACGACTGGTCAAAATAATACTCGTGTTGTTCCGGACGATGTACGCAAGGTCT
TTGCCGAATATCCCAACGTGATCAACGAGATAATCAATCTTTCGTTATCCAACGGTGCGACGCTGGATGAAGGCGATCAA
AACGTTGTGCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGAGATCGATACCGCGATTTGTGCGAA
AACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGCAATGTTAATGACGGCCAGATTCAGGGCGTTATTA
ACAAGCTGTGGGGCGTGGATACGAACTATCAGTCTGTCAGCAAGTTCCACGTCTTCCATGACTCTACCAACTTCTATGGC
AGCACCGGTAACGCGCGCGGTCAGGCGGTGGTAAATATCTCCAACTCGGCATTCCCGATTCTGATGGCGCGTAATGATAA
AAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGTACATTACGGAAGCGCCTTCCATTG
TGCAGCCAGAGAACGTTACGCGCGATACTGCGACTTTCAACCTGCCGTTTATTTCGCTGGGGCAAGTCGGTGAAGGCAAA
CTGATGGTTATCGGTAACCCGCACTACAACAGCATCCTGCGTTGCCCGAACGGTTACAGTTGGGGCGGTGGTGTTAATAG
TAAAGGTGAGTGTACGCTCAGCGGTGATTCTGATGACATGAAGCACTTTATGCAGAACGTACTGCGCTACTTGTCAAATG
ACATCTGGCAGCCAAATACCAAGAGCATCATGACTGTCGGCACCAACCTGGAGAACGTTTATTTCAAAAAAGCGGGCCAG
GTATTGGGAAATAGTGCACCATTTGCTTTCCATGAGGATTTCACTGGTATCACGGTTAAACAGTTGACCAGCTATGGCGA
TCTGAATCCGGAAGAGATTCCGTTGCTGATCCTCAACGGCTTTGAATATGTGACTCAGTGGTCTGGCGATCCCTATGCTG
TGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCTTATCTGAACAAAGGT
GGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCCAGTTTTGTGCGTCTGCTGGATGC
CGCGGGTCTGTCAATGGCTCTGAACAAATCGGTGGTGAACAACGATCCGCAAGGGTATCCGGATCGCGTTCGTCAGCGTC
GCGCGACTGGCATTTGGGTTTATGAACGTTATCCTGCTGCAGACGGCGCGCAACCGCCGTACACCATCGACCCAAATACA
GGGGAAGTGACCTGGAAATACCAGCAAGACAACAAGCCTGATGACAAGCCGAAACTGGAAGTTGCGAGCTGGCAGGAGGA
AGTTGAGGGCAAACAGGTAACGCGTTATGCCTTTATTGATGAAGCGGAATACACAACAGAAGAATCTCTGGAAGCGGCAA
AGGCAAAAATCTTTGAGAAGTTTCCTGGGTTACAGGAGTGTAAGGACTCGACTTACCATTACGAGATTAACTGTTTGGAG
CGCCGCCCAGGCACGGATGTTCCGGTAACAGGTGGCATGTATGTTCCGCGCTATACGCAACTGAATCTTGACGCCGACAC
CGCGAAAGCGATGGTGCAGGCGGCGGATTTAGGCACCAACATTCAGCGCCTGTATCAGCATGAGCTTTATTTCCGTACCA
AAGGCAGTAAAGGTGAGCGTCTGAACAGTGTTGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAACGAT
ACGAAATATCGTTACGAAGAGGGCAAGGAAGATGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTACGCCAA
TGATGCCTATGCAGGCGGCACCAAGTGCTCCGCAGATCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGTGACG
GTAGCAGCAAAGCGGGCATGATGAACCCAAGCTATCCGCTCAACTATATGGAAAAACCGCTGACGCGTCTGATGCTGGGC
CGTTCCTGGTGGGATCTGAACATTAAGGTTGATGTGGAGAAGTACCCAGGATCCGTATCGGCAAAGGGTGAGAGCGTTAC
GGAAAACATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCAACCGGCCTGTGGGCACCGGCCC
AGCAGGACGTCACCATTAAGTCTTCGGCGTCAGTCCCAGTGACTGTTACCGTGGCGCTGGCTGACGACCTGACTGGACGT
GAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTATACTCTGGAGGCTAACGGTGAAGTGACCTT
CAAGGTGCCTTATGGTGGTCTGATTTATATCAAGGGCGACAGTAAGGATGATGTTTCTGCTAACTTCACCTTTACCGGTG
TAGTAAAAGCGCCGTTCTATAAAGACGGCGAATGGAAAAACGATCTGGACTCACCGGCGCCGCTGGGCGAGCTGGAGTCT
GCGTCGTTCGTCTATACCACGCCGAAGAAGAACCTTGAGGCCAGCAATTTCACTGGTGGTGTAGCAGAATTCGCTAAAGA
TCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGATGTTTACCTATA
AAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCGGGTTATCCGGTA
ATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATTTGGCACGAAGTCGG
TCATAACGCTGCAGAAACACCGCTGAACGTACCGGGTGCAACTGAAGTGGCGAACAACGTGCTGGCGCTGTACATGCAGG
ATCGCTATCTCGGTAAGATGAACCGTGTCGCTGACGACATTACCGTCGCGCCGGAATATCTGGACGAGAGCAACGGTCAG
GCCTGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGTTGAAGGAGTGGGCAGAGGAAAACTTTGATAT
CAAACAGTGGTATCCAGATGGTGAGCTGCCTAAGTTCTACAGCGATCGTAAAGGGATGAAGGGCTGGAACCTGTTCCAGT
TGATGCACCGTAAAGCGCGCGGCGATGATGTTGGTAACAGCACCTTTGGTGGCAAGAATTACTGTGCTGAATCCAATGGT
AACGCTGCCGACACGCTGATGCTGTGTGCATCCTGGGTCGCTCAGGCGGATCTTTCGGAATTCTTTAAGAAATGGAATCC
GGGTGCAAGTGCTTACCAGTTGCCGGGAGCAACGGAGATGAGTTTCCAGGGCGGTGTGAGCTCTTCGGCTTACAGCACGC
TGGCGTCACTCAAGCTGCCGAAACCGGAAAAAGGGCCGGAAACCATTAACAAGGTTACCGAGCATAAGATGTCTGCCGAG
TAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTP
EPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGNTVSCVVGSTTIATFNTQSEAARSLRAVDKVSFSLEDAQE
LANSENKKTNAISLVTSSDSCPADAEQLCLTFSSVVDRARFEKLYKQIDLATDNFSKLVNEEVENNAATDKAPSTHTSTV
VPVTTEGTKPDLNASFVSANAEQFYQYQPTEIILSEGQLVDSLGNGVAGVDYYTNSGRGVTDENGKFSFSWGETISFGID
TFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLDEGDQ
NVVLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYQSVSKFHVFHDSTNFYG
STGNARGQAVVNISNSAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVQPENVTRDTATFNLPFISLGQVGEGK
LMVIGNPHYNSILRCPNGYSWGGGVNSKGECTLSGDSDDMKHFMQNVLRYLSNDIWQPNTKSIMTVGTNLENVYFKKAGQ
VLGNSAPFAFHEDFTGITVKQLTSYGDLNPEEIPLLILNGFEYVTQWSGDPYAVPLRADTSKPKLTQQDVTDLIAYLNKG
GSVLIMENVMSNLKEESASSFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRATGIWVYERYPAADGAQPPYTIDPNT
GEVTWKYQQDNKPDDKPKLEVASWQEEVEGKQVTRYAFIDEAEYTTEESLEAAKAKIFEKFPGLQECKDSTYHYEINCLE
RRPGTDVPVTGGMYVPRYTQLNLDADTAKAMVQAADLGTNIQRLYQHELYFRTKGSKGERLNSVDLERLYQNMSVWLWND
TKYRYEEGKEDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLG
RSWWDLNIKVDVEKYPGSVSAKGESVTENISLYSNPTKWFAGNMQSTGLWAPAQQDVTIKSSASVPVTVTVALADDLTGR
EKHEVALNRPPRVTKTYTLEANGEVTFKVPYGGLIYIKGDSKDDVSANFTFTGVVKAPFYKDGEWKNDLDSPAPLGELES
ASFVYTTPKKNLEASNFTGGVAEFAKDLDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDAHSGYPV
MNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLNVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLDESNGQ
AWARGGAGDRLLMYAQLKEWAEENFDIKQWYPDGELPKFYSDRKGMKGWNLFQLMHRKARGDDVGNSTFGGKNYCAESNG
NAADTLMLCASWVAQADLSEFFKKWNPGASAYQLPGATEMSFQGGVSSSAYSTLASLKLPKPEKGPETINKVTEHKMSAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 86
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_542367.1 lipoprotein AcfD-like VFG0106 Protein 0.0 49