Gene Information

Name : yghJ (G2583_3690)
Accession : YP_003501165.1
Strain : Escherichia coli CB9615
Genome accession: NC_013941
Putative virulence/resistance : Virulence
Product : lipoprotein acfD-like protein precursor
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3737467 - 3742026 bp
Length : 4560 bp
Strand : -
Note : -

DNA sequence :
ATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGTGCAACCCTGTTAGCCGGTTGTGATGGCGG
TGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAGTGAAACCTGATCCAACAC
CAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAGCCTACGCCAGAACCGACACCTGATCCTGAACCAACACCA
GAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGGTAACTGGTGCTACCTG
TAATGGTGAATCCAGCGATGGCTTTACATTTAAACCTGGCGAGGACGTTACTTGCGTGGCGGGTAACACGACAATTGCCA
CCTTCAACACTCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAGGACGCGCAAGAA
CTGGCGGCTTCCGACAATAAGAAAAGCAATGCGGTTTCGCTGGTAACGTCCAGTAACAGCTGTCCGGCGGATACAGAACA
GGTTTGCCTGGAGTTCTCCTCAGTGATCGAGAGTAAACGCTTCGACTCGCTGTATAAGCAAATCGATCTGGCACCGGAAG
AGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGACCGATAAAGCGCCATCCACTCATACTTCACCGGTC
GTGCCCGCCACCACTCCGGGAACAAAACCGGATCTAAACGCTTCCTTCGTGTCGGCTAACGCGGAACAGTTTTATCAGTA
TCAACCTACTGAAATTATTCGCTCCGAAGGCCGACTGGTAGATAGCCAGGGATATGGTGTTGCTGGCGTCAACTACTACA
CCAATTCAGGCCGTGGCGTGACAGGGGAAAATGGTGAATTTTCCTTTAGCTGGGGCGAAACCATCTCCTTTGGTATCGAT
ACCTTTGAACTGGGGTCAGTGCGCGGCAATAAGTCGACCATTGCATTGACTGAACTGGGTGATGAAGTTCGCGGGGCGAA
TATCGATCAGCTTATTCATCGCTATTCGACGACCAGGCAAAATAATACCCGTGTTGTGCCGGACGATGTACGCAAGGTCT
TTGCCGAATATCCCAACGTGATCAACGAGATTATCAATCTCTCGTTATCCAACGGTGCGACGCTGGATGAAGGTGAGCAA
GTTGTTAATCTGCCAAACGAATTTATTGAGCAGTTTAATACGGGTCAGGCCAAAGAGATCGATACCGCGATTTGTGCGAA
AACCGACGGTTGTAACGAGGTTCGCTGGTTCTCGCTGACGACGCGCAATGTTAATGACGGCCAGATTCAGGGCGTTATTA
ACAAGCTGTGGGGTGTGGATGAAGATTACAAATCGGTGACGAAATTCCACGTCTTTCATGACTCTACCAACTTCTATGGT
AGCACCGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACGCGGCATTCCCGATTCTGATGGCGCGTAATGATAA
AAACTACTGGCTGGCCTTCGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGTACATTACTGAAGCGCCTTCCATTG
TGCAACCTGAGAACGTGACACGCGAAACTGCGACCTTCAACCTGCCGTTTATCCCGCTGGGGCAAGTCGGCAAAGGCAAG
CTGATGGTTATCGGTAACCCGCACTACAACAGCATTTTGCGTTGCCCGAACGGTTACAGCTGGAACGGGAGCGTTAATAA
AGACGGACAGTGTACGCTCAACAGCGACTCGGATGACATGAAGAACTTCATGGAGAACGTGTTGCGCTATCTGTCAAATG
ATCGCTGGTTGCCGGATGCAAAATCCAATATGACCGTGGGTACTAACCTGGACACGGTGTATTTCAAAAAACACGGGCAG
GTTACAGGAAACAGTGCTGCGTTTGGCTTCCATCCGGATTTTGCGGGTATCTCTGTTGAGCATTTAAGTAGCTATGGCGA
TCTCGATCCGCAGGACATGCCACTGCTGATCCTCAACGGCTTTGAGTATGTGACTCAGGTTGGGGGCGATCCCTATGCAG
TGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACCCAGCAGGATGTGACCGATCTGATCGCCTATCTGAACAAAGGT
GGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGCTTTGTGCGTCTGCTGGATGC
CGCAGGCCTGTCAATGGCACTGAACAAGTCGGTGGTGAATAACGATCCGCAGGGGTATCCGGATCGCGTACGCCAACAAC
GCGCAACGGGTATTTGGGTCTATGAACGTTATCCGTTTGTTGATGGTAAACCGCCGTATACCATTGATGAAACAACGAAA
GAAGTTATCTGGAAATACCAGCAAGACAACAAGCCTGATGATAAGCCGAAACTGGAAGTTGCCAGCTGGCAGGAGGAAGT
TGAGGGCAAACAGGTAACGCGTTATGCCTTTATTGATGAGGCGGAGTTTAAAACAAAAGAGTCTCTGGAGGCTGCAAAGG
CAAAAATCTTTGAGAAGTTTCCTGGATTAAAGGAGTGTAAGGACCCAACTTACCACTACGAGGTCAACTGTCTGGAATAT
CGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCACAGTATACGCAATTAAGCCTTAACGCCGACACGGC
AAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACTTCCGGACCAATG
GTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTGTGGAACGATACG
AGCTATCGTTATGAAGAAGGCAAAAATGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTGCTATACCAACGA
TGCCTATACCGATGGCACACGGTGCTCCGCAGATCTGAAAAAATCGCTGGTCGATAACAACATGATCTACGGTGACGGTA
GCAGCAAAGCGGGCATGATGAACCCGAGCTATCCACTCAACTATATGGAAAAACCGCTGACGCGCCTGATGCTGGGCCGT
TCCTGGTGGGATCTGAACATTAAGGTTGATGTGGAGAAGTACCCTGGAGCGGTATCTGTAGGGGGAGAAGAGGTTACTGA
AACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCAGGTAACATGCAGTCAACTGGCCTGTGGGCACCGGCTCAGA
AAGAGGTCACCATTAAGTCCAATGCGAACGTTCCTGTGACCGTCACCGTGGCGCTGGCTGACGACCTGACCGGACGTGAG
AAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTACTCTCTGGACGCTAGCGGTACGGTGAAGTTCAA
GGTGCCTTACGGTGGCCTGATTTATATCAAGGGCAATAGCTCTACCAATGAATCTGCCAGCTTCACCTTTACTGGCGTGG
TAAAAGCACCGTTCTATAAAGACGGCGCATGGAAAAACGATCTGAACTCACCGGCTCCGCTGGGTGAGCTGGAATCAGAC
GCTTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGCAATTACACTGGCGGATTGGAGCAATTCGCTAACGATCT
GGACACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGATGTTTACCTTTAAAA
ACTTGCCGGGTCACAAACACCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCTGGTTACCCGGTGATG
AACAGCAGCTTCTCGCCGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCTGATCTGGCATGAAGTCGGTCA
TAACGCCGCAGAAACGCCGTTGACTGTACCGGGTGCTACTGAAGTTGCGAACAACGTGCTGGCGCTGTACATGCAGGATC
GCTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTTGCGCCGGAATATCTGGAGGAGAGCAACGGTCAGGCA
TGGGCGCGTGGCGGTGCGGGTGATCGTCTGCTGATGTACGCACAACTGAAGGAATGGGCAGAGAAAAACTTTGATATCAC
GAAGTGGTATCCAGAAGGTAACCTGCCTAAGTTCTACAGCGAGCGTGAAGGGATGAAAGGCTGGAACCTGTTCCAGTTGA
TGCATCGTAAAGCACGCGGCGATGAGGTCAGCAATGACAAGTTTGGCGGCAGAAATTACTGTGCTGAATCCAACGGTAAC
GCTGCAGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATCTTTCGGAGTTCTTTAAGAAATGGAATCCAGG
TGCGAATGCTTACCAGTTGCCGGGGGCGAGCGAGATGAGCTTCGAGGGCGGTGTGAGCCAGTCGGCTTACAACACGCTCG
CGTCACTCGATCTGCCGAAACCGAAGCAAGGGCCGGAAACCATTAACAAGGTTACCGAGCATAAGATGTCTGTCGAGTAA

Protein sequence :
MNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTP
EPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFKPGEDVTCVAGNTTIATFNTQSEAARSLRAVEKVSFSLEDAQE
LAASDNKKSNAVSLVTSSNSCPADTEQVCLEFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTHTSPV
VPATTPGTKPDLNASFVSANAEQFYQYQPTEIIRSEGRLVDSQGYGVAGVNYYTNSGRGVTGENGEFSFSWGETISFGID
TFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTRQNNTRVVPDDVRKVFAEYPNVINEIINLSLSNGATLDEGEQ
VVNLPNEFIEQFNTGQAKEIDTAICAKTDGCNEVRWFSLTTRNVNDGQIQGVINKLWGVDEDYKSVTKFHVFHDSTNFYG
STGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVQPENVTRETATFNLPFIPLGQVGKGK
LMVIGNPHYNSILRCPNGYSWNGSVNKDGQCTLNSDSDDMKNFMENVLRYLSNDRWLPDAKSNMTVGTNLDTVYFKKHGQ
VTGNSAAFGFHPDFAGISVEHLSSYGDLDPQDMPLLILNGFEYVTQVGGDPYAVPLRADTSKPKLTQQDVTDLIAYLNKG
GSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQQRATGIWVYERYPFVDGKPPYTIDETTK
EVIWKYQQDNKPDDKPKLEVASWQEEVEGKQVTRYAFIDEAEFKTKESLEAAKAKIFEKFPGLKECKDPTYHYEVNCLEY
RPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWLWNDT
SYRYEEGKNDELGFKTFTEFLNCYTNDAYTDGTRCSADLKKSLVDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGR
SWWDLNIKVDVEKYPGAVSVGGEEVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGRE
KHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTNESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESD
AFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRNDEDGKHRMFTFKNLPGHKHRFTNDVQISIGDAHSGYPVM
NSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQA
WARGGAGDRLLMYAQLKEWAEKNFDITKWYPEGNLPKFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGRNYCAESNGN
AADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMSFEGGVSQSAYNTLASLDLPKPKQGPETINKVTEHKMSVE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 90
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 47
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_003501165.1 lipoprotein acfD-like protein precursor VFG0106 Protein 0.0 48