Gene Information

Name : yghJ (ECIAI39_3460)
Accession : YP_002409370.1
Strain : Escherichia coli IAI39
Genome accession: NC_011750
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3612166 - 3616740 bp
Length : 4575 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1644747; Product type lp : lipoprotein

DNA sequence :
TTGTCACTTGCGTTATTAATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGTGCAACCCTGTT
AGCCGGTTGTGATGGCGGTGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAG
TGAAACCTGATCCAACACCAAACCCGGAGCCGACGCCTGAGCCAACGCCGGACCCAGAACCTACGCCGGAACCGACACCT
GATCCTGAGCCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCGGAAGCCAGCGGGTAAC
TGGTGCTACTTGTAATGGTGAATCCAGCGATGGCTTTACATTTACACCTGGTGAGGACGTTACTTGCGTGGCGGGTAACA
CGACAATTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCGTTTAGTCTTGAG
GACGCGCAAGAACTGGCAGCTTCCGATGACAAGAAAAGCAATGCGGTTTCGCTGGTAACGTCCAGTAACAGCTGTCCGGC
GAATACAGAACAGGTTTGCCTGACGTTCTCCTCAGTGATCGAGAGTAAACGTTTCGACTCGCTGTATAAGCAAATCGATC
TGGCACCGGAAGAGTTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGACCGATAAAGCGCCATCCACTCAT
ACTTCACCGGTTGTGCCCGTCACCACGCCGGGAACAAAACCGGATCTGAACGCTTCCTTCGTGTCGGCTAACGCGGAACA
GTTTTATCAGTATCAACCCACTGAAATCATTCTCTCCGAAGGCCGACTGGTAGATAGCATGGGCAATGGTGTGGTTGGCG
TAAATTACTACACCAGCTCAGGCCGTGGCGTAACTGGCGAAAACGGCAAATTCAACTTCAGCTGGGGCGAAACCATCTCC
TTTGGTATCGACACCTTTGAACTGGGCTCAGTGCGCGGCAATAAGTCGACCATTGCGTTGACTGAACTGGGTGACGAAGT
TCGCGGCGCGAATATTGATCAGCTTATTCATCGTTACTCCCAGGCCGGAAAAAATGATGAGCGTGAAGTGCCGGACGTAG
TGCGCAAGGTCTTTGCCGAGTATCCCAACGTAATCAACGAGATTATCAATCTCTCGTTATCCAATGGCGAGGCGTTGAGC
GAAGGCGATCAAACCTTTGAGCGGACAAACGAATTTCTTGAGCAGTTTGAATCCGGGCAGGCTAAAGAGATTGATACGGC
GATTTGTGACTCCCTTGGGGGCTGCAACTCTCAGCGTTGGTTCTCGTTGACAGCACGCAATGTTAACGAAGGTCAGATTC
AGGGCGTTATCAACAAGCTGTGGGGTGTGGATAAAGATTACAAATCAGTGACGAAATTCCACGTCTTCCATGACTCTACC
AACTTCTATGGCAGTACCGGTAATGCGCGCGGTCAGGCAGTGGTGAATATCTCCAACGCGGCCTTCCCGATTCTGATGGC
GCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAACGAGCTGGCGTACATTACGGAAG
CGCCTTCTCTCGTTGAGCCGGAAAACGTTACGCGCGATACCGCCACCTTTAACCTGCCGTTTATTTCGCTGGGGCAAGTC
GGTGAGGGCAAACTGATGGTTATCGGTAACCCGCACTACAACAGCATTTTGCGTTGTCCGAACGGTTACAGCTGGGAAGG
CGGTGTTGATAAAAACGGTCAGTGTACGCGTAACAGTGATTCTAATGATATGAAGCACTTTATGCAGAACGTGTTGCGCT
ATCTGTCCGACGATAAATGGACGCCGGACGCGAAAGCCAGCATGACCGTAGGCACCAACCTGGATACTGTCTATTTCAAA
CGTCATGGTCAGGTTACAGGAAACAGCGCTGAGTTCGGCTTTCATCCGGATTTTGCGGGTATCTCTGTTGAGCATTTAAG
TAGCTATGGCGATCTCGACCCGCAGGAAATGCCGCTGCTGATCCTCAACGGCTTTGAGTATGTGACTCAGGTTGGTAACG
ATCCTTATGCAATCCCGCTGCGAGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGACCGATCTGATCGCCTAT
CTGAACAAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCGTCTGGCTTTGTACG
TCTGCTTGATGCCGCAGGTCTGTCGATGGCACTGAACAAGTCGGTAGTAAATAACGATCCGCAAGGGTATCCGAACCGCG
TTCGTCAGCAGCGCGCAACGGGCATTTGGGTCTATGAACGTTATCCTGCCGTAGATGGTGCGCTGCCGTACACCATCGAT
AGTAAGACAGGGGAAGTTAAGTGGAAATATCAGGTAGAAAACAAACCTGATGACAAACCGAAGCTGGAAGTTGCCAGCTG
GCTGGAAGATGTAGATGGCAAACAGGAAACGCGTTATGCCTTTATTGATGAGGCCGATCATAAAACAGAGGATTCTCTGA
AGGCTGCGAAGGCAAAAATCTTTGAGAAGTTTCCTGGATTAAAGGAGTGTAAGGACCCAACTTACCACTACGAGGTCAAC
TGTCTGGAATATCGTCCTGGCACGGGGGTTCCGGTTACTGGTGGCATGTATGTTCCACAGTATACGCAACTAAGCCTTAA
CGCCGACACGGCAAAAGCGATGGTGCAGGCTGCGGATTTAGGCACCAACATTCAGCGTCTGTATCAGCATGAGCTCTACT
TCCGGACCAATGGTCGCAAAGGTGAGCGTCTGAGCAGCGTCGATCTGGAACGTCTGTACCAGAACATGTCGGTCTGGCTG
TGGAATAAAATTGAATATCGCTATGAAAACGACAAGGATGACGAGCTGGGCTTTAAAACGTTCACCGAGTTCCTGAACTG
CTACGCCAACGATGCCTATACTGGCGGCACGCAGTGTTCTGATGAGCTGAAAAAATCGCTGGTCGATAACAACATGATCT
ACGGCGAGAAGAGCGTTAATAAAGCGGGCATGATGAACCCGAGCTATCCGCTCAACTATATGGAAAAACCGCTGACGCGC
CTGATGCTGGGTCGTTCCTGGTGGGATCTAAACATCAAAGTTGATGTCGAGAAGTATCCGGGAGCGGTATCGGCAGAAGG
TGAGAAGGTTACTGAAACCATCAGCCTGTACTCCAATCCGACCAAATGGTTTGCGGGTAACATGCAGTCTACTGGCCTGT
GGGCTCCGGCTCAGAAAGAGGTCACCATTGAGTCTTCTGCATCAGTTCCTGTGACTGTCACCGTGGCGCTGGCCGACGAT
CTGACCGGACGCGAGAAGCATGAAGTCGCGCTGAACCGTCCGCCAAAAGTGACGAAAACCTATGATCTGAAAGCCAATGA
TAAGGTGACCTTCAAGGTGCCTTACGGTGGTCTGATTTATATCAAGGGCAACAGCCCAAAGAATGAGTCAGCCGAATTCA
CCTTTACTGGTGTGGTAAAAGCGCCGTTCTATAAAGACGGCGAATGGAAAAACGCTCTGAACTCCCCTGCGCCGCTGGGC
GAGCTGGAATCAGACGCTTTCGTTTATACCACGCCGAAGAAGAACCTTGAGGCTAGCAATTTCACTGGTGGTGTAGCAGA
ATTCGCTAAAGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGTCGTAATGATGAAGACGGTAAGCACCGGA
TGTTTACCTATAAAAACTTGACGGGGCACAAGCATCGTTTCACCAACGATGTGCAGATCTCCATCGGTGATGCGCACTCG
GGTTATCCGGTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGTTGATCTG
GCACGAAGTAGGGCACAACGCCGCCGAAACGCCGCTGACTGTTCCTGGCGCAACTGAAGTAGCGAACAACGTTCTGGCGC
TGTACATGCAGGATCGCTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACTGTCGCGCCGGAATATCTGGAGGAG
AGCAACGGTCAGGCATGGGCGCGCGGCGGTGCGGGCGACCGTCTGCTGATGTACGCGCAGCTGAAGGAGTGGGCAGAGAA
AAACTTTGATATCAAACAGTGGTATCCAGAAGGTGACCTGCCTAAGTTCTACAGCGATCGTGAAGGGATGAAAGGCTGGA
ACCTGTTCCAGTTGATGCACCGTAAAGCACGCGGCGATGATGTTGGCAAAACCAAGTTTGGCGAAAGAAATTACTGTGCC
GAATCCAACGGTAACGCTGCCGACAAGCTGATGCTGTGTGCCTCCTGGGTCGCTCAGACGGATCTTTCGGAGTTCTTTAA
GAAATGGAATCCGGGCGCGAATGCTTACCAGTTACCGGGAGCGAGCGAGATGAACTTCGAGGGCGGTGTGAGCCAGTCGG
CTTACGAGACGCTGGCGGCGCTTAATCTGCCGAAACCGCAGCAAGGGCCGGAAACCATTAATCAGGTTACCGAGCATAAG
ATGTCAGCTGAGTAA

Protein sequence :
MSLALLMNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTP
DPEPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGEDVTCVAGNTTIATFDTQSEAARSLRAVEKVSFSLE
DAQELAASDDKKSNAVSLVTSSNSCPANTEQVCLTFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKAPSTH
TSPVVPVTTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSMGNGVVGVNYYTSSGRGVTGENGKFNFSWGETIS
FGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSQAGKNDEREVPDVVRKVFAEYPNVINEIINLSLSNGEALS
EGDQTFERTNEFLEQFESGQAKEIDTAICDSLGGCNSQRWFSLTARNVNEGQIQGVINKLWGVDKDYKSVTKFHVFHDST
NFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSLVEPENVTRDTATFNLPFISLGQV
GEGKLMVIGNPHYNSILRCPNGYSWEGGVDKNGQCTRNSDSNDMKHFMQNVLRYLSDDKWTPDAKASMTVGTNLDTVYFK
RHGQVTGNSAEFGFHPDFAGISVEHLSSYGDLDPQEMPLLILNGFEYVTQVGNDPYAIPLRADTSKPKLTQQDVTDLIAY
LNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPNRVRQQRATGIWVYERYPAVDGALPYTID
SKTGEVKWKYQVENKPDDKPKLEVASWLEDVDGKQETRYAFIDEADHKTEDSLKAAKAKIFEKFPGLKECKDPTYHYEVN
CLEYRPGTGVPVTGGMYVPQYTQLSLNADTAKAMVQAADLGTNIQRLYQHELYFRTNGRKGERLSSVDLERLYQNMSVWL
WNKIEYRYENDKDDELGFKTFTEFLNCYANDAYTGGTQCSDELKKSLVDNNMIYGEKSVNKAGMMNPSYPLNYMEKPLTR
LMLGRSWWDLNIKVDVEKYPGAVSAEGEKVTETISLYSNPTKWFAGNMQSTGLWAPAQKEVTIESSASVPVTVTVALADD
LTGREKHEVALNRPPKVTKTYDLKANDKVTFKVPYGGLIYIKGNSPKNESAEFTFTGVVKAPFYKDGEWKNALNSPAPLG
ELESDAFVYTTPKKNLEASNFTGGVAEFAKDLDTFASSMNDFYGRNDEDGKHRMFTYKNLTGHKHRFTNDVQISIGDAHS
GYPVMNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYLEE
SNGQAWARGGAGDRLLMYAQLKEWAEKNFDIKQWYPEGDLPKFYSDREGMKGWNLFQLMHRKARGDDVGKTKFGERNYCA
ESNGNAADKLMLCASWVAQTDLSEFFKKWNPGANAYQLPGASEMNFEGGVSQSAYETLAALNLPKPQQGPETINQVTEHK
MSAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 98
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_002409370.1 inner membrane lipoprotein VFG0106 Protein 0.0 49