Gene Information

Name : yghJ (EC55989_3382)
Accession : YP_002404344.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : M : Cell wall/membrane/envelope biogenesis
COG ID : COG0810
EC number : -
Position : 3462403 - 3466986 bp
Length : 4584 bp
Strand : -
Note : Evidence 2b : Function of strongly homologous gene; PubMedId : 1644747; Product type lp : lipoprotein

DNA sequence :
TTGTCACTTGCGTTATTAATGAATAAGAAATTTAAATATAAGAAATCGCTTTTAGCGGCTATTTTGAGTGCAACCCTGTT
AGCCGGTTGTGATGGCGGTGGCTCCGGATCTTCCTCCGATACGCCGCCTGTAGATTCTGGAACAGGGTCTTTGCCGGAAG
TGAAACCTGATCCAACACCAAACCCGGAGCCGACACCTGAGCCAACGCCGGACCCAGAACCTACGCCGGAACCGACACCT
GATCCTGAGCCAACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACTCTGGGCGGAAGCCTGCG
GGTAACTGGTGATATCACCTGTAATGATGAATCCAGCGATGGCTTTACCTTTACACCAGGCGACAAAGTCACCTGTGTGG
CAGGGAACAACACGACAATTGCTACCTTCGACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGAAAAAGTGTCG
TTTAGTCTTGAGGACGCGCAAGAACTGGCGGGTTCCGACAACAAGAAAAGCAATGCGCTCTCGCTGGTCACCTCCATGAA
CAGTTGCCCGGCGAATACAGAACAGGTGTGCCTGGAGTTCTCCTCGGTGATCGAGAGTAAACGTTTCGACTCGCTGTATA
AGCAAATCGATCTGGCACCGGAAGAATTCAAAAAGCTGGTCAATGAAGAGGTGGAAAACAATGCCGCGACCGATAAAGCG
CCATCCACTCATACTTCACCGGTCGTGCCCGCCACCACTCCGGGAACAAAACCGGATCTAAACGCTTCCTTCGTGTCGGC
TAACGCGGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTCTCTGAAGGTCGACTGGTCGATAGTCAGGGGGATG
GTGTTGTTGGTGTCAACTATTACACCAATTCCGGCCGTGGTGTAACCGGAGAAAACGGGGAATTTTCCTTTAGTTGGGGG
GAAACCATCTCCTTTGGCATCGACACTTTTGAGCTTGGTTCTGTGCGTGGTAACAAGTCGACTATTGCATTGACTGAACT
GGGTGATGAAGTTCGCGGGGCAAATATCGATCAGTTGATTCACCGCTATTCGAAGGCTGGACAAAATCACACGCGTGTAG
TTCCGGATGAAGTGCGCAAGGTTTTTGCTGAATATCCCAACGTGATTAACGAGATTATCAATCTCTCGTTATCCAATGGT
GCGACGCTGGGGGAAGGTGAGCAAGTCGTTAATCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCAAAGA
GATCGATACCGCGATTTGTGCGAAAACCGATGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGCAATGTTAATG
ACGGCCAGATTCAGGGCGTTATCAACAAGCTGTGGGGCGTGGATACGAACTACAAATCTGTCAGTAAGTTCCATGTATTC
CATGACTCTACCAACTTCTATGGCAGCACGGGTAATGCGCGCGGTCAGGCGGTGGTGAATATCTCCAACGCGGCCTTCCC
GATTCTGATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAATGAGCTGGCGT
ACATTACTGAAGCGCCTTCCATTGTGCGACCAGAGAACGTGACACGCGAAACCGCCACCTTCAACCTGCCGTTTATCTCG
CTGGGGCAAGTGGGCGATGGCAAGCTGATGGTTATCGGTAACCCACACTACAACAGCATCCTGCGTTGCCCGAACGGCTA
CAGCTGGAACGGGGGCGTTAATAAAGACGGACAGTGTACGCTCAACAGCGACCCGGATGACATGAAGAACTTCATGGAGA
ACGTGCTGCGCTATCTGTCAAATGATCGCTGGTTGCCGGATGCAAAATCCAGTATGACCGTGGGTACTAACCTGGACACG
GTATATTTCAAAAAACATGGTCAGGTGCTGGGAAATAGCGCACCGTTTGCGTTCCACAAGGATTTCACTGGCATCACGGT
CAAACCAATGACCAGCTATGGCAATCTGAATCCAGATGAAGTTCCTCTGTTGATCCTCAATGGCTTTGAATACGTCACAC
AATGGGGTAGCGATCCTTACTCCATTCCTCTGCGCGCAGATACCAGCAAACCGAAGCTGACCCAGCAGGATGTGACCGAT
TTGATCGCCTATATGAACAAAGGTGGATCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCGCATC
TGGCTTTGTACGTCTGCTTGATGCCGCAGGTTTGTCGATGGCGCTTAACAAGTCGGTAGTAAATAACGATCCGCAAGGCT
ACCCTGACCGTGTTCGTCAACGACGTTCAACGCCAATTTGGGTCTATGAGCGTTATCCGGCTGTCGATGGTAAACCACCG
TATACCATTGATGACACCACGAAAGAAGTTATCTGGAAATATCAGCAAGAAAACAAACCTGATGACAAACCGAAGCTGGA
AGTTGCCAGCTGGCAGGAAGAAGTTGAGGGTAAACAGGTAACCCAATTCGCCTTTATTGATGAAGCCGACCACAAAACGC
CTGAGTCACTGGCTGCGGCAAAACAGAGAATTCTGGACGCGTTCCCAGGGCTGGAAGTGTGTAAGGATTCTGACTATCAC
TATGAGGTCAACTGTCTGGAATACCGCCCAGGCACGGGTGTGCCGGTAACCGGTGGCATGTATGTTCCGCAGTATACGCA
GCTGGATCTTGGAGCTGACACTGCGAAAGCGATGCTGCAGGCTGCGGATTTAGGCACCAATATTCAGCGCCTGTATCAGC
ATGAGCTTTATTTCCGTACCAATGGCCTCCAGGGTGAGCGTCTCAACAGCGTTGATCTGGAACGTTTATACCAAAACATG
TCCGTCTGGCTGTGGAACGAGACGAAATATCGTTATGAAGAGGGTAAAGAAGACGAGCTGGGCTTTAAAACGTTCACTGA
GTTTCTGAACTGCTACACCAACAATGCATACGTTGGCACGCAGTGTTCTGCTGAGCTGAAAAAATCGCTGATCGATAACA
AGATGATTTATGGTGAAGAAAGCAGCAAAGCGGGCATGATGAACCCGAGCTACCCGCTCAACTATATGGAAAAACCGCTG
ACACGCCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATCAAAGTTGATGTTGAGAAGTATCCGGGAGTGGTGAATAC
AAACGGCGAAACCGTCACACAAAACATTAACTTGTACTCAGCTCCAACCAAATGGTTTGCAGGTAACATGCAGTCAACTG
GCCTGTGGGCACCTGCCCAGCAGGAAGTCAGCATTGAGTCAAAGGCGACAGTTCCTGTGACCGTGACTGTTGCGCTGGCC
GACGACCTGACAGGACGAGAGAAGCATGAAGTTAGCCTGAATCGTCCACCCAGAGTGACAAAAACCTATGACCTGAAAGC
CAATGATAAGGTGACGTTCAAAGTCCCTTACGGTGGTCTGATTTACATCAAGGGCGACAGCAAAGAGGTGCAATCAGCTG
ACTTCACCTTTACCGGTGTAGTAAAAGCGCCGTTCTATAAAGACGGTAAGTGGCAACACGATCTGAACTCCCCTGCCCCG
CTGGGCGAACTGGAGTCTGCCTCGTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGCAATTACACTGGCGGACT
GGAGCAATTCGCTAACGATCTGGATACCTTTGCCAGCTCGATGAATGACTTCTACGGCCGTGATAGCGAAGACGGTAAGC
ACCGGATGTTTACCTATAAAAACTTGCCGGGCCACAAACATCGTTTCGCCAACGATGTGCAGATCTCCATCGGTGATGCG
CATTCGGGTTATCCGGTAATGAACAGCAGCTTCTCGCCGAACAGCACCACGCTGCCGACGACGCCGCTGAACGACTGGCT
GATCTGGCATGAAGTCGGTCATAACGCCGCAGAAACGCCGTTGACTGTACCGGGTGCAACTGAAGTCGCTAACAACGTGC
TGGCGCTGTACATGCAGGATCGTTATCTCGGCAAGATGAACCGTGTCGCTGACGATATTACCGTCGCACCGGAATATCTG
GAGGAAAGCAACGGTCAGGCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCACAGCTGAAGGAATGGGC
AGAGAAAAACTTTGATATCAAGAAATGGTATCCAGATGGCACTCCTCTGCCAGAGTTTTACAGCGAGCGTGAAGGGATGA
AAGGCTGGAACCTGTTCCAGTTGATGCATCGTAAAGCACGCGGCGATGAGGTCAGCAATGACAAGTTTGGCGGCAAGAAT
TACTGTGCTGAATCCAACGGTAACGCAGCGGACACGCTGATGCTGTGTGCCTCCTGGGTCGCCCAGACGGATCTTTCGGA
GTTCTTTAAGAAATGGAATCCGGGCGCGAATGCTTACCAGTTGCCGGGAGCGACGGAGATGAGCTTCGAAGGCGGTGTGA
GCCAGTCGGCGTACAACACGCTGGCGTCACTCAATCTGCCGAAACCGAAGCAAGGGCCGGAAACCATTAACAAGGTTACC
GAGTATTCGATGCCTGCTGAATAA

Protein sequence :
MSLALLMNKKFKYKKSLLAAILSATLLAGCDGGGSGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTP
DPEPTPEPEPEPVPTKTGYLTLGGSLRVTGDITCNDESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRAVEKVS
FSLEDAQELAGSDNKKSNALSLVTSMNSCPANTEQVCLEFSSVIESKRFDSLYKQIDLAPEEFKKLVNEEVENNAATDKA
PSTHTSPVVPATTPGTKPDLNASFVSANAEQFYQYQPTEIILSEGRLVDSQGDGVVGVNYYTNSGRGVTGENGEFSFSWG
ETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSKAGQNHTRVVPDEVRKVFAEYPNVINEIINLSLSNG
ATLGEGEQVVNLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNVNDGQIQGVINKLWGVDTNYKSVSKFHVF
HDSTNFYGSTGNARGQAVVNISNAAFPILMARNDKNYWLAFGEKRAWDKNELAYITEAPSIVRPENVTRETATFNLPFIS
LGQVGDGKLMVIGNPHYNSILRCPNGYSWNGGVNKDGQCTLNSDPDDMKNFMENVLRYLSNDRWLPDAKSSMTVGTNLDT
VYFKKHGQVLGNSAPFAFHKDFTGITVKPMTSYGNLNPDEVPLLILNGFEYVTQWGSDPYSIPLRADTSKPKLTQQDVTD
LIAYMNKGGSVLIMENVMSNLKEESASGFVRLLDAAGLSMALNKSVVNNDPQGYPDRVRQRRSTPIWVYERYPAVDGKPP
YTIDDTTKEVIWKYQQENKPDDKPKLEVASWQEEVEGKQVTQFAFIDEADHKTPESLAAAKQRILDAFPGLEVCKDSDYH
YEVNCLEYRPGTGVPVTGGMYVPQYTQLDLGADTAKAMLQAADLGTNIQRLYQHELYFRTNGLQGERLNSVDLERLYQNM
SVWLWNETKYRYEEGKEDELGFKTFTEFLNCYTNNAYVGTQCSAELKKSLIDNKMIYGEESSKAGMMNPSYPLNYMEKPL
TRLMLGRSWWDLNIKVDVEKYPGVVNTNGETVTQNINLYSAPTKWFAGNMQSTGLWAPAQQEVSIESKATVPVTVTVALA
DDLTGREKHEVSLNRPPRVTKTYDLKANDKVTFKVPYGGLIYIKGDSKEVQSADFTFTGVVKAPFYKDGKWQHDLNSPAP
LGELESASFVYTTPKKNLNASNYTGGLEQFANDLDTFASSMNDFYGRDSEDGKHRMFTYKNLPGHKHRFANDVQISIGDA
HSGYPVMNSSFSPNSTTLPTTPLNDWLIWHEVGHNAAETPLTVPGATEVANNVLALYMQDRYLGKMNRVADDITVAPEYL
EESNGQAWARGGAGDRLLMYAQLKEWAEKNFDIKKWYPDGTPLPEFYSEREGMKGWNLFQLMHRKARGDEVSNDKFGGKN
YCAESNGNAADTLMLCASWVAQTDLSEFFKKWNPGANAYQLPGATEMSFEGGVSQSAYNTLASLNLPKPKQGPETINKVT
EYSMPAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 87
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 50
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
yghJ YP_002404344.1 inner membrane lipoprotein VFG0106 Protein 0.0 49