Gene Information

Name : NRG857_14720 (NRG857_14720)
Accession : YP_006121288.1
Strain : Escherichia coli NRG 857C
Genome accession: NC_017634
Putative virulence/resistance : Virulence
Product : inner membrane lipoprotein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3115357 - 3119787 bp
Length : 4431 bp
Strand : -
Note : -

DNA sequence :
TTGCCGGAAGTGAAACCCGATCCAACACCAACCCCGGAGCCGACACCTGAGCCGACGCCGGACCCAGAACCTACGCCGGA
TCCAACACCTGATCCTGAGCCGACACCAGAACCGGAGCCAGAACCTGTTCCTACGAAAACGGGTTATCTGACCCTGGGCG
GAAGCCAGCGGGTAACTGGTGCTACCTGTAATGGTGAATCCAGCGATGGCTTTACCTTTACGCCAGGCAATACCGTGAGT
TGTGTGGTGGGCAGTACGACCATTGCAACATTCAACACCCAGTCAGAAGCTGCGCGTAGCCTGCGTGCGGTTGACAAAGT
GTCGTTTAGCCTGGAGGACGCGCAGGAGCTGGCGAATTCTGAAAATAAGAAAACCAACGCCATCTCTCTGGTGACGTCCA
GCGACAGTTGCCCCGCAGATGCAGAACAGCTTTGTCTTACTTTCTCGTCAGTGGTTGATCGCGCGCGATTTGAAAAACTG
TATAAGCAAATTGATCTGGCAACAGACAATTTCAGCAAGCTGGTCAATGAAGAGGTGGAAAACAATGCTGCGACTGATAA
AGCGCCGTCCACCCATACCTCAACGGTAGTGCCAGTCACGACAGAGGGAACAAAACCGGATCTGAACGCGTCCTTCGTGT
CGGCTAACGCGGAACAGTTTTATCAGTATCAACCCACTGAAATCATTCTTTCCGAAGGCCAACTGGTGGATAGCCTGGGG
AACGGTGTTGCTGGCGTTGACTACTACACCAATTCAGGCCGTGGCGTAACTGACGAAAACGGTAAATTCTCCTTTAGCTG
GGGCGAAACCATCTCCTTTGGTATCGATACCTTTGAACTGGGCTCAGTACGTGGCAATAAGTCGACCATTGCGCTGACTG
AATTGGGTGATGAAGTTCGCGGGGCAAATATCGATCAGCTCATTCATCGTTATTCGACGACTGGTCAAAATAATACTCGT
GTTGTTCCGGACGATGTACGCAAGGTCTTTGCCGAATATCCCAACGTGATCAACGAGATAATCAATCTCTCGTTATCCAA
TGGTGCGACGCTGGATGAAGGCGATCAAAATGTTGTGCTGCCTAACGAATTTATCGAGCAGTTTAAGACGGGTCAGGCCA
AAGAGATCGATACCGCGATTTGTGCGAAAACCGACGGTTGTAACGAGGCTCGCTGGTTCTCGCTGACAACGCGCAATGTT
AATGACGGCCAGATTCAGGGCGTTATTAACAAGCTGTGGGGCGTGGATACGAACTATCAGTCTGTCAGCAAGTTCCACGT
CTTCCATGACTCTACCAACTTCTATGGCAGCACCGGTAACGCGCGCGGTCAGGCGGTGGTAAATATCTCCAACTCGGCAT
TCCCGATTCTGATGGCGCGTAATGATAAAAACTACTGGCTGGCGTTTGGCGAAAAACGCGCCTGGGATAAAAATGAGCTG
GCGTACATTACGGAAGCGCCTTCCATTGTGCAGCCAGAGAACGTTACGCGCGATACTGCGACTTTCAACCTGCCGTTTAT
TTCGCTGGGGCAAGTCGGTGAAGGCAAACTGATGGTTATCGGTAACCCGCACTACAACAGCATCCTGCGTTGCCCGAACG
GTTACAGTTGGGGCGGTGGTGTTAATAGTAAAGGTGAGTGTACGCTCAGCGGTGATTCTGATGACATGAAGCACTTTATG
CAGAACGTCCTGCGCTACTTGTCAAATGACATCTGGCAGCCAAATACCAAGAGCATCATGACTGTCGGCACCAACCTGGA
GAACGTTTATTTCAAAAAAGCGGGCCAGGTATTGGGAAATAGTGCACCATTTGCTTTCCATGAGGATTTCACTGGTATCA
CGGTTAAACAGTTGACCAGCTATGGCGATCTGAATCCGGAAGAGATTCCGTTGCTGATCCTCAACGGCTTTGAATATGTG
ACTCAGTGGTCTGGCGATCCCTATGCTGTGCCTCTGCGTGCAGATACCAGCAAACCGAAGCTGACTCAGCAGGATGTGAC
CGATCTGATCGCTTATCTGAACAAAGGTGGCTCGGTGCTGATCATGGAAAACGTGATGAGCAATCTTAAGGAAGAGAGCG
CGTCCAGTTTTGTGCGTCTGCTGGATGCCGCGGGTCTGTCAATGGCTCTGAACAAATCGGTGGTGAACAACGATCCGCAA
GGGTATCCGGATCGCGTTCGTCAGCGTCGCGCGACTGGCATTTGGGTTTATGAACGTTATCCTGCTGCAGACGGCGCGCA
ACCGCCGTACACCATCGACCCAAATACAGGGGAAGTGACCTGGAAATACCAGCAAGACAACAAGCCTGATGACAAGCCGA
AACTGGAAGTTGCGAGCTGGCAGGAGGAAGTTGAGGGCAAACAGGTAACGCGTTATGCCTTTATTGATGAAGCGGAATAC
ACAACAGAAGAATCTCTGGAAGCGGCAAAGGCAAAAATCTTTGAGAAGTTTCCTGGGTTACAGGAGTGTAAGGACTCGAC
TTACCATTACGAGATTAACTGTTTGGAGCGCCGCCCAGGCACGGATGTTCCGGTAACAGGTGGCATGTATGTTCCGCGCT
ATACGCAACTGAATCTTGACGCCGACACCGCGAAAGCGATGGTGCAGGCGGCGGATTTAGGCACCAACATTCAGCGCCTG
TATCAGCATGAGCTTTATTTCCGTACCAAAGGCAGTAAAGGTGAGCGTCTGAACAGTGTTGATCTGGAACGTCTGTACCA
GAACATGTCGGTCTGGCTGTGGAACGATACGAAATATCGTTACGAAGAGGGCAAGGAAGATGAGCTGGGCTTTAAAACGT
TCACCGAGTTCCTGAACTGCTACGCCAATGATGCCTATGCAGGCGGCACCAAGTGCTCCGCAGATCTGAAAAAATCGCTG
GTCGATAACAACATGATCTACGGTGACGGTAGCAGCAAAGCGGGCATGATGAACCCAAGCTATCCGCTCAACTATATGGA
AAAACCGCTGACGCGTCTGATGCTGGGCCGTTCCTGGTGGGATCTGAACATTAAGGTTGATGTGGAGAAGTACCCTGGAG
CGGTATCTGTAGGGGGAGAAGAGGTTACTGAAACCATCAGCCTGTACTCGAATCCGACCAAATGGTTTGCAGGTAACATG
CAGTCAACTGGCCTGTGGGCACCGGCTCAGAAAGAGGTCACCATTAAGTCCAATGCGAACGTTCCTGTGACCGTCACCGT
GGCGCTGGCTGACGACCTGACCGGACGTGAGAAGCATGAAGTTGCGCTGAACCGTCCGCCAAGAGTGACTAAAACGTACT
CTCTGGACGCTAGCGGTACGGTGAAGTTCAAGGTGCCTTACGGTGGCCTGATTTATATCAAGGGCAATAGCTCTACCAAT
GAATCTGCCAGCTTCACCTTTACTGGCGTGGTAAAAGCACCGTTCTATAAAGACGGCGCATGGAAAAACGATCTGAACTC
TCCTGCCCCGCTGGGCGAACTGGAGTCTGCGTCGTTCGTCTATACCACACCGAAGAAGAACCTGAATGCCAGCAATTACA
CGGGCGGACTGGATCAATTCGCTAAAGATCTGGATACCTTTGCCAGCTCGATGAATGATTTCTACGGTCGTAATGATGAA
GACGGTAAGCACCGGATGTTTACCTATAAAAACTTGACGGGCCACAAGCATCGTTTCACAAACGATGTGCAGATCTCCAT
CGGTGATGCGCACTCTGGTTATCCGGTAATGAACAGCAGCTTCTCGACGAACAGCACCACGCTGCCGACGACGCCGCTGA
ACGACTGGCTGATTTGGCACGAAGTCGGTCATAACGCTGCAGAAACACCGCTGAACGTACCGGGTGCAACTGAAGTGGCG
AACAACGTGCTGGCGCTGTACATGCAGGATCGCTATCTCGGCAAGATGAACCGTGTCGCTGACGACATTACCGTCGCGCC
GGAATATCTGGAGGAGAGCAACGGTCAGGCATGGGCGCGCGGCGGTGCGGGTGACCGTCTGCTGATGTACGCGCAGCTGA
AAGAGTGGGCAGAGAAAAACTTTGATATCAAACAGTGGTATCCAGAAGGTGACCTGCCTAAGTTCTACAGCGATCGTAAA
GGGATGAAGGGCTGGAACCTGTTCCAGTTGATGCACCGTAAAGCGCGCGGCGATGATGTCAGCAATGACAAGTTTGGCGG
CAGAAATTACTGTGCTGAGTCAAACGGTAACGCTGCTGACACGCTGATGCTGTGTGCATCCTGGGTCGCTCAGGCGGATC
TTTCGGAATTCTTTAAGAAATGGAATCCGGGCGCAAATGCTTACCAGCTTCCGGGGGCAAGTGAGATGAGCTTCGAAGGC
GGAGTGAGCCAGTCGGCTTACAACACGCTCGCGTCGCTCAAGCTGCCGAAACCGGAACAGGGGCCGGAAACCATTAACAA
GGTTACCGAGCATAAGATGTCTGCCGAGTAA

Protein sequence :
MPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPTKTGYLTLGGSQRVTGATCNGESSDGFTFTPGNTVS
CVVGSTTIATFNTQSEAARSLRAVDKVSFSLEDAQELANSENKKTNAISLVTSSDSCPADAEQLCLTFSSVVDRARFEKL
YKQIDLATDNFSKLVNEEVENNAATDKAPSTHTSTVVPVTTEGTKPDLNASFVSANAEQFYQYQPTEIILSEGQLVDSLG
NGVAGVDYYTNSGRGVTDENGKFSFSWGETISFGIDTFELGSVRGNKSTIALTELGDEVRGANIDQLIHRYSTTGQNNTR
VVPDDVRKVFAEYPNVINEIINLSLSNGATLDEGDQNVVLPNEFIEQFKTGQAKEIDTAICAKTDGCNEARWFSLTTRNV
NDGQIQGVINKLWGVDTNYQSVSKFHVFHDSTNFYGSTGNARGQAVVNISNSAFPILMARNDKNYWLAFGEKRAWDKNEL
AYITEAPSIVQPENVTRDTATFNLPFISLGQVGEGKLMVIGNPHYNSILRCPNGYSWGGGVNSKGECTLSGDSDDMKHFM
QNVLRYLSNDIWQPNTKSIMTVGTNLENVYFKKAGQVLGNSAPFAFHEDFTGITVKQLTSYGDLNPEEIPLLILNGFEYV
TQWSGDPYAVPLRADTSKPKLTQQDVTDLIAYLNKGGSVLIMENVMSNLKEESASSFVRLLDAAGLSMALNKSVVNNDPQ
GYPDRVRQRRATGIWVYERYPAADGAQPPYTIDPNTGEVTWKYQQDNKPDDKPKLEVASWQEEVEGKQVTRYAFIDEAEY
TTEESLEAAKAKIFEKFPGLQECKDSTYHYEINCLERRPGTDVPVTGGMYVPRYTQLNLDADTAKAMVQAADLGTNIQRL
YQHELYFRTKGSKGERLNSVDLERLYQNMSVWLWNDTKYRYEEGKEDELGFKTFTEFLNCYANDAYAGGTKCSADLKKSL
VDNNMIYGDGSSKAGMMNPSYPLNYMEKPLTRLMLGRSWWDLNIKVDVEKYPGAVSVGGEEVTETISLYSNPTKWFAGNM
QSTGLWAPAQKEVTIKSNANVPVTVTVALADDLTGREKHEVALNRPPRVTKTYSLDASGTVKFKVPYGGLIYIKGNSSTN
ESASFTFTGVVKAPFYKDGAWKNDLNSPAPLGELESASFVYTTPKKNLNASNYTGGLDQFAKDLDTFASSMNDFYGRNDE
DGKHRMFTYKNLTGHKHRFTNDVQISIGDAHSGYPVMNSSFSTNSTTLPTTPLNDWLIWHEVGHNAAETPLNVPGATEVA
NNVLALYMQDRYLGKMNRVADDITVAPEYLEESNGQAWARGGAGDRLLMYAQLKEWAEKNFDIKQWYPEGDLPKFYSDRK
GMKGWNLFQLMHRKARGDDVSNDKFGGRNYCAESNGNAADTLMLCASWVAQADLSEFFKKWNPGANAYQLPGASEMSFEG
GVSQSAYNTLASLKLPKPEQGPETINKVTEHKMSAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAE85238.1 hypothetical protein Not tested PAI V 536 Protein 0.0 87
VC0395_A0370 YP_001216326.1 lipoprotein Not tested VPI-1 Protein 0.0 49
VC0845 NP_230493.1 hypothetical protein Not tested VPI-1 Protein 0.0 49
acfD AAK20802.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75646.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75664.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75670.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75661.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75667.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75658.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 49
acfD ACK75655.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75649.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48
acfD ACK75652.1 accessory colonization factor AcfD Virulence VPI Protein 0.0 48

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
NRG857_14720 YP_006121288.1 inner membrane lipoprotein VFG0106 Protein 0.0 49