Gene Information

Name : ECO103_5084 (ECO103_5084)
Accession : YP_003224897.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG4365
EC number : -
Position : 5287467 - 5288288 bp
Length : 822 bp
Strand : +
Note : Integrative element ECO103_IE06

DNA sequence :
ATGCGATTAGCCAGCCGCTTTGGTCGTATAAACCAGATACGCCGTGACCGGCCTCTGACCCATGAAGAGCTGATGAGTCA
TGTACCCAGCGTGTTCGGGAGTGACAAGCATGAATCTCGTTCAGACCGTTATACCTACATCCCGACCATCACCATCCTCG
AAAGCCTGCAGCGTGAAGGCTTTGAGCCGTTCTTTGCCTGCCAGACAAAGGTCCGTGACCAGAGCAAGCGGGAACACACC
AAACACATGCTTCGCCTGCATCGTGCCGGTCAACTTACCGGGCATCAGGTACCGGAAATCATTCTGCTTAACAGCCACGA
CGGCTCATCAAGCTACCAGATGCTGCCGGGGCTGTTTCGTGGCGTTTGTACCAACGGCCTGGTCTGCGGTCAGTCGTTCG
GAGAGGTACGGGTGCCACATAAAGGCAATGTCGTTGAGAAGGTGATTGAAGGGGCTTACGAAGTGCTCGGGGTGTTTGAC
CGGGTGGAGGAGAAGCGCGATGCAATGCAGTCACTGGCGCTGCCAGCCCCGGCCCGTCACGCGCTGGCAAATGCTGCGTT
GGAATATCGCTTCGGTGAGGACCACCAGCCGGTCACGGTATCGCAGTTGCTGACCCCACGTCGCCGGGAGGACTACAGCG
ATGACCTGTGGACCGTCTACCAGCGCGTGCAGGAGAACCTGATGAAAGGTGGTCTGTCAGGGCGAACCGCTCAGGGGAAA
AGCAGCCGCACCCGTGCTGTTACCGGTATTGATGGCGATGTGAAGCTCAATCGCGCCCTGTGGGTGATGGCAGAAAACAT
GCTCGAGTTTTTCGGGCGTTAA

Protein sequence :
MRLASRFGRINQIRRDRPLTHEELMSHVPSVFGSDKHESRSDRYTYIPTITILESLQREGFEPFFACQTKVRDQSKREHT
KHMLRLHRAGQLTGHQVPEIILLNSHDGSSSYQMLPGLFRGVCTNGLVCGQSFGEVRVPHKGNVVEKVIEGAYEVLGVFD
RVEEKRDAMQSLALPAPARHALANAALEYRFGEDHQPVTVSQLLTPRRREDYSDDLWTVYQRVQENLMKGGLSGRTAQGK
SSRTRAVTGIDGDVKLNRALWVMAENMLEFFGR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
z1215 CAD33785.1 Z1215 protein Not tested PAI I 536 Protein 7e-100 77
yafZ CAE85199.1 YafZ protein Not tested PAI V 536 Protein 2e-99 76
Z1215 NP_286750.1 hypothetical protein Not tested TAI Protein 1e-98 76
unnamed CAD42096.1 hypothetical protein Not tested PAI II 536 Protein 1e-98 76
Z1655 NP_287158.1 hypothetical protein Not tested TAI Protein 1e-98 76
yafZ YP_854321.1 hypothetical protein Not tested PAI I APEC-O1 Protein 9e-94 75
ORF_47 AAZ04456.1 conserved hypothetical protein Not tested PAI I APEC-O1 Protein 7e-94 75
ECO103_3587 YP_003223444.1 hypothetical protein Not tested LEE Protein 3e-94 75
unnamed CAI43843.1 hypothetical protein Not tested LEE Protein 1e-94 75
S3199 NP_838482.1 hypothetical protein Not tested SHI-1 Protein 1e-94 75
aec71 AAW51754.1 Aec71 Not tested AGI-3 Protein 1e-94 75
SF2994 NP_708768.1 hypothetical protein Not tested SHI-1 Protein 1e-94 75
yafZ ADD91704.1 YafZ Not tested PAI-I AL862 Protein 1e-94 75
c5156 NP_757004.1 hypothetical protein Not tested PAI II CFT073 Protein 9e-94 74
unnamed CAD66201.1 hypothetical protein Not tested PAI III 536 Protein 3e-93 73
ORF C80 AAN62174.1 conserved hypothetical protein; associated with oriT region on plasmids Not tested PAGI-2(C) Protein 3e-74 56
ORF SG86 AAN62307.1 conserved hypothetical protein Not tested PAGI-3(SG) Protein 8e-72 54

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO103_5084 YP_003224897.1 hypothetical protein VFG1526 Protein 4e-100 77
ECO103_5084 YP_003224897.1 hypothetical protein VFG1615 Protein 8e-99 76
ECO103_5084 YP_003224897.1 hypothetical protein VFG0658 Protein 5e-95 75
ECO103_5084 YP_003224897.1 hypothetical protein VFG1676 Protein 2e-93 73