Gene Information

Name : ECO103_3808 (ECO103_3808)
Accession : YP_003223653.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 3878931 - 3880469 bp
Length : 1539 bp
Strand : +
Note : Integrative element ECO103_IE04; predicted protein Orf3 in insertion sequence ISEc8

DNA sequence :
ATGAACGACATCTCTTCTGACGACATCTTCCTGCTGAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCA
GGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGATAAACTCCGCCGGATGAACTTCGGCA
GTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGATACGCTG
ACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCCCCCGTAAGCCGTTCCCTGAATCACTACC
CCGTGACGAAAAGCGACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATA
CCGCCGAACAGCTGGAGTTGATGCGTAGCGCCTTCCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGC
GATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCCGGACCGGGGCTGCTGGCCCGCGTGCT
GACCTCGAAGTATGCAGAGCACACCCCGCTGTATCGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGAGCCGTT
CACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTCATGACT
GACGGCAAACTCCATGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTG
GGCGTATGTTCGTGATGACCGCAATGCCGGGTCAGCGTTGGCACCTGCAGTGAGGTTCGCTTACAGCCCGGACAGAAAAG
GCATCCATCCGCAGACTCATCTTGCCTGCTTCAGCGGTGTGCTGCAAGCGGATGCGTACGCCGGGTTCAACGAGCTGTAT
CGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCATCCCGTC
AGCACTGACGGAAGAAGCCCTGGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGC
AGCGGCTTGCTGAACGTCAGCGAAAAACGAAACCGCTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAGACC
CTGTCGCGACACTCAGAGTTGGCGAAGGCGTTCGCGTACGCACTTAACCAGTGGCCGGCACTGACGTACTATGCGAACGA
TGGCTGGGTGGAAATCGACAACAACATCGCTGAAAATGCCCTGCGGGCGGTCAGTCTGGGTCGTAAAAACTTCCTGTTCT
TCGGCTCTGACCATGGTGGTGAGCGGGGAGCGCTAGTGTACAGCCTGATCGGGACGTGCAAACTGAATGACGTGGATCCA
GAAAGCTACCTTCGCCATGTGCCTGGCGTCATAGCAGACTGGCCGGTCAACCGGGTCAGCGAACTGCTTCCGTGGCGCAT
AGCACTGCCAGCTGAATAA

Protein sequence :
MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTL
TGRVYDPAVQRPLRQTRPRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQC
DAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEALHGYVMT
DGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVRFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELY
RNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKT
LSRHSELAKAFAYALNQWPALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALVYSLIGTCKLNDVDP
ESYLRHVPGVIADWPVNRVSELLPWRIALPAE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 0.0 99
unnamed AAC31494.1 L0015 Not tested LEE Protein 0.0 99
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 0.0 99
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 0.0 99
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 0.0 99
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 0.0 99
tnp AEA34686.1 transposase Not tested Not named Protein 0.0 99
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 0.0 99
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 0.0 99
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 0.0 99
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 0.0 98
unnamed AAL57570.1 unknown Not tested LEE Protein 0.0 98
unnamed AAL08460.1 unknown Not tested SRL Protein 0.0 96
c3563 NP_755438.1 hypothetical protein Not tested PAI I CFT073 Protein 3e-163 92
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 7e-150 64
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 8e-154 63
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 8e-154 63
s0025 CAD33772.1 IS66-like transposase Not tested PAI I 536 Protein 2e-105 62
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 1e-139 60
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 6e-101 59
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 5e-116 56
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 7e-127 55
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 6e-116 54

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO103_3808 YP_003223653.1 hypothetical protein VFG0793 Protein 0.0 99
ECO103_3808 YP_003223653.1 hypothetical protein VFG1051 Protein 0.0 96
ECO103_3808 YP_003223653.1 hypothetical protein VFG1700 Protein 1e-163 92
ECO103_3808 YP_003223653.1 hypothetical protein VFG1736 Protein 4e-123 63
ECO103_3808 YP_003223653.1 hypothetical protein VFG1513 Protein 1e-105 62