Gene Information

Name : ECO103_4908 (ECO103_4908)
Accession : YP_003224727.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 5090696 - 5091970 bp
Length : 1275 bp
Strand : -
Note : Integrative element ECO103_IE05; predicted protein Orf3 in insertion sequence IS682

DNA sequence :
TTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACACCTTCCCCGTGAAATACAGCGTCTGGAATCTGAAGAAAGCTG
TTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGTGCAGAACAACTGGAACTGGTGAGCAGCGCCCTGA
AAGTGATCCGCACAGAACGGGTAAAAAAAGCGTGTACAAAATGTGACTGCATCGTTGAAGCACCGGCGCCGTCCCGCCCG
ATAGAGCGTGGTATCGCGGGCCCCGGATTACTTGCCCGCGTGTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCG
TCAGAGTGAAATCTTTGCCCGACAGGGTGTCGAACTGAGCCGTGCATTACTCTCCAACTGGGTTGACGCGTGCTGCCAGT
TAATGACGCCGCTGAATGATGCCCTGTACCGTTATGTGATGAATACCCGCAAGCTTCACACTGACGACACACCGGTAAAG
GTACTGGCACCGGGCCTGAAAAAGACGAAAACAGGGCGCATCTGGACGTATGTCCGGGATGATCGCAATGCGGGTTCGTC
ATCTCCTCCGGCGGTCTGGTTCGCGTACTCACCGAACCGGCAGGGGAAACACCCGGAGCAACACCTCCGCCCCTTCCGGG
GTATCCTGCAGGCGGATGCGTTCACAGGTTATGACAGGCTGTTCAGTGCAGAACGTGAAGGTGGTGCGCTGACAGAAGTT
GCGTGCTGGGCCCATGCCCGGAGAAAAATCCACGATGTATACATCAGCAGCAAAAGTGCGACGGCAGAAGAAGCCCTGAA
GCGAATCAGTGAACTGTACGCCATCGAGGATGAAATACGGGGATTACCAGAGTCAGAGCGTCTTGCAGTCAGACAGCAGC
GAAGCAAAGCGTTACTGACGTCGCTGCATGAATGGATGATGGAGAAGAATGGCACGCTGTCGAAAAAATCCAGACTGGGC
GAAGCGTTCAGCTATGTACTGAATCAGTGGGACGCCCTCTGTTATTACAGTGATGACGGACTGGCGGAGACGGACAATAA
CACAGCGGAAAGAGCGCTTCGTGCAGTCTGTCTTGGAAAGAAAAATTACGTGTTCTTCGGTAGCGATCACGGCGGCGAGC
GTGGTGCACTGCTATACGGGCTGATCGGCACCTGCCGTCTGAACGGTATCGATCCGGAAGCGTATCTGCGCCATATTCTG
AGCGTACTGCCGGAATGGCCCTCCAACCGGGTTGATGAACTCCTGCCATGGAACGTAGTACTCACCAATAAATAA

Protein sequence :
MRQSRHRRPLPAHLPREIQRLESEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRP
IERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNTRKLHTDDTPVK
VLAPGLKKTKTGRIWTYVRDDRNAGSSSPPAVWFAYSPNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGGALTEV
ACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESERLAVRQQRSKALLTSLHEWMMEKNGTLSKKSRLG
EAFSYVLNQWDALCYYSDDGLAETDNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHIL
SVLPEWPSNRVDELLPWNVVLTNK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 0.0 99
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 5e-176 95
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 5e-176 95
c3563 NP_755438.1 hypothetical protein Not tested PAI I CFT073 Protein 8e-127 68
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 2e-131 67
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 5e-132 67
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 1e-132 67
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 4e-132 67
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 1e-132 67
tnp AEA34686.1 transposase Not tested Not named Protein 4e-132 67
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 8e-133 67
unnamed AAC31494.1 L0015 Not tested LEE Protein 7e-133 67
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 6e-132 67
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 5e-132 67
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 2e-132 67
unnamed AAL57570.1 unknown Not tested LEE Protein 6e-133 67
SF2972 NP_708746.1 hypothetical protein Not tested SHI-1 Protein 2e-80 57
unnamed AAK00463.1 unknown Not tested SHI-1 Protein 7e-83 57
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 3e-101 57
s0025 CAD33772.1 IS66-like transposase Not tested PAI I 536 Protein 5e-80 56
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 6e-100 55
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 4e-91 55
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 2e-91 55

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO103_4908 YP_003224727.1 hypothetical protein VFG1700 Protein 3e-127 68
ECO103_4908 YP_003224727.1 hypothetical protein VFG0793 Protein 4e-133 67
ECO103_4908 YP_003224727.1 hypothetical protein VFG0634 Protein 9e-81 57
ECO103_4908 YP_003224727.1 hypothetical protein VFG1513 Protein 3e-80 56