Gene Information

Name : ECO103_3554 (ECO103_3554)
Accession : YP_003223421.1
Strain : Escherichia coli 12009
Genome accession: NC_013353
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 3630649 - 3632097 bp
Length : 1449 bp
Strand : +
Note : Integrative element ECO103_IE03; intact form of IS683 are not identified; predicted protein Orf3 in insertion sequence IS683

DNA sequence :
ATGAACAACACACTCCCCGACGACATCGAGCAACTGAAGGCCCTGCTGATCGCACAGCAGGCTGTTATCGTCCGTCTGTC
TGGTGAAATAACCGGCTATGCCCGCGAGATCAGCTCACTCAGAGCGCTGGTCGCTAAACTGCAGAGAATGTTGTTCGGTC
GCAGCAGCGAGAAAAGCCGCGAGAAGATAGAAAAGAAGATCGCACGGGCAGAAACGCGTATAACCGAGCTCCAGAACAGG
CTTGGTGAGGCGCAGTTGCAACTCACCTCAATGGCCGGAGAGACAGCGCCGAAAACATCAGACTCTCCCGTCCGCAAAGC
ACTTCCGGCAACACTTCCCCGTGACAGGCAGGTTATCTCCCCGGCAGAAACCGAATGCCCCGTCTGCAGCGGCAAACTGA
AACCGCTGGGAGAAAGCATCTCTGAACAACTGGATATCATCAACACCGCGTTCAGGGTAATCGAAACGGTTCGCCCAAAA
CGGGCCTGCAGCCGGTGCGACTGTATAGTTCAGGCTCCGCAGCCACCAAAACCCATCGAGCGCAGTTACGCCAGTCCGGC
TCTGCTGGCCCGCATAATCATGGCTAAGTTCGCCGAGCATCTGCCGCTGTACCGTCAGTCGGAAATCTATGCCCGCCAGG
GCGTGGAGCTGCACCGCAATACGATGGGGCGCTGGGTTGACATCATGGGAGAGCAGCTTCGCCCGCTGTATGATGAACTG
AAGCACTATGTGCTGATGCCGGGTAAAGTGCATGCCGATGACACGCCGGTAAATGTACTGGAGCCGGGTCAGGGTAAAAC
CCGTACCGGACGGCTGTGGGTCTATGTTCGTGACGATCGCAACGCCGGTTCGACCATGCCGGCAGCGGTGTGGTTCTCAT
ACTCTCCCGACCGCAAAGGCATCCACCCACAGCAACATCTGGCGGACTACAGAGGTATCCTGCAGGCCGATGCATATGCG
GGTTACAATGCTCTTTACGAAAGCGGTCAGGCAACCGAAGCGGCTTGTATGGCACATGCCCGACGCAAGATCCACGATGT
ACATGTCCGCCATCCAACGACAGTAACGGGAGAAGCGCTCCGTCGTATCGGGGAACTGTACGCTATCGAGGCTGAGATCC
GCGGCAGTCCGGCAGAAGAGCGACTGGCGGTCAGAAAAGCCAGAACGGTACCGCTAATGCAGTCGTTGTATGAGTGGCTC
CAGGGGCAGATGAACACGCTGTCGCGCCACTCGGATACAGCGAAAGCGTTCACCTATCTGCTGAAGCAATGGGACGCTCT
GAACGAATACTGCCGCAATGGCTGGGTGGAGATCGACAATAACCTGTGTGAAAACGCCCTCCGGGTAGTTGCACTGGGGC
GGCGTAACTACCGGACGTGGTTTCTGCCAGATAAAGGCAACGGAACAAAAGAATCATGGTTTTTCCGGAACCGGCCACAC
CATGAATGA

Protein sequence :
MNNTLPDDIEQLKALLIAQQAVIVRLSGEITGYAREISSLRALVAKLQRMLFGRSSEKSREKIEKKIARAETRITELQNR
LGEAQLQLTSMAGETAPKTSDSPVRKALPATLPRDRQVISPAETECPVCSGKLKPLGESISEQLDIINTAFRVIETVRPK
RACSRCDCIVQAPQPPKPIERSYASPALLARIIMAKFAEHLPLYRQSEIYARQGVELHRNTMGRWVDIMGEQLRPLYDEL
KHYVLMPGKVHADDTPVNVLEPGQGKTRTGRLWVYVRDDRNAGSTMPAAVWFSYSPDRKGIHPQQHLADYRGILQADAYA
GYNALYESGQATEAACMAHARRKIHDVHVRHPTTVTGEALRRIGELYAIEAEIRGSPAEERLAVRKARTVPLMQSLYEWL
QGQMNTLSRHSDTAKAFTYLLKQWDALNEYCRNGWVEIDNNLCENALRVVALGRRNYRTWFLPDKGNGTKESWFFRNRPH
HE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 0.0 100
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 0.0 99
s0025 CAD33772.1 IS66-like transposase Not tested PAI I 536 Protein 6e-121 69
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 4e-140 64
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 6e-110 56
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 4e-116 55
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 3e-116 55
unnamed AAC31494.1 L0015 Not tested LEE Protein 2e-116 54
tnp AEA34686.1 transposase Not tested Not named Protein 1e-116 54
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 3e-116 54
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 3e-116 54
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 4e-115 54
unnamed AAL57570.1 unknown Not tested LEE Protein 1e-115 54
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 9e-100 54
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 2e-116 53
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 2e-115 53
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 2e-115 53
l0015 CAD33775.1 L0015 protein Not tested PAI I 536 Protein 3e-84 52
unnamed AAL08460.1 unknown Not tested SRL Protein 2e-97 52
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 2e-112 50
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 2e-98 49
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 2e-98 49
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 1e-76 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECO103_3554 YP_003223421.1 hypothetical protein VFG1513 Protein 3e-121 69
ECO103_3554 YP_003223421.1 hypothetical protein VFG0793 Protein 1e-116 54
ECO103_3554 YP_003223421.1 hypothetical protein VFG1516 Protein 2e-84 52
ECO103_3554 YP_003223421.1 hypothetical protein VFG1051 Protein 1e-97 52
ECO103_3554 YP_003223421.1 hypothetical protein VFG1736 Protein 3e-96 51