Gene Information

Name : EC55989_3339 (EC55989_3339)
Accession : YP_002404305.1
Strain : Escherichia coli 55989
Genome accession: NC_011748
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 3427073 - 3428617 bp
Length : 1545 bp
Strand : -
Note : Evidence 4 : Homologs of previously reported genes of unknown function

DNA sequence :
ATGCTGATGTCTGTACAGAAAGAAAAGAACGTTGCAGAGAGCGTGGTATCTGAAGCGCATGCCGGCGACAGCGTATATGC
TTCCCTGTTTGAAAAAATTAACCTGAGTCCGGTATCTGCCCTGAGTGCACTGGATATCTGGCAGGATCCACAGGCCATGT
CAGAAGCATCTGCAGATGAACGTCTGACGGCAGGGATGCAGGTGTTTATGGAGTGTCTGGCAAAAGCCGGCACGCAGGTG
GAAAAACTCGATAAAGCGCTGATTGACCATCATATTGCAGAGCTTGATTATCAGATCAGCCGTCAACTGGATGCTGTGCT
TCATCATCCGGAATTTCAGAAAGTGGAGTCGCTGTGGCGGGGCGTGAAGTCCCTGGTGGATAAAACGGACTTCCGCCGTA
ATGTGAAAATTGAGCTGCTGGATCTGTCCAAAGACGATCTGCGTCAGGATTTTGAGGATGCGCCGGAAATTATTCAGAGC
GGCCTTTATCTGCAGACCTATGTGGCTGAGTATGACACGCCGGGTGGTGAGCCCATTGCCGCCCTGGTTTCAGCCTGGGA
GTTTGATGCATCTGCACAGGACGTGGCGTTGCTGAAAAATATTTCCCGGGTGGCGGCATCTGCACATATGCCGTTTATTG
GTTCAGTTGGCCCGGCGTTCTTCCAGAAAGAAACCATGGAAGAGGTGGCTGCCATCAAGGATATTGGCAACCACTTTGAG
CGTGCCGAGTACATCAAGTGGAATGCTTTCCGTGAGACGGACGATGCCCGCTATATTGGCCTGGTGATGCCCCGTGTACT
GGGACGCCTGCCGTATGGCCCGGATACCGTACCTGTCCGCAGCTTTAACTATGTTGAAGAGGTGAAAGGACCGGACCACC
ATAAATATCTCTGGACGAACGCGTCTTTTGCCTTTGCAGCCAACATGGTGCGCAGTTTTGTCACCAACGGCTGGTGTGTT
CAGATCCGTGGGCCTCAGGCCGGTGGTGCGGTGCAGGATTTGCCAATCCATTTGTATGATCTGGGAACCGGTAATCAGGT
CAAAATTCCGTCAGAGGTGATGATCCCGGAAACGCGGGAGTTTGAATTCGCGAATCTGGGCTTTATTCCGCTCTCTTACT
ACAAGAACCGGGATTATGCCTGCTTCTTCTCTGCTAACTCCGCCCAGAAACCGGCGCTGTACGATACGCCGGATGCAACG
GCGAACAGTCGTATTAATGCCCGTCTTCCCTATATCTTCCTGCTGTCCCGTATTGCGCATTACCTGAAGATCATTCAGCG
GGAAAATATCGGTACCACCAAAGATCGCCGTCTGCTTGAGCTGGAGCTGAATAACTGGATCCGGGGACTGGTGACGGAGA
TGACCGATCCGGGTGATGAACTGCAGGCCTCGCATCCGCTGCGTGACGGAAAAGTGGTGGTGGAAGATATCGAAGACAAT
CCGGGATTCTTCCGCGTGAAACTCTACGCCGTACCGCATTTCCAGGTAGAAGGTATGGACGTCAGCCTGTCACTGGTTTC
GCAGATGCCGAAGGCAAAAGCGTAA

Protein sequence :
MLMSVQKEKNVAESVVSEAHAGDSVYASLFEKINLSPVSALSALDIWQDPQAMSEASADERLTAGMQVFMECLAKAGTQV
EKLDKALIDHHIAELDYQISRQLDAVLHHPEFQKVESLWRGVKSLVDKTDFRRNVKIELLDLSKDDLRQDFEDAPEIIQS
GLYLQTYVAEYDTPGGEPIAALVSAWEFDASAQDVALLKNISRVAASAHMPFIGSVGPAFFQKETMEEVAAIKDIGNHFE
RAEYIKWNAFRETDDARYIGLVMPRVLGRLPYGPDTVPVRSFNYVEEVKGPDHHKYLWTNASFAFAANMVRSFVTNGWCV
QIRGPQAGGAVQDLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDYACFFSANSAQKPALYDTPDAT
ANSRINARLPYIFLLSRIAHYLKIIQRENIGTTKDRRLLELELNNWIRGLVTEMTDPGDELQASHPLRDGKVVVEDIEDN
PGFFRVKLYAVPHFQVEGMDVSLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 2e-99 45
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 2e-99 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EC55989_3339 YP_002404305.1 hypothetical protein VFG2475 Protein 1e-111 49
EC55989_3339 YP_002404305.1 hypothetical protein VFG2093 Protein 3e-105 46
EC55989_3339 YP_002404305.1 hypothetical protein VFG2070 Protein 4e-85 43