Gene Information

Name : EcE24377A_3124 (EcE24377A_3124)
Accession : YP_001464139.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 3128789 - 3130327 bp
Length : 1539 bp
Strand : +
Note : identified by match to protein family HMM PF05943; match to protein family HMM TIGR03355

DNA sequence :
ATGTCTGTACAACAAGAACATGCCACCTCTGAAACTGCAACACTGACCACCACTGAGTCCGGCGGCGTTTATCAGTCCCT
GTTTGATAAAATTAATTTAACCCCGGTGTCTTCCATTCAGGAAATAGATTTATGGCAAAACAGCGAAACACTGGCGGATG
CCTCACCCGATGAGCGCGTGACAGCGGCGATTCACGTTCTACTTTCCTGTCTGGCGAAATCAGGCGAGAACGTGGTCAAG
CTCGACAAGAGCCTGCTGGATTTTCATATCGACGATTTGGATCAGAAAATCAGTAAACAGCTTGATGCGGTCATGCACCA
TCCTGAATTCCAGAAAGTCGAGTCGCTGTGGCGCGGCACATGGTTCGTCGTACAGCGCACTGATTTTCGCAAAAATGTCA
GAATTGAATTGCTGGATATCAGCAAAGAGCATCTGCGTCAGGATTTCGACGACTCCCCGGAAATCATTCAGAGCGGTTTA
TATCGTCATACATACATTCAGGAGTACGATACGCCGGGTGGCGAACCTGTTGCCTCATTAATTTCCAGCTATGAATTTGA
TAACAGCCCGCAGGATATTGCCCTGCTGCGCAATATTTCCAGAGTGTCTGCCGCTTCCCATATGCCTTTTATCGGTTCTG
TCGGGCCGAAATTCTTCCTTAAAAATTCGATGGAAGAAGTCGCCGCGATTAAAGATATCGGCAACTACTTTGACCGCGCA
GAATATATTAAATGGAAATCGTTTCGTGATACCGATGACAGCCGCTATGTGGGATTAGTGATGCCGCGAGTGCTGGGCCG
TCTGCCCTATGGGCCGGACACGGTGCCGGTACGCAGCTTTAACTATGTGGAAGAAGTCAAAGGCCCGGATCACGAAAAGT
ATCTCTGGACAAACGCCTCGTTCGCCTTTGCCGCCAATATGGTGAAGAGCTTTGTGAATAATGGCTGGTGCGTGCAGATC
CGTGGTCCACAAGCAGGTGGCGCAGTGGCCGATCTGCCCATCCATCTTTACGATCTCGGCACCGGCAATCAGGTCAAAAT
TCCGTCCGAAGTGATGATCCCGGAAACCCGCGAATTTGAATTTTCCAACCTTGGCTTTATTCCGCTCTCTTATTATAAGA
ATCGCGATTACGCCTGCTTCTTCTCGGCAAACTCTGCCCAGAAACCGGCGTTGTATGATACCGCTGACGCCACCGCCAAC
AGCCGTATTAACGCCCGTCTGCCCTATATCTTCCTGCTGTCCCGCATTGCGCATTACCTGAAAATCATCCAGCGCGAGAA
TATCGGTACCACCAAAGACCGCCGCGTACTGGAACTGGAACTGAACACCTGGATCCGCACACTGGTGACGGAGATGACCG
ATCCGGGTGATGAACTGCAGGCGTCTCACCCGCTGCGCGACGGTAAGGTTATCGTGGAAGATATTGAGGACAATCCGGGC
TTCTTCCGCGTCAGACTCTTTGCCGTGCCGCATTTCCAGATTGAAGGGATGGATATCAACCTGTCACTGGTTTCCCAGAT
GCCGAAAGCGAAAGCCTGA

Protein sequence :
MSVQQEHATSETATLTTTESGGVYQSLFDKINLTPVSSIQEIDLWQNSETLADASPDERVTAAIHVLLSCLAKSGENVVK
LDKSLLDFHIDDLDQKISKQLDAVMHHPEFQKVESLWRGTWFVVQRTDFRKNVRIELLDISKEHLRQDFDDSPEIIQSGL
YRHTYIQEYDTPGGEPVASLISSYEFDNSPQDIALLRNISRVSAASHMPFIGSVGPKFFLKNSMEEVAAIKDIGNYFDRA
EYIKWKSFRDTDDSRYVGLVMPRVLGRLPYGPDTVPVRSFNYVEEVKGPDHEKYLWTNASFAFAANMVKSFVNNGWCVQI
RGPQAGGAVADLPIHLYDLGTGNQVKIPSEVMIPETREFEFSNLGFIPLSYYKNRDYACFFSANSAQKPALYDTADATAN
SRINARLPYIFLLSRIAHYLKIIQRENIGTTKDRRVLELELNTWIRTLVTEMTDPGDELQASHPLRDGKVIVEDIEDNPG
FFRVRLFAVPHFQIEGMDINLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 2e-98 43
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 1e-98 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcE24377A_3124 YP_001464139.1 hypothetical protein VFG2093 Protein 9e-104 44
EcE24377A_3124 YP_001464139.1 hypothetical protein VFG2475 Protein 9e-108 44
EcE24377A_3124 YP_001464139.1 hypothetical protein VFG2070 Protein 3e-86 42