Gene Information

Name : c3386 (c3386)
Accession : NP_755261.1
Strain : Escherichia coli CFT073
Genome accession: NC_004431
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3517
EC number : -
Position : 3223576 - 3225120 bp
Length : 1545 bp
Strand : +
Note : Escherichia coli O157:H7 ortholog: z0262

DNA sequence :
ATGCTGATGTCTGTACAACAAGAACATGCCACCTCTGAAACTGCAACACTCACCACCACTGAGTCCGGCGGCGTTTATCA
GTCCCTGTTCGATAAAATTAATTTAACCCCGGTGTCTTCCATTCAGGAAATCGATTTATGGCAAAACAGCGAAACGCTGG
CCGATGCCTCACCCGATGAGCGCGTGACGGCGGCGATTCACGTTCTGCTTTCCTGTCTGGCGAAATCAGGCGAGGACGTG
GTTAAGCTCGACAAGAGCCTGCTGGATTTTCATATCGACGATCTGGATCAGAAAATCAGTAAACAGCTTGATGCGGTCAT
GCACCACCCTGAATTCCAGAAAGTCGAGTCGCTGTGGCGTGGTACATGGTTCGTCGTACAGCGCACTGATTTTCGCAAAA
ATGTCAGAATTGAATTGCTGGATATCAGCAAAGAGCATCTGCGTCAGGATTTTGATGATTCACCGGAAATCATTCAGAGT
GGTTTATATCGCCATACATACATTCAGGAGTACGATACGCCGGGTGGCGAACCTGTAGCCTCATTAATATCCAGCTATGA
ATTTGATAACAGCCCGCAGGATATTGCCCTGCTGCGTAATATTTCCAGAGTGTCTGCCGCTTCCCATATGCCTTTTATCG
GTTCTGTCGGACCGAAATTCTTCCTTAAAAATTCGATGGAAGAAGTCGCCGCGATTAAAGATATCGGCAACTACTTTGAC
CGCGCAGAATATATTAAATGGAAGTCGTTCCGCGATACGGATGACAGCCGCTATGTGGGATTAGTGATGCCGCGCGTGCT
GGGCCGTCTGCCCTATGGGCCGGACACGGTGCCGGTACGCAGCTTTAACTATGTGGAAGAAGTCAAAGGCCCGGATCACG
AAAAATACCTGTGGACAAACGCCTCGTTCGCCTTTGCCGCCAATATGGTGAAGAGCTTTGTGAATAATGGCTGGTGCGTG
CAGATCCGTGGCCCACAGGCGGGCGGCGCAGTGGCCGATCTGCCGATCCATCTTTACGATCTCGGCACCGGCAATCAGGT
CAAAATTCCGTCCGAAGTGATGATCCCGGAAACCCGCGAGTTTGAGTTCGCCAACCTTGGCTTTATTCCACTCTCTTATT
ACAAAAACCGCGATTACGCCTGCTTTTTCTCGGCAAACTCCGCCCAGAAACCGGCGCTGTATGATACCGCTGACGCCACC
GCCAACAGCCGTATCAACGCCCGTTTGCCCTATATCTTCCTGCTGTCCCGCATTGCTCACTACCTGAAAATTATTCAGCG
CGAGAATATCGGCACCACCAAAGACCGCCGCGTGCTGGAACTGGAGCTTAACACCTGGATCCGCACGCTGGTGACGGAAA
TGACCGATCCGGGCGATGAACTTCAGGCTTCGCATCCACTGCGCGACGGGAAAGTTATCGTCGAGGACATAGAGGACAAT
CCGGGCTTTTTCCGCGTCAGACTCTTTGCCGTGCCGCATTTCCAGATCGAAGGGATGGACGTCAACCTTTCTCTGGTTTC
CCAGATGCCAAAAGCAAAAGCCTGA

Protein sequence :
MLMSVQQEHATSETATLTTTESGGVYQSLFDKINLTPVSSIQEIDLWQNSETLADASPDERVTAAIHVLLSCLAKSGEDV
VKLDKSLLDFHIDDLDQKISKQLDAVMHHPEFQKVESLWRGTWFVVQRTDFRKNVRIELLDISKEHLRQDFDDSPEIIQS
GLYRHTYIQEYDTPGGEPVASLISSYEFDNSPQDIALLRNISRVSAASHMPFIGSVGPKFFLKNSMEEVAAIKDIGNYFD
RAEYIKWKSFRDTDDSRYVGLVMPRVLGRLPYGPDTVPVRSFNYVEEVKGPDHEKYLWTNASFAFAANMVKSFVNNGWCV
QIRGPQAGGAVADLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDYACFFSANSAQKPALYDTADAT
ANSRINARLPYIFLLSRIAHYLKIIQRENIGTTKDRRVLELELNTWIRTLVTEMTDPGDELQASHPLRDGKVIVEDIEDN
PGFFRVRLFAVPHFQIEGMDVNLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 4e-98 43
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 3e-98 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
c3386 NP_755261.1 hypothetical protein VFG2093 Protein 1e-103 44
c3386 NP_755261.1 hypothetical protein VFG2475 Protein 2e-107 44
c3386 NP_755261.1 hypothetical protein VFG2070 Protein 2e-86 42