Gene Information

Name : NRG857_13790 (NRG857_13790)
Accession : YP_006121104.1
Strain : Escherichia coli NRG 857C
Genome accession: NC_017634
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2901188 - 2902726 bp
Length : 1539 bp
Strand : +
Note : COG3517 Uncharacterized protein conserved in bacteria

DNA sequence :
ATGTCTGTACAACAAGAACATGCCACCTCTGAAACTGCAACACTCACCACCACTGAGTCCGGCGGCGTTTATCAGTCCCT
GTTCGATAAAATTAATTTAACCCCGGTGTCTTCCATTCAGGAAATCGATTTATGGCAAAACAGCGAAACGCTGGCCGATG
CCTCACCCGATGAGCGCGTGACGGCGGCGATTCACGTTCTGCTTTCCTGTCTGGCGAAATCAGGCGAGGACGTGGTTAAG
CTCGACAAGAGCCTGCTGGATTTTCATATCGACGATCTGGATCAGAAAATCAGTAAACAGCTTGATGCGGTCATGCACCA
CCCTGAATTCCAGAAAGTCGAGTCGCTGTGGCGTGGTACATGGTTCGTCGTACAGCGCACTGATTTTCGCAAAAATGTCA
GAATTGAACTGCTGGATATCAGTAAAGAACATCTGCGGCAGGACTTTGATGATTCTCCGGAAATCATTCAAAGTGGTTTA
TATCGCCATACATACATTCAGGAGTACGATACGCCGGGTGGCGAACCTGTTGCCTCATTAATTTCCAGCTATGAATTTGA
TAACAGCCCGCAGGATATTGCCCTGCTGCGTAATATTTCCAGAGTGTCTGCCGCTTCCCATATGCCTTTTATCGGTTCTG
TCGGACCGAAATTCTTCCTTAAAAATTCGATGGAAGAAGTCGCCGCGATTAAAGATATCGGCAACTACTTTAACCGCGCA
GAATATATTAAATGGAAGTCGTTCCGCGATACGGATGACAGCCGCTATGTGGGATTAGTGATGCCGCGCGTGCTGGGCCG
TCTGCCCTATGGGCCGGACACGGTGCCGGTACGCAGCTTTAACTATGTGGAAGAAGTCAAAGGCCCGGATCACGAAAAAT
ACCTGTGGACAAACGCCTCGTTCGCCTTTGCCGCCAATATGGTGAAGAGCTTTGTGAATAATGGCTGGTGCGTGCAGATC
CGTGGTCCACAGGCAGGTGGCGCAGTGGCCGATCTGCCCATCCATCTTTACGATCTCGGCACCGGCAATCAGGTCAAAAT
TCCGTCCGAAGTGATGATCCCGGAAACCCGCGAATTTGAATTTGCCAACCTTGGCTTTATTCCGCTCTCTTATTATAAGA
ATCGCGATTACGCCTGCTTCTTCTCGGCGAACTCTGCCCAGAAACCGGCGTTGTACGATACCGCTGACGCCACCGCCAAC
AGCCGTATCAATGCCCGTCTGCCTTACATCTTCCTGCTGTCCCGCATTGCGCATTACCTGAAAATTATTCAGCGCGAGAA
TATCGGCACCACCAAAGACCGCCGCGTGCTGGAACTGGAGCTGAATACCTGGATCCGCACGCTGGTGACGGAGATGACCG
ATCCTGGCGATGAACTTCAGGCTTCGCATCCACTGCGCGACGGGAAAGTTATCGTCGAGGACATAGAGGACAATCCGGGC
TTCTTCCGCGTCAGACTCTTTGCCGTGCCGCATTTCCAGATCGAAGGGATGGACGTCAACCTTTCTCTGGTTTCCCAGAT
GCCAAAAGCAAAAGCCTGA

Protein sequence :
MSVQQEHATSETATLTTTESGGVYQSLFDKINLTPVSSIQEIDLWQNSETLADASPDERVTAAIHVLLSCLAKSGEDVVK
LDKSLLDFHIDDLDQKISKQLDAVMHHPEFQKVESLWRGTWFVVQRTDFRKNVRIELLDISKEHLRQDFDDSPEIIQSGL
YRHTYIQEYDTPGGEPVASLISSYEFDNSPQDIALLRNISRVSAASHMPFIGSVGPKFFLKNSMEEVAAIKDIGNYFNRA
EYIKWKSFRDTDDSRYVGLVMPRVLGRLPYGPDTVPVRSFNYVEEVKGPDHEKYLWTNASFAFAANMVKSFVNNGWCVQI
RGPQAGGAVADLPIHLYDLGTGNQVKIPSEVMIPETREFEFANLGFIPLSYYKNRDYACFFSANSAQKPALYDTADATAN
SRINARLPYIFLLSRIAHYLKIIQRENIGTTKDRRVLELELNTWIRTLVTEMTDPGDELQASHPLRDGKVIVEDIEDNPG
FFRVRLFAVPHFQIEGMDVNLSLVSQMPKAKA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec18 YP_851426.1 hypothetical protein Not tested PAI II APEC-O1 Protein 5e-98 43
aec18 AAQ96712.1 Aec18 Not tested AGI-1 Protein 3e-98 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
NRG857_13790 YP_006121104.1 hypothetical protein VFG2093 Protein 3e-103 44
NRG857_13790 YP_006121104.1 hypothetical protein VFG2475 Protein 2e-107 44
NRG857_13790 YP_006121104.1 hypothetical protein VFG2070 Protein 1e-86 42