Gene Information

Name : O3K_20400 (O3K_20400)
Accession : YP_006780744.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4186551 - 4188692 bp
Length : 2142 bp
Strand : -
Note : COG3501 Uncharacterized protein conserved in bacteria

DNA sequence :
ATGTCAACCGGATTACGTTTCACGCTGGAAGTGGACGGCCTGCCACCGGATGCTTTTGCGGTGGTCTCCTTTCATCTGAA
CCAGTCACTCTCTTCGCTTTTTTCCCTCGATCTCTCTCTGGTCAGCCAGCAGTTTCTCTCCCTTGAATTCCAGCAGATCC
TCGACAAAATGGCCTACCTGACGATATGGCAGGGCGATGACGTACAGCGCCGGGTGAAAGGTATGGTGACCTGGTTTGAA
CTGGGGGAGAACGACAAAAACCAGATGCTGTACAGCATGAAGGTGTGCCCACCGCTGTGGCGCACAGGGCTGCGCCAGAA
CTTCCGTATCTTCCAGAATGAGGACATCGAAAGCATCCTCGGCACGATATTGCAGGAAAACGGGGTGACCGAGTGGAGCC
CGCTGTTCAGCGAGCCACATCCTTCCCGTGAGTTTTGTGTCCAGTACGGTGAGACTGATTACGATTTCCTGTGCCGGATG
GCGGCGGAGGAAGGCATCTTCTTTTATGAGGAGCACGCGCAAAAAAGTACCGACCAGAGCCTGGTCCTGTGCGATACCGT
GCGTTATCTGCCGGAGTCCTTTGAGATCCCCTGGAATCCGAACACCCGTACCGAGGTGAGCACCCTCTGCATCAGCCAGT
TTCGCTACAGCGCACAAATCCGCCCTTCTTCCGTGGTGACCAAAGACTACACCTTTAAACGCCCCGGCTGGGCCGGACGT
TTTGATCAGGAAGGCCAGCACCAGGATTACCAGCGCACACAGTATGAAGTGTATGACTACCCCGGACGTTTCAAGGGGGC
CCACGGGCAGAACTTTGCCCGCTGGCAGATGGACGGCTGGCGAAACAATGCAGAAACCGCGCGGGGAATGAGTCGCTCGC
CGGAGATATGGCCGGGACGACGAATTGTGCTGACGGGGCATCCGCAGGCGAACCTGAACCGGGAATGGCAGGTGGTGGCA
AGTGAACTGCACGGCGAACAGCCACAGGCGGTGCCAGGACGGCAGGGAGCGGGGACGGCGCTGGAGAACCATTTTGCGGT
GATCCCGGCAGACAGAACATGGCGACCACAGCCGTTGCTGAAACCGCTGGTGGACGGCCCGCAGAGCGCCGTCGTGACGG
GACCGGCAGGCGAGGAAATCTTCTGTGATGAACATGGCCGCGTGCGGGTGAAATTTAACTGGGACCGTTATAACCCGTCA
AACCAGGACAGTTCATGCTGGATCCGTGTGGCACAGGCGTGGGCAGGCACCGGATTCGGTAACCTGGCGATACCGCGTGT
GGGTCAGGAGGTGATTGTGGACTTCCTCAACGGCGATCCGGACCAGCCGATCATTATGGGGCGCACCTACCACCAGGAAA
ACCGCACACCCGGCAGCCTGCCGGGAACAAAGACGCAGATGACCATTCGTTCGAAAACCTATAAGGGCAGCGGGTTTAAT
GAACTGAAGTTTGACGATGCGACAGGGAAAGAACAGGTCTACATCCACGCGCAGAAGAACATGAACACCGAGGTGCTGAA
TAACCGCACCACTGATGTGATAAACAACCATGCCGAAAAAATAGGTAACAACCAGGCGATCACCGTTACCAATAACCAGA
TCCAGAACATTGGCGTTAATCAGATACAGACGGTTGGTGTCAACCAGGTGGAAACGGTGGGCAGTAACCAGATTATCAAA
GTGGGATCAAACCAGGTTGAAAAGGTGGGGATCATTCGTGCGCTGACGGTGGGTGTAGCTTACCAGACGACGGTAGGCGG
CATTATGAATACCTCGGTGGCGTTGTTGCAGTCCTCACAGGTAGGGCTGCATAAATCACTGATGGTGGGAATGGGCTACA
GCGTCAATGTGGGGAATAACGTCACCTTCTCGGTGGGCAAGACGATGAAGGAAAACACCGGACAAACAGCAGTTTATTCT
GCCGGTGAGCATCTTGAACTCTGCTGTGGTAAGGCAAGGCTGGTGCTGACGAAGGACGGAAGCATATTTCTCAACGGTAC
GCACATTCATCTGGAAGGGGAGTCGGATGTGAACGGTGATGCGCCAGTGATTAACTGGAACTGTGGTGCCACACAACCTG
TACCGGATGCGCCTGTGCCGAAAGATTTACCCCCCGGAATGCCGGATATGCGGCAATTTTGA

Protein sequence :
MSTGLRFTLEVDGLPPDAFAVVSFHLNQSLSSLFSLDLSLVSQQFLSLEFQQILDKMAYLTIWQGDDVQRRVKGMVTWFE
LGENDKNQMLYSMKVCPPLWRTGLRQNFRIFQNEDIESILGTILQENGVTEWSPLFSEPHPSREFCVQYGETDYDFLCRM
AAEEGIFFYEEHAQKSTDQSLVLCDTVRYLPESFEIPWNPNTRTEVSTLCISQFRYSAQIRPSSVVTKDYTFKRPGWAGR
FDQEGQHQDYQRTQYEVYDYPGRFKGAHGQNFARWQMDGWRNNAETARGMSRSPEIWPGRRIVLTGHPQANLNREWQVVA
SELHGEQPQAVPGRQGAGTALENHFAVIPADRTWRPQPLLKPLVDGPQSAVVTGPAGEEIFCDEHGRVRVKFNWDRYNPS
NQDSSCWIRVAQAWAGTGFGNLAIPRVGQEVIVDFLNGDPDQPIIMGRTYHQENRTPGSLPGTKTQMTIRSKTYKGSGFN
ELKFDDATGKEQVYIHAQKNMNTEVLNNRTTDVINNHAEKIGNNQAITVTNNQIQNIGVNQIQTVGVNQVETVGSNQIIK
VGSNQVEKVGIIRALTVGVAYQTTVGGIMNTSVALLQSSQVGLHKSLMVGMGYSVNVGNNVTFSVGKTMKENTGQTAVYS
AGEHLELCCGKARLVLTKDGSIFLNGTHIHLEGESDVNGDAPVINWNCGATQPVPDAPVPKDLPPGMPDMRQF

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec15 YP_851429.1 hypothetical protein Not tested PAI II APEC-O1 Protein 4e-162 54
aec15 AAQ96709.1 Aec15 Not tested AGI-1 Protein 3e-162 54
vgrG AAN64196.1 VgrG Not tested macrophage toxin pathogenicity island Protein 3e-124 50