Gene Information

Name : WFL_01050 (WFL_01050)
Accession : YP_006171665.1
Strain : Escherichia coli W
Genome accession: NC_017664
Putative virulence/resistance : Unknown
Product : IcmF-like protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 242268 - 245732 bp
Length : 3465 bp
Strand : -
Note : COG3523 Uncharacterized protein conserved in bacteria

DNA sequence :
ATGCCCCGGTTTAAAGTCTCTGCTTTCTGGCTGCTGATACTGGCGTGGATTTTTCTGCTGGTGTGGATCTGGTGGAAAGG
CCCGATGTGGACGCTGTATGAAGAACAGTGGCTCAAACCACTGGCGAACCGCTGGCTGGCAACGGCGGCGTGGGGGATTA
TTGCCCTGGTGTGGCTTACCGTCCGGGTGATGAAGCGCCTGCAACAGCTTGAAAAAATGCAAAAGCAACAGCGCGAGGAA
GCCGTTGATCCGCTCAGCGTGGAACTGAACGCCCAGCAGCGTTATCTTGACCGCTGGCTGCTGCGCCTGCAACGCCATCT
CGACAACCGCCGTTTCCTGTGGCAGTTGCCGTGGTATATGGTCATCGGCCCGGCGGGCAGTGGCAAAACGACACTGCTGC
GCGAGGGGTTTCCGTCCGACATTATTTATGCCCCGGAGGGCGCACGCGGCGCAGAACAACGCCTGTACCTCACGCCCCAT
GTCGGTAAACAGGCGGTGATCTTTGATATCGACGGCACACTCTGCGCTCCCGCTGATGCGGATATCCTGCATCGCCGTCT
GTGGGAACATGCCCTCGGCTGGCTGAAAGAAAAGCGCGCGCGCCAGCCGCTGAACGGGATTATTCTGACACTCGATTTAC
CCGATCTGCTTACCGCAGACAAACGCCGCCGCGAGCATCTGTTACAGACGCTGCGCAGCCGCTTACAGGATATACGCCAG
CATCTTCACTGCCAGTTACCGGTTTACGTGGTACTGACCCGGCTTGATTTATTACAGGGTTTTGCCGCCCTGTTCCAGTC
ACTCAACAGACAGGATCGCGATGCGATTCTGGGCGTCACGTTCACCCGCCGTGCCCATGAAAATGATGACTGGCGAACAG
AGTTGAATGCTTTCTGGCAGACATGGGTGGATCGAATGAATCTGGCGTTGCCGGATCTGATGGTCGCTCAGACTCACACC
CGCGCGTCTTTATTCAGTTTTTCCCGCCAGATGCAGGGAAGCCGTGAACCGCTGGTGTCACTGCTTGAGGGTCTGCTTGA
TGGCGAAAATATGAACGTGATGCTGCGTGGTGTCTATCTCACCTCTTCGCTTCAGCGTGGACAGATGGATGATATATTCA
CCCAGTCTGCCGCCCGCCAGTACCGGCTGGGCAATAACCCACTGGCGTCCTGGCCCCTGGTGGACACCGCGCCTTATTTC
ACCCGCAGCCTGTTCCCGCAGGCATTACTCGCAGAGCCTAATCTGGCAACAGAGAGCCGCGCCTGGCTGATACGTTCCCG
TCGCCGCCTGACGGTTTTCTCTGCCACAGGCGGCGTGGCAGCACTGCTGCTCATCACCGGCTGGCATCACTATTACAACG
GTAACTATCAGTCCGGCATCACCGTGCTTAAGCAGGCCAAAGCCTTTATGGACGTGCCGCCTCCGCAGGGGGAAGATGAC
TTTGGCAACCTGCAACTGCCGCTCCTGAACCCGGTACGCGATGCCACACTGGCCTATGGCGACTGGGGCGACCGCAGCCG
TCTGGCCGATATGGGACTGTACCAGGGACGACGTATCGGGCCTTATGTGGAACAGACCTATCTGCAACTGCTGGAGCAAC
GTTACCTGCCCTCGCTGTTTAACGGGCTGGTCAAAGCGATGAACGCCGCGCCGCCGGAGAGTGAAGAAAAACTCGCGGTG
CTGCGCGTAATGCGAATGCTGGAGGACAAAAGCGGACGTAACAATGAGGTGGTGAAGCAGTATATGGCAAAACGCTGGAG
CGAAAAATTCCACGGCCAGCGCGATATCCAGGCACAACTGATGTCCCATCTTGACTACGCGCTGGCTCATACTGACTGGC
ATGCAGAGCGTCAGGCGGGCGACGGTGACGCCATCAGCCGCTGGACGCCATATGACAAGCCCGTGGTATCAGCACAGAAA
GAACTGAGCAAACTGCCTGTCTACCAGCGGGTTTACCAGAGCCTGAAAACGCGGGCGCTGGGCGTTCTTCCTGCCGACCT
CAATCTGCGTGACCAGGTAGGGCCAACCTTTGACCAGGTGTTTACATCTGCCGATGACAACAAACTGGTTGTTCCACAGT
TTCTTACCCGTTACGGCCTGCAAAGCTATTTTGTAAAACAGCGCGATGAACTGGTTGAACTGACGGCGATGGATTCCTGG
GTACTTAACCTTACCCGCAGCGTGAAATACAGTGACGCCGACCGTGCGGAAATCCAGCGCCAGTTGACCGAGCAGTATAT
CAGCGACTACACCGCCACCTGGCGGGCCGGGATGGACAATCTGAATATCCGCAATTTTGAGTCCATCGGACAACTGACCG
GGGCGCTGGAGCAGGTTATCAGCGGCGACCTGCCTTTGCAGCGGGCGCTAACCGTGCTGCGTGACAACACACAGCCAGGC
GTCTTTTCTGAAAAACTCTCTGCCAAAGAACGGGAGGAAGCCCTGGCAGAGCCGGATTACCAGTTACTCACCCGCCTCGG
GCATGAATTCGCCCCGGAAAACAGTACCCTGGCAGTACAGAAAGACAAAGAAAGCACGATGCAGGCCGTGTATCAGCAAC
TCACCGAGTTGCACCGCTACCTGCTGGCAATCCAGAACGCGCCTGTACCAGGGAAATCGGCGCTGAAAGCCGTGCAGTTA
CGGCTTGATCAGAACAGCAGCGATCCGATATTCGCCACCCGCCAGATGGCAAAAACGCTGCCTGCTCCGCTCAACCGCTG
GGTTGGCAGGCTGGCTGACCAGGCCTGGCATGTGGTGATGGTGGAGGCTGTTCATTATATGGAAGTGGACTGGCGCGACA
GCGTGGTGAAACCGTTTAACGAGCAACTGGCAAATAACTATCCGTTTAATCCGCGTTCTGCACAGGATGCCTCACTGGAT
GCCTTCGAACGCTTCTTTAAACCGGATGGCATACTGGATACCTTCTACCAGCAGAACCTGAAGCTGTTTATCGATAATGA
CCTGAGTCTGGAGGATGGCGATAACAACGTCATTATTCGCGAAGATATTATTGCGCAACTGGAAACTGCGCAGAAAATCC
GTGACATCTTCTTCAGCAAACAGAACGGTCTGGGAACATCCTTTGCCGTGGAAACGGTATCGCTTTCAGGCAATAAACGC
CGCAGTGTACTGAACCTTGACGGTCAGTTAGTCGATTACAGCCAGGGCCGTAACTATACCGCCCATCTGGTCTGGCCTAA
CAACATGCGCGAAGGCAACGAAAGTAAGCTGACGCTCATCGGCACCAGCGGCAACGCGCCGCGCAGTATCAGCTTCAGCG
GGCCGTGGGCGCAGTTCCGCCTGTTCGGGGCCGGACAACTGACCGGAGTACAGGATGGCAACTTTACCGTGCGCTTTAGC
GTGGACGGTGGCGCGATGACCTACCGTGTGCATACCGACACGGAAGATAACCCGTTCAGCGGTGGGTTGTTCAGCCAGTT
TGGTCTGTCAGACACACTGTACTGA

Protein sequence :
MPRFKVSAFWLLILAWIFLLVWIWWKGPMWTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEKMQKQQREE
AVDPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARGAEQRLYLTPH
VGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQTLRSRLQDIRQ
HLHCQLPVYVVLTRLDLLQGFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYF
TRSLFPQALLAEPNLATESRAWLIRSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDD
FGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQGRRIGPYVEQTYLQLLEQRYLPSLFNGLVKAMNAAPPESEEKLAV
LRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLMSHLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQK
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSW
VLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESIGQLTGALEQVISGDLPLQRALTVLRDNTQPG
VFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQL
RLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLD
AFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKR
RSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFS
VDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec30 AAQ96724.1 Aec30 Not tested AGI-1 Protein 0.0 78
aec30 YP_851415.1 hypothetical protein Not tested PAI II APEC-O1 Protein 0.0 78
pmt1 AAN64194.1 Pmt1 Not tested macrophage toxin pathogenicity island Protein 0.0 57