Gene Information

Name : O3K_21030 (O3K_21030)
Accession : YP_006780870.1
Strain : Escherichia coli 2011C-3493
Genome accession: NC_018658
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4333143 - 4334528 bp
Length : 1386 bp
Strand : +
Note : COG2804 Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB

DNA sequence :
ATGAATATTCCACAGCTCACGGCCCTGTGTCTGCGTTATCAGGGAGTCTTGCTGGATGCCAGCGAAGAAGTGGTTCATGT
TGCGGTGGTCGATGCCCCCTCACATGAGTTGCTGGACGCATTGCATTTCGCTACCACCAAACGTATTGAGATCACCTGCT
GGACGCGCCAACAAATGGAAGGTCACGCCAGTCGCACACAACAGACATTGCCCGTAGCTGTTCAGGAGAAGCATCAGCCC
AAAGCAGAGTTGCTAGCTCGAACGTTACAATCTGCGCTGGAACAACGCGCGTCTGATATTCATATCGAACCAGCGGACAA
TGCCTACCGCATCCGCTTGCGTATCGACGGCGTATTGCATCCTTTACCGGATGTTTCACCGGATGCCGGAGTCGCATTAA
CCGCCAGATTAAAAGTGCTGGGAAACCTGGATATTGCGGAACATCGCCTGCCGCAGGACGGGCAATTCACTGTCGAACTG
GCAGGAAACGCCGTCTCATTTCGTATTGCGACCTTACCATGTCGGGGTGGTGAAAAGGTGGTATTAAGGTTGTTACAGCA
GGTGGGTCAGGCACTGGATGTCAACACGCTTGGAATGCAGCCGTTACAACTGGCGGACTTTGCTCATGCCTTGCAACAAC
CACAGGGACTGGTGCTGGTAACTGGCCCTACAGGCAGCGGCAAAACGGTCACGCTTTATAGTGCCCTGCAAACGCTGAAT
ACCGCTGACATTAATATTTGTAGCGTCGAAGATCCGGTTGAGATCCCCATAGCCGGACTAAACCAGACGCAAATCCATCC
GCGTGCCGGACTCACCTTTCAGGGCGTGTTGCGTGCGTTATTGCGCCAGGATCCTGACGTCATCATGATCGGAGAGATCC
GCGATGGCGAAACAGCAGAGATCGCTATTAAAGCGGCGCAAACTGGTCACCTGGTGTTGTCTACCCTACACACTAATTCC
ACCTGCGAAACGCTGGTACGTTTACAGCAAATGGGAGTCGCCCGCTGGATGCTCTCATCAGCGCTTACGCTGGTAATAGC
CCAGCGTCTGGTACGTAAACTTTGCCCACATTGTCGCCAGCAGCAAGGGGAGCCCATCCATATTCCAGTCAATGTATGGC
CGTCGCCGCTGCCCCACTGGCAGGCACCCGGTTGTGTACATTGCTACCACGGTTTTTATGGTCGTACGGCCTTATTTGAA
GTTCTGCCCATAACGCCGGTCATTCGTCAGCTTATTTCCGCTAATACCGACGTTGAATCGCTGGAAACGCACGCACGACA
GGCGGGTATGTGTACGCTTTTTGAAAACGGCTGCCTGGCCGTGGAGCAAGGCTTAACCACCTTTGAAGAGTTAATCCGCG
TACTGGGGATGCCGCATGGCGAGTAA

Protein sequence :
MNIPQLTALCLRYQGVLLDASEEVVHVAVVDAPSHELLDALHFATTKRIEITCWTRQQMEGHASRTQQTLPVAVQEKHQP
KAELLARTLQSALEQRASDIHIEPADNAYRIRLRIDGVLHPLPDVSPDAGVALTARLKVLGNLDIAEHRLPQDGQFTVEL
AGNAVSFRIATLPCRGGEKVVLRLLQQVGQALDVNTLGMQPLQLADFAHALQQPQGLVLVTGPTGSGKTVTLYSALQTLN
TADINICSVEDPVEIPIAGLNQTQIHPRAGLTFQGVLRALLRQDPDVIMIGEIRDGETAEIAIKAAQTGHLVLSTLHTNS
TCETLVRLQQMGVARWMLSSALTLVIAQRLVRKLCPHCRQQQGEPIHIPVNVWPSPLPHWQAPGCVHCYHGFYGRTALFE
VLPITPVIRQLISANTDVESLETHARQAGMCTLFENGCLAVEQGLTTFEELIRVLGMPHGE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspE YP_854406.1 type II secretion protein GspE Not tested PAI I APEC-O1 Protein 1e-69 45
gspE CAE85233.1 GspE, hypothetical type II secretion protein Not tested PAI V 536 Protein 9e-70 45

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
O3K_21030 YP_006780870.1 hypothetical protein VFG2048 Protein 3e-71 46
O3K_21030 YP_006780870.1 hypothetical protein VFG2426 Protein 3e-73 45
O3K_21030 YP_006780870.1 hypothetical protein VFG0182 Protein 4e-72 44
O3K_21030 YP_006780870.1 hypothetical protein VFG0233 Protein 3e-74 44
O3K_21030 YP_006780870.1 hypothetical protein VFG0112 Protein 3e-82 44
O3K_21030 YP_006780870.1 hypothetical protein VFG1876 Protein 3e-71 43