Gene Information

Name : ECOK1_4626 (ECOK1_4626)
Accession : YP_006103686.1
Strain : Escherichia coli IHE3034
Genome accession: NC_017628
Putative virulence/resistance : Resistance
Product : putative sulfatase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4759973 - 4761646 bp
Length : 1674 bp
Strand : -
Note : identified by match to protein family HMM PF00884; match to protein family HMM PF08019

DNA sequence :
ATGCGCACTTTGTTCGATGGAAACACCGTGATGTTGAAGCGCCTACTAAAAAGACCCTCTTTGAATTTACTCGCCTGGCT
ATTGTTAGCCGCTTTTTATATCTCTATCTGCCTGAATATTGCCTTTTTTAAACAGGTGTTGCAGGCGCTGCCGCTGGACT
CGCTGCATAACGTACTGGTTTTCTTGTCGATGCCGGTCGTCGCCTTCAGCGTGATTAATATTGTCCTGACACTAAGCTCT
TTCTTGTGGCTTAATCGACCGCTGGCCTGCCTGTTTATTCTGGTTGGCGCGGCTGCACAATATTTCATAATGACTTACGG
CATCGTCATCGACCGCTCGATGATTGCCAATATTATTGATACCACTCCGGCAGAAAGTTATGCGTTGATGACGCCGCAAA
TGTTATTAACGCTGGGATTCAGCGGCGTGCTTGCTGCGCTGATTGCCTGCTGGATCAAAATCAAACCCACCACCTCACGC
CTGCGCAGCGTTCTTTTCCGCGGGGCCAACATCCTGATTTCTGTACTGCTGATTCTGCTGGTTGCCGCACTGTTCTACAA
AGACTATGCATCGCTGTTTCGCAACAACAAAGAACTGGTGAAATCCTTAAGCCCGTCCAACAGCATTGTTGCTAGCTGGT
CGTGGTACTCTCACCAGCGGTCAGCCAACCTGCCACTGATACGCATTGGTGAAGATGCTCACCGCAACCCATTAATGCAA
AATGGGAAACGCAAAAACCTGACCATTTTGATCGTCGGAGAAACGTCGCGCGCAGAGAATTTCTCGCTGAATGGTTATCC
ACGCGAAACCAATCCTCGCCTGGCAAAAGATAACGTGGTCTATTTCCCTAATACCGCCTCTTGTGGCACAGCAACAGCCG
TCTCAGTGCCGTGCATGTTCTCGGATATGCCGCGTGAGCACTACAAAGAAGAGCTGGCACAGCACCAGGAAGGCGTGCTG
GATATCATTCAGCGAGCGGGCATCAACGTGCTGTGGAATGACAACGATGGCGGCTGTAAAGGCGTTTGCGATCGCGTACC
TCACCAGAACGTCACCGCGCTGAACCTGCCTGGTCAGTGCATCAACGGCGAATGCTATGACGAAGTGCTGTTCCACGGGC
TGGAAGAGTACATCAATAATCTGCAAAGCGATGGCCTCATTGTATTACACACTATCGGCAGCCACGGCCCGACCTATTAC
AACCGCTATCCGCCGCAGTTCAGGAAATTTACCCCAACCTGCGACACTAACGAGATCCAGACCTGTACCCAAGAGCAACT
GGTGAACACTTACGACAACACGCTGGTTTACGTCGACTATATTGTTGATAAAGCGATTAATCTGCTGAAAGAACATCAGG
ATAAATTTACCACCAGCCTGGTTTATCTTTCTGACCACGGTGAATCGTTAGGTGAAAATGGCATCTATCTGCACGGTCTG
CCTTATGCCATCGCCCCGGATAGCCAAAAACAGGTGCCGATGCTGCTGTGGCTATCGGAGGATTATCAAAAACGGTATCA
GGTTGACCAGAACTGCCTGCAAAAACAGGCGCAAACGCAACACTATTCACAAGACAATTTATTCTCAACCTTATTGGGCC
TGACTGGCGTTGAGACGAAGTATTACCAGGCTGCGGATGATATTCTGCAAACTTGCAGGAGAGTGAGTGAATGA

Protein sequence :
MRTLFDGNTVMLKRLLKRPSLNLLAWLLLAAFYISICLNIAFFKQVLQALPLDSLHNVLVFLSMPVVAFSVINIVLTLSS
FLWLNRPLACLFILVGAAAQYFIMTYGIVIDRSMIANIIDTTPAESYALMTPQMLLTLGFSGVLAALIACWIKIKPTTSR
LRSVLFRGANILISVLLILLVAALFYKDYASLFRNNKELVKSLSPSNSIVASWSWYSHQRSANLPLIRIGEDAHRNPLMQ
NGKRKNLTILIVGETSRAENFSLNGYPRETNPRLAKDNVVYFPNTASCGTATAVSVPCMFSDMPREHYKEELAQHQEGVL
DIIQRAGINVLWNDNDGGCKGVCDRVPHQNVTALNLPGQCINGECYDEVLFHGLEEYINNLQSDGLIVLHTIGSHGPTYY
NRYPPQFRKFTPTCDTNEIQTCTQEQLVNTYDNTLVYVDYIVDKAINLLKEHQDKFTTSLVYLSDHGESLGENGIYLHGL
PYAIAPDSQKQVPMLLWLSEDYQKRYQVDQNCLQKQAQTQHYSQDNLFSTLLGLTGVETKYYQAADDILQTCRRVSE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z1140 NP_286675.1 hypothetical protein Not tested TAI Protein 1e-92 43
Z1579 NP_287083.1 hypothetical protein Not tested TAI Protein 1e-92 43

• Homologs from CARD and BacMet (resistance genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECOK1_4626 YP_006103686.1 putative sulfatase BAC0485 Protein 5e-154 61