Gene Information

Name : EcDH1_3878 (EcDH1_3878)
Accession : YP_006093768.1
Strain : Escherichia coli DH1
Genome accession: NC_017625
Putative virulence/resistance : Resistance
Product : sulfatase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4178961 - 4180634 bp
Length : 1674 bp
Strand : +
Note : PFAM: sulfatase; protein of unknown function DUF1705; KEGG: sfx:S3621 putative cell division protein

DNA sequence :
ATGCGCACTTTGTTCGATGGAAACACCGTGATGTTGAAGCGCCTACTAAAAAGACCCTCTTTGAATTTACTCGCCTGGCT
ATTGTTGGCCGCTTTTTATATCTCTATCTGCCTGAATATTGCCTTTTTTAAACAGGTGTTGCAGGCGCTGCCGCTGGATT
CGCTGCATAACGTACTGGTTTTCTTGTCGATGCCGGTCGTCGCTTTCAGCGTGATTAATATTGTCCTGACACTAAGCTCT
TTCTTATGGCTTAATCGACCACTGGCCTGCCTGTTTATTCTGGTTGGCGCGGCTGCACAATATTTCATAATGACTTACGG
CATCGTCATCGACCGCTCGATGATTGCCAATATTATTGATACCACTCCGGCAGAAAGTTATGCGCTGATGACACCGCAAA
TGTTATTAACGCTGGGATTCAGCGGCGTGCTTGCTGCGCTGATTGCCTGCTGGATAAAAATCAAACCTGCCACCTCGCGT
CTGCGCAGTGTTCTTTTCCGTGGAGCCAATATTCTGGTTTCTGTACTACTGATTTTGCTGGTCGCCGCACTGTTTTATAA
AGACTACGCCTCGTTGTTCCGCAATAACAAAGAGCTGGTGAAATCCTTAAGCCCCTCTAACAGCATTGTTGCCAGCTGGT
CATGGTACTCCCATCAGCGACTGGCAAATCTGCCGCTGGTGCGAATTGGTGAAGACGCGCACCGCAACCCGTTAATGCAG
AACGAAAAACGTAAAAATTTGACCATCCTGATTGTCGGCGAAACCTCGCGGGCGGAGAACTTCTCCCTCAACGGCTACCC
GCGTGAAACTAACCCGCGGCTGGCGAAAGATAACGTGGTCTATTTCCCTAATACCGCATCTTGCGGCACGGCAACGGCAG
TTTCAGTACCGTGCATGTTCTCGGATATGCCGCGTGAGCACTACAAAGAAGAGCTGGCACAGCACCAGGAAGGCGTGCTG
GATATCATTCAGCGAGCGGGCATCAACGTGCTGTGGAATGACAACGATGGCGGCTGTAAAGGTGCCTGCGACCGCGTGCC
TCACCAGAACGTCACCGCGCTGAATCTACCTGATCAGTGCATCAACGGCGAATGCTATGACGAAGTGCTGTTCCACGGGC
TTGAAGAGTACATCAATAACCTGCAAGGTGATGGCGTGATTGTCTTACACACCATCGGCAGCCACGGTCCGACCTATTAC
AACCGCTATCCGCCTCAGTTCAGGAAATTTACCCCAACCTGCGACACCAATGAGATCCAGACCTGTACCAAAGAGCAACT
GGTGAACACTTACGACAACACGCTGGTTTACGTCGACTATATTGTTGATAAAGCGATTAATCTGCTGAAAGAACATCAGG
ATAAATTTACCACCAGCCTGGTTTATCTTTCTGACCACGGTGAATCGTTAGGTGAAAATGGCATCTATCTGCACGGTCTG
CCTTATGCCATCGCCCCGGATAGCCAAAAACAGGTGCCGATGCTGCTGTGGCTGTCGGAGGATTATCAAAAACGGTATCA
GGTTGACCAGAACTGCCTGCAAAAACAGGCGCAAACGCAACACTATTCACAAGACAATTTATTCTCCACGCTATTGGGAT
TAACTGGCGTTGAGACGAAGTATTACCAGGCTGCGGATGATATTCTGCAAACTTGCAGGAGAGTGAGTGAATGA

Protein sequence :
MRTLFDGNTVMLKRLLKRPSLNLLAWLLLAAFYISICLNIAFFKQVLQALPLDSLHNVLVFLSMPVVAFSVINIVLTLSS
FLWLNRPLACLFILVGAAAQYFIMTYGIVIDRSMIANIIDTTPAESYALMTPQMLLTLGFSGVLAALIACWIKIKPATSR
LRSVLFRGANILVSVLLILLVAALFYKDYASLFRNNKELVKSLSPSNSIVASWSWYSHQRLANLPLVRIGEDAHRNPLMQ
NEKRKNLTILIVGETSRAENFSLNGYPRETNPRLAKDNVVYFPNTASCGTATAVSVPCMFSDMPREHYKEELAQHQEGVL
DIIQRAGINVLWNDNDGGCKGACDRVPHQNVTALNLPDQCINGECYDEVLFHGLEEYINNLQGDGVIVLHTIGSHGPTYY
NRYPPQFRKFTPTCDTNEIQTCTKEQLVNTYDNTLVYVDYIVDKAINLLKEHQDKFTTSLVYLSDHGESLGENGIYLHGL
PYAIAPDSQKQVPMLLWLSEDYQKRYQVDQNCLQKQAQTQHYSQDNLFSTLLGLTGVETKYYQAADDILQTCRRVSE

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z1140 NP_286675.1 hypothetical protein Not tested TAI Protein 4e-92 43
Z1579 NP_287083.1 hypothetical protein Not tested TAI Protein 4e-92 43

• Homologs from CARD and BacMet (resistance genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcDH1_3878 YP_006093768.1 sulfatase BAC0485 Protein 9e-154 60