Gene Information

Name : EcSMS35_4754 (EcSMS35_4754)
Accession : YP_001746675.1
Strain : Escherichia coli SMS-3-5
Genome accession: NC_010498
Putative virulence/resistance : Virulence
Product : sulfatase family protein
Function : -
COG functional category : R : General function prediction only
COG ID : COG2194
EC number : 3.1.6.-
Position : 4853175 - 4854674 bp
Length : 1500 bp
Strand : -
Note : yjgX; identified by match to protein family HMM PF00884

DNA sequence :
ATGAATATCAGAAAACTGTTTTGTCCGGGAAACACACCCCGGATTTTATTGTTTTTATTCTTTTTTGTTGTTTCTGCAAT
AACCACAATTGCATGCGGATACACTGAGAAGAATGCCACAGGAAATGTGCTGCTTCTGTTTCTCCTTCTTCTCCTTGCAC
ACAGAAATACCCTCACATCCATTACAGCGCTGTTATTTCTGTTCTGTTGTGCACTGTATGCGCCTGCGGGTATGACGTAC
GGTAAAATCAACAACAGTTTTATTGTCGCGTTGCTGCAGACCACGGCTGATGAGGCTGCAGAGTTTACCGGGATGATTCC
TGTTTATCATTTTCTGGTCAGTGCCGCGATTCTGGTATTCATGGTGATTTTCTGGCGGACACACCACCGCGGTCACCGTA
ACTGGCTGGCACTGCTGCTATTCGTATTATGCTCTGTAAACAGCTGGCCGTTGCGGATGGTTAAAGGAACTGTTGTGGGG
ACAACTGACACATTGCGTGAAATGCAGCGTTATAAACAACTGAGTCAGCACGGGGCTGATAACTGGAAAATCCTGCCGGG
TGTGCCGTTGTATGACACGATTGTTATCGTTACTGGTGAGAGTGTGCGCAGGGATTATATGTCGGTATATGGCTATCCCG
TGCCAACCACGCCGTGGCTGAATACAGCTCCCGGTTTATTTATTGACGGCTATACATCGGCAGCAGCCAGTACCGTACCT
TCCCTGAGCCGGACACTGATTTATGACTATGAGCAGAACCCTGATTCCGGCAACAACGTGGTGGCGCTGGCAGCAAAAGC
AGGATACAGCACATGGTGGATATCCAATCAGGGAAAACTGGGAGAGCATGACACACGCATCTCTGTTATTGCTTCTGATG
CGGAGCATACCGTTTTCCTCAAGAAAGGCAGCTTCGCTTCCCGTAAAACGGATGACATGTTGTTGTTACAGGAAACAGAA
CGTGCGCTGGCGGATAAATCCTCGCCGAAGGTGATTTTCCTGCACATGATTGGCTCTCATCCGAATCCGTGTGACCGACT
TAACTCCTGGCCGAATTATTACCTGGAGCAGTATCCCCGAAAGATTGCCTGTTACCTCGCCAGCATCAGTAAACTGGATA
ACTTTCTCGGTCAGCTTGATGGTATCCTTCGCCGGCATTCCCGTCACTTTGCAATGCTTTACTTTTCTGACCATGGGCTG
TCGGTCAGCGACAGTGCTAATCCTGTTCATCATGATGGTCATGTGCAGGGGGGCTACAGCGTTCCCCTGATTATTACCGC
CAGTGACATCACGTCTCATCAGTCCGTCAGCAGGAAAATCAGTGCCCGTAATTTCGCAGGCATTTTTCAGTGGCTGACCG
GTATTCGTACCGAAAATATAACGCCATTCAATCCGCTGACAGACGAAGATAATGAACCCGTTATGGTTTTTAACGGAGAG
AAAAATGTGCCGGCAGACAGTCTGAAACCGCAGCCACTTATTCTTCCGGACCACAGGTAA

Protein sequence :
MNIRKLFCPGNTPRILLFLFFFVVSAITTIACGYTEKNATGNVLLLFLLLLLAHRNTLTSITALLFLFCCALYAPAGMTY
GKINNSFIVALLQTTADEAAEFTGMIPVYHFLVSAAILVFMVIFWRTHHRGHRNWLALLLFVLCSVNSWPLRMVKGTVVG
TTDTLREMQRYKQLSQHGADNWKILPGVPLYDTIVIVTGESVRRDYMSVYGYPVPTTPWLNTAPGLFIDGYTSAAASTVP
SLSRTLIYDYEQNPDSGNNVVALAAKAGYSTWWISNQGKLGEHDTRISVIASDAEHTVFLKKGSFASRKTDDMLLLQETE
RALADKSSPKVIFLHMIGSHPNPCDRLNSWPNYYLEQYPRKIACYLASISKLDNFLGQLDGILRRHSRHFAMLYFSDHGL
SVSDSANPVHHDGHVQGGYSVPLIITASDITSHQSVSRKISARNFAGIFQWLTGIRTENITPFNPLTDEDNEPVMVFNGE
KNVPADSLKPQPLILPDHR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed AFX83957.1 sulfatase family protein Not tested SE-PAI Protein 0.0 99
APECO1_3531 YP_854244.1 hypothetical protein Not tested PAI I APEC-O1 Protein 0.0 98
yjgX CAD42019.1 hypothetical protein Not tested PAI II 536 Protein 0.0 98
ORF_3 AAZ04414.1 conserved hypothetical protein Not tested PAI I APEC-O1 Protein 0.0 98

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
EcSMS35_4754 YP_001746675.1 sulfatase family protein VFG1538 Protein 0.0 98