Gene Information

Name : ECS88_3251 (ECS88_3251)
Accession : YP_002392872.1
Strain : Escherichia coli S88
Genome accession: NC_011742
Putative virulence/resistance : Virulence
Product : superfamily I DNA helicase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 3221776 - 3225303 bp
Length : 3528 bp
Strand : +
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pe : enzyme

DNA sequence :
ATGGGACTGGTAATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAA
GGGCAGTTTTGAACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTG
TCAGTAAATTTTTTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCGTCTTGCGCCCAAAAGTTTATTTCCGGTTA
CTGCAGCATGGTAAGGACCGTTCCGCAGGCGCACCTGATATTGTTACCCCGCTAGTGACGCCAGCCTTGCTAAGCCGTGA
GGGTTTTTTATATCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTG
GTGAGATTGGGCAGTATGACAAATACAAGACAATCCATACCTCGTTCTCTATCAACTTTGATGACAGCATTGATAAGACT
GCCGAAACGGATGAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAG
GCTGCTGAAGAACGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAA
CGGCGCAATCTGGTGGTGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTGTTTGCAAGAAGGATGTGCCGCTC
TTCAATCGCTTCGCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGG
ACACTCCGGAGATAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATA
TCCTTGCTGTTAATGGACCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCG
GCTCTCGAAAAATCTGAGCCTCCGGTTATTATCGCGACTTCAACGAACAACCAGGCTGTAACGAACATTATCGAAGCGTT
CGGGAAAGATTTTTCACAGGGCACTGGTGCAATGGCCGGACGATGGTTACCTGAGCTGAAAAGCTTCGGCGCTTATTTTC
CCTCAAGCACTCGTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTAT
GTAGAGGATGCACTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGT
CATTGAACTCCTGCATGGTCAGTTGGTAGCAAAATCCGAGCAATTGAAAAGACTGAACGCAACATGGCAAACGTTAAGCC
AGGTACGGGCTGCGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAA
GAACAAAAAGTCACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATT
ATTTTCCTGGCTCCCAGCGGTTCGTAGTAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGA
TTGCAGGAAATCAGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAA
ACAACATACCGGCAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCT
GGCATTTGATTTAGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCT
TCCCTGCATTTTTACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGAATTGATGATCTGCAGGAA
GAGAAGAAGAAAAAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGAC
ATGCTATATGCTGCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAACGTAAATTCGAGAAAAGTTATTTGTATGATT
TTGCCGATTTACTCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAG
GCATTAGTGATTGGCGATACGGAGCAGATCCCGCCAATATGGAGTATTGCTCCCGCGATTGATGTCGGTAACATGCTTGC
GGAAAAAATTCTGTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCG
CATCTGGCAGCGTTATGAAAATAGCGCAGTTTGCTTCGCGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTA
TATGAACACCGCCGGTGCTTCGATAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAG
AGGGCGTGAAGAGAGCAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCAGGCAAGTAGTGGAA
GTAGATATAATTTGCTTGAGGCTGAAACGATAGCGGCCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGC
AAATCGCTTCATGAAGTTGTCGGTATCGTGACGCCTTTTAGCGCGCAGGTATCCACTATCAAACAGGCGCTGGGTAAACA
AGGTATCAGTACGGGCGCGAATGAAAAGTCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTG
TTATATTCTCGCCAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTC
TCCCGTGCGAAGGATAGCTTCCTGGTCTTCGGCGATATGGATCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATT
ACTGGCAAAATACCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTTCCG
AGACCAAAATCTACACACTTCATGGTGTGGAGCAGCATGATAACTTCCTGAATCAGACGTTCGAAAATACCGATAAACAC
ATCACGATAGTTTCTCCATGGCTGACCTGGCAAAAGCTGGAGCAAACCGGTTTTCTTGATTCTATGATTGCGGCGTGTTC
ACGTGGTATTAACGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAAAGATTTTGAGAAGCGAAAAGAGAAGC
AGCAGAACCTTAAAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAA
ATTGTTATTGGTGATGATGGTTTGCTATGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACG
ATACGATACCTCAATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTC
AGGTTTAG

Protein sequence :
MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVVLRPKVYFRL
LQHGKDRSAGAPDIVTPLVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDSIDKT
AETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPL
FNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARA
ALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY
VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQ
EQKVTLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQ
TTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMARIDDLQE
EKKKKGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKK
ALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYL
YEHRRCFDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGEQASSGSRYNLLEAETIAAWLAENQQNIEAHYG
KSLHEVVGIVTPFSAQVSTIKQALGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAV
SRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTSETKIYTLHGVEQHDNFLNQTFENTDKH
ITIVSPWLTWQKLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHKDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSK
IVIGDDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 100
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 100
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 98
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 98
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 98

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECS88_3251 YP_002392872.1 superfamily I DNA helicase VFG0627 Protein 0.0 98
ECS88_3251 YP_002392872.1 superfamily I DNA helicase VFG1537 Protein 0.0 98