Name : ECS88_3251 (ECS88_3251) Accession : YP_002392872.1 Strain : Escherichia coli S88 Genome accession: NC_011742 Putative virulence/resistance : Virulence Product : superfamily I DNA helicase Function : - COG functional category : L : Replication, recombination and repair COG ID : COG1112 EC number : - Position : 3221776 - 3225303 bp Length : 3528 bp Strand : + Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pe : enzyme DNA sequence : ATGGGACTGGTAATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAA GGGCAGTTTTGAACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTG TCAGTAAATTTTTTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCGTCTTGCGCCCAAAAGTTTATTTCCGGTTA CTGCAGCATGGTAAGGACCGTTCCGCAGGCGCACCTGATATTGTTACCCCGCTAGTGACGCCAGCCTTGCTAAGCCGTGA GGGTTTTTTATATCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTG GTGAGATTGGGCAGTATGACAAATACAAGACAATCCATACCTCGTTCTCTATCAACTTTGATGACAGCATTGATAAGACT GCCGAAACGGATGAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAG GCTGCTGAAGAACGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAA CGGCGCAATCTGGTGGTGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTGTTTGCAAGAAGGATGTGCCGCTC TTCAATCGCTTCGCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGG ACACTCCGGAGATAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATA TCCTTGCTGTTAATGGACCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCG GCTCTCGAAAAATCTGAGCCTCCGGTTATTATCGCGACTTCAACGAACAACCAGGCTGTAACGAACATTATCGAAGCGTT CGGGAAAGATTTTTCACAGGGCACTGGTGCAATGGCCGGACGATGGTTACCTGAGCTGAAAAGCTTCGGCGCTTATTTTC CCTCAAGCACTCGTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTAT GTAGAGGATGCACTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGT CATTGAACTCCTGCATGGTCAGTTGGTAGCAAAATCCGAGCAATTGAAAAGACTGAACGCAACATGGCAAACGTTAAGCC AGGTACGGGCTGCGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAA GAACAAAAAGTCACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATT ATTTTCCTGGCTCCCAGCGGTTCGTAGTAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGA TTGCAGGAAATCAGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAA ACAACATACCGGCAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCT GGCATTTGATTTAGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCT TCCCTGCATTTTTACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGAATTGATGATCTGCAGGAA GAGAAGAAGAAAAAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGAC ATGCTATATGCTGCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAACGTAAATTCGAGAAAAGTTATTTGTATGATT TTGCCGATTTACTCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAG GCATTAGTGATTGGCGATACGGAGCAGATCCCGCCAATATGGAGTATTGCTCCCGCGATTGATGTCGGTAACATGCTTGC GGAAAAAATTCTGTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCG CATCTGGCAGCGTTATGAAAATAGCGCAGTTTGCTTCGCGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTA TATGAACACCGCCGGTGCTTCGATAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAG AGGGCGTGAAGAGAGCAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCAGGCAAGTAGTGGAA GTAGATATAATTTGCTTGAGGCTGAAACGATAGCGGCCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGC AAATCGCTTCATGAAGTTGTCGGTATCGTGACGCCTTTTAGCGCGCAGGTATCCACTATCAAACAGGCGCTGGGTAAACA AGGTATCAGTACGGGCGCGAATGAAAAGTCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTG TTATATTCTCGCCAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTC TCCCGTGCGAAGGATAGCTTCCTGGTCTTCGGCGATATGGATCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATT ACTGGCAAAATACCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTTCCG AGACCAAAATCTACACACTTCATGGTGTGGAGCAGCATGATAACTTCCTGAATCAGACGTTCGAAAATACCGATAAACAC ATCACGATAGTTTCTCCATGGCTGACCTGGCAAAAGCTGGAGCAAACCGGTTTTCTTGATTCTATGATTGCGGCGTGTTC ACGTGGTATTAACGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAAAGATTTTGAGAAGCGAAAAGAGAAGC AGCAGAACCTTAAAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAA ATTGTTATTGGTGATGATGGTTTGCTATGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACG ATACGATACCTCAATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTC AGGTTTAG Protein sequence : MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVVLRPKVYFRL LQHGKDRSAGAPDIVTPLVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDSIDKT AETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPL FNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARA ALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQ EQKVTLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQ TTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMARIDDLQE EKKKKGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKK ALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYL YEHRRCFDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGEQASSGSRYNLLEAETIAAWLAENQQNIEAHYG KSLHEVVGIVTPFSAQVSTIKQALGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAV SRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTSETKIYTLHGVEQHDNFLNQTFENTDKH ITIVSPWLTWQKLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHKDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSK IVIGDDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
ORF_2 | AAZ04413.1 | superfamily I DNA helicase | Not tested | PAI I APEC-O1 | Protein | 0.0 | 100 |
APECO1_3532 | YP_854230.1 | superfamily I DNA helicase | Not tested | PAI I APEC-O1 | Protein | 0.0 | 100 |
unnamed | CAD42018.1 | hypothetical protein | Not tested | PAI II 536 | Protein | 0.0 | 98 |
S3169 | NP_838460.1 | superfamily I DNA helicase | Not tested | SHI-1 | Protein | 0.0 | 98 |
SF2965 | NP_708739.1 | superfamily I DNA helicase | Not tested | SHI-1 | Protein | 0.0 | 98 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
ECS88_3251 | YP_002392872.1 | superfamily I DNA helicase | VFG0627 | Protein | 0.0 | 98 |
ECS88_3251 | YP_002392872.1 | superfamily I DNA helicase | VFG1537 | Protein | 0.0 | 98 |