Gene Information

Name : APECO1_3532 (APECO1_3532)
Accession : YP_854230.1
Strain : Escherichia coli APEC O1
Genome accession: NC_008563
Putative virulence/resistance : Virulence
Product : superfamily I DNA helicase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 3306992 - 3310519 bp
Length : 3528 bp
Strand : +
Note : -

DNA sequence :
ATGGGACTGGTAATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAA
GGGCAGTTTTGAACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTG
TCAGTAAATTTTTTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCGTCTTGCGCCCAAAAGTTTATTTCCGGTTA
CTGCAGCATGGTAAGGACCGTTCCGCAGGCGCACCTGATATTGTTACCCCGCTAGTGACGCCAGCCTTGCTAAGCCGTGA
GGGTTTTTTATATCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTG
GTGAGATTGGGCAGTATGACAAATACAAGACAATCCATACCTCGTTCTCTATCAACTTTGATGACAGCATTGATAAGACT
GCCGAAACGGATGAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAG
GCTGCTGAAGAACGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAA
CGGCGCAATCTGGTGGTGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTGTTTGCAAGAAGGATGTGCCGCTC
TTCAATCGCTTCGCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGG
ACACTCCGGAGATAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATA
TCCTTGCTGTTAATGGACCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCG
GCTCTCGAAAAATCTGAGCCTCCGGTTATTATCGCGACTTCAACGAACAACCAGGCTGTAACGAACATTATCGAAGCGTT
CGGGAAAGATTTTTCACAGGGCACTGGTGCAATGGCCGGACGATGGTTACCTGAGCTGAAAAGCTTCGGCGCTTATTTTC
CCTCAAGCACTCGTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTAT
GTAGAGGATGCACTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGT
CATTGAACTCCTGCATGGTCAGTTGGTAGCAAAATCCGAGCAATTGAAAAGACTGAACGCAACATGGCAAACGTTAAGCC
AGGTACGGGCTGCGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAA
GAACAAAAAGTCACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATT
ATTTTCCTGGCTCCCAGCGGTTCGTAGTAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGA
TTGCAGGAAATCAGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAA
ACAACATACCGGCAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCT
GGCATTTGATTTAGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCT
TCCCTGCATTTTTACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGAATTGATGATCTGCAGGAA
GAGAAGAAGAAAAAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGAC
ATGCTATATGCTGCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAACGTAAATTCGAGAAAAGTTATTTGTATGATT
TTGCCGATTTACTCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAG
GCATTAGTGATTGGCGATACGGAGCAGATCCCGCCAATATGGAGTATTGCTCCCGCGATTGATGTCGGTAACATGCTTGC
GGAAAAAATTCTGTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCG
CATCTGGCAGCGTTATGAAAATAGCGCAGTTTGCTTCGCGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTA
TATGAACACCGCCGGTGCTTCGATAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAG
AGGGCGTGAAGAGAGCAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCAGGCAAGTAGTGGAA
GTAGATATAATTTGCTTGAGGCTGAAACGATAGCGGCCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGC
AAATCGCTTCATGAAGTTGTCGGTATCGTGACGCCTTTTAGCGCGCAGGTATCCACTATCAAACAGGCGCTGGGTAAACA
AGGTATCAGTACGGGCGCGAATGAAAAGTCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTG
TTATATTCTCGCCAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTC
TCCCGTGCGAAGGATAGCTTCCTGGTCTTCGGCGATATGGATCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATT
ACTGGCAAAATACCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTTCCG
AGACCAAAATCTACACACTTCATGGTGTGGAGCAGCATGATAACTTCCTGAATCAGACGTTCGAAAATACCGATAAACAC
ATCACGATAGTTTCTCCATGGCTGACCTGGCAAAAGCTGGAGCAAACCGGTTTTCTTGATTCTATGATTGCGGCGTGTTC
ACGTGGTATTAACGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAAAGATTTTGAGAAGCGAAAAGAGAAGC
AGCAGAACCTTAAAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAA
ATTGTTATTGGTGATGATGGTTTGCTATGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACG
ATACGATACCTCAATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTC
AGGTTTAG

Protein sequence :
MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVVLRPKVYFRL
LQHGKDRSAGAPDIVTPLVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDSIDKT
AETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPL
FNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARA
ALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY
VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQ
EQKVTLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQ
TTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMARIDDLQE
EKKKKGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKK
ALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYL
YEHRRCFDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGEQASSGSRYNLLEAETIAAWLAENQQNIEAHYG
KSLHEVVGIVTPFSAQVSTIKQALGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAV
SRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTSETKIYTLHGVEQHDNFLNQTFENTDKH
ITIVSPWLTWQKLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHKDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSK
IVIGDDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 100
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 100
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 98
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 98
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 98

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
APECO1_3532 YP_854230.1 superfamily I DNA helicase VFG0627 Protein 0.0 98
APECO1_3532 YP_854230.1 superfamily I DNA helicase VFG1537 Protein 0.0 98