Gene Information

Name : ECIAI39_4598 (ECIAI39_4598)
Accession : YP_002410462.1
Strain : Escherichia coli IAI39
Genome accession: NC_011750
Putative virulence/resistance : Virulence
Product : putative superfamily I DNA helicase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 4796684 - 4800211 bp
Length : 3528 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology; Product type pe : putative enzyme

DNA sequence :
ATGGGACTGGTAATGGATGAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAA
GGGCAGTTTTGAACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTG
TCAGTAAATTTTTTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCGTCTTGCGCCCAAAAGTTTATTTCCGGTTA
CTGCAGCATGGTAAGGACCGTTCCGCAGGCGCACCTGATATTGTTACCCCGCTAGTGACGCCAGCCTTGCTAAGCCGTGA
GGGTTTTTTATATCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTG
GTGAGATTGGGCAGTATGACAAATACAAGACAATCCATACCTCGTTCTCTATCAACTTTGATGACAGCATTGATAAGACT
GCCGAAACGGATGAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAG
GCTGCTGAAGAACGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAA
CGGCGCAATCTGGTGGTGCCAGTTTCCATATCCTTTCGCTTTATGATCACCTGCTTGTTTGCAAGAAGGATGTGCCGCTC
TTCAATCGCTTCGCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGAGCAAAATTCAGCGACAGGCTTGG
ACACTCCGGAGATAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATA
TCCTTGCTGTTAATGGACCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCG
GCTCTCGAAAAATCTGAGCCTCCGGTTATTATCGCGACTTCAACGAACAACCAGGCTGTAACGAACATTATCGAAGCGTT
CGGGAAAGATTTTTCACAGGGCACTGGTGCAATGGCCGGACGATGGTTACCTGAGCTGAAAAGCTTCGGCGCTTATTTTC
CCTCAAGCACTCGTAAAGCTGAGGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTAT
GTAGAGGATGCACTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGT
CATTGAACTCCTGCATGGTCAGTTGGTAGCAAAATCCGAGCAATTGAAAAGACTGAACGCAACATGGCAAACGTTAAGCC
AGGTACGGGCTGCGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAA
GAACAAAAAGTCACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATT
ATTTTCCTGGCTCCCAGCGGTTCGTAGTAAGCGACAGTACCAAATACAGCTGTTTCTCGAAGATAAATTAGGCGCGCTGA
TTGCAGGAAATCAGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAA
ACAACATACCGGCAGCAGATTGACTCCGCCCATGAAATCGTTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGGCT
GGCATTTGATTTAGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCT
TCCCTGCATTTTTACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGAATTGATGATCTGCAGGAA
GAGAAGAAGAAAAAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGAC
ATGCTATATGCTGCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAACGTAAATTCGAGAAAAGTTATTTGTATGATT
TTGCCGATTTACTCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAG
GCATTAGTGATTGGCGATACGGAGCAGATCCCGCCAATATGGAGTATTGCTCCCGCGATTGATGTCGGTAACATGCTTGC
GGAAAAAATTCTGTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCG
CATCTGGCAGCGTTATGAAAATAGCGCAGTTTGCTTCGCGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTA
TATGAACACCGCCGGTGCTTCGATAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAG
AGGGCGTGAAGAGAGCAATTTAATGCCCGCAATGGGTTATCTCCATATTGATGGTAAAGGAGAGCAGGCAAGTAGTGGAA
GTAGATATAATTTGCTTGAGGCTGAAACGATAGCGGCCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGC
AAATCGCTTCATGAAGTTGTCGGTATCGTGACGCCTTTTAGCGCGCAGGTATCCACTATCAAACAGGCGCTGGGTAAACA
AGGTATCAGTACGGGCGCGAATGAAAAGTCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTG
TTATATTCTCGTCAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTC
TCCCGTGCGAAGGATAGCTTCCTGGTCTTCGGCGATATGGATCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATT
ACTGGCAAAATACCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACTTCCG
AGACCAAAATCTACACACTTCATGGTGTGGAGCAGCATGATAACTTCCTGAATCAGACGTTCGAAAATACCGATAAACAC
ATCACGATAGTTTCTCCATGGCTGACCTGGCAAAAGCTGGAGCAAACCGGTTTTCTTGATTCTATGATTGCGGCGTGTTC
ACGTGGTATTAACGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAAAGATTTTGAGAAGCGAAAAGAGAAGC
AGCAGAACCTTAAAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAA
ATTGTTATTGGTGATGATGGTTTGCTATGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACG
ATACGATACCTCAATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTC
AGGTTTAG

Protein sequence :
MGLVMDENALGFASYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVVLRPKVYFRL
LQHGKDRSAGAPDIVTPLVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDSIDKT
AETDEEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCKKDVPL
FNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARA
ALEKSEPPVIIATSTNNQAVTNIIEAFGKDFSQGTGAMAGRWLPELKSFGAYFPSSTRKAEAAKKYQTEDFFNQVESKEY
VEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQLVAKSEQLKRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQ
EQKVTLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQ
TTYRQQIDSAHEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMARIDDLQE
EKKKKGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKK
ALVIGDTEQIPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYL
YEHRRCFDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGEQASSGSRYNLLEAETIAAWLAENQQNIEAHYG
KSLHEVVGIVTPFSAQVSTIKQALGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSSVYSKHEDGGFIDSDNSMLNVAV
SRAKDSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTSETKIYTLHGVEQHDNFLNQTFENTDKH
ITIVSPWLTWQKLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHKDFEKRKEKQQNLKAALEKLNALGIATKLVNRVHSK
IVIGDDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 99
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 99
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 98
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 97
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 97

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECIAI39_4598 YP_002410462.1 putative superfamily I DNA helicase VFG1537 Protein 0.0 98
ECIAI39_4598 YP_002410462.1 putative superfamily I DNA helicase VFG0627 Protein 0.0 97