Gene Information

Name : ECP_4523 (ECP_4523)
Accession : YP_672360.1
Strain : Escherichia coli 536
Genome accession: NC_008253
Putative virulence/resistance : Virulence
Product : superfamily I DNA helicases
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG1112
EC number : -
Position : 4737132 - 4740647 bp
Length : 3516 bp
Strand : +
Note : -

DNA sequence :
ATGGATAAAAATGCTTTAGGGTTTGCCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAAGGGCAGTTTTAA
ACGGAAAGACGCCCAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGACGAAGCGATTGTCAGTAAATTTT
TTGAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCATCTTGCGCCCAAAAGTTTATTTCCGGTTACTGCAGCATGGT
AAGGACCGTTCCGCAGGCGCGCCTGATATTGTTACCCCGATAGTGACGCCAGCCTTGCTAAGCCGTGAGGGTTTTTTATA
TCCGACGCCAGCGACATCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTGGTGAGATTGGGC
AGTATGACAAATACAAGACAATCCATACCTCATTCTCTATCAACTTTGATGACGGCATTGATAAGACTGCCGAAACGGAT
GAAGAACGGGAAGCACGATATGCAGCCTTGCAGCAGGAGTGGCGTCAATATCTGGATGATTCAGAGAGGCTACTGAAGAA
CGTTGCCGGCGACTGGATTAAAAATCCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAACGGCGCAATCTG
GCGGTGCCAGTTTCCATATCCTTTCTCTTTATGAACACCTGCTTGTTTGCAAGAAGGATGTGCCGCTCTTCAATCGCTTC
GCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCAGGCGCAAAATTCAGCGACAGGCTTGGACACTCCGGAGA
TAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCAAGACATGGCGATATCCTTGCTGTTA
ATGGCCCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCAGCTCTCGAAAAA
TCTGAGCCTCCGGTTATTATCGCGACTTCAACGAATAACCAGGCTGTAACGAACATTATTGAGGCATTCGGGAAAGACTT
TTCGCAAGGTTCAGGTGCGATGGCCGGGCGATGGTTGCCAGAGCTGAAAAGTTTCGGCGCTTATTTTCCCTCAAGCAGCC
GTAAAGCTGAAGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTATGTAGAAGATGCA
CTGCTGTTTTATCTCGAGAAAGCTAAGGCAGCTTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGTCATTGAACTCCT
GCATGGTCAGTTGGCAGCAAAATCCGAGCAACTGATAAGACTGAACGCAACATGGCAAACGTTAAGCCAGGTACGGGCTG
CGCGTGAGCTTATTGCTAACGACATTGAGCAATATCTCGATAATTTAAATAAATTACTTTCCGGACAAGAACAAAAAATC
ACTCTACTGAAAAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATTATTTTCCTGGCT
CCCGGCGGTTCGCAGTAAGCGACAGTACCAAATACAACTGTTTCTCGAAGATAAATTAGGTGCGCTGATTGCAGGAAATC
AGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAAACAACATACCGG
CAGCAGATTGACTCCGCCCATGAAATCATTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGACTGGCTCTTGATTT
AGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACTGGCCGATGAACTGGCTGATACGCAGATTCGCTTCCCTGCATTTT
TACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGCATTGATGATCTGCAGGAAGAGAAGAAGAAA
AAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGACATGCTATATGCT
GCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAGCGTAAATTCGAGAAAAGTTATTTGTATGATTTTGCCGATTTAC
TCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAGGCATTAGTGATT
GGCGATACGGAGCAGCTCCCGCCAATATGGAGTATTGCTCCTGCGATTGATGTCGGTAACATGCTGGCGGAAAAAATTCT
GTCTGGCAGTACGCAAGAAGAGATTACCGAGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCGCATCTGGCAGCG
TTATGAAAATAGCGCAGTTTGCTTCACGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTATATGAACACCGC
CGGTGCTACGACAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAGAGGGCGTGAAGA
GAGCAATTTAATGCCCGAAATGGGGTATCTCCATATTGATGGTAAAGGTGAGCTGGCAAGTAGTGGAAGTCGATATAATT
TGCTTGAGGCTGAAACGATAGCGGTCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGTAAATCGCTTCAT
GAAGTTGTCGGTATCGTGACGCCTTTTAGCGCTCAGGTATCCACTATCAAACAGGTGCTGGGCAAACAAGGTATCAGTAC
AGGCGCGAATGAAAAATCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTGTGATATTCTCGC
CAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTCTCCCGTGCGAAG
GACAGTTTTCTGGTCTTCGGCGATATGGACCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATTACTGGCAAAATA
CCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACCGCCGGGACCAAAATCT
ACACACTTCATGGTGTGGAGCAACATGATAATTTCCTGAATCAGACATTTGAAAATACCAGTAAACACATCACGATAGTT
TCTCCATGGCTGACCTGGCAAAGGCTGGAGCAAACCGGTTTTCTTGATTCCATGATTGCGGCGTGTTCACGTGGAATTAA
CGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAATGATTTTGAGATGCGAAAAGAGAAGCAGCAAAACTTTA
AAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTGGTAAACCGTGTTCATAGCAAAATTGTTATTGGT
GATGATGGTTTGCTGTGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACGATACGATACCTC
AATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTCAGGTTTAG

Protein sequence :
MDKNALGFASYWRNSLADAESGKGSFKRKDAQNFTHWHGIAAGRLDEAIVSKFFEGEKDDVETVDVILRPKVYFRLLQHG
KDRSAGAPDIVTPIVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTIHTSFSINFDDGIDKTAETD
EEREARYAALQQEWRQYLDDSERLLKNVAGDWIKNPEQYELAEHGYIVKTAQSGGASFHILSLYEHLLVCKKDVPLFNRF
ASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEK
SEPPVIIATSTNNQAVTNIIEAFGKDFSQGSGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA
LLFYLEKAKAAFPEKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQVRAARELIANDIEQYLDNLNKLLSGQEQKI
TLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQTTYR
QQIDSAHEIILKEQQAVQEWQRLALDLGYEGDEELSFSLADELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQEEKKK
KGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVI
GDTEQLPPIWSIAPAIDVGNMLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR
RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPEMGYLHIDGKGELASSGSRYNLLEAETIAVWLAENQQNIEAHYGKSLH
EVVGIVTPFSAQVSTIKQVLGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAVSRAK
DSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITIV
SPWLTWQRLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEMRKEKQQNFKAALEKLNALGIATKLVNRVHSKIVIG
DDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 100
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 98
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 98
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 98
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 98

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECP_4523 YP_672360.1 superfamily I DNA helicases VFG1537 Protein 0.0 100
ECP_4523 YP_672360.1 superfamily I DNA helicases VFG0627 Protein 0.0 98