Gene Information

Name : ECABU_c48370 (ECABU_c48370)
Accession : YP_006108747.1
Strain : Escherichia coli ABU 83972
Genome accession: NC_017631
Putative virulence/resistance : Virulence
Product : phospholipase D
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 4933674 - 4937189 bp
Length : 3516 bp
Strand : +
Note : -

DNA sequence :
ATGGATGAAAATGCTTTAGGGTTTACCTCATACTGGCGCAACTCGCTTGCGGATGCTGAGTCAGGAAAGGGCAGTTTTGA
ACGGAAAGACGCCAAAAATTTCACTCACTGGCATGGGATAGCGGCGGGACGTCTTGAGGAAGCGATTGTCAGTAAATTTT
TTAAGGGAGAAAAAGACGATGTCGAAACGGTCGATGTCATCTTGCGCCCAAAAGTTTATTTCCGGTTACTGCAGCATGGT
AAGGACCGTTCTGCAGGTGCGCCTGATATTGTTACCCCGATAGTGACGCCAGCCTTGCTAAGCCGTGAGGGTTTTTTATA
TCCGACGCCAGCGACCTCCATTCCCAGAGACCTGCTTGAACCTTTGCCAAAAGGAGCATTTTCGATTGGTGAGATTGGGC
AGTATGACAAATACAAGACGACCCATACCACGTTCTCTATCAACTTTGATGACAGCGTTGATAAGACTGCCGAAACGGAT
GAAGAACGGGAAGCACGATATGCCGCCTTGCAGCTGGAGTGGCGTCAATATCTGTATGACTCAGAGAGGCTACTGAAGAG
CGTTGCCGGCGACTGGATTGAAAAACCTGAGCAATATGAACTCGCTGAGCACGGTTATATTGTTAAAACGGCTCAATCTG
GCGGTGCCAGTTTCCATATCCTTTCTCTTTATGATCACCTGCTTGTTTGCAATAAGGATGTGCCGCTCTTCAATCGCTTC
GCCTCGCGAGAGGTTCATGCTGCAGAGTCTTTGCTGGCCCCTGGAGCAAAATTCAGCGACAGGCTTGGACACTCCGGAGA
TAAGTTTCCGCTGGCAAAGGCTCAGCGCGATGCCTTAAGCCATTTTCTGGATGCTAGACATGGCGATATCCTTGCTGTTA
ATGGCCCTCCGGGAACCGGAAAAACCACGCTGGTGCTTTCTATCATCGCCACGCAGTGGGCCAGAGCGGCTCTCGAAAAA
TCTGAGCCTCCGGTTATTATCGCGACTTCAACGAATAACCAGGCTGTAACGAACATTATTGAGGCATTCGGGAAAGACTT
TTCGCAAGGTTCAGGTGCGATGGCCGGGCGATGGTTGCCAGAGCTGAAAAGCTTCGGCGCTTATTTTCCCTCAAGCAGCC
GTAAAGCTGAAGCAGCCAAAAAATATCAAACTGAAGATTTCTTCAACCAGGTTGAGTCAAAAGAGTATGTAGAAGATGCA
CTGCTGTTTTATCTGGAAAAGGCTAAGGCAGCCTTTCCTGAAAAAGAGTGTTCATCCCCTGAAAAGGTCATTGAACTCCT
GCATGGTCAGTTGGCAGCAAAATCCGAGCAACTGATAAGACTGAACGCAACATGGCAAACGTTAAGCCAGATTCGGGCTG
CGCGTGAGCTTATTGCTAATGATATTGAGCAATATCTCAATAATTTAAATAAATTACTTTCCGGACAAGAACAAAAAATC
ACTCTACTGAAGAGTGCTAAAACGGAATGGAAAAAATATCGCGCCGGTGAATCACTGATCTATTCATTATTTTCCTGGCT
CCCGGCGGTTCGCAGTAAGCGACAGTACCAAATACAACTGTTTCTCGAAGATAAATTAGGTGCGCTGATTGCAGGAAATC
AGTGGTCTGATCCTGAAACTATCGAACGTAATATTGATGGGCTGCTCAATTCCGCTGAGCGCGAGCAAACAACATACCGG
CAGCAGATTGACTCCGCCCATGAAATCATTCTTAAAGAACAGCAGGCGGTTCAGGAGTGGCAGAGACTGGCTCTTGATTT
AGGGTATGAGGGCGACGAGGAACTGAGCTTCTCACAGGCCGATGAACTGGCTGATACGCAGATTCGCTTCCCTGCATTTT
TACTGACGACTCACTACTGGGAAGGTCGTTGGCTGATGGATATGGCCAGCATTGATGATCTGCAGGAAGAGAAGAAGAAA
AAAGGCGCTAAAGGGGTAACCGCCCGTTGGCAACGTCGAATGAAACTCACTCCATGTGTGGTGATGACATGCTATATGCT
GCCCGGCAATATGCAGATAAGTGAGCACAAAGGACAGCGTAAATTCGAGAAAAGTTATTTGTATGATTTTGCCGATTTAC
TCATTGTCGATGAAGCCGGGCAGGTGCTTCCTGAAGTGGCTGCTGCCTCGTTTGCATTAGCTAAGAAGGCATTAGTGATT
GGCGATACGGAGCAGCTCCCGCCAATATGGAGTATTGCTCCTGCGATTGATGTCGGTAACATGCTGGCGGAAAAAATTCT
GTCTGGCAGTACGCAAGAAGAGATTACCGCGAAATATACGGCAATCGCAGACCTTGGTAAAAGTGCCGCATCTGGCAGCG
TTATGAAAATAGCGCAGTTTGCTTCACGCTATCAATATGATCCCGAACTGGCTCGTGGTATGTACCTATATGAACACCGC
CGGTGCTACGACAATATTATTGGATACTGTAATACGCTCTGCTATCACGGTAAGTTGTTGCCTAAAAGAGGGCGTGAAGA
GAGCAATTTAATGCCCGAAATGGGGTATCTCCATATTGATGGTAAAGGTGAGCTGGCAAGTAGTGGAAGTCGATATAATT
TGCTTGAGGCTGAAACGATAGCGGTCTGGTTGGCAGAGAACCAGCAAAATATTGAAGCGCATTACGGTAAATCGCTTCAT
GAAGTTGTCGGTATCGTGACGCCTTTTAGCGCTCAGGTATCCACTATCAAACAGGTGCTGGGCAAACAAGGTATCAGTAC
AGGCGCGAATGAAAAATCGCTCACAGTGGGCACCGTGCACTCTCTTCAGGGAGCGGAAAGAGCGATTGTGATATTCTCGC
CAGTCTATTCAAAACATGAAGACGGCGGGTTTATTGATAGCGATAACAGCATGCTGAATGTTGCAGTCTCCCGTGCGAAG
GACAGTTTTCTGGTCTTCGGCGATATGGATCTGTTTGAGGTCCAGCCAGCCTCATCTCCACGGGGATTACTGGCAAAATA
CCTCTTTGAGTCAGAGAAGAATGCGCTCTCTTTTGATTATAAAGAGCGTAAGGATTTAAAAACCGCCGGGACCAAAATCT
ACACACTTCATGGTGTGGAGCAACATGATAATTTCCTGAATCAGACATTTGAAAATACCAGTAAACACATCACGATAGTT
TCTCCATGGCTGACCTGGCAAAGGCTGGAGCAAACCGGTTTTCTTGATTCCATGATTGCGGCGTGTTCACGTGGAATTAA
CGTCACGATAGTCACTGACAGAAGCTACAACACTGAACATAATGATTTTGAGAAGCGAAAAGAGAAGCAGCAGAACTTTA
AAGCGGCGCTGGAGAAACTGAATGCGCTGGGTATTGCTACAAAGCTAGTAAAACGTGTTCATAGCAAAATTGTTATTGGT
GATGATGGTTTGCTGTGCGTGGGATCGTTCAACTGGTTTAGTGCGACACGGGAAGCGCGATATGAACGATACGATACATC
AATGGTTTATTGCGGTGATAACCTGAAGGGTGAGATTGAGGCTATTTATAATAGTCTTGAGAGGCGTCAGGTTTAG

Protein sequence :
MDENALGFTSYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGRLEEAIVSKFFKGEKDDVETVDVILRPKVYFRLLQHG
KDRSAGAPDIVTPIVTPALLSREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFDDSVDKTAETD
EEREARYAALQLEWRQYLYDSERLLKSVAGDWIEKPEQYELAEHGYIVKTAQSGGASFHILSLYDHLLVCNKDVPLFNRF
ASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQRDALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEK
SEPPVIIATSTNNQAVTNIIEAFGKDFSQGSGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVESKEYVEDA
LLFYLEKAKAAFPEKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQIRAARELIANDIEQYLNNLNKLLSGQEQKI
TLLKSAKTEWKKYRAGESLIYSLFSWLPAVRSKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQTTYR
QQIDSAHEIILKEQQAVQEWQRLALDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRWLMDMASIDDLQEEKKK
KGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKFEKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVI
GDTEQLPPIWSIAPAIDVGNMLAEKILSGSTQEEITAKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYEHR
RCYDNIIGYCNTLCYHGKLLPKRGREESNLMPEMGYLHIDGKGELASSGSRYNLLEAETIAVWLAENQQNIEAHYGKSLH
EVVGIVTPFSAQVSTIKQVLGKQGISTGANEKSLTVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAVSRAK
DSFLVFGDMDLFEVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTAGTKIYTLHGVEQHDNFLNQTFENTSKHITIV
SPWLTWQRLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKEKQQNFKAALEKLNALGIATKLVKRVHSKIVIG
DDGLLCVGSFNWFSATREARYERYDTSMVYCGDNLKGEIEAIYNSLERRQV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAD42018.1 hypothetical protein Not tested PAI II 536 Protein 0.0 99
S3169 NP_838460.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 99
SF2965 NP_708739.1 superfamily I DNA helicase Not tested SHI-1 Protein 0.0 99
APECO1_3532 YP_854230.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 97
ORF_2 AAZ04413.1 superfamily I DNA helicase Not tested PAI I APEC-O1 Protein 0.0 97

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
ECABU_c48370 YP_006108747.1 phospholipase D VFG0627 Protein 0.0 99
ECABU_c48370 YP_006108747.1 phospholipase D VFG1537 Protein 0.0 99