Gene Information

Name : polA (CDR20291_0964)
Accession : YP_003217463.1
Strain : Clostridium difficile R20291
Genome accession: NC_013316
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : -
Position : 1182350 - 1184998 bp
Length : 2649 bp
Strand : +
Note : has 3'-5' exonuclease, 5'-3' exonuclease and 5'-3'polymerase activities, primarily functions to fill gaps during DNA replication and repair

DNA sequence :
TTGGAAAAAAAATTAATTATAATAGATGGAAATTCAATAATAAACAGAGCCTTTTATGCTCTGCCAGAAATGAATAATAA
AGAAGGGTTAAAGACCAATGCAATATATGGATTTACAACAATGCTATTTAAAATGATAGATATATATAAACCAACACATA
TAAGTGTAGCCTTTGACAGAAAAGCACCTACATTTAGACATTTAGAATATAAAGAGTATAAGGCTGGAAGAAAGGGAATG
CCAGATGAACTGGCAGAACAATTACAGCCATTAAAAGACCTTTTAGATAAATTCAAAATAAATAGATTGGAAATAGATGG
GTATGAGGCAGACGACATTATAGGAACTGTTTCTAAAAAAGCAGAAAATAATGGGTATAAGGTATACATAGTAACTGGAG
ACAAAGATGCTATTCAACTTGCTTCAGATAATACAACTACTCTTATAACAAAAAAAGGAGTTGGAGAAGTTGAAGAATAC
AACTTTAATTCTGTAATTGAAAAATACGAGATGACTCCAGCACAATTTATTGATTTAAAAGGTCTTATGGGAGATAAGTC
AGATAATATACCTGGAGTACCTGGAATCGGTGAAAAAACAGGTATAAAGCTTATAAAAGAGTTTTCTAGTATAGAGAATT
TAATAGAGAATACAGATAAGCTTAAAGGTAGTGTAAAAAAGAAAATAGAGGAAAATAAAGAAATAGCTATTCAAAGTAAA
AGACTTGCAACAATTATAAGAGATGTGCCAATAGAAGTGGATTTAGATAGTATGGTTTTTGGCGATTATGATAAAGATGA
ACTTTTAGATAAATTTAGATACTTTGGATTTACAAGTCTCTTATCAAGATTAGTTGATTTGGTCGATTCAGATGACACAG
AACACGTAGAAGAAAAAATAGAAATATTAAAATTAAAAGATACAGAAAAATTTATAGATGAAGTAAACAAAAAAGGACAA
GTTATTTTAAAAACTGTTAGAGGACAAGGTAATATACTTGAAAAAAATATAATATATATTTTTATAAGTGTAGATGGAGA
AAAAATATATTATGTAGATTCAAATGAAGTGTCAAAGTTAGATAGTATATTTTCAAATAATGAAATTAAAAAGTATGGAT
ATAATTTAAAAGATGATTATATATCTTTAAAACCATATAATATAAACTTAGAAAATATGTATTTTGATATAACAATAGCT
GAGTACTTAATAGATTCAACTTCATCTACCTCATATGATTGTAGTGCAATAGCTATGAAGTATTTATCAAAAAAGGTTAA
ATCAAAAGAAGAATTATTAGGAAAGGGTGTTAAATCTAAAAACTATGAGGATTTAGAATTTGAGGATTTAGCAGAGTATA
TGGGTCAAATAATATGCACAGTTAAAGAAACAATTCCTATGATGGAAAGAAGTCTAATGGATATGAGTATGGATGGTCTT
TTTTATCATGTAGAGATGCCTTTAGTAGAAGTCTTAGGTCATATGGAGTATGAAGGTATAAAAGTAGACAAAAGTATATT
GGATGAATTGAGTGTAGAGTTTAAAGAAATAATAGCTACACTAGAAAAGGAAATTTATGAATTATCAGGAGAGCCATTCA
ATATAAATTCACCAAAACAATTGGGTGTTATATTATTTGAAAAATTAGAGCTTCCAATAATTAAAAAAACTAAAACTGGA
TATTCAACTAATGCAGATGTTTTAGAAAAATTAAGAGATAAGCATCCAATAATAGATAAGATAACTGAGTATAGACAAAT
AGTAAAGTTAAATTCTACTTACGTTGAAGGACTTTTAAATATAATAAATCCAGAAAGTAATAGAATACATTCATCATTTA
ATCAAACTATAACGACTACAGGAAGAATTTCATCTACAGAACCAAATCTTCAAAATATACCTATAAAAATGGAAATGGGT
AGAAAAATAAGAAAAGTGTTTATTGCAGATGATAACTGTAAGTTGGTTGATGCCGATTACTCACAAGTAGAGCTAAGAGT
TCTTGCACATATGAGTCAGGATGAAAATATGATAGAAGCATTTAAACATCATGAAGATATTCATACCAAAACAGCTTCAC
AAGTTTTTAATGTTCCTATGGAAGATGTTACTAGTGAATTGAGAGGTGCAGCTAAAGCTGTAAACTTTGGTATTATATAT
GGTAAAAGTGACTTTGGTCTAGCTGATGATTTAAATATACCTGTTCCAAAGGCGAAAGAATATATCGAAAGTTATTTTGC
TAAATACCATAAAATAAAAGAGTTTATGGACACAACAGTAGAAAAGGCTAGTGAGGATGGATATGCTGTAACAATTTTAA
ACAGAAGAAGATATATACCAGAAATAAAGTCAAGCAATTTTATGGAAAGAAATAGAGGAAAGAGATTTGCAATGAATGCA
CCAATACAAGGAAGTGCAGCAGATATAATAAAAATTGCTATGATAAATGTTCATAAAAAATTGGAAGAGAACAATTTGAA
ATCAAAACTTATACTACAAGTTCACGACGAGTTGATAGTTGAAGCAATTGATGATGAAATAGATATAGTTAAGAAAATAG
TAAAAGAAGAAATGGAAAATGCTGTAAATATGGATGTTCACTTAGATGTAGATTTAAATGTAGGTAGTTCTTGGTATGAG
ACAAAGTAG

Protein sequence :
MEKKLIIIDGNSIINRAFYALPEMNNKEGLKTNAIYGFTTMLFKMIDIYKPTHISVAFDRKAPTFRHLEYKEYKAGRKGM
PDELAEQLQPLKDLLDKFKINRLEIDGYEADDIIGTVSKKAENNGYKVYIVTGDKDAIQLASDNTTTLITKKGVGEVEEY
NFNSVIEKYEMTPAQFIDLKGLMGDKSDNIPGVPGIGEKTGIKLIKEFSSIENLIENTDKLKGSVKKKIEENKEIAIQSK
RLATIIRDVPIEVDLDSMVFGDYDKDELLDKFRYFGFTSLLSRLVDLVDSDDTEHVEEKIEILKLKDTEKFIDEVNKKGQ
VILKTVRGQGNILEKNIIYIFISVDGEKIYYVDSNEVSKLDSIFSNNEIKKYGYNLKDDYISLKPYNINLENMYFDITIA
EYLIDSTSSTSYDCSAIAMKYLSKKVKSKEELLGKGVKSKNYEDLEFEDLAEYMGQIICTVKETIPMMERSLMDMSMDGL
FYHVEMPLVEVLGHMEYEGIKVDKSILDELSVEFKEIIATLEKEIYELSGEPFNINSPKQLGVILFEKLELPIIKKTKTG
YSTNADVLEKLRDKHPIIDKITEYRQIVKLNSTYVEGLLNIINPESNRIHSSFNQTITTTGRISSTEPNLQNIPIKMEMG
RKIRKVFIADDNCKLVDADYSQVELRVLAHMSQDENMIEAFKHHEDIHTKTASQVFNVPMEDVTSELRGAAKAVNFGIIY
GKSDFGLADDLNIPVPKAKEYIESYFAKYHKIKEFMDTTVEKASEDGYAVTILNRRRYIPEIKSSNFMERNRGKRFAMNA
PIQGSAADIIKIAMINVHKKLEENNLKSKLILQVHDELIVEAIDDEIDIVKKIVKEEMENAVNMDVHLDVDLNVGSSWYE
TK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 7e-137 43