Gene Information

Name : polA (SPCG_0032)
Accession : YP_001834749.1
Strain : Streptococcus pneumoniae CGSP14
Genome accession: NC_010582
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : -
Position : 32865 - 35534 bp
Length : 2670 bp
Strand : +
Note : has 3'-5' exonuclease, 5'-3' exonuclease and 5'-3'polymerase activities, primarily functions to fill gaps during DNA replication and repair

DNA sequence :
ATGGTATTTTTTGATTCTTTCCTTTATAATGGGTGTATGGATAAGAAAAAATTATTATTGATTGATGGGTCTTCTGTAGC
TTTTCGGGCGTTTTTTGCGCTGTATCAGCAGTTGGACCGTTTTAAGAATGTGGCTGGTTTGCATACCAATGCGATTTATG
GTTTTCAGTTGATGTTGAGTCATTTATTGGAGCGGGTTGAGCCGAGTCATATTTTGGTGGCTTTTGATGCGGGAAAGACG
ACCTTCCGGACAGAGATGTATGCGGACTATAAGGGTGGTCGGGCCAAGACTCCTGATGAGTTTCGTGAGCAATTTCCTTT
CATTCGTGAGTTGCTGGATCATATGGGGATTCGTCACTATGATCTGGCTCAGTATGAGGCGGATGACATCATTGGGACGC
TGGATAAGCTAGCAGAGCAGGATGGTTTTGATATTACTATTGTCAGTGGGGACAAGGATTTGATTCAGCTGACGGATGAG
CATACGGTGGTTGAAATTTCCAAGAAAGGTGTGTCTGAGTTTGAGGCCTTTACGCCAGATTACCTCATGGAAGAAATGGG
CCTCACACCAGCTCAGTTTATCGATCTCAAGGCGCTCATGGGTGATAAGTCGGATAATATCCCTGGGGTGACCAAAGTCG
GTGAAAAGACGGGTATTAAGCTCTTGCTGGAGCATGGTTCGCTTGAGGGGATTTATGAAAATATTGATGGAATGAAGACT
TCTAAGATGAAGGAAAATCTCATCAATGACAAGGAACAGGCCTTTTTGTCTAAAACACTAGCGACCATTGATACCAAGGC
ACCGATTGCGATTGGTTTAGAGGACTTGGTCTATAGTGGTCCAGATGTTGAAAATCTTGGGAAATTCTACGATGAGATGG
GCTTCAAACAGCTAAAGCAGGCTTTAAATGTGTCGTCAGCTGATGTGTCTGAGAGTTTGGATTTTACTATTGTTGACCAA
ATCAGTCAAGATATGCTGAGTGAAGAGTCTATCTTCCACTTTGAGCTTTTTGGTGAGAATTACCATACGGATAATTTGGT
TGGATTTGCCTGGTCTTGTGGGGATAAGCTCTATGCCACAGACAAGCTTGAGCTGTTGCAAGACCCGATTTTCAAGGATT
TCTTAGAAAAAACATCTCTGAGAGTTTATGACTTTAAGAAGGTTAAAGTTCTTTTGCAACGTTTTGGTGTGGATTTGCAG
GCGCCTGCTTTTGACATCCGTTTGGCTAAATACCTCCTTTCGACTGTGGAGGACAATGAAATTGCGACCATCGCTAGTCT
TTATGGTCAGACTTACTTGGTTGATGATGAAACTTTCTACGGTAAGGGTGTTAAAAAGGCCATTCCTGAACGTGAGAAAT
TCTTGGAACACTTAGCTTGTAAACTTGCTGTTTTGGTAGAAACAGAGCCTATTTTACTTGAAAAACTCAGCGAAAATGGG
CAATTAGAGCTTCTTTATGATATGGAGCAACCTCTGGCTTTTGTCCTTGCCAAGATGGAAATTGCTGGGATTATGGTCAA
GAAAGAGACCTTGCTTGAGATGCAGGCTGAAAATGAGCTTGTCATTGAAAAACTGACTCAAGAGATTTACGAGCTGGCTG
GTGAGGAGTTTAATGTCAACTCGCCTAAGCAGTTGGGCGTGCTTCTCTTTGAGAAATTGGGACTTCCTCTAGAATACACT
AAGAAAACCAAGACAGGTTATTCGACAGCAGTGGATGTTTTAGAGCGTCTCGCTCCTATTGCTCCGATTGTTAAGAAAAT
CCTGGATTACCGTCAAATTGCTAAGATTCAATCTACTTATGTAATTGGCTTGCAGGACTGGATTTTGGCTGATGGAAAGA
TTCATACTCGCTATGTGCAGGATTTGACCCAGACCGGGCGTTTGTCTAGTGTGGATCCAAACTTGCAAAATATTCCTGCC
CGATTGGAACAGGGGCGCTTGATTCGGAAGGCTTTTGTGCCAGAGTGGGAGGATAGTGTGCTACTCAGCTCTGACTATTC
ACAGATTGAATTGCGCGTTTTGGCGCATATTTCTAAGGATGAGCACTTGATTAAGGCCTTCCAAGAGGGGGCAGATATCC
ATACTTCGACAGCCATGCGGGTCTTTGGCATTGAGCGTCCTGATGATGTGACTGCAAACGACCGTCGCAATGCCAAGGCA
GTTAACTTTGGAGTGGTTTATGGGATTTCAGACTTTGGCTTGTCTAATAATTTGGGAATTAGTCGTAAGGAAGCCAAAGC
CTACATTGATACCTACTTTGAACGTTTTCCAGGTATTAAAAACTACATGGATGAAGTGGTGCGGGAGGCGCGTGATAAGG
GCTATGTAGAGACCCTCTTTAAGCGTCGCCGTGAGTTGCCAGATATCAATTCGCGCAACTTCAATATTCGTGGTTTTGCG
GAGCGAACTGCTATCAACTCACCTATCCAGGGTTCGGCAGCAGATATTCTCAAGATTGCCATGATTCAGCTGGATAAAGC
CTTAGTTGCAGGTGGTTATCAGACTAAGATGCTGTTACAAGTGCACGATGAAATCGTCCTTGAAGTGCCTAAATCTGAAT
TGGTAGAGATGAAAAAATTGGTGAAACAAACCATGGAAGAAGCCATTCAACTCAGTGTTCCTCTTATCGCAGATGAGAAT
GAAGGGGCAACCTGGTACGAGGCTAAATAA

Protein sequence :
MVFFDSFLYNGCMDKKKLLLIDGSSVAFRAFFALYQQLDRFKNVAGLHTNAIYGFQLMLSHLLERVEPSHILVAFDAGKT
TFRTEMYADYKGGRAKTPDEFREQFPFIRELLDHMGIRHYDLAQYEADDIIGTLDKLAEQDGFDITIVSGDKDLIQLTDE
HTVVEISKKGVSEFEAFTPDYLMEEMGLTPAQFIDLKALMGDKSDNIPGVTKVGEKTGIKLLLEHGSLEGIYENIDGMKT
SKMKENLINDKEQAFLSKTLATIDTKAPIAIGLEDLVYSGPDVENLGKFYDEMGFKQLKQALNVSSADVSESLDFTIVDQ
ISQDMLSEESIFHFELFGENYHTDNLVGFAWSCGDKLYATDKLELLQDPIFKDFLEKTSLRVYDFKKVKVLLQRFGVDLQ
APAFDIRLAKYLLSTVEDNEIATIASLYGQTYLVDDETFYGKGVKKAIPEREKFLEHLACKLAVLVETEPILLEKLSENG
QLELLYDMEQPLAFVLAKMEIAGIMVKKETLLEMQAENELVIEKLTQEIYELAGEEFNVNSPKQLGVLLFEKLGLPLEYT
KKTKTGYSTAVDVLERLAPIAPIVKKILDYRQIAKIQSTYVIGLQDWILADGKIHTRYVQDLTQTGRLSSVDPNLQNIPA
RLEQGRLIRKAFVPEWEDSVLLSSDYSQIELRVLAHISKDEHLIKAFQEGADIHTSTAMRVFGIERPDDVTANDRRNAKA
VNFGVVYGISDFGLSNNLGISRKEAKAYIDTYFERFPGIKNYMDEVVREARDKGYVETLFKRRRELPDINSRNFNIRGFA
ERTAINSPIQGSAADILKIAMIQLDKALVAGGYQTKMLLQVHDEIVLEVPKSELVEMKKLVKQTMEEAIQLSVPLIADEN
EGATWYEAK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 0.0 73