Gene Information

Name : Teth514_2156 (Teth514_2156)
Accession : YP_001663764.1
Strain : Thermoanaerobacter sp. X514
Genome accession: NC_010320
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : 2.7.7.7
Position : 2159100 - 2161718 bp
Length : 2619 bp
Strand : -
Note : has 3'-5' exonuclease, 5'-3' exonuclease and 5'-3'polymerase activities, primarily functions to fill gaps during DNA replication and repair

DNA sequence :
ATGTCTAAATTTTTAGTAATTGACGGGAGTAGCCTCATGTACAGAGCCTATTATGCATTGCCTATGCTTACTACAAGTGA
AGGATTACATACAAATGCTTTGTACGGTTTTACTATGATGCTTATAAAACTTATTGAAGAGGAAAAACCTGATTACATAG
CTATTGCTTTTGACAAAAAAGCTCCTACTTTTAGACACAAAGAGTATCAAGACTACAAAGCTACAAGACAAGCAATGCCA
GAAGAATTGGCTGAACAGGTAGACCTTTTAAAAGAAATTATTGAGGGCTTTAATATAAAGATTTTAGAATTAGAAGGTTA
CGAAGCTGATGACATCATAGGTACTATTTCAAAGTTGGCAGAGGAAAAAGAAATGGAAGTGCTTGTAGTTACAGGAGATA
GGGACGCGCTTCAATTAGTTTCGGATAAAGTGAAAGTGAAGATTTCTAAAAAGGGTATAACTCAGATGGAAGAGTTTGAC
GAAAAGGCTGTTTTAGAAAGATATGAAATAACTCCTCACCAATTTATAGATTTAAAAGGACTAATGGGGGATAAATCTGA
CAACATCCCTGGAATACCTAATATAGGGGAAAAAACAGCAATTAAACTATTAAAAGACTTTGGAACAATTGAGAATTTAC
TACAAAATCTTTCTCAGCTCAAAGGTAAGATAAAAGAAAATATAGAAAATAATAAAGAATTAGCTATAATGAGTAAAAAA
CTTGTCACTATAAAAAGAGACATTCCCATTGAGATAGATTTTGAGGAATATAGAGTAAAAGAGTTTAATGAGGAGAAGCT
TTTAGAGATTTTTAATAAATTAGAATTCTTTAGTTTGATTGATAGCATAAAGAAAAAAAATGACGTAGAGATTGTAAATA
ATCATAAAGTTCAAAAATGGTCAAAAGTAGATATAAAAAAATTAATAGCTTTATTGCAAGATAGCAAAAGTATTGCTTTT
TATCCACTAATTTATGAAGGGGAAATAAAGAAAATAGCTTTTTCTTTTGGAAACGATACTGTTTATATTGATGGTTTTCA
AATAAAAGATTTAAAAGAGATTTTTGAAAAAGAAAAATTTGAATTTACAACCCATGAAATAAAAGATTTTTTAGTTAAGC
TTTCTTATAAAGGAATAGAGTGTAAAAGCAAGTACATGGATACTGCTATAATGGCTTATCTTTTAAATCCTTCTGAGTCT
AACTATGATTTAGATCGTGTGCTAAAAAAATATTTAAAGGTTGATGTTCCATCTTATGAAGAGGTATTTGGCAAAGGTAG
GGATAAAAAGAAACTTGAAGAAATAGGAGAAGATATACTTGCTGATTACATTTGCAGTAGATGTGTACATCTATTTGATT
TAAGAGAAAAGTTGATGAATTTTATTGAAGAAATGGATATGAAAAGACTTTTGTTGGAAATAGAAATGCCTCTTGTAGAA
GTCTTAAAATCAATGGAAGTAAGTGGTTTTACATTGGATAAAGAAGTCCTAAAAGAGCTTTCACAAAAAATAAATGATAG
AATAGCAGAAATACTAGATAAAATTTATAAAGAGGCAGGGTATCAATTTAATGTAAATTCTCCTAAGCAATTAAGTGAAT
TTTTGTTTGAAAAATTAAATTTACCAGTAATAAAGAAAACAAAAACAGGGTATTCTACAGATTCTGAAGTTTTAGAGCAA
TTAGTTCCTTACAATAATATTGTCAATGATATAATAGAGTATAGGCAACTTACAAAACTTAAATCTACTTATATAGATGG
ATTTTTGCCTCTCATGGATGAAAACAATAGAGTACATTCTAATTTTAAGCAAATGGTCACTTCTACAGGCAGAATAAGCA
GTACCGAGCCAAATCTACAAAATATACCTATAAGAGAAGAATTTGGAAGGCAAATTAGAAGAGCTTTTATTCCGCGGACT
AAAGATGGGTATATTGTCTCAGCTGATTATTCTCAGATTGAACTACGAGTTTTAGCACATGTTTCGGGAGATGAAAAGCT
AATAGAATCTTTTATGAATAATGAAGATATACATTTAAGGACTGCTTCGGAAGTTTTTAAAGTCCCAATGGAAAAAGTTA
CACCAGAGATGAGAAGAGTAGCAAAAGCCGTAAACTTTGGCATAATATATGGCATCAGCGATTACGGGCTTTCTCGAGAC
CTTAAAATATCAAGAAAAGAGGCAAAAGAGTATATAAATAATTATTTTGAAAGATACAAAGGAGTAAAAGAATATATTGA
AAAAATAGTTCGATTTGCAAAAGAAAATGGCTATGTGATTACAATAATGAACAGGAGAAGATATATTCCTGAGATAAACT
CCAGAAATTTTACTCAAAGGTCACAGGCTGAAAGGTTAGCGATGAATGCTCCAATACAGGGAAGTGCTGCCGATATAATA
AAAATGGCAATGGTTAGAGTGTACAACGATTTAGAAAAATTAAAGCTTAAGTCTAAGCTTATATTACAAGTTCATGACGA
ACTTGTAGTGGATACTTATAAAGATGAAGTAGAAATCGTAAAAAAGATACTCAAAGATAATATGGAAAATGTAGTACAAT
TAAAAGTTCCCCTTGTAGTGGAAATTGGAGTAGGACCCAATTGGTTTTTAGCCAAGTGA

Protein sequence :
MSKFLVIDGSSLMYRAYYALPMLTTSEGLHTNALYGFTMMLIKLIEEEKPDYIAIAFDKKAPTFRHKEYQDYKATRQAMP
EELAEQVDLLKEIIEGFNIKILELEGYEADDIIGTISKLAEEKEMEVLVVTGDRDALQLVSDKVKVKISKKGITQMEEFD
EKAVLERYEITPHQFIDLKGLMGDKSDNIPGIPNIGEKTAIKLLKDFGTIENLLQNLSQLKGKIKENIENNKELAIMSKK
LVTIKRDIPIEIDFEEYRVKEFNEEKLLEIFNKLEFFSLIDSIKKKNDVEIVNNHKVQKWSKVDIKKLIALLQDSKSIAF
YPLIYEGEIKKIAFSFGNDTVYIDGFQIKDLKEIFEKEKFEFTTHEIKDFLVKLSYKGIECKSKYMDTAIMAYLLNPSES
NYDLDRVLKKYLKVDVPSYEEVFGKGRDKKKLEEIGEDILADYICSRCVHLFDLREKLMNFIEEMDMKRLLLEIEMPLVE
VLKSMEVSGFTLDKEVLKELSQKINDRIAEILDKIYKEAGYQFNVNSPKQLSEFLFEKLNLPVIKKTKTGYSTDSEVLEQ
LVPYNNIVNDIIEYRQLTKLKSTYIDGFLPLMDENNRVHSNFKQMVTSTGRISSTEPNLQNIPIREEFGRQIRRAFIPRT
KDGYIVSADYSQIELRVLAHVSGDEKLIESFMNNEDIHLRTASEVFKVPMEKVTPEMRRVAKAVNFGIIYGISDYGLSRD
LKISRKEAKEYINNYFERYKGVKEYIEKIVRFAKENGYVITIMNRRRYIPEINSRNFTQRSQAERLAMNAPIQGSAADII
KMAMVRVYNDLEKLKLKSKLILQVHDELVVDTYKDEVEIVKKILKDNMENVVQLKVPLVVEIGVGPNWFLAK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 1e-144 41