Gene Information

Name : Thet_0779 (Thet_0779)
Accession : YP_003903700.1
Strain : Thermoanaerobacter sp. X513
Genome accession: NC_014538
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : -
Position : 792624 - 795242 bp
Length : 2619 bp
Strand : +
Note : TIGRFAM: DNA polymerase I; PFAM: DNA-directed DNA polymerase; 5'-3' exonuclease, N-terminal resolvase-like domain; 5'-3' exonuclease, SAM-fold domain; 3'-5' exonuclease; KEGG: tpd:Teth39_1474 DNA polymerase I; SMART: 5'-3' exonuclease; Helix-hairpin-helix

DNA sequence :
ATGTCTAAATTTTTAGTAATTGACGGGAGTAGCCTCATGTACAGAGCCTATTATGCATTGCCTATGCTTACTACAAGTGA
AGGATTACATACAAATGCTTTGTACGGTTTTACTATGATGCTTATAAAACTTATTGAAGAGGAAAAACCTGATTACATAG
CTATTGCTTTTGACAAAAAAGCTCCTACTTTTAGACACAAAGAGTATCAAGACTACAAAGCTACAAGACAAGCAATGCCA
GAAGAATTGGCTGAACAGGTAGACCTTTTAAAAGAAATTATTGAGGGCTTTAATATAAAGATTTTAGAATTAGAAGGTTA
CGAAGCTGATGACATCATAGGTACTATTTCAAAGTTGGCAGAGGAAAAAGAAATGGAAGTGCTTGTAGTTACAGGAGATA
GGGACGCGCTTCAATTAGTTTCGGATAAAGTGAAAGTGAAGATTTCTAAAAAGGGTATAACTCAGATGGAAGAGTTTGAC
GAAAAGGCTGTTTTAGAAAGATATGAAATAACTCCTCACCAATTTATAGATTTAAAAGGACTAATGGGGGATAAATCTGA
CAACATCCCTGGAATACCTAATATAGGGGAAAAAACAGCAATTAAACTATTAAAAGACTTTGGAACAATTGAGAATTTAC
TACAAAATCTTTCTCAGCTCAAAGGTAAGATAAAAGAAAATATAGAAAATAATAAAGAATTAGCTATAATGAGTAAAAAA
CTTGTCACTATAAAAAGAGACATTCCCATTGAGATAGATTTTGAGGAATATAGAGTAAAAGAGTTTAATGAGGAGAAGCT
TTTAGAGATTTTTAATAAATTAGAATTCTTTAGTTTGATTGATAGCATAAAGAAAAAAAATGACGTAGAGATTGTAAATA
ATCATAAAGTTCAAAAATGGTCAAAAGTAGATATAAAAAAATTAATAGCTTTATTGCAAGATAGCAAAAGTATTGCTTTT
TATCCACTAATTTATGAAGGGGAAATAAAGAAAATAGCTTTTTCTTTTGGAAACGATACTGTTTATATTGATGGTTTTCA
AATAAAAGATTTAAAAGAGATTTTTGAAAAAGAAAAATTTGAATTTACAACCCATGAAATAAAAGATTTTTTAGTTAAGC
TTTCTTATAAAGGAATAGAGTGTAAAAGCAAGTACATGGATACTGCTATAATGGCTTATCTTTTAAATCCTTCTGAGTCT
AACTATGATTTAGATCGTGTGCTAAAAAAATATTTAAAGGTTGATGTTCCATCTTATGAAGAGGTATTTGGCAAAGGTAG
GGATAAAAAGAAACTTGAAGAAATAGGAGAAGATATACTTGCTGATTACATTTGCAGTAGATGTGTACATCTATTTGATT
TAAGAGAAAAGTTGATGAATTTTATTGAAGAAATGGATATGAAAAGACTTTTGTTGGAAATAGAAATGCCTCTTGTAGAA
GTCTTAAAATCAATGGAAGTAAGTGGTTTTACATTGGATAAAGAAGTCCTAAAAGAGCTTTCACAAAAAATAAATGATAG
AATAGCAGAAATACTAGATAAAATTTATAAAGAGGCAGGGTATCAATTTAATGTAAATTCTCCTAAGCAATTAAGTGAAT
TTTTGTTTGAAAAATTAAATTTACCAGTAATAAAGAAAACAAAAACAGGGTATTCTACAGATTCTGAAGTTTTAGAGCAA
TTAGTTCCTTACAATAATATTGTCAATGATATAATAGAGTATAGGCAACTTACAAAACTTAAATCTACTTATATAGATGG
ATTTTTGCCTCTCATGGATGAAAACAATAGAGTACATTCTAATTTTAAGCAAATGGTCACTTCTACAGGCAGAATAAGCA
GTACCGAGCCAAATCTACAAAATATACCTATAAGAGAAGAATTTGGAAGGCAAATTAGAAGAGCTTTTATTCCGCGGACT
AAAGATGGGTATATTGTCTCAGCTGATTATTCTCAGATTGAACTACGAGTTTTAGCACATGTTTCGGGAGATGAAAAGCT
AATAGAATCTTTTATGAATAATGAAGATATACATTTAAGGACTGCTTCGGAAGTTTTTAAAGTCCCAATGGAAAAAGTTA
CACCAGAGATGAGAAGAGTAGCAAAAGCCGTAAACTTTGGCATAATATATGGCATCAGCGATTACGGGCTTTCTCGAGAC
CTTAAAATATCAAGAAAAGAGGCAAAAGAGTATATAAATAATTATTTTGAAAGATACAAAGGAGTAAAAGAATATATTGA
AAAAATAGTTCGATTTGCAAAAGAAAATGGCTATGTGATTACAATAATGAACAGGAGAAGATATATTCCTGAGATAAACT
CCAGAAATTTTACTCAAAGGTCACAGGCTGAAAGGTTAGCGATGAATGCTCCAATACAGGGAAGTGCTGCCGATATAATA
AAAATGGCAATGGTTAGAGTGTACAACGATTTAGAAAAATTAAAGCTTAAGTCTAAGCTTATATTACAAGTTCATGACGA
ACTTGTAGTGGATACTTATAAAGATGAAGTAGAAATCGTAAAAAAGATACTCAAAGATAATATGGAAAATGTAGTACAAT
TAAAAGTTCCCCTTGTAGTGGAAATTGGAGTAGGACCCAATTGGTTTTTAGCCAAGTGA

Protein sequence :
MSKFLVIDGSSLMYRAYYALPMLTTSEGLHTNALYGFTMMLIKLIEEEKPDYIAIAFDKKAPTFRHKEYQDYKATRQAMP
EELAEQVDLLKEIIEGFNIKILELEGYEADDIIGTISKLAEEKEMEVLVVTGDRDALQLVSDKVKVKISKKGITQMEEFD
EKAVLERYEITPHQFIDLKGLMGDKSDNIPGIPNIGEKTAIKLLKDFGTIENLLQNLSQLKGKIKENIENNKELAIMSKK
LVTIKRDIPIEIDFEEYRVKEFNEEKLLEIFNKLEFFSLIDSIKKKNDVEIVNNHKVQKWSKVDIKKLIALLQDSKSIAF
YPLIYEGEIKKIAFSFGNDTVYIDGFQIKDLKEIFEKEKFEFTTHEIKDFLVKLSYKGIECKSKYMDTAIMAYLLNPSES
NYDLDRVLKKYLKVDVPSYEEVFGKGRDKKKLEEIGEDILADYICSRCVHLFDLREKLMNFIEEMDMKRLLLEIEMPLVE
VLKSMEVSGFTLDKEVLKELSQKINDRIAEILDKIYKEAGYQFNVNSPKQLSEFLFEKLNLPVIKKTKTGYSTDSEVLEQ
LVPYNNIVNDIIEYRQLTKLKSTYIDGFLPLMDENNRVHSNFKQMVTSTGRISSTEPNLQNIPIREEFGRQIRRAFIPRT
KDGYIVSADYSQIELRVLAHVSGDEKLIESFMNNEDIHLRTASEVFKVPMEKVTPEMRRVAKAVNFGIIYGISDYGLSRD
LKISRKEAKEYINNYFERYKGVKEYIEKIVRFAKENGYVITIMNRRRYIPEINSRNFTQRSQAERLAMNAPIQGSAADII
KMAMVRVYNDLEKLKLKSKLILQVHDELVVDTYKDEVEIVKKILKDNMENVVQLKVPLVVEIGVGPNWFLAK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 1e-144 41