Gene Information

Name : Tthe_1101 (Tthe_1101)
Accession : YP_003851706.1
Strain : Thermoanaerobacterium thermosaccharolyticum DSM 571
Genome accession: NC_014410
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : -
Position : 1128453 - 1131056 bp
Length : 2604 bp
Strand : +
Note : TIGRFAM: DNA polymerase I; PFAM: DNA-directed DNA polymerase; 5'-3' exonuclease, SAM-fold domain; 5'-3' exonuclease, N-terminal resolvase-like domain; KEGG: tte:TTE0874 DNA polymerase I; SMART: 5'-3' exonuclease; DNA-directed DNA polymerase; Helix-hairpin

DNA sequence :
ATGTCAAAATTTTTAATCATAGATGGAAATAGCTTAATGTACAGAGCGTATTTTGCGCTTCCAGATTTAATGAACAGTGA
GGGACTGCATACAAATGCCATATACGGTTTTTCAATGATGCTTTTAAAACTGCTTGATGAAGAAAAACCTGATTATATAG
CTGTTGCATTTGATAAAAAAGCTCCAACTTTTAGACATAAAGAGTACAGTGCTTATAAAGGTACCAGGCAGTCTATGCCT
GAGGAACTTATTGAGCAGGTTGATATTTTAAAAGATGTAATCAATGCATTTAATATAAAGACAATTGAGATTGAAGGATT
TGAAGCAGATGACATAATAGGCACTGTATCTAAAATTGCATCTGAAAATGGCTTAAAAGTATTGATTGTCACCGGTGATA
GAGATGCACTTCAACTTGTATCAGATGGTGTAAAAGTGAAGATATGCAAAAAAGGCATAACTCAGATGGAAGAATACGAT
GAAAGGGCAGTCGTTGAGAGGTATGAGGTTACTCCACGCCAGTTTATTGATTTGAAGGGCCTTATGGGAGATAAATCTGA
CAATATCCCTGGCGTGCCTAATATAGGAGAAAAGACAGCGATAAAACTTATTAAAGAATTTGGCTCTATTGAGAATGTCC
TTATGAATACTGATAAATTAAAAGGGAAAATAAGAGAAAATATTGAAAATAATACAGAAATGGCTATGTTAAGCAAAAAA
CTTGCTACTATAGAGAGAAATGTTCCAATTGAAATAGATTTAAGCGAATACCAGTTGAGGGATTATGATAGAGAAAAATT
AATAGATTTATTTGAGAAGTTGGAGTTTACCAGTTTAATTAATGACCTAAAAAAAGATGCTGATGATATTAGAGAAGTCA
AAGAGTGGCCTGTAAGAGACTTTAAATATATAAGAGAGCTGCTAAAGAGAGAAGATACACTATCATTTTATCCATTAATA
CTTGAAGGCGAAGTAAAAGCTGTATCATTTGCCACTGATGATGAATCATTTTTTGTTGAAGTAGATGACTATGAAGTGTT
TAAATTATTGGATAATGAGAAGCTTACGCTTATAGGTCATGATATAAAAGACTTTTTTGTTAATCTTTCATACCATGGCA
TAGAGCTGAATTGTAAATTTTACGATACCGCAATAATGACTTATCTGCTAAATCCTTCAGAATCCAATTACGATATAGGG
CGTGTCTTAAAGAAGTATTTAAAAGAGGACATACCTAATATTGAGGATATGCTGGGAAAAGGTAAAAGCAAAAAGAGTTA
TGATGACATAGATAAGAAGCTTTTAATTGATTATCTGTGTGCTACTGCATCAAAATTATCTAAATTAAAAGATAAGCTAA
TGTCCTTTATAAAAGAAATGGAAATGGAAGAGCTTCTTGATAATGTTGAGCTTCCACTGGTAGAAGTGTTAAAATCTATG
GAGGTATATGGATTTACATTAGATAAAGATGTATTAAAAGATTTATCCAAAGAAATAGGCGAAAAGACAGATAAAATAAT
AAAAGACATATATGACGCTGCGGGATATGAATTTAATATTAATTCTACCAAACAGTTATCAGAGTTTTTGTTTGATAAAT
TGAATTTGCCTCCAATAAAAAAGACTAAAACAGGATATTCCACTGATATGGAAGTTCTTGTGGAGCTTATACCGTACAAT
GAGATAGTTGGCGAGATAATCGAATATAGACAGCTTATGAAGCTGAAATCAACATATATAGATGGCTTCATGCCAATAAT
GGACAAGGATGACAAAGTTCATTCCACGTTTAAGCAGACAGTTGCGGCAACGGGTAGAATCAGCTCTACAGAGCCTAATT
TGCAAAATATACCAGTAAGAGATGAATTTGGCAGAAGAATAAGGAAAGCTTTTATATCCAGCTTTCAGGGAGGATATATT
GTATCTGCTGATTATTCACAGATAGAGTTGAGAGTTCTTGCACATCTTTCAGAGGATATAAAGCTTATTGAGTCATTTTT
AAACAATGAAGACATACATTTAAGGACAGCGTCAGAAGTGTTTAAAATTGCTCCTGAAGAAGTTACAGGTGAAATGAGAA
GGCGTGCGAAAGCTGTAAATTTCGGAATCGTGTATGGTATAAGCGATTATGGTCTCTCTAGAGATTTAAAGATCTCTCGT
AAAGAGGCAAAAGAATATATAGACAATTATTTTGATAGGTACAAAGGAGTTAAAAATTACATTGATTCGGTAGTCAAATT
TGCCAGAGAAAATGGATATGTGACGACAATTTTGAATAGAAGAAGGTATATACCGGAGATTAATTCAAAGAATTATAATC
AAAGGTCATTTGGCGAGAGAATGGCGATGAATACGCCTATTCAGGGAAGCGCGGCAGACATAATAAAAATGTCAATGGTG
AAAGTATACAATGAATTAAAGTCAAGATCATTAAAATCGAAGCTAATACTGCAGATCCATGACGAGCTTATTGTTGATAC
TTTTCCCGATGAAGTTGAAATCGTCAAAAACTTATTAAAGACCATAATGGAAAATGTCATAAAGCTAAGAGTCCCTTTGG
TTGTAGATATTGGCTATGGTAAAAATTGGTACGATGCAAAATAA

Protein sequence :
MSKFLIIDGNSLMYRAYFALPDLMNSEGLHTNAIYGFSMMLLKLLDEEKPDYIAVAFDKKAPTFRHKEYSAYKGTRQSMP
EELIEQVDILKDVINAFNIKTIEIEGFEADDIIGTVSKIASENGLKVLIVTGDRDALQLVSDGVKVKICKKGITQMEEYD
ERAVVERYEVTPRQFIDLKGLMGDKSDNIPGVPNIGEKTAIKLIKEFGSIENVLMNTDKLKGKIRENIENNTEMAMLSKK
LATIERNVPIEIDLSEYQLRDYDREKLIDLFEKLEFTSLINDLKKDADDIREVKEWPVRDFKYIRELLKREDTLSFYPLI
LEGEVKAVSFATDDESFFVEVDDYEVFKLLDNEKLTLIGHDIKDFFVNLSYHGIELNCKFYDTAIMTYLLNPSESNYDIG
RVLKKYLKEDIPNIEDMLGKGKSKKSYDDIDKKLLIDYLCATASKLSKLKDKLMSFIKEMEMEELLDNVELPLVEVLKSM
EVYGFTLDKDVLKDLSKEIGEKTDKIIKDIYDAAGYEFNINSTKQLSEFLFDKLNLPPIKKTKTGYSTDMEVLVELIPYN
EIVGEIIEYRQLMKLKSTYIDGFMPIMDKDDKVHSTFKQTVAATGRISSTEPNLQNIPVRDEFGRRIRKAFISSFQGGYI
VSADYSQIELRVLAHLSEDIKLIESFLNNEDIHLRTASEVFKIAPEEVTGEMRRRAKAVNFGIVYGISDYGLSRDLKISR
KEAKEYIDNYFDRYKGVKNYIDSVVKFARENGYVTTILNRRRYIPEINSKNYNQRSFGERMAMNTPIQGSAADIIKMSMV
KVYNELKSRSLKSKLILQIHDELIVDTFPDEVEIVKNLLKTIMENVIKLRVPLVVDIGYGKNWYDAK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 5e-145 42