Gene Information

Name : Toce_1653 (Toce_1653)
Accession : YP_003826013.1
Strain : Thermosediminibacter oceani DSM 16646
Genome accession: NC_014377
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : 2.7.7.7
Position : 1631697 - 1634291 bp
Length : 2595 bp
Strand : -
Note : COGs: COG0749 DNA polymerase I - 3'-5' exonuclease and polymerase domains; InterProIPR018320:IPR020046:IPR020047:IPR002562:IPR 001098:IPR002298:IPR002421:IPR008918:IPR019760; KEGG: pth:PTH_1976 DNA polymerase I; PFAM: DNA-directed DNA polymerase; 5'-3' ex

DNA sequence :
ATGAAGAAAATAATGTTGATTGATGGAAACAGCCTGATCCACAGGGCCTTTCATGCCTTGCCGCCGCTGATGACATCCAA
GGGAGTGCATACCAACGCTGTATACGGCTTCATGAACATGCTGATGCGGATTCTGAAGGAACAGCGGCCCGACTACATAG
CGGTGGCCTTTGATAAAAAGTCACCCACATTCAGGCACCAGGAATTTATTGAGTATAAGGCCAACAGGGTCAGGACACCC
GAAGAGCTGGTGGGCCAATTCGACGTTTTAAAACAAATACTAAAGGCGATGAACATCCGATATATCGAAATAGACGGATA
CGAAGCCGATGACATACTCGGGAGCCTGTCGAAAAAAGCCGAAGAAGCAGGGATCTTTACGCTCATCGTAACAGGGGATA
AGGATACCCTTCAGCTGGTATCTCCCATGGTCCACGTGATGCTTACCAGGAAGGGTATATCGGAAATGGAGATTTACGAT
CCCGATAAGATGGCTGAGCGGTTCGGTATACCACCGCAGGCCATCCCCGATATGAAGGGGCTCATGGGCGACTCGTCCGA
CAATATACCCGGCATTCCGGGGGTAGGAGAAAAAACGGCCTTGAAACTTTTGCAGGAATACGGCTCCCTGGAAAATATCC
TGGAAAATGCAGAGAAGCTCAAGGGGAAGTTACGGGAAAATATATTAAAATACGGTGAGCAGGCTCGAGTCAGCAAGCGC
CTGGCGACCATCGTCAGGGATGTGGAGCTGGACGTCGACCTGGAAGAGATTGCTTTAACCGAACCCGATTACGCAGAGCT
CCTAAAAATTTTCAGGGAGCTGGAGTTTTACACTCTCATAAATAAACTGCCCCGCCCGCAAGAGAAAGAAGAACATCCTG
AGAAGCTGTCGTGCACCGTAATCGACTACAGCGGTTTCGGCCGGATGATGGAAAGGGTAAGAGCTGCGGGAGTTCTGGCC
GTGGAACTGAAAACCGACGGTAGAAATCCCATGGATGCCCACCTTATAGGCATAGGCTTTTCGCCTTCCCGCGGGGAAGG
GTTTTACGTACCGGCAGAAGTACTGGAAAAAAGCCCGGAGGTGAAATCCGATCTAAAAGCCGTACTGGCCGACCCGGGGA
TTACTAAGATAATCCACGACGGAAAGTACGCGAGGACGGTACTTGCTAAAATAGGAATGGATTTTGTATACAATTTTGAT
ACCATGCTGGCAGCTTACCTGCTGGACCCCTCAAAGCCCAGATATGACCTGGAGAGCGTAGTCTTCGATAACCTGGGCGT
GGAATTGAAAGGCACCGAAGATCCCGGCAGGAGGGTAGCGTACCTCATACCTTTGAAGGAGATAATGAGCGAAAAACTTA
AAAGCTGCGCAATGGAAGAGCTCTTTTTCGGAGTTGAGATGCCGCTGAGCTTCGTGCTGTCTGATATGGAGATGACGGGG
ATAAAAGTGGACCCCGAAAAGCTCGAATCTCTATCAAGGGAGTTCGGGGAAAAACTGGAGGAACTAACCGGAGAAATTTA
CCGGCTTGCGGGGGTGGAGTTTAACATAAACTCGCCGAAACAACTGGGGGAAGTGCTTTTTGAAAAGCTAAACCTGCCGG
TCATAAAGAAAAAAAAGAGCGGTTATTCCACCGATGCCGAAGTCCTGGAAAAACTGAAAAACGCCCATCCGGTCGTGGAG
AAGATTCTGGAATACCGGTTCTTGATGAAAATGAAGTCAACCTATGCCGATGGGCTGTTGGCCCTTGTTGACAAAAGCAC
TTATCGTATTCACAGCAATTTCAATCAGACTATAACCGCCACCGGCCGTATCAGCAGTACCGAGCCGAACCTGCAGAACA
TACCCGTCAAGACCGATATAGGCAGGAAGATAAGGGGGGTGTTTGTGGCGGAAAGTCCCGAGCATGTCTTGCTGTCCGGG
GATTATTCCCAGATAGAGCTGAGGGTCCTGGCACATTTATCGGGAGATGAGGGGCTTATCGAGGCTTTCATAAAAGGGGA
GGATATCCATACCAGGACCGCCAGCGAGGTCTTCGGGGTTCCTCCCGAACAGGTTACCCCTCTTTTAAGGGATAGAGCAA
AAGCCGTAAATTTTGGTATAATCTATGGTATAAGCGATTACGGCCTTGCCCAGAACCTGGGCATATCCACCGCGGAAGCC
CGGGAGTATATCGAAAATTACTTAAACAGGTATCCGAAGGTAAGGGATTATATCCGGGAGACCATCAGGAATGCCAGGAT
GAGCGGGTATGTGACCACCATCCTCAACCGCAGGCGCTACATTCCGGAAATTAACAGCAGGAACTACAACCTCAGGTCCT
TTGCCGAGAGGGTGGCGATGAACACCCCCATTCAGGGGAGTGCTGCAGATATAATAAAGGTTGCCATGGTGAAAATTACC
AACCACTTTCGTGAGTATGGACTCAAGGCCAAAATGCTAATTCAGGTTCACGACGAGCTGATCTTCGACGTTCCGAAGTC
CGAGCTTGAAGTCGTGAAAAATATCGTTAAAGATGATATGGAAAACGCCATTCCGCTGAAGGTCCCGCTGGTGGTCGATT
TTAAGGAGGGCTATACCTGGGAAGAAATAAGCTGA

Protein sequence :
MKKIMLIDGNSLIHRAFHALPPLMTSKGVHTNAVYGFMNMLMRILKEQRPDYIAVAFDKKSPTFRHQEFIEYKANRVRTP
EELVGQFDVLKQILKAMNIRYIEIDGYEADDILGSLSKKAEEAGIFTLIVTGDKDTLQLVSPMVHVMLTRKGISEMEIYD
PDKMAERFGIPPQAIPDMKGLMGDSSDNIPGIPGVGEKTALKLLQEYGSLENILENAEKLKGKLRENILKYGEQARVSKR
LATIVRDVELDVDLEEIALTEPDYAELLKIFRELEFYTLINKLPRPQEKEEHPEKLSCTVIDYSGFGRMMERVRAAGVLA
VELKTDGRNPMDAHLIGIGFSPSRGEGFYVPAEVLEKSPEVKSDLKAVLADPGITKIIHDGKYARTVLAKIGMDFVYNFD
TMLAAYLLDPSKPRYDLESVVFDNLGVELKGTEDPGRRVAYLIPLKEIMSEKLKSCAMEELFFGVEMPLSFVLSDMEMTG
IKVDPEKLESLSREFGEKLEELTGEIYRLAGVEFNINSPKQLGEVLFEKLNLPVIKKKKSGYSTDAEVLEKLKNAHPVVE
KILEYRFLMKMKSTYADGLLALVDKSTYRIHSNFNQTITATGRISSTEPNLQNIPVKTDIGRKIRGVFVAESPEHVLLSG
DYSQIELRVLAHLSGDEGLIEAFIKGEDIHTRTASEVFGVPPEQVTPLLRDRAKAVNFGIIYGISDYGLAQNLGISTAEA
REYIENYLNRYPKVRDYIRETIRNARMSGYVTTILNRRRYIPEINSRNYNLRSFAERVAMNTPIQGSAADIIKVAMVKIT
NHFREYGLKAKMLIQVHDELIFDVPKSELEVVKNIVKDDMENAIPLKVPLVVDFKEGYTWEEIS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 1e-160 43