Gene Information

Name : COB47_1113 (COB47_1113)
Accession : YP_003840398.1
Strain : Caldicellulosiruptor obsidiansis OB47
Genome accession: NC_014392
Putative virulence/resistance : Unknown
Product : DNA polymerase I
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG0749
EC number : -
Position : 1232701 - 1235253 bp
Length : 2553 bp
Strand : +
Note : TIGRFAM: DNA polymerase I; PFAM: DNA-directed DNA polymerase; 5'-3' exonuclease, N-terminal resolvase-like domain; 5'-3' exonuclease, SAM-fold domain; KEGG: ate:Athe_1441 DNA polymerase I; SMART: 5'-3' exonuclease; Helix-hairpin-helix domain protein class

DNA sequence :
ATGAAATTAGTTATATTTGATGGTAACAGTATTTTATACAGAGCTTTTTTTGCTCTTCCTGAACTGACAACTTCGAGTAA
TATTCCTACAAATGCTATTTATGGGTTTATAAATGTAATATTGAAATATTTAGAGCAAGAGAATCCTGATTATGTTGCAG
TTGCATTTGATAAGAAAGGAAGACAGGCACGAAAAAGCGAATATGAAGAATACAAAGCTAACAGAAAACCTATGCCAGAT
AATCTTCAAGTACAAATCCCTTATGTTAGGGAGATTCTTTATGCTCTTAATATTCCAATTATTGAGTTTGAAGGTTATGA
AGCGGATGATGTAATTGGCTCGCTTGTAAACATGTTCAAAAATACTGATTTAGATATTGTTATTATTACTGGTGATAAGG
ATACTCTTCAGTTGTTAGATAAAAACGTAGTTGTGAAGATTGTTTCAACAAAATTTGATAAAACAACAGAAGATTTATAT
ACTGCAGAGAATGTAAAAGAAAAATATGGAGTTTGGGCAAATCAAGTACCGGATTATAAAGCGCTTGTTGGAGATCAATC
AGATAACATTCCAGGGGTAAAAGGAATTGGGGCAAAGAGTGCTCAAAAACTTTTGGAGGAGTATTCCTCTTTGGAAGAGA
TATATCAAAATCTAAATAACATCAAAGGTTCTATACGTGAAAAATTAGAGGCTAGCAAAGACATTGCGTTTTTATCCAAG
CGTTTAGCGACAATTGTATGTGATTTACCATTAAATATTAAACTTGAGGATTTAAGAACAAGAGAGTGGAACAAGAAAAG
ATTATATGAAATTTTACTCCAATTAGAATTCAAGAGTATAATAAAACGGTTAGGGCTGTCAGAAGAGGTTCAAGTTGAGT
TTGTTCAACAGCTAACTAATATTCACGATGTAGAGCAAAAAAAGCTTGAAGGAATATCACAGATAAGATCAAAAGATATC
TCATTAATGTTTGTGCCAGAAGAAAAATGTTTTTATTTATACGACCAAGAAAGTAATACTGTGTTTGTAACAGAAGAAAG
ACATTTAGTAGAGGAGATTTTAAAAAGTGAATCTGTAAAAATTGTATATGATTTGAAAAATATATTTCATGAACTCTACT
TAGAGAACACAGATAATATCAGAAATTGTGAGGATGTAATGATTGCTTCTTATGTTCTTGACAGCACAAGAAGTTCATAT
GAATTGGAAACATTGTTTATATCCTACTTAAATACCGACTTAGCAGCTGTGAAAAAGGATAAAAAAGTAACATCGGTAAT
ACTTTTGAAACGATTATGGGACGAACTTTCAAGCTTAATTGATTTAAATTCATGCCAGTTTGTGTATACAAATATAGAAC
GTCCTCTTATTCCTATTCTATACGAGATGGAAAAAGCAGGATTTAAGGTAGACAGAGATGCACTACTTCAGTATACTAAG
GAGATTGAAAGCAAAATATTAAATCTTGAGAAGCAGATATATCAAATTGCAGGTGAGTGGTTTAATATAAATTCACCCAA
ACAACTTTCTTACATTTTGTTTGAGAAATTAAAACTTCCTGTAATAAAAAAGACAAAAACAGGATATTCCACTGATGCTG
AGGTTTTAGAAGAGCTTTTTGACAAACATGAAATAGTTCCTCTTATTTTGGATTACAGGATGTATACCAAGATACTAACA
ACCTATTGTCAAGGATTACTCCAGGCAATAAATCCTTCTTCAGGCAGAGTCCATACAACATTTATCCAAACAGGCACAGC
GACAGGAAGACTTGCAAGTAGTGATCCTAATTTACAAAATATACCTGTGAAATACGATGAAGGAAGATTAATAAGAAAGG
TTTTTATACCTGAAGAAGGGCATGTCCTGATTGATGCGGATTATTCACAAATTGAACTGAGAATACTTGCTCATATTTCT
GAAGATGAGAGACTTATAAATGCCTTCAAAAATAACGTTGACATCCATTCGCAGACAGCAGCTGAGGTTTTTGGTGTAGA
TATAGACGATGTTACCCCAGAGATGAGAAGTCAAGCTAAAGCAGTAAATTTTGGTATCGTTTATGGGATTTCTGATTATG
GACTTGCAAGGGATATTAAGATTTCCAGGAAAGAAGCTGCAGAGTTTATAAACAGGTATTTTGAACGTTATCCTAAAGTT
AAAGAGTATTTAGATAATATTGTCAAATTTGCTCGTGATAACGGATATGTTTTAACCCTATTCAACAGAAGGAGATATGT
AAAGGACATAAAATCTGCAAACAGGAATGCAAGAAACTATGCCGAAAGGATTGCAATGAATTCGCCAATTCAGGGCAGCG
CTGCTGATATCATGAAATTGGCAATGATTAAAGTGTATCAAAAGCTCAAGGAGAATAATCTTAAATCAAAAATAATTTTG
CAGGTACACGATGAGCTTTTAATTGAAGCTCCATACGAAGAAAAGGATATAGTAAAAGAAATAGTAAAAAGAGAAATGGA
AAATGCAGTAGCTTTAAAAGTGCCTCTGGTAGTTGAGGTGAAAGAAGGACTGAACTGGTATGAGACAAAATAA

Protein sequence :
MKLVIFDGNSILYRAFFALPELTTSSNIPTNAIYGFINVILKYLEQENPDYVAVAFDKKGRQARKSEYEEYKANRKPMPD
NLQVQIPYVREILYALNIPIIEFEGYEADDVIGSLVNMFKNTDLDIVIITGDKDTLQLLDKNVVVKIVSTKFDKTTEDLY
TAENVKEKYGVWANQVPDYKALVGDQSDNIPGVKGIGAKSAQKLLEEYSSLEEIYQNLNNIKGSIREKLEASKDIAFLSK
RLATIVCDLPLNIKLEDLRTREWNKKRLYEILLQLEFKSIIKRLGLSEEVQVEFVQQLTNIHDVEQKKLEGISQIRSKDI
SLMFVPEEKCFYLYDQESNTVFVTEERHLVEEILKSESVKIVYDLKNIFHELYLENTDNIRNCEDVMIASYVLDSTRSSY
ELETLFISYLNTDLAAVKKDKKVTSVILLKRLWDELSSLIDLNSCQFVYTNIERPLIPILYEMEKAGFKVDRDALLQYTK
EIESKILNLEKQIYQIAGEWFNINSPKQLSYILFEKLKLPVIKKTKTGYSTDAEVLEELFDKHEIVPLILDYRMYTKILT
TYCQGLLQAINPSSGRVHTTFIQTGTATGRLASSDPNLQNIPVKYDEGRLIRKVFIPEEGHVLIDADYSQIELRILAHIS
EDERLINAFKNNVDIHSQTAAEVFGVDIDDVTPEMRSQAKAVNFGIVYGISDYGLARDIKISRKEAAEFINRYFERYPKV
KEYLDNIVKFARDNGYVLTLFNRRRYVKDIKSANRNARNYAERIAMNSPIQGSAADIMKLAMIKVYQKLKENNLKSKIIL
QVHDELLIEAPYEEKDIVKEIVKREMENAVALKVPLVVEVKEGLNWYETK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
polA YP_281523.1 DNA polymerase I Not tested Not named Protein 2e-132 42