Gene Information

Name : Geoth_2088 (Geoth_2088)
Accession : YP_004588104.1
Strain : Geobacillus thermoglucosidasius C56-YS93
Genome accession: NC_015660
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3002
EC number : -
Position : 2010969 - 2013605 bp
Length : 2637 bp
Strand : -
Note : KEGG: gmc:GY4MC1_1998 protein of unknown function DUF2309; HAMAP: UPF0753 protein; PFAM: Protein of unknown function DUF2309

DNA sequence :
ATGAGCACGACTATCATATCGCCCGAACGCCCGAATGCGCGCGAGCAACGGACAGGAGATGCCGCGGCAGCTATCAATGT
GGCCGAGCTTGTCCAAAATGCCAGCAAGGCAATCGCTCCGCTTTGGCCGATTGCCACGTTTATCGCCCGCCATCCGTGGA
TGGGGCTGGAACATCTTCCGTTTGAGCAAGTGGCCCGTCGTTTAAAGTCACTGAAAGATATTGACATTTATCCAAGTATG
TCCATGTTGCGAGCGGCCCAGCGGAAAGGGGAGTTGAATCCTAAGTTTTTGGAAATGCGGCTGCAGCGCTGGCTTGACGA
GCAGCTGCTGGCGCTGCCGCGCGAGGAAGCGGAACGTTTTTGCCGCGCCGCGTTGCAGCATGAAGAAATTCCCAATAAGT
TGTTAACGTCTTCAAGGCTAAAAAGCTTGGCTGCAAAAATGAAAGACATACAATTGCATGTCGATGGCGAACGTTTACCC
ATCCGGCCGATAAGCCTTCTTCTTGAAGAGCAAGGTGAAGGAAAATGGGCGCGTCTGCTTGACCATCATATGATTAAATG
GTGCAAATTATTTTTAGATGAATCGCAAGCTTCGTGGTCACTGCCATATCGGGAAAAAGGGTTTTACTGCGCATGGCGGA
AACTGGTGACCAACGACCCAGCTTTAAACAAAGAACAACGTGAGCGATTGAAAGATTTGCCGCAAAATGCTGAAGAGGCA
TTGCGGCAAGCGTTAATCATGCTCGGCATACCGCATGGCGCGATGAAAGACTACTTGGAAGCGCATCTTCTTTCCTTGCC
AGGATGGGCAGGAATGTTACAATGGCGTTCGCAGACGTCCGGCCAAGCGCATTTGCTGCTTGTGGATTATTTAGCTATCC
GCCTTTCGCTGGAATGGGCGTTGATTGCGCCGTATTTGCCGTTTGCAAAACAAAAAAAGGACGATGAAGCGTTTCTTCTT
CCGCTTCTCGCGGCTTGGATGCACTGGGGAGGGTTGACGCCGGAAGAATGGCTGCGGTTGCCGCAGGATGCACAACAAGC
GCGGTTATTTCTGGCTTATCGCTTTGATAAAATCGTTCGCAGCAAACTTTGGCTCGAAGCATGGGAAGATACGCAAGAGG
CACAGCTAAAGGAAAAAATAGCGTCTCATTCTCTTAACAGCGAGCAAAAACAAGCAATCGCCCAGTTTATCTTTTGTATC
GATGTTCGTTCCGAACCTTTCCGGCGCCATTTAGAGCAGGCAGGACCGTTTGAAACATACGGGTGCGCTGGTTTTTTCGG
CCTTCCCATCAAGACGCGCGAACTGGATAGCGATTATGCGCACGCTTCTTGCCCAGCCATCGTAGAGCCGCTTCATGAAG
TCCGTGAATATGCATCAGCGGCAACTGTCAAGGAATATCGCGGCCGCCGCAACGTGCGGCTTTCGCTTGGCTATATGTTT
AAAAAAATGAAACAGCATTTGTTTGCTAGTTTGTTGCTTCCGGAAGTGAGCGGACCGTGGCTCGGTTTGCACACACTGGC
CTGGAACATAGCGCCTAGCGGAGCGGGCCGCGCATTTCGGCAGTTTCAAGATAATTGGGCGCAAAAGCCGGAAACCGAAC
TTTCGCTTGATCGGGAATCTCCTTTGGAAGCAGCGGATCTTCCTGTAGGTTTTTCCACAGAGGAAAAAGTACAGTATGTT
TATCGCCTTTTAAAAGGAATAGGGCTTACCAGCCGCTTTGCGCCACTTGTCGTTGTCTGCGGCCATGAAAGTGAAACGGC
GAACAATCCTTATGCCTCGTCGCTTGATTGCGGTGCGTGCGGCGGAGCGGCGGGAGGATTCAACGCCCGGGTGTTCGCCG
CCCTTTGCAACTTGAAAGAAGTTCGCAAAGGCCTTGCGGAAAAAGGCATGGTTATTCCCGAGGACACTGTTTTTATCGCT
GCCGAGCATATAACAACGGTTGATGAACTTCGCTGGCTTTATGTGCCGACGCTTTCGGAAACGGCACAAAAAGCGTTTGA
GATGTTGCAAAGCAAGCTAAAAGAAGTAAGCCGCAACGCAAATCATGAGCGGCTGGCGAAACTGCCGGGATTGGTGCGGA
AAAAGCAAGATCCGCTTGCCGAGGCGCGCCGCCGCGCGGCAGACTGGAGCGAAATCCGCCCGGAATGGGGGCTTGCAGGG
AATGCCGCGTTTATTATCGGCCGCCGTCAGCTGACACAGCACTGCAATTTTGAAGGAAAAGTATTTTTGCATAGCTATGA
CTGGCGCGAGGATCCGTCCGCAGAGTCGCTTGCAAACATTATTGCTGGTCCGGTGACTGTTGCGCAATGGATTAACTTGC
AATATTATGCATCAACCGTCGTTCCGCATTACTACGGAAGCGGCAGCAAAACAACGCAAACGGTCACTGCGGGGATCGGA
GTCATGCAAGGAAATGCGAGCGACTTGCTCACCGGGCTTCCGTGGCAGTCGGTCATGTCGTCTGATTTTGAAATGTTCCA
TTCTCCGCTTCGTTTGCTCGTCATCATCGAAGCGCCACGGCAATATATCAAACGTTTGCTCGAAGACGATCCGCATTTCC
GGCAAAAGGTGCAAAACGGATGGCTCCGTTTAGCTTCCATTGACCCGGACAGCGGGCAATGGGAAAAATGGTCATAA

Protein sequence :
MSTTIISPERPNAREQRTGDAAAAINVAELVQNASKAIAPLWPIATFIARHPWMGLEHLPFEQVARRLKSLKDIDIYPSM
SMLRAAQRKGELNPKFLEMRLQRWLDEQLLALPREEAERFCRAALQHEEIPNKLLTSSRLKSLAAKMKDIQLHVDGERLP
IRPISLLLEEQGEGKWARLLDHHMIKWCKLFLDESQASWSLPYREKGFYCAWRKLVTNDPALNKEQRERLKDLPQNAEEA
LRQALIMLGIPHGAMKDYLEAHLLSLPGWAGMLQWRSQTSGQAHLLLVDYLAIRLSLEWALIAPYLPFAKQKKDDEAFLL
PLLAAWMHWGGLTPEEWLRLPQDAQQARLFLAYRFDKIVRSKLWLEAWEDTQEAQLKEKIASHSLNSEQKQAIAQFIFCI
DVRSEPFRRHLEQAGPFETYGCAGFFGLPIKTRELDSDYAHASCPAIVEPLHEVREYASAATVKEYRGRRNVRLSLGYMF
KKMKQHLFASLLLPEVSGPWLGLHTLAWNIAPSGAGRAFRQFQDNWAQKPETELSLDRESPLEAADLPVGFSTEEKVQYV
YRLLKGIGLTSRFAPLVVVCGHESETANNPYASSLDCGACGGAAGGFNARVFAALCNLKEVRKGLAEKGMVIPEDTVFIA
AEHITTVDELRWLYVPTLSETAQKAFEMLQSKLKEVSRNANHERLAKLPGLVRKKQDPLAEARRRAADWSEIRPEWGLAG
NAAFIIGRRQLTQHCNFEGKVFLHSYDWREDPSAESLANIIAGPVTVAQWINLQYYASTVVPHYYGSGSKTTQTVTAGIG
VMQGNASDLLTGLPWQSVMSSDFEMFHSPLRLLVIIEAPRQYIKRLLEDDPHFRQKVQNGWLRLASIDPDSGQWEKWS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0411 YP_042537.1 hypothetical protein Not tested vSa¥á Protein 7e-165 43
SAR0453 YP_039901.1 hypothetical protein Not tested vSa¥á Protein 2e-164 43