Gene Information

Name : GY4MC1_1998 (GY4MC1_1998)
Accession : YP_003989353.1
Strain : Geobacillus sp. Y4.1MC1
Genome accession: NC_014650
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : S : Function unknown
COG ID : COG3002
EC number : -
Position : 1973408 - 1976044 bp
Length : 2637 bp
Strand : -
Note : PFAM: protein of unknown function DUF2309; KEGG: gwc:GWCH70_1610 hypothetical protein

DNA sequence :
ATGAGCACGACTATCATATCGCCCGAACGCCCGAATGCGCGCGAGCAACGGACAGGTGATGCCGCGGCAGCTATCAATGT
GGCCGAGCTTGTCCAAAATGCCAGCAAGGCAATCGCTCCGCTTTGGCCGATTGCCACGTTTATCGCCCGCCATCCGTGGA
TGGGGCTGGAACATCTTCCGTTTGAGCAAGTGGCCCGCCGTTTAAAGTCACTGAAAGATATTGACATTTATCCAAGTATG
TCCATGTTACGGGCGGCCCAGCGGAAAGGGGAGTTGAATCCTAAGTTTTTGGAAATGCGGCTGCAGCGCTGGCTTGACGA
GCAGCAGCTGGCGCTGCCGCGCGAGGAAGCGGAACGTTTTTGCCGCGCCGCGTTGCAGCATGAAGAAATTCCCAATAAGT
TGTTAACGTCTTCAAGGCTAAAAAGCTTGGCTGCAAAAATGAAAGACATACAATTGCATGTCGATGGCGAACGTTTACCC
ATCCGGCCGATAAGCCTTCTTTTTGAAGAGCAAGGTGAAGGAAAATGGGCGCGTCTGCTTGACCATCATATGATTAAATG
GTGCAAATTATTTTTAGATGAATCGCAAGCTTCGTGGTCACTGCCATATCGGGAAAAAGGGTTTTACTGCGCATGGCGGA
AACTGGTGACCAACGACCCTGCTTTAAACAAAGAACAACGTGAGCGATTGAAAGATTTGCCGCAAGATGCTGAAGAGGCA
TTGCGGCAAGCGTTAATCATGCTCGGCATACCGCATGGCGCGATGAAAGACTACTTGGAAGCGCATCTTCTTTCCTTGCC
AGGATGGGCAGGAATGTTACAATGGCGTTCGCAGATGTCCGGCCAAGCGCATTTGCTGCTTGTGGATTATTTAGCTATCC
GCCTTTCGCTGGAATGGGCGTTGATTGCGCCGTATTTGCCGTTTGCAAAACAAAAAAAGGACGATGAAGCGTTTCTTCTT
CCGCTTCTCGCGGCTTGGATGCACTGGGGAGGGTTGACGCCGGAAGAATGGCTGCGGTTGCCGCGGGATGCACAACAAGC
GCGGTTATTTCTGGCTTATCGCTTTGATAAAATCGTTCGCAGCAAACTTTGGCTCGAAGCATGGGAAGATACGCAAGAGG
CACAGCTAAAGGAAAAAATAGCGTCTCATTCTCTTAACAGCGAGCAAAAACAAGCAATCGCCCAGCTTATCTTTTGTATC
GATGTTCGTTCCGAACCTTTCCGGCGCCATTTAGAGCAGGCAGGACCGTTTGAAACATACGGGTGCGCTGGTTTTTTCGG
CCTTCCCATCAAGACGCGCGAACTGGATAGCGATTATGCGCACGCTTCTTGCCCAGCCATCGTAGAGCCGCTTCATGAAG
TCCGTGAATATGCATCAGCGGCAACTGTCAAGGAATATCGCGGCCGCCGCAACGTGCGGCTTTCGCTTGGCTATATGTTT
AAAAAAATGAAACAGCATTTGTTTGCTAGTTTGTTGCTTCCAGAAGTGAGCGGACCGTGGCTCGGTTTGCACACACTGGC
CTGGAACATAGCGCCTAGCGGAACGGGCCGCGCATTTCGGCAGTTTCCAGATAATTGGGTGCAAAAGCCGGAAACCGAAC
TTTCGCTTGATCGGGAATCTCCTTTGGAAGCCGCGGATCTTCCTGTAGGTTTTTCCACAGAGGAAAAAGTACAGTATGTT
TATCGCCTTTTAAAAGGAATAGGGCTTACCAGCCGCTTTGCGCCACTTGTCGTTGTCTGCGGCCATGAAAGTGAAACGGC
GAACAATCCTTATGCCTCGTCGCTTGATTGCGGTGCGTGCGGCGGAGCAGCGGGAGGATTCAACGCCCGGGTGTTCGCCG
CCCTTTGCAACTTGAAAGAAGTTCGCAAAGGCCTTGCGGAAAAAGGCATGGTTATTCCCGAGGACACTGTTTTTATCGCT
GCCGAGCATATAACAACGGTTGATGAACTTCGCTGGCTTTATGTGCCGACACTTTCGGAAGCGGCACAAAAAGCGTTTGA
GATGTTGCAAGGCAAGCTAAAAGAAGTAAGCCGCAACGCAAATCATGAGCGGCTGGCGAAACTGCCGGGATTGGTGCGGA
AAAAGCAAGATCCGCTTGCCGAGGCGCGCCGCCGCGCGGCAGACTGGAGCGAAATCCGCCCGGAATGGGGGCTTGCGGGG
AATGCCGCGTTTATTATCGGCCGCCGTCAGCTGACACAGCACTGCAATTTTGAAGGAAAAGTATTTTTGCATAGCTATGA
CTGGCGCGAGGATCCGTCCGCAGAGTCGCTTGCAAACATTATTGCTGGTCCGGTGACTGTTGCGCAATGGATTAACTTGC
AATATTATGCATCAACCGTCGTTCCGCATTACTACGGAAGCGGCAGCAAAACAACGCAAACGGTCACTGCGGGGATCGGA
GTCATGCAAGGAAATGCGAGCGACTTGCTCACCGGGCTTCCGTGGCAGTCGGTCATGTCGTCTGATTTTGAAATGTTCCA
TTCTCCGCTTCGTTTGCTCGTCATCATCGAGGCGCCGCGGCAATATATCAAACGTTTGCTCGAAGACGATCCGCATTTCC
GGCAAAAGGTGCAAAACGGATGGCTCCGTTTAGCTTCCATTGACCCGGACAGCGGGCAATGGGAAAAATGGTCATAA

Protein sequence :
MSTTIISPERPNAREQRTGDAAAAINVAELVQNASKAIAPLWPIATFIARHPWMGLEHLPFEQVARRLKSLKDIDIYPSM
SMLRAAQRKGELNPKFLEMRLQRWLDEQQLALPREEAERFCRAALQHEEIPNKLLTSSRLKSLAAKMKDIQLHVDGERLP
IRPISLLFEEQGEGKWARLLDHHMIKWCKLFLDESQASWSLPYREKGFYCAWRKLVTNDPALNKEQRERLKDLPQDAEEA
LRQALIMLGIPHGAMKDYLEAHLLSLPGWAGMLQWRSQMSGQAHLLLVDYLAIRLSLEWALIAPYLPFAKQKKDDEAFLL
PLLAAWMHWGGLTPEEWLRLPRDAQQARLFLAYRFDKIVRSKLWLEAWEDTQEAQLKEKIASHSLNSEQKQAIAQLIFCI
DVRSEPFRRHLEQAGPFETYGCAGFFGLPIKTRELDSDYAHASCPAIVEPLHEVREYASAATVKEYRGRRNVRLSLGYMF
KKMKQHLFASLLLPEVSGPWLGLHTLAWNIAPSGTGRAFRQFPDNWVQKPETELSLDRESPLEAADLPVGFSTEEKVQYV
YRLLKGIGLTSRFAPLVVVCGHESETANNPYASSLDCGACGGAAGGFNARVFAALCNLKEVRKGLAEKGMVIPEDTVFIA
AEHITTVDELRWLYVPTLSEAAQKAFEMLQGKLKEVSRNANHERLAKLPGLVRKKQDPLAEARRRAADWSEIRPEWGLAG
NAAFIIGRRQLTQHCNFEGKVFLHSYDWREDPSAESLANIIAGPVTVAQWINLQYYASTVVPHYYGSGSKTTQTVTAGIG
VMQGNASDLLTGLPWQSVMSSDFEMFHSPLRLLVIIEAPRQYIKRLLEDDPHFRQKVQNGWLRLASIDPDSGQWEKWS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAR0453 YP_039901.1 hypothetical protein Not tested vSa¥á Protein 7e-165 43
SAS0411 YP_042537.1 hypothetical protein Not tested vSa¥á Protein 9e-166 43