Gene Information

Name : Theco_3474 (Theco_3474)
Accession : YP_007214513.1
Strain : Thermobacillus composti KWC4
Genome accession: NC_019897
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3699595 - 3702330 bp
Length : 2736 bp
Strand : +
Note : PFAM: Uncharacterized protein conserved in bacteria (DUF2309)

DNA sequence :
ATGAATCGCTCCGCGATGTTGTCCGAACGGGCTGTGGATCATGTGGAAAACCGTGTGGAAAATAATGAAAAAAAATGCGA
TCGGGATTTTTCCGGCAACCTGCCGGTGGATGAGATGGTGGGAAGGGCGAGCCGGGTCATCGCTCCGCTGTGGCCGCTTC
GCACGTTTGTCGCTGTCCATCCCTGGGCGGGACTGGAGCATCTGACGTTTGAGGAAGTGGCGGAACGATTTCGGGATTCC
CGGGGTTTGGACCTGTTTCCCCCGATGGCGATGTTTCATGAAGCGCTGCGAAAAGGGGAGTTAAGTCCACGTAAGCTCGA
GGAACGTTTCGGGAAGTGGCTGAATGAGGAGGCGCATTCCATTCCCCGCAAAGAAGCGGAACGGCTGTGCTGCGGACTGC
TGTGGCACGAAAACGTTCCCCGCGAGGTCATGAATTCGTTAGAGATGAAACGTCTGGCCGTCCGGATGAAACACGCCAAA
ACGCTGCCGGATCGCCTTCCCGTCCGTTGTGTCCGAACCAAAAGCGCGATATTGGAAGCGCAAGGAGAACCCCGCTTTGC
GCGGGCGCTGGACCGCCATATGATCAAATGGTGCAAGCTGTTTCTGGACGAAGGGCAAGCCGCGTGGGGGATGCCGTTTC
GGGAATCGGGATTCTACTGCACCTGGCGGAAACTGGCGTGCCACGATCCGTCCCTGAGCCGGGCGGAGCGCGGGCGGATC
AAACAAACGCCGGCAAACGCGGAAGACGCGTTGAAGCAGGCGCTGTTTAACCTGAATGTTCCGCGGCCGGACATGGAGCG
TTATCTGGAAGAACACCTGATCGCGCTGCCGGGCTGGGCGGGGATGTTGCTGTGGCGCTCGCAAAAATCCGGACAGGAAT
ACCGCTTGCTGACGGAATATTTGGCGATTCGTCTGTCGACGGAATGGGCGCTGGTCGCCCCCCGTCTTCCCCTTCAGGAG
ACCGGTGATGCCGACCATGACGCGGAACTCTTACCCTGGATTGGTGCATGGTTGCATTGGGGAGGATGGACGTGCGAACA
ATGGTTCGGGATGTCGCCTGAAGAGCGGTTTATCCGGCTGGATTTCGCCCGCCGTTTCGCCTGGATGGTTCGTCCCAGAC
TGTGGCTGGAAGCCTGGGAGGATACACAAGAGGAGAAGTTGCGGGAAACGATATCGGCAACGCAAACAAACGGCCACGCC
CGACGGGCGGCCGCGCAGCTGCTCTTCTGCATCGACGTCCGTTCCGAACCTTTCCGCCGGCATCTCGAACGGGAAGGGCC
GTTTGAAACATTCGGCTGTGCGGGATTTTTCGGCCTTCCCATCCGGACGCGCCTGCCGGACGGGCACGTCCATGCGGCCT
GTCCGGCCATTGTCGAACCCCGCCATGAGGTACGGGAACAGCTTCCATCCGCCGGCAGCCAAACGTATCTTTGGGCGGAA
TCCGCGAAACTTTCCGTCGCCCGCGTATTTAAAAAGATGAAACAGGGGCTTGTCACCAGCCTGTTGTTGCCGGAAATGAG
CGGACCGTGGCTGGGATTGTACATGCTTATGCGGAATGCGGCTCCTGAACGCGTCATCGGTTCCATCCATCGTTGGCGGA
AACGGTCTGCCGGTAAAATGAAGACGCATCTGACCCTGGATCTCGAGAGTACTTGCGGGGACTCCGGTTTGCCGACAGGA
TTTTCCCTGGAAGAGAAAGTGAACTATGTGGGCAGCTTGTTGAGAAGCATCGGCCTGACGTCCGCATTTTCGCCTCTTGT
GGTCGTTTGCGGCCACAAAAGCGAGACGGCGAACAACCCTTACGCATCGGCTCTGGATTGCGGCGCTTGCGGCGGAGCCG
CCGGAGGGTTGAACGCCCGGGTATTCGCGGAACTCTGCAACCGGGAAGAGGTGCGGAGGGCTCTGGCCGGGCAAGGCATC
GTCATCCCGGAAGAGACGGTTTTTATCGCCGCGGAACATAGCACCACTGTCCATGAGTTGCGCTGGCTGCATGTGCCGGA
ACTTTCCCCGGCGGCCCGGGACGCTTTTGCCCTGCTTCAGGACAGGCTCCGGGCGGTAACCCGCAAGGTCAATCTGGAGC
AACTGGCCAAATTGCCGGGTGCTGGCGCCGCTGGACGCGATCCGGTTGCCGAGGCGCACCGGCGTGCGGCAGACTGGAGT
GAAGTTCGTCCGGAATGGGGGCTGGCGGGGAATTACGCGTTTGTCATCGGCAGGCGACACTTGACCGAATCGTGCAATCT
GGAAGGAAGGGTATTTTTGCACAGTTACGACTGGCGGGAAGACCCGGACGGAACTTTGCTGATGAATATTGCGGCCGGTC
CGGTGACCGTCGCCCAGTGGATTAACCTGCAATATTACGCTTCGACGGTCGCCCCCCATATCTACGGGAGCGGCAACAAG
GCCACCCAGACCGTCACGGCGGGGATCGGTGTGATGCAGGGCAACGGAAGCGATTTGCTTGCGGGGTTGCCGTGGCAGTC
GGTCATGGCAAGCGACAGGGAATGGTTTCATTCCCCGCTTCGCTTGCTCGTGGTCATCGAAGCCCCGCGGCCGTATATGG
TGAAGCTGCTCGAAGGGAATCCCGAATTCCGCAGAAAAGTGAGCAACGGATGGTTGCGCCTTGTGTCCGTCGACCCGGTG
AGCGGAATTTGGGAAAGATGGACGCCACGAAGCCTTGGCCCGGCACTGACGTTGGACGACGAGGCAGCAGGTTGTGCAGG
TTGTGCGTGTGATTGA

Protein sequence :
MNRSAMLSERAVDHVENRVENNEKKCDRDFSGNLPVDEMVGRASRVIAPLWPLRTFVAVHPWAGLEHLTFEEVAERFRDS
RGLDLFPPMAMFHEALRKGELSPRKLEERFGKWLNEEAHSIPRKEAERLCCGLLWHENVPREVMNSLEMKRLAVRMKHAK
TLPDRLPVRCVRTKSAILEAQGEPRFARALDRHMIKWCKLFLDEGQAAWGMPFRESGFYCTWRKLACHDPSLSRAERGRI
KQTPANAEDALKQALFNLNVPRPDMERYLEEHLIALPGWAGMLLWRSQKSGQEYRLLTEYLAIRLSTEWALVAPRLPLQE
TGDADHDAELLPWIGAWLHWGGWTCEQWFGMSPEERFIRLDFARRFAWMVRPRLWLEAWEDTQEEKLRETISATQTNGHA
RRAAAQLLFCIDVRSEPFRRHLEREGPFETFGCAGFFGLPIRTRLPDGHVHAACPAIVEPRHEVREQLPSAGSQTYLWAE
SAKLSVARVFKKMKQGLVTSLLLPEMSGPWLGLYMLMRNAAPERVIGSIHRWRKRSAGKMKTHLTLDLESTCGDSGLPTG
FSLEEKVNYVGSLLRSIGLTSAFSPLVVVCGHKSETANNPYASALDCGACGGAAGGLNARVFAELCNREEVRRALAGQGI
VIPEETVFIAAEHSTTVHELRWLHVPELSPAARDAFALLQDRLRAVTRKVNLEQLAKLPGAGAAGRDPVAEAHRRAADWS
EVRPEWGLAGNYAFVIGRRHLTESCNLEGRVFLHSYDWREDPDGTLLMNIAAGPVTVAQWINLQYYASTVAPHIYGSGNK
ATQTVTAGIGVMQGNGSDLLAGLPWQSVMASDREWFHSPLRLLVVIEAPRPYMVKLLEGNPEFRRKVSNGWLRLVSVDPV
SGIWERWTPRSLGPALTLDDEAAGCAGCACD

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAR0453 YP_039901.1 hypothetical protein Not tested vSa¥á Protein 4e-146 42
SAS0411 YP_042537.1 hypothetical protein Not tested vSa¥á Protein 4e-147 42