Name : Theco_3474 (Theco_3474) Accession : YP_007214513.1 Strain : Thermobacillus composti KWC4 Genome accession: NC_019897 Putative virulence/resistance : Unknown Product : hypothetical protein Function : - COG functional category : - COG ID : - EC number : - Position : 3699595 - 3702330 bp Length : 2736 bp Strand : + Note : PFAM: Uncharacterized protein conserved in bacteria (DUF2309) DNA sequence : ATGAATCGCTCCGCGATGTTGTCCGAACGGGCTGTGGATCATGTGGAAAACCGTGTGGAAAATAATGAAAAAAAATGCGA TCGGGATTTTTCCGGCAACCTGCCGGTGGATGAGATGGTGGGAAGGGCGAGCCGGGTCATCGCTCCGCTGTGGCCGCTTC GCACGTTTGTCGCTGTCCATCCCTGGGCGGGACTGGAGCATCTGACGTTTGAGGAAGTGGCGGAACGATTTCGGGATTCC CGGGGTTTGGACCTGTTTCCCCCGATGGCGATGTTTCATGAAGCGCTGCGAAAAGGGGAGTTAAGTCCACGTAAGCTCGA GGAACGTTTCGGGAAGTGGCTGAATGAGGAGGCGCATTCCATTCCCCGCAAAGAAGCGGAACGGCTGTGCTGCGGACTGC TGTGGCACGAAAACGTTCCCCGCGAGGTCATGAATTCGTTAGAGATGAAACGTCTGGCCGTCCGGATGAAACACGCCAAA ACGCTGCCGGATCGCCTTCCCGTCCGTTGTGTCCGAACCAAAAGCGCGATATTGGAAGCGCAAGGAGAACCCCGCTTTGC GCGGGCGCTGGACCGCCATATGATCAAATGGTGCAAGCTGTTTCTGGACGAAGGGCAAGCCGCGTGGGGGATGCCGTTTC GGGAATCGGGATTCTACTGCACCTGGCGGAAACTGGCGTGCCACGATCCGTCCCTGAGCCGGGCGGAGCGCGGGCGGATC AAACAAACGCCGGCAAACGCGGAAGACGCGTTGAAGCAGGCGCTGTTTAACCTGAATGTTCCGCGGCCGGACATGGAGCG TTATCTGGAAGAACACCTGATCGCGCTGCCGGGCTGGGCGGGGATGTTGCTGTGGCGCTCGCAAAAATCCGGACAGGAAT ACCGCTTGCTGACGGAATATTTGGCGATTCGTCTGTCGACGGAATGGGCGCTGGTCGCCCCCCGTCTTCCCCTTCAGGAG ACCGGTGATGCCGACCATGACGCGGAACTCTTACCCTGGATTGGTGCATGGTTGCATTGGGGAGGATGGACGTGCGAACA ATGGTTCGGGATGTCGCCTGAAGAGCGGTTTATCCGGCTGGATTTCGCCCGCCGTTTCGCCTGGATGGTTCGTCCCAGAC TGTGGCTGGAAGCCTGGGAGGATACACAAGAGGAGAAGTTGCGGGAAACGATATCGGCAACGCAAACAAACGGCCACGCC CGACGGGCGGCCGCGCAGCTGCTCTTCTGCATCGACGTCCGTTCCGAACCTTTCCGCCGGCATCTCGAACGGGAAGGGCC GTTTGAAACATTCGGCTGTGCGGGATTTTTCGGCCTTCCCATCCGGACGCGCCTGCCGGACGGGCACGTCCATGCGGCCT GTCCGGCCATTGTCGAACCCCGCCATGAGGTACGGGAACAGCTTCCATCCGCCGGCAGCCAAACGTATCTTTGGGCGGAA TCCGCGAAACTTTCCGTCGCCCGCGTATTTAAAAAGATGAAACAGGGGCTTGTCACCAGCCTGTTGTTGCCGGAAATGAG CGGACCGTGGCTGGGATTGTACATGCTTATGCGGAATGCGGCTCCTGAACGCGTCATCGGTTCCATCCATCGTTGGCGGA AACGGTCTGCCGGTAAAATGAAGACGCATCTGACCCTGGATCTCGAGAGTACTTGCGGGGACTCCGGTTTGCCGACAGGA TTTTCCCTGGAAGAGAAAGTGAACTATGTGGGCAGCTTGTTGAGAAGCATCGGCCTGACGTCCGCATTTTCGCCTCTTGT GGTCGTTTGCGGCCACAAAAGCGAGACGGCGAACAACCCTTACGCATCGGCTCTGGATTGCGGCGCTTGCGGCGGAGCCG CCGGAGGGTTGAACGCCCGGGTATTCGCGGAACTCTGCAACCGGGAAGAGGTGCGGAGGGCTCTGGCCGGGCAAGGCATC GTCATCCCGGAAGAGACGGTTTTTATCGCCGCGGAACATAGCACCACTGTCCATGAGTTGCGCTGGCTGCATGTGCCGGA ACTTTCCCCGGCGGCCCGGGACGCTTTTGCCCTGCTTCAGGACAGGCTCCGGGCGGTAACCCGCAAGGTCAATCTGGAGC AACTGGCCAAATTGCCGGGTGCTGGCGCCGCTGGACGCGATCCGGTTGCCGAGGCGCACCGGCGTGCGGCAGACTGGAGT GAAGTTCGTCCGGAATGGGGGCTGGCGGGGAATTACGCGTTTGTCATCGGCAGGCGACACTTGACCGAATCGTGCAATCT GGAAGGAAGGGTATTTTTGCACAGTTACGACTGGCGGGAAGACCCGGACGGAACTTTGCTGATGAATATTGCGGCCGGTC CGGTGACCGTCGCCCAGTGGATTAACCTGCAATATTACGCTTCGACGGTCGCCCCCCATATCTACGGGAGCGGCAACAAG GCCACCCAGACCGTCACGGCGGGGATCGGTGTGATGCAGGGCAACGGAAGCGATTTGCTTGCGGGGTTGCCGTGGCAGTC GGTCATGGCAAGCGACAGGGAATGGTTTCATTCCCCGCTTCGCTTGCTCGTGGTCATCGAAGCCCCGCGGCCGTATATGG TGAAGCTGCTCGAAGGGAATCCCGAATTCCGCAGAAAAGTGAGCAACGGATGGTTGCGCCTTGTGTCCGTCGACCCGGTG AGCGGAATTTGGGAAAGATGGACGCCACGAAGCCTTGGCCCGGCACTGACGTTGGACGACGAGGCAGCAGGTTGTGCAGG TTGTGCGTGTGATTGA Protein sequence : MNRSAMLSERAVDHVENRVENNEKKCDRDFSGNLPVDEMVGRASRVIAPLWPLRTFVAVHPWAGLEHLTFEEVAERFRDS RGLDLFPPMAMFHEALRKGELSPRKLEERFGKWLNEEAHSIPRKEAERLCCGLLWHENVPREVMNSLEMKRLAVRMKHAK TLPDRLPVRCVRTKSAILEAQGEPRFARALDRHMIKWCKLFLDEGQAAWGMPFRESGFYCTWRKLACHDPSLSRAERGRI KQTPANAEDALKQALFNLNVPRPDMERYLEEHLIALPGWAGMLLWRSQKSGQEYRLLTEYLAIRLSTEWALVAPRLPLQE TGDADHDAELLPWIGAWLHWGGWTCEQWFGMSPEERFIRLDFARRFAWMVRPRLWLEAWEDTQEEKLRETISATQTNGHA RRAAAQLLFCIDVRSEPFRRHLEREGPFETFGCAGFFGLPIRTRLPDGHVHAACPAIVEPRHEVREQLPSAGSQTYLWAE SAKLSVARVFKKMKQGLVTSLLLPEMSGPWLGLYMLMRNAAPERVIGSIHRWRKRSAGKMKTHLTLDLESTCGDSGLPTG FSLEEKVNYVGSLLRSIGLTSAFSPLVVVCGHKSETANNPYASALDCGACGGAAGGLNARVFAELCNREEVRRALAGQGI VIPEETVFIAAEHSTTVHELRWLHVPELSPAARDAFALLQDRLRAVTRKVNLEQLAKLPGAGAAGRDPVAEAHRRAADWS EVRPEWGLAGNYAFVIGRRHLTESCNLEGRVFLHSYDWREDPDGTLLMNIAAGPVTVAQWINLQYYASTVAPHIYGSGNK ATQTVTAGIGVMQGNGSDLLAGLPWQSVMASDREWFHSPLRLLVVIEAPRPYMVKLLEGNPEFRRKVSNGWLRLVSVDPV SGIWERWTPRSLGPALTLDDEAAGCAGCACD |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
SAR0453 | YP_039901.1 | hypothetical protein | Not tested | vSa¥á | Protein | 4e-146 | 42 |
SAS0411 | YP_042537.1 | hypothetical protein | Not tested | vSa¥á | Protein | 4e-147 | 42 |