Name : Ethha_1801 (Ethha_1801) Accession : YP_004092057.1 Strain : Ethanoligenens harbinense YUAN-3 Genome accession: NC_014828 Putative virulence/resistance : Unknown Product : hypothetical protein Function : - COG functional category : V : Defense mechanisms COG ID : COG0610 EC number : - Position : 1937331 - 1940342 bp Length : 3012 bp Strand : - Note : KEGG: rsd:TGRD_039 type I restriction-modification system restriction subunit; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase DNA sequence : ATGGCAATTCAGACAAAAGAGCGCAACTTCGAACAAGAGATAGAATGGTGGCTCACTGAGGGTGCAGTCGAAGCCGACCG CTACAAGAAGGGCAATCCCGTCGATTTTGACCGCAAACTTGCATTGGATAAAGGGGCGATTCTGGCGTTTATCAAAGACA CTCAGCCTGACGAATGGCAGGGGCTTTGCAGGCGTCACGGCTCGGAAGTCTCTGCCGAAGCGGAATTCTTCAAACGTCTT AATTCGGAACTGAACTCTCGCGGCATGATTGACGTGCTTCGCCACGGTGTGGTGGACCTTGGCATCTCCGTGCGGCTTGC TTACTTTAAGCCGGGCAGCGGCATGAATCAGAGCTTAACGGCTCTCTACGCCAAGAACGTCCTGCAGATAACGCGGCAGG TCAAGTACAGCCTGCAAAACGAGAACTCTATCGACACCGTTATCTTTCTGAACGGGCTGCCGATAATCACCATAGAGCTC AAAAATCCGCTCACAGGGCAGACGTATCGAAATGCCATCACGCAGTATGAAAACGATCGTGACCCGCGTGAACTTCTGCT TGCTTTCAAAAAGCGGGCTATTGTGCATTTTGCCGTGGACACCGAAGAAGTCTGGATGACCACTTGGCTTCGCAAACTTG ATACCACCTTCATACCTTTCAATAAGGGAACCGAAGACCACGGAGCGGGCAATCCTGTCGCCGAGGGCGGTGATTATCGC ACGGCGTACTTATGGAAAGAGATACTGCAAAGGGATAGCATTCTGGACATTCTGCACCGCTTCGTGCAGGTCTCGAAAGA TGACAAGGGCAAGGAAAAATTGATATTTCCGCGTTACCACCAGTTGGACGCAGTCCGTAAATTGGTGGCGGATGCTTACG CCAACGGGTCGGGCAAGAACTATCTGATTCAGCATTCGGCAGGTTCGGGCAAATCGAACTCCATCGCTTGGCTTGCTCAC CACTTGGCGAACCTGCATGATATACATGACGAGGTGATATTCCACAGCATCATCGTCATTACTGATCGCCGTGTTCTCGA CAAACAACTCCAGCGCGATATTTACAACATGGAGCATAAGCCGGGCGTGGTCGTCCTCGTGGATAAGAACTCAAAGCAGC TAACCACCGCGCTGAACAACGGCGACAAAATCATCGTCTGCACCCTGCAGAAGTTCCCGTTTGTCGATGTGCAGAAGGTA TCCACTACGGGCAAGCGGTTTGCTATTATCGTGGACGAGGCCCACTCATCTCAAACAGGCGACGCAAGCAAGCGTATGAA AGAGATTCTGGCGGACATCTCTTTGCAGGGTGACGATGTCGTCGAAAAAAAACTGCATGAGTTTGCAGTAGAAGAAGCAA AGGCCGAAGCCGAGGAAAAAGACCTTGACGAAGCTATCGCTGATGAAATGGCGGCACATGGTCAGCAGCCAAACCTCTCG TTCTTTGCTTTTACCGCCACGCCCAAACAAAAGACACTGGAAATATTTGGGCAAACGACAGCCGCTGGCAAGCCGGAGCC GTTCCACCTCTATAGCATGAGACAGGCCATCGAGGAGCATTTCATTTTCAATGTCCTTGAAAACTATACGACCTACGAAA CCTACTTCCAAATTGGAAAGAAAATAGCCGACGACCCCGTGTATGGCAAGAATCTGGCGAACAAGGCTCTCGGCAAATAT ATGAGCCTCCACCCACACAACCTCGCACAGAAGGCGGAGGTCATCATTGAGCATTTCCGCAGTCAAGTGCAGCACCGCAT CGGCGGGCAGGCGAAAGCTATGCTTGTGACGGGTTCGCGCCTCCACGCCGTGCGCTATTTCTTCGAGTTCCAGAGATACA TCAAAAAGATGCACTATGACTTGGGCATCTTGGTGGCGTTCTCTGGCACGGTCAAAGACAAGGTGAGCGGCGAAATCAAA GAGTATACGGAATCTAACCTAAACAAGTTTCCGGATAGCGAGACTGTGGAAAAATTCGACACCGCCGAATATCAGCTTTT GATTGTCGCCGAAAAGTATCAGACGGGCTTCGACCAGCCGCTTCTGCACACGATGTATGTGGATAAGAAACTGACGGGCA TTAAGGCTGTTCAGACGATTTCCCGCGTTAATCGCGCGTGTAAGGGTAAGACTGAGACCTTCATCTTGGACTTCGTCAAC TCACGCGAAGACATCGAAAAGGCGTTTCAGGATTATTATCAGGCGACGGGCGTGGCTGAAACGACAGACCCGAACACCAT CTACGACATCAAAAATTTCCTTGATCGCTTTATGCTCTACCGTGACAGCGAGATTGAGGCTTTCGCCAAGGTTTTCTTTA AGGAAACGAAGAACCAGGGAAACATTGACCTTGCGAAGCTGAATGGTTTTATCGACCCCGCTGTCGACCGCTACAATGCC CTGACCGAGGATCAGGATAAAATGGACTTCAAAGGCGCACTTGCAAAGTTCATCCGCCTGTATGCGTTTCTCACGCACAT TATCAACCTCGGCGATGAGAATCTACACAAGTTCCATGCTTATGCCAAGTGCTTGCTCCGCAAACTGCCCAAAAGTGATA CGGAGCGCACACCCGATATTGGCAGCGATGTTATGCTCCAATATTATCGAGTGCAGAAAGTGGCTGAAGGCTCTATTGCT TTGGCGAACGAGGACGGCATTCTAAAGAGTAAGACCTCCAGCACGGGATTGCCGATTGAAGACGAAAAAGAGGCTTTATC CGCTATCATCCAAAGCCTAAACGAACGCCTTGGCACGAACTTCACTGAAATGGATAAGGTTTTGGAGCAATTCGTTCAGG ACATGTCCAATAACCAAGAGATGGTCTTGCGCTCTAAGAACCCGCTCGATCTCTTTAAAATCATTTACGACAATACCATT ATGGATGTGGTTCTAGGACGCATGGCAAAGAACCAAGAATTCTGCGAGAAGTATCTGGAGGACGAGGAATTCAGACGCGA GATCGACAAAATATTATTGCCGCTTGTTCACGATCGGTTGTCGAAAATATAG Protein sequence : MAIQTKERNFEQEIEWWLTEGAVEADRYKKGNPVDFDRKLALDKGAILAFIKDTQPDEWQGLCRRHGSEVSAEAEFFKRL NSELNSRGMIDVLRHGVVDLGISVRLAYFKPGSGMNQSLTALYAKNVLQITRQVKYSLQNENSIDTVIFLNGLPIITIEL KNPLTGQTYRNAITQYENDRDPRELLLAFKKRAIVHFAVDTEEVWMTTWLRKLDTTFIPFNKGTEDHGAGNPVAEGGDYR TAYLWKEILQRDSILDILHRFVQVSKDDKGKEKLIFPRYHQLDAVRKLVADAYANGSGKNYLIQHSAGSGKSNSIAWLAH HLANLHDIHDEVIFHSIIVITDRRVLDKQLQRDIYNMEHKPGVVVLVDKNSKQLTTALNNGDKIIVCTLQKFPFVDVQKV STTGKRFAIIVDEAHSSQTGDASKRMKEILADISLQGDDVVEKKLHEFAVEEAKAEAEEKDLDEAIADEMAAHGQQPNLS FFAFTATPKQKTLEIFGQTTAAGKPEPFHLYSMRQAIEEHFIFNVLENYTTYETYFQIGKKIADDPVYGKNLANKALGKY MSLHPHNLAQKAEVIIEHFRSQVQHRIGGQAKAMLVTGSRLHAVRYFFEFQRYIKKMHYDLGILVAFSGTVKDKVSGEIK EYTESNLNKFPDSETVEKFDTAEYQLLIVAEKYQTGFDQPLLHTMYVDKKLTGIKAVQTISRVNRACKGKTETFILDFVN SREDIEKAFQDYYQATGVAETTDPNTIYDIKNFLDRFMLYRDSEIEAFAKVFFKETKNQGNIDLAKLNGFIDPAVDRYNA LTEDQDKMDFKGALAKFIRLYAFLTHIINLGDENLHKFHAYAKCLLRKLPKSDTERTPDIGSDVMLQYYRVQKVAEGSIA LANEDGILKSKTSSTGLPIEDEKEALSAIIQSLNERLGTNFTEMDKVLEQFVQDMSNNQEMVLRSKNPLDLFKIIYDNTI MDVVLGRMAKNQEFCEKYLEDEEFRREIDKILLPLVHDRLSKI |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
SAS0025 | YP_042158.1 | type I restriction enzyme protein | Not tested | SCC476 | Protein | 2e-178 | 46 |