Name : Ethha_1806 (Ethha_1806) Accession : YP_004092062.1 Strain : Ethanoligenens harbinense YUAN-3 Genome accession: NC_014828 Putative virulence/resistance : Virulence Product : hypothetical protein Function : - COG functional category : V : Defense mechanisms COG ID : COG0610 EC number : - Position : 1948645 - 1951740 bp Length : 3096 bp Strand : - Note : KEGG: tex:Teth514_1218 hypothetical protein; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase DNA sequence : ATGGCTTTTACGGATAAAACCGAGAGGGGATTTGAGACGATCATCGTGAACTGGCTCGTGGAGCAGAATGGCTACGAGCA GGGAACGAATGACGACTACAGTAAGGAATACGCCGTAGACGAAACCCGCCTCTTCCGGTTCCTGAATGATACGCAGCCGA GGGAAATGGCAAAGCTTGGTGTAAATAACAGCGATCAGAAGAAGCGGCAGTTCCTAAACCGCCTTTCCGGAGAGATTGCC AAGCGCGGCATTATTGATGTGTTGCGAAACGGTGTGAAAGCGTATCCGGCTGACCTCATCATGTTCTACTTCACGCCGAC TGAGAACAACGAGAAGTCGAAGCAGATGTTTGAAAAGAATATCTTCAGTGTGACACGGCAGCTTCGCTATTCCATCGACG CTTCAAAGCTCGCACTCGACCTTTGCCTGTTCATCAACGGTCTTCCGGTCGTCACGATTGAGCTCAAGAATCATTTCACT GGTCAGTCGACAGCAGATGCCGTTGAGCAGTATAAGGAAGATCGCGATCCGCGGGACACGCTGTTCTCGTTCAAACGGTG CATGGTGCATTTCGCAGTCGATGACCAGACCGTCATGTTCTGCACGAAGCTTGCTGGTAAAGACAGCTGGTTCCTGCCAT TCAACAAGGGCTATAACGATGGCGCGGGCAATCCGCCGAATCCGGACGGCATCATGACAGACTATCTGTGGAAGGACATT CTGACGAAGTGGAAGCTCTCCCGCATTATCGAGAATTACGCTCAGGTCGTCGTTGACGAAGATCCGGACACGAAGAAGAA AACCGTGAAACAGATCTGGCCACGCTACCATCAGCTGGACTGCGTGGAGAAGCTGCTCACAGATGTGAAACAAAACGGTG TCGGTAAGCGCTATCTCATCCAGCACAGTGCAGGCTCCGGAAAGTCGAATTCTATCGCATGGCTTGCTCATCAGCTGATT GGATTGGAACAAGACGGCCATCCGATGATCGATTCCGTTATCGTCGTCACCGACCGCAGGATTCTGGACAAGCAGATCCG CGATACCATCAAGCAGTTCATGCAGGTGAAAAACACGGTCGTTTGGGCACAGCATTCCGGAGACCTGAAAAAGGCAATCC AGGATGGCAAGCGGATCATTATAACCACGGTTGAGAAGTTCCCGTACATCTCGCAGGAAATCGGTCAGGAACATATCAAT AATCATTTTGCCATTATCATCGATGAGGCACACTCAGGCCAGAGCGGGCGCAATTCCGCGAATATGAATCTGGCGCTTTC CGGCATGGCTTCAGACAACGAAATGGACAATGAAGACAAGATCAATGCGATTGTCGAGGGCCGGAAGCTCGTAAAGACCG CGAGCTATTTTGCGTTCACCGCGACCCCAAAGAACAAGACCGAGGAGGTTTTCGGAACGCCATATGAAGAGGATGGCGAA ATCAAGCACAGGCCTTTCCATGTTTACACCATGAAGCAGGCCATTCAGGAAGGCTTCATCCTTGACGTCTTGAAGAACTA TACGGCGATCGACAGCTGGTACAAGATTGCCAAGAAGGTCGAGGACGATCCGATGTTCGACAAGAAGCGCGCCCAGAAAA AACTGCGTTCCTTTGTCGAGGGGAATCCGGATGTCATCGCCAAGAAGGCTGCCATGATGGTGGATCACTTCCATGAGCAG ATCATCGCTAAGAAGAAGCTGAACGGAAAGTCCCGCGCAATGGTGGTGACCGCGAGCATCCCACGCTGCATCGAGACCTA CTATGCCATCAACAAGTGCCTTGCAGACAGACATAGTCCGTACAAGGCGATTATCGCTTTCTCCGGCGAGTGCAAATACA ACGGACAGGAACCTGCCCTGACGTCGGCAGGACTGAACGGGTTCCCGGACGCCAAGATCCCGAAGGAGTTCAAGAAAGAT CCGTATCGTCTTCTGGTAGTTGCAGACATGTTCCAGACCGGATTTGATGAGCCACTTCTGCAGACCATGTACGTGGATAA GCCGCTTTATGACATTGCGGCAGTGCAGACGCTTTCCCGTCTGAACCGCGCGGCTCCCGGGAAGGACGAAGTCTATGTGC TGGACTTTGCAAACAAGACATCGACGATTCAGGACGCCTTCTCGAAGTTCTACAGGACAACGATACTGTCTGGAGAGACT GATCCGAACAAGCTTTACGACCTGATCACGCTCATGGAGAGCTATCAGGTATATGACGCTGACGATGTAGAGCACGTAGT TGATTTGTTCCTTGGCGGAGCTGAGAGAGACAGGCTTGATCCGTTGCTTGATCCGTGCGTGGCTACCTACAACGAACTCG AAACGGACGATCAGATCAAGTTCAAGAGCGCAGCAAAGTCATTTGTCCGTACATACGGTTTCCTTGGTTCCATTCTTCCC TATGGGAATGTGGACTGGGAAAAGTTGTCGATCTTCCTGAATCTGCTAATACCGAAACTTCCGTCACCTCGTGAGGATGA TTTGTCAGAGGGCATTTTGTCTACGATTGATCTGGACAGTTATCGGAATGAAGCACAGGAGGCCGTGGCCATAAAACTGG AGGACGAGGAGGCCGAGATTGCTCCGGTCCCTGCCGGAAAGGTCGGCCATAACGTTGAGCCGGAACTTGATCCGCTTTCC AAGATCATCATGGATTTCAACGATATGTTCGGAAACATCCAGTGGAATGACGCTGATAACGTACAGCGCCAGATCCTCCA GATTCCGGCGATGGTTTCTCGTGACGAGAAATACCAGAATGCCATGAAGAATTCTGATGAGCAGGAAGCCCGAACAGAAA GCGAGCGCGCCCTGCAGAAGGTCATCTTCTCCATCATGGCGGACAACATGGAGCTCTTTAAGCAGTTTCAGGACAATCCG TCGTTCAAGAAGTGGCTTACGAATATGGTATTCAACATGACGTACAACAAGGAGGGGAAACCGTATGAAGCACCAGACGA TTTGGATTCTCCGAAAGTCGCCAGCGACAATTTCACAAGCTATCGATATCCAACAGAAGCGCCAAGAAGCGGAATGATGG TAGCCGATGGCAAAGTAACGTTCGGTGAAAAGAAGGACAGCGATTCTAAAAAGTAG Protein sequence : MAFTDKTERGFETIIVNWLVEQNGYEQGTNDDYSKEYAVDETRLFRFLNDTQPREMAKLGVNNSDQKKRQFLNRLSGEIA KRGIIDVLRNGVKAYPADLIMFYFTPTENNEKSKQMFEKNIFSVTRQLRYSIDASKLALDLCLFINGLPVVTIELKNHFT GQSTADAVEQYKEDRDPRDTLFSFKRCMVHFAVDDQTVMFCTKLAGKDSWFLPFNKGYNDGAGNPPNPDGIMTDYLWKDI LTKWKLSRIIENYAQVVVDEDPDTKKKTVKQIWPRYHQLDCVEKLLTDVKQNGVGKRYLIQHSAGSGKSNSIAWLAHQLI GLEQDGHPMIDSVIVVTDRRILDKQIRDTIKQFMQVKNTVVWAQHSGDLKKAIQDGKRIIITTVEKFPYISQEIGQEHIN NHFAIIIDEAHSGQSGRNSANMNLALSGMASDNEMDNEDKINAIVEGRKLVKTASYFAFTATPKNKTEEVFGTPYEEDGE IKHRPFHVYTMKQAIQEGFILDVLKNYTAIDSWYKIAKKVEDDPMFDKKRAQKKLRSFVEGNPDVIAKKAAMMVDHFHEQ IIAKKKLNGKSRAMVVTASIPRCIETYYAINKCLADRHSPYKAIIAFSGECKYNGQEPALTSAGLNGFPDAKIPKEFKKD PYRLLVVADMFQTGFDEPLLQTMYVDKPLYDIAAVQTLSRLNRAAPGKDEVYVLDFANKTSTIQDAFSKFYRTTILSGET DPNKLYDLITLMESYQVYDADDVEHVVDLFLGGAERDRLDPLLDPCVATYNELETDDQIKFKSAAKSFVRTYGFLGSILP YGNVDWEKLSIFLNLLIPKLPSPREDDLSEGILSTIDLDSYRNEAQEAVAIKLEDEEAEIAPVPAGKVGHNVEPELDPLS KIIMDFNDMFGNIQWNDADNVQRQILQIPAMVSRDEKYQNAMKNSDEQEARTESERALQKVIFSIMADNMELFKQFQDNP SFKKWLTNMVFNMTYNKEGKPYEAPDDLDSPKVASDNFTSYRYPTEAPRSGMMVADGKVTFGEKKDSDSKK |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
VC1765 | NP_231400.1 | type I restriction enzyme HsdR | Not tested | VPI-2 | Protein | 2e-167 | 43 |
VC0395_A1363 | YP_001217306.1 | type I restriction enzyme HsdR | Not tested | VPI-2 | Protein | 2e-167 | 43 |
VPI2_0013c | ACA01830.1 | type I restriction enzyme HsdR | Not tested | VPI-2 | Protein | 1e-166 | 43 |
Gene | GenBank Accn | Product | ID of source DB | Alignment Type | E-val | Identity |
Ethha_1806 | YP_004092062.1 | hypothetical protein | VFG1098 | Protein | 1e-167 | 43 |