Gene Information

Name : Ethha_1806 (Ethha_1806)
Accession : YP_004092062.1
Strain : Ethanoligenens harbinense YUAN-3
Genome accession: NC_014828
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1948645 - 1951740 bp
Length : 3096 bp
Strand : -
Note : KEGG: tex:Teth514_1218 hypothetical protein; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase

DNA sequence :
ATGGCTTTTACGGATAAAACCGAGAGGGGATTTGAGACGATCATCGTGAACTGGCTCGTGGAGCAGAATGGCTACGAGCA
GGGAACGAATGACGACTACAGTAAGGAATACGCCGTAGACGAAACCCGCCTCTTCCGGTTCCTGAATGATACGCAGCCGA
GGGAAATGGCAAAGCTTGGTGTAAATAACAGCGATCAGAAGAAGCGGCAGTTCCTAAACCGCCTTTCCGGAGAGATTGCC
AAGCGCGGCATTATTGATGTGTTGCGAAACGGTGTGAAAGCGTATCCGGCTGACCTCATCATGTTCTACTTCACGCCGAC
TGAGAACAACGAGAAGTCGAAGCAGATGTTTGAAAAGAATATCTTCAGTGTGACACGGCAGCTTCGCTATTCCATCGACG
CTTCAAAGCTCGCACTCGACCTTTGCCTGTTCATCAACGGTCTTCCGGTCGTCACGATTGAGCTCAAGAATCATTTCACT
GGTCAGTCGACAGCAGATGCCGTTGAGCAGTATAAGGAAGATCGCGATCCGCGGGACACGCTGTTCTCGTTCAAACGGTG
CATGGTGCATTTCGCAGTCGATGACCAGACCGTCATGTTCTGCACGAAGCTTGCTGGTAAAGACAGCTGGTTCCTGCCAT
TCAACAAGGGCTATAACGATGGCGCGGGCAATCCGCCGAATCCGGACGGCATCATGACAGACTATCTGTGGAAGGACATT
CTGACGAAGTGGAAGCTCTCCCGCATTATCGAGAATTACGCTCAGGTCGTCGTTGACGAAGATCCGGACACGAAGAAGAA
AACCGTGAAACAGATCTGGCCACGCTACCATCAGCTGGACTGCGTGGAGAAGCTGCTCACAGATGTGAAACAAAACGGTG
TCGGTAAGCGCTATCTCATCCAGCACAGTGCAGGCTCCGGAAAGTCGAATTCTATCGCATGGCTTGCTCATCAGCTGATT
GGATTGGAACAAGACGGCCATCCGATGATCGATTCCGTTATCGTCGTCACCGACCGCAGGATTCTGGACAAGCAGATCCG
CGATACCATCAAGCAGTTCATGCAGGTGAAAAACACGGTCGTTTGGGCACAGCATTCCGGAGACCTGAAAAAGGCAATCC
AGGATGGCAAGCGGATCATTATAACCACGGTTGAGAAGTTCCCGTACATCTCGCAGGAAATCGGTCAGGAACATATCAAT
AATCATTTTGCCATTATCATCGATGAGGCACACTCAGGCCAGAGCGGGCGCAATTCCGCGAATATGAATCTGGCGCTTTC
CGGCATGGCTTCAGACAACGAAATGGACAATGAAGACAAGATCAATGCGATTGTCGAGGGCCGGAAGCTCGTAAAGACCG
CGAGCTATTTTGCGTTCACCGCGACCCCAAAGAACAAGACCGAGGAGGTTTTCGGAACGCCATATGAAGAGGATGGCGAA
ATCAAGCACAGGCCTTTCCATGTTTACACCATGAAGCAGGCCATTCAGGAAGGCTTCATCCTTGACGTCTTGAAGAACTA
TACGGCGATCGACAGCTGGTACAAGATTGCCAAGAAGGTCGAGGACGATCCGATGTTCGACAAGAAGCGCGCCCAGAAAA
AACTGCGTTCCTTTGTCGAGGGGAATCCGGATGTCATCGCCAAGAAGGCTGCCATGATGGTGGATCACTTCCATGAGCAG
ATCATCGCTAAGAAGAAGCTGAACGGAAAGTCCCGCGCAATGGTGGTGACCGCGAGCATCCCACGCTGCATCGAGACCTA
CTATGCCATCAACAAGTGCCTTGCAGACAGACATAGTCCGTACAAGGCGATTATCGCTTTCTCCGGCGAGTGCAAATACA
ACGGACAGGAACCTGCCCTGACGTCGGCAGGACTGAACGGGTTCCCGGACGCCAAGATCCCGAAGGAGTTCAAGAAAGAT
CCGTATCGTCTTCTGGTAGTTGCAGACATGTTCCAGACCGGATTTGATGAGCCACTTCTGCAGACCATGTACGTGGATAA
GCCGCTTTATGACATTGCGGCAGTGCAGACGCTTTCCCGTCTGAACCGCGCGGCTCCCGGGAAGGACGAAGTCTATGTGC
TGGACTTTGCAAACAAGACATCGACGATTCAGGACGCCTTCTCGAAGTTCTACAGGACAACGATACTGTCTGGAGAGACT
GATCCGAACAAGCTTTACGACCTGATCACGCTCATGGAGAGCTATCAGGTATATGACGCTGACGATGTAGAGCACGTAGT
TGATTTGTTCCTTGGCGGAGCTGAGAGAGACAGGCTTGATCCGTTGCTTGATCCGTGCGTGGCTACCTACAACGAACTCG
AAACGGACGATCAGATCAAGTTCAAGAGCGCAGCAAAGTCATTTGTCCGTACATACGGTTTCCTTGGTTCCATTCTTCCC
TATGGGAATGTGGACTGGGAAAAGTTGTCGATCTTCCTGAATCTGCTAATACCGAAACTTCCGTCACCTCGTGAGGATGA
TTTGTCAGAGGGCATTTTGTCTACGATTGATCTGGACAGTTATCGGAATGAAGCACAGGAGGCCGTGGCCATAAAACTGG
AGGACGAGGAGGCCGAGATTGCTCCGGTCCCTGCCGGAAAGGTCGGCCATAACGTTGAGCCGGAACTTGATCCGCTTTCC
AAGATCATCATGGATTTCAACGATATGTTCGGAAACATCCAGTGGAATGACGCTGATAACGTACAGCGCCAGATCCTCCA
GATTCCGGCGATGGTTTCTCGTGACGAGAAATACCAGAATGCCATGAAGAATTCTGATGAGCAGGAAGCCCGAACAGAAA
GCGAGCGCGCCCTGCAGAAGGTCATCTTCTCCATCATGGCGGACAACATGGAGCTCTTTAAGCAGTTTCAGGACAATCCG
TCGTTCAAGAAGTGGCTTACGAATATGGTATTCAACATGACGTACAACAAGGAGGGGAAACCGTATGAAGCACCAGACGA
TTTGGATTCTCCGAAAGTCGCCAGCGACAATTTCACAAGCTATCGATATCCAACAGAAGCGCCAAGAAGCGGAATGATGG
TAGCCGATGGCAAAGTAACGTTCGGTGAAAAGAAGGACAGCGATTCTAAAAAGTAG

Protein sequence :
MAFTDKTERGFETIIVNWLVEQNGYEQGTNDDYSKEYAVDETRLFRFLNDTQPREMAKLGVNNSDQKKRQFLNRLSGEIA
KRGIIDVLRNGVKAYPADLIMFYFTPTENNEKSKQMFEKNIFSVTRQLRYSIDASKLALDLCLFINGLPVVTIELKNHFT
GQSTADAVEQYKEDRDPRDTLFSFKRCMVHFAVDDQTVMFCTKLAGKDSWFLPFNKGYNDGAGNPPNPDGIMTDYLWKDI
LTKWKLSRIIENYAQVVVDEDPDTKKKTVKQIWPRYHQLDCVEKLLTDVKQNGVGKRYLIQHSAGSGKSNSIAWLAHQLI
GLEQDGHPMIDSVIVVTDRRILDKQIRDTIKQFMQVKNTVVWAQHSGDLKKAIQDGKRIIITTVEKFPYISQEIGQEHIN
NHFAIIIDEAHSGQSGRNSANMNLALSGMASDNEMDNEDKINAIVEGRKLVKTASYFAFTATPKNKTEEVFGTPYEEDGE
IKHRPFHVYTMKQAIQEGFILDVLKNYTAIDSWYKIAKKVEDDPMFDKKRAQKKLRSFVEGNPDVIAKKAAMMVDHFHEQ
IIAKKKLNGKSRAMVVTASIPRCIETYYAINKCLADRHSPYKAIIAFSGECKYNGQEPALTSAGLNGFPDAKIPKEFKKD
PYRLLVVADMFQTGFDEPLLQTMYVDKPLYDIAAVQTLSRLNRAAPGKDEVYVLDFANKTSTIQDAFSKFYRTTILSGET
DPNKLYDLITLMESYQVYDADDVEHVVDLFLGGAERDRLDPLLDPCVATYNELETDDQIKFKSAAKSFVRTYGFLGSILP
YGNVDWEKLSIFLNLLIPKLPSPREDDLSEGILSTIDLDSYRNEAQEAVAIKLEDEEAEIAPVPAGKVGHNVEPELDPLS
KIIMDFNDMFGNIQWNDADNVQRQILQIPAMVSRDEKYQNAMKNSDEQEARTESERALQKVIFSIMADNMELFKQFQDNP
SFKKWLTNMVFNMTYNKEGKPYEAPDDLDSPKVASDNFTSYRYPTEAPRSGMMVADGKVTFGEKKDSDSKK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-167 43
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 2e-167 43
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-166 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Ethha_1806 YP_004092062.1 hypothetical protein VFG1098 Protein 1e-167 43