Gene Information

Name : Ethha_1801 (Ethha_1801)
Accession : YP_004092057.1
Strain : Ethanoligenens harbinense YUAN-3
Genome accession: NC_014828
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1937331 - 1940342 bp
Length : 3012 bp
Strand : -
Note : KEGG: rsd:TGRD_039 type I restriction-modification system restriction subunit; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase

DNA sequence :
ATGGCAATTCAGACAAAAGAGCGCAACTTCGAACAAGAGATAGAATGGTGGCTCACTGAGGGTGCAGTCGAAGCCGACCG
CTACAAGAAGGGCAATCCCGTCGATTTTGACCGCAAACTTGCATTGGATAAAGGGGCGATTCTGGCGTTTATCAAAGACA
CTCAGCCTGACGAATGGCAGGGGCTTTGCAGGCGTCACGGCTCGGAAGTCTCTGCCGAAGCGGAATTCTTCAAACGTCTT
AATTCGGAACTGAACTCTCGCGGCATGATTGACGTGCTTCGCCACGGTGTGGTGGACCTTGGCATCTCCGTGCGGCTTGC
TTACTTTAAGCCGGGCAGCGGCATGAATCAGAGCTTAACGGCTCTCTACGCCAAGAACGTCCTGCAGATAACGCGGCAGG
TCAAGTACAGCCTGCAAAACGAGAACTCTATCGACACCGTTATCTTTCTGAACGGGCTGCCGATAATCACCATAGAGCTC
AAAAATCCGCTCACAGGGCAGACGTATCGAAATGCCATCACGCAGTATGAAAACGATCGTGACCCGCGTGAACTTCTGCT
TGCTTTCAAAAAGCGGGCTATTGTGCATTTTGCCGTGGACACCGAAGAAGTCTGGATGACCACTTGGCTTCGCAAACTTG
ATACCACCTTCATACCTTTCAATAAGGGAACCGAAGACCACGGAGCGGGCAATCCTGTCGCCGAGGGCGGTGATTATCGC
ACGGCGTACTTATGGAAAGAGATACTGCAAAGGGATAGCATTCTGGACATTCTGCACCGCTTCGTGCAGGTCTCGAAAGA
TGACAAGGGCAAGGAAAAATTGATATTTCCGCGTTACCACCAGTTGGACGCAGTCCGTAAATTGGTGGCGGATGCTTACG
CCAACGGGTCGGGCAAGAACTATCTGATTCAGCATTCGGCAGGTTCGGGCAAATCGAACTCCATCGCTTGGCTTGCTCAC
CACTTGGCGAACCTGCATGATATACATGACGAGGTGATATTCCACAGCATCATCGTCATTACTGATCGCCGTGTTCTCGA
CAAACAACTCCAGCGCGATATTTACAACATGGAGCATAAGCCGGGCGTGGTCGTCCTCGTGGATAAGAACTCAAAGCAGC
TAACCACCGCGCTGAACAACGGCGACAAAATCATCGTCTGCACCCTGCAGAAGTTCCCGTTTGTCGATGTGCAGAAGGTA
TCCACTACGGGCAAGCGGTTTGCTATTATCGTGGACGAGGCCCACTCATCTCAAACAGGCGACGCAAGCAAGCGTATGAA
AGAGATTCTGGCGGACATCTCTTTGCAGGGTGACGATGTCGTCGAAAAAAAACTGCATGAGTTTGCAGTAGAAGAAGCAA
AGGCCGAAGCCGAGGAAAAAGACCTTGACGAAGCTATCGCTGATGAAATGGCGGCACATGGTCAGCAGCCAAACCTCTCG
TTCTTTGCTTTTACCGCCACGCCCAAACAAAAGACACTGGAAATATTTGGGCAAACGACAGCCGCTGGCAAGCCGGAGCC
GTTCCACCTCTATAGCATGAGACAGGCCATCGAGGAGCATTTCATTTTCAATGTCCTTGAAAACTATACGACCTACGAAA
CCTACTTCCAAATTGGAAAGAAAATAGCCGACGACCCCGTGTATGGCAAGAATCTGGCGAACAAGGCTCTCGGCAAATAT
ATGAGCCTCCACCCACACAACCTCGCACAGAAGGCGGAGGTCATCATTGAGCATTTCCGCAGTCAAGTGCAGCACCGCAT
CGGCGGGCAGGCGAAAGCTATGCTTGTGACGGGTTCGCGCCTCCACGCCGTGCGCTATTTCTTCGAGTTCCAGAGATACA
TCAAAAAGATGCACTATGACTTGGGCATCTTGGTGGCGTTCTCTGGCACGGTCAAAGACAAGGTGAGCGGCGAAATCAAA
GAGTATACGGAATCTAACCTAAACAAGTTTCCGGATAGCGAGACTGTGGAAAAATTCGACACCGCCGAATATCAGCTTTT
GATTGTCGCCGAAAAGTATCAGACGGGCTTCGACCAGCCGCTTCTGCACACGATGTATGTGGATAAGAAACTGACGGGCA
TTAAGGCTGTTCAGACGATTTCCCGCGTTAATCGCGCGTGTAAGGGTAAGACTGAGACCTTCATCTTGGACTTCGTCAAC
TCACGCGAAGACATCGAAAAGGCGTTTCAGGATTATTATCAGGCGACGGGCGTGGCTGAAACGACAGACCCGAACACCAT
CTACGACATCAAAAATTTCCTTGATCGCTTTATGCTCTACCGTGACAGCGAGATTGAGGCTTTCGCCAAGGTTTTCTTTA
AGGAAACGAAGAACCAGGGAAACATTGACCTTGCGAAGCTGAATGGTTTTATCGACCCCGCTGTCGACCGCTACAATGCC
CTGACCGAGGATCAGGATAAAATGGACTTCAAAGGCGCACTTGCAAAGTTCATCCGCCTGTATGCGTTTCTCACGCACAT
TATCAACCTCGGCGATGAGAATCTACACAAGTTCCATGCTTATGCCAAGTGCTTGCTCCGCAAACTGCCCAAAAGTGATA
CGGAGCGCACACCCGATATTGGCAGCGATGTTATGCTCCAATATTATCGAGTGCAGAAAGTGGCTGAAGGCTCTATTGCT
TTGGCGAACGAGGACGGCATTCTAAAGAGTAAGACCTCCAGCACGGGATTGCCGATTGAAGACGAAAAAGAGGCTTTATC
CGCTATCATCCAAAGCCTAAACGAACGCCTTGGCACGAACTTCACTGAAATGGATAAGGTTTTGGAGCAATTCGTTCAGG
ACATGTCCAATAACCAAGAGATGGTCTTGCGCTCTAAGAACCCGCTCGATCTCTTTAAAATCATTTACGACAATACCATT
ATGGATGTGGTTCTAGGACGCATGGCAAAGAACCAAGAATTCTGCGAGAAGTATCTGGAGGACGAGGAATTCAGACGCGA
GATCGACAAAATATTATTGCCGCTTGTTCACGATCGGTTGTCGAAAATATAG

Protein sequence :
MAIQTKERNFEQEIEWWLTEGAVEADRYKKGNPVDFDRKLALDKGAILAFIKDTQPDEWQGLCRRHGSEVSAEAEFFKRL
NSELNSRGMIDVLRHGVVDLGISVRLAYFKPGSGMNQSLTALYAKNVLQITRQVKYSLQNENSIDTVIFLNGLPIITIEL
KNPLTGQTYRNAITQYENDRDPRELLLAFKKRAIVHFAVDTEEVWMTTWLRKLDTTFIPFNKGTEDHGAGNPVAEGGDYR
TAYLWKEILQRDSILDILHRFVQVSKDDKGKEKLIFPRYHQLDAVRKLVADAYANGSGKNYLIQHSAGSGKSNSIAWLAH
HLANLHDIHDEVIFHSIIVITDRRVLDKQLQRDIYNMEHKPGVVVLVDKNSKQLTTALNNGDKIIVCTLQKFPFVDVQKV
STTGKRFAIIVDEAHSSQTGDASKRMKEILADISLQGDDVVEKKLHEFAVEEAKAEAEEKDLDEAIADEMAAHGQQPNLS
FFAFTATPKQKTLEIFGQTTAAGKPEPFHLYSMRQAIEEHFIFNVLENYTTYETYFQIGKKIADDPVYGKNLANKALGKY
MSLHPHNLAQKAEVIIEHFRSQVQHRIGGQAKAMLVTGSRLHAVRYFFEFQRYIKKMHYDLGILVAFSGTVKDKVSGEIK
EYTESNLNKFPDSETVEKFDTAEYQLLIVAEKYQTGFDQPLLHTMYVDKKLTGIKAVQTISRVNRACKGKTETFILDFVN
SREDIEKAFQDYYQATGVAETTDPNTIYDIKNFLDRFMLYRDSEIEAFAKVFFKETKNQGNIDLAKLNGFIDPAVDRYNA
LTEDQDKMDFKGALAKFIRLYAFLTHIINLGDENLHKFHAYAKCLLRKLPKSDTERTPDIGSDVMLQYYRVQKVAEGSIA
LANEDGILKSKTSSTGLPIEDEKEALSAIIQSLNERLGTNFTEMDKVLEQFVQDMSNNQEMVLRSKNPLDLFKIIYDNTI
MDVVLGRMAKNQEFCEKYLEDEEFRREIDKILLPLVHDRLSKI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0025 YP_042158.1 type I restriction enzyme protein Not tested SCC476 Protein 2e-178 46