Gene Information

Name : Thet_1691 (Thet_1691)
Accession : YP_003904563.1
Strain : Thermoanaerobacter sp. X513
Genome accession: NC_014538
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1652459 - 1655431 bp
Length : 2973 bp
Strand : +
Note : KEGG: dde:Dde_1859 DEAD/DEAH box helicase-like; PFAM: protein of unknown function DUF450; type III restriction protein res subunit; SMART: DEAD-like helicase

DNA sequence :
ATGACTGCTACTAATACACGAGAAAGTGGTCTTGAATCTTTGATTGTAGATTGGCTTGTAAATCAAAATGGTTATGAACA
AGGCAGCAATGCTGACTATAACCGTGACTATGCTATTGATGAAACACGCCTATTTCGTTTTCTTTCAGCGACGCAGCCAG
ATGAAATGGAGAAACTCGGTGTATTTAAAAGCGACTTAAAAAAGGCTCAGTTTCTAAACCGATTGCGTGGTGAAATAGCA
AAACGCGGAATTATTGATGTACTTCGTAATGGTATTAAGGTTTATCCTGCTGACCTGGTTATGTTTTATCTAACACCAAG
TGAGAGAAACATAAAAGCAAAAGCTCTATTTGAGCAGAATATTTTCAGTGTTACACGACAACTGCAGTATTCAAAAGATG
CGACTCGTCTTGCCCTTGATTTGTGCATTTTTATCAATGGCTTGCCAGTTATAACATGCGAGCTTAAGAATCAACTTACA
AAGCAAAATGTTGATGATGCTGTTTACCAATATAAAACGGATCGTGATCCGAAGGAACTGCTTTTCCAATTTAAACGCTG
TATGGTTCATTTTGCAGTAGATGATGCAAGGGTCAAGTTCAGTACTAAGCTTGATGGTAAAGCTTCCTGGTTCTTGCCAT
TTGACAAAGGGTACAATGATGGAGCCGGCAACCCTCCAAATTCTTCTGGTATAATGACAGATTATTTATGGAAGGACATC
CTTGAAAAGTATATGCTTGCACATATAATCGAAAATTACGCTCAAGTTGTTGAAAAAGTAGACCAGGAAACAAAAAAGAA
AACATATACACAAATTTTCCCACGTTACCATCAACTGTCTGCTGTTGAAAGTCTCCTCGCAGATGTACGACATAATGGTG
TTGGCCAAAGATACTTAATTCAACATAGTGCTGGTAGTGGAAAATCAAATTCTATTGCATGGCTGGCTCATCAACTCGTA
GGACTCGAAAAGAATGGAAAAGCCATCATTGACTCTGTGGTAGTTGTTACAGACCGTGTAATACTTGATAAACAAATTCG
AGATACGATAAAACAATTTATGCAGGTTTCTAGCACTGTAGCATGGGCAGAACACTCTGATGATTTAAGGAAAGCAATCA
ATGGCGGTAAGAAGATTATAATAACTACTGTACATAAGTTCCCTATTATTCTTGATAGTATAGGTTCAGAACACAAAGGG
CGTTCTTTTGCCATAATTATTGACGAGGCCCATTCATCACAGAGCGGTAACATGTCGGCTAAGATGAATATTGTATTATC
GGGTGAAGTTACTGGAGAAGAGGAAGATTTTGAAGATAAAATCAACCGCCTTATGGAAGGGCGCAAAATGCTGAAGAACG
CTAGCTATTTTGCATTTACCGCTACTCCGAAAAACAAAACCCTTGAAATGTTTGGTATCCCATACCAAGACGGAGATGAA
ATTAAACATCGTCCGTTCCATGTATATACAATGAAGCAAGCAATTCAAGAAGGTTTTATTTTAGATGTACTTAAATACTA
TACCCCTGTTGACAGTTATTACAGACTTGCTAAAACCATTGAAGATGATCCTTTATTTGACAAGAAAAAAGCACAAAAGA
AACTCCGTCAATTTGTAGAAAGCAATAAATTTGCCATATCACAAAAAGCAGAAATCATGGTGAACCACTTTCATGATCAG
GTTATTTCAAAAGGAAAAATCGGAGGAAAAGCGCGAGCTATGGTGGTTACGAGTAGTATAGAGCGCTGTATAGAATACTA
TTACGCAATTAATAAATGCCTTGCTGACAGGCGTAGTCCTTATAAAGCTATTATTGCTTTTTCTGGGGAAAAAGAATATG
GTGGTAAAACTTTAACCTCTGCAGCAATTAATGGCTTCCCAGATAATACAATTGAGAAGGTATTCCGTAAGGATCCATAT
CGATTTCTTATAGTGGCTGATATGTTCCAAACGGGTTATGATGAACCACTACTCCATACAATGTACGTTGACAAAATGCT
ATCTGATATAAAGGCGGTTCAGACTCTATCTCGACTGAACCGCTCTCATCCACAGAAACATGATACTTTTGTACTTGATT
TTGCAAATAAAACAGAGACCATTGAAGCAGCATTCTCAAAATATTATCGGACAACTATTCTGTCTAATGAAACTGATCCG
AACAAGCTTTATGATCTCATAGCAATTATGGAATCCCATCAAGTATACGAAAGTGGGCATGTTGATTCGCTTGTTGAACT
GTACTTAAATGGTGCAGAACGTGATAGGTTAGATCCTATCCTCGATGCATGTACTGCTATTTACAAAGAGCTTGATGACG
AAGGGAAAATTGAATTCAAAAGTGCAGCAAAAGCATTTGTTCGAACATACGGCTTTCTTGGAGCTATTCTTCCTTACGGT
AATGCAGAATGGGAAAAGCTATCAATATTTTTAAATTTATTAATACCTAAACTTCCTTCTCCCAAAGAAGATGATTTATC
TCAAGGGATACTAGATTCAATTGATTTAGATAGTTACCGAGTGGAAGCTCGTGATTCTATGTCTCTTGTATTAGATGATG
CTGACGCTGAGATTGGCCCTGTACCTGCTGGTCGTGTAGGTGGCATAGTGGAGCCAGAAATGGATTTACTTTCTAGCATT
CTTTCATCATTTAATGACTTGTTCGGCAATATAGACTGGAACGATGCTGATAATGTTCGCCGCCAAATTCTAGAAATACC
AGGAATGGTTACAAAAGACGAGCGCTATATTAACGCAATGAAAAATTCAGACAAGCAAAATGCGCGTATGGAAAGTGAAC
GTGCCCTTCAGTCGGTTATATTTAGTATAATGGCGGATAATATGGAGTTATTTAAGCAGTTTAATGATAATCCTTCGTTT
AAGAAATGGCTGTCGGATCTTGTTTTCAATTTAACGTATAACCCTGAAGGAAAGCCATTTGAAACTCCTTCCAATGATTC
AAATAACAAATAA

Protein sequence :
MTATNTRESGLESLIVDWLVNQNGYEQGSNADYNRDYAIDETRLFRFLSATQPDEMEKLGVFKSDLKKAQFLNRLRGEIA
KRGIIDVLRNGIKVYPADLVMFYLTPSERNIKAKALFEQNIFSVTRQLQYSKDATRLALDLCIFINGLPVITCELKNQLT
KQNVDDAVYQYKTDRDPKELLFQFKRCMVHFAVDDARVKFSTKLDGKASWFLPFDKGYNDGAGNPPNSSGIMTDYLWKDI
LEKYMLAHIIENYAQVVEKVDQETKKKTYTQIFPRYHQLSAVESLLADVRHNGVGQRYLIQHSAGSGKSNSIAWLAHQLV
GLEKNGKAIIDSVVVVTDRVILDKQIRDTIKQFMQVSSTVAWAEHSDDLRKAINGGKKIIITTVHKFPIILDSIGSEHKG
RSFAIIIDEAHSSQSGNMSAKMNIVLSGEVTGEEEDFEDKINRLMEGRKMLKNASYFAFTATPKNKTLEMFGIPYQDGDE
IKHRPFHVYTMKQAIQEGFILDVLKYYTPVDSYYRLAKTIEDDPLFDKKKAQKKLRQFVESNKFAISQKAEIMVNHFHDQ
VISKGKIGGKARAMVVTSSIERCIEYYYAINKCLADRRSPYKAIIAFSGEKEYGGKTLTSAAINGFPDNTIEKVFRKDPY
RFLIVADMFQTGYDEPLLHTMYVDKMLSDIKAVQTLSRLNRSHPQKHDTFVLDFANKTETIEAAFSKYYRTTILSNETDP
NKLYDLIAIMESHQVYESGHVDSLVELYLNGAERDRLDPILDACTAIYKELDDEGKIEFKSAAKAFVRTYGFLGAILPYG
NAEWEKLSIFLNLLIPKLPSPKEDDLSQGILDSIDLDSYRVEARDSMSLVLDDADAEIGPVPAGRVGGIVEPEMDLLSSI
LSSFNDLFGNIDWNDADNVRRQILEIPGMVTKDERYINAMKNSDKQNARMESERALQSVIFSIMADNMELFKQFNDNPSF
KKWLSDLVFNLTYNPEGKPFETPSNDSNNK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-164 43
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-164 43
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-163 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Thet_1691 YP_003904563.1 hypothetical protein VFG1098 Protein 4e-164 43