Gene Information

Name : Teth514_1218 (Teth514_1218)
Accession : YP_001662848.1
Strain : Thermoanaerobacter sp. X514
Genome accession: NC_010320
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 1258950 - 1261922 bp
Length : 2973 bp
Strand : -
Note : PFAM: type III restriction enzyme, res subunit; protein of unknown function DUF450; SMART: DEAD-like helicases-like

DNA sequence :
ATGACTGCTACTAATACACGAGAAAGTGGTCTTGAATCTTTGATTGTAGATTGGCTTGTAAATCAAAATGGTTATGAACA
AGGCAGCAATGCTGACTATAACCGTGACTATGCTATTGATGAAACACGCCTATTTCGTTTTCTTTCAGCGACGCAGCCAG
ATGAAATGGAGAAACTCGGTGTATTTAAAAGCGACTTAAAAAAGGCTCAGTTTCTAAACCGATTGCGTGGTGAAATAGCA
AAACGCGGAATTATTGATGTACTTCGTAATGGTATTAAGGTTTATCCTGCTGACCTGGTTATGTTTTATCTAACACCAAG
TGAGAGAAACATAAAAGCAAAAGCTCTATTTGAGCAGAATATTTTCAGTGTTACACGACAACTGCAGTATTCAAAAGATG
CGACTCGTCTTGCCCTTGATTTGTGCATTTTTATCAATGGCTTGCCAGTTATAACATGCGAGCTTAAGAATCAACTTACA
AAGCAAAATGTTGATGATGCTGTTTACCAATATAAAACGGATCGTGATCCGAAGGAACTGCTTTTCCAATTTAAACGCTG
TATGGTTCATTTTGCAGTAGATGATGCAAGGGTCAAGTTCAGTACTAAGCTTGATGGTAAAGCTTCCTGGTTCTTGCCAT
TTGACAAAGGGTACAATGATGGAGCCGGCAACCCTCCAAATTCTTCTGGTATAATGACAGATTATTTATGGAAGGACATC
CTTGAAAAGTATATGCTTGCACATATAATCGAAAATTACGCTCAAGTTGTTGAAAAAGTAGACCAGGAAACAAAAAAGAA
AACATATACACAAATTTTCCCACGTTACCATCAACTGTCTGCTGTTGAAAGTCTCCTCGCAGATGTACGACATAATGGTG
TTGGCCAAAGATACTTAATTCAACATAGTGCTGGTAGTGGAAAATCAAATTCTATTGCATGGCTGGCTCATCAACTCGTA
GGACTCGAAAAGAATGGAAAAGCCATCATTGACTCTGTGGTAGTTGTTACAGACCGTGTAATACTTGATAAACAAATTCG
AGATACGATAAAACAATTTATGCAGGTTTCTAGCACTGTAGCATGGGCAGAACACTCTGATGATTTAAGGAAAGCAATCA
ATGGCGGTAAGAAGATTATAATAACTACTGTACATAAGTTCCCTATTATTCTTGATAGTATAGGTTCAGAACACAAAGGG
CGTTCTTTTGCCATAATTATTGACGAGGCCCATTCATCACAGAGCGGTAACATGTCGGCTAAGATGAATATTGTATTATC
GGGTGAAGTTACTGGAGAAGAGGAAGATTTTGAAGATAAAATCAACCGCCTTATGGAAGGGCGCAAAATGCTGAAGAACG
CTAGCTATTTTGCATTTACCGCTACTCCGAAAAACAAAACCCTTGAAATGTTTGGTATCCCATACCAAGACGGAGATGAA
ATTAAACATCGTCCGTTCCATGTATATACAATGAAGCAAGCAATTCAAGAAGGTTTTATTTTAGATGTACTTAAATACTA
TACCCCTGTTGACAGTTATTACAGACTTGCTAAAACCATTGAAGATGATCCTTTATTTGACAAGAAAAAAGCACAAAAGA
AACTCCGTCAATTTGTAGAAAGCAATAAATTTGCCATATCACAAAAAGCAGAAATCATGGTGAACCACTTTCATGATCAG
GTTATTTCAAAAGGAAAAATCGGAGGAAAAGCGCGAGCTATGGTGGTTACGAGTAGTATAGAGCGCTGTATAGAATACTA
TTACGCAATTAATAAATGCCTTGCTGACAGGCGTAGTCCTTATAAAGCTATTATTGCTTTTTCTGGGGAAAAAGAATATG
GTGGTAAAACTTTAACCTCTGCAGCAATTAATGGCTTCCCAGATAATACAATTGAGAAGGTATTCCGTAAGGATCCATAT
CGATTTCTTATAGTGGCTGATATGTTCCAAACGGGTTATGATGAACCACTACTCCATACAATGTACGTTGACAAAATGCT
ATCTGATATAAAGGCGGTTCAGACTCTATCTCGACTGAACCGCTCTCATCCACAGAAACATGATACTTTTGTACTTGATT
TTGCAAATAAAACAGAGACCATTGAAGCAGCATTCTCAAAATATTATCGGACAACTATTCTGTCTAATGAAACTGATCCG
AACAAGCTTTATGATCTCATAGCAATTATGGAATCCCATCAAGTATACGAAAGTGGGCATGTTGATTCGCTTGTTGAACT
GTACTTAAATGGTGCAGAACGTGATAGGTTAGATCCTATCCTCGATGCATGTACTGCTATTTACAAAGAGCTTGATGACG
AAGGGAAAATTGAATTCAAAAGTGCAGCAAAAGCATTTGTTCGAACATACGGCTTTCTTGGAGCTATTCTTCCTTACGGT
AATGCAGAATGGGAAAAGCTATCAATATTTTTAAATTTATTAATACCTAAACTTCCTTCTCCCAAAGAAGATGATTTATC
TCAAGGGATACTAGATTCAATTGATTTAGATAGTTACCGAGTGGAAGCTCGTGATTCTATGTCTCTTGTATTAGATGATG
CTGACGCTGAGATTGGCCCTGTACCTGCTGGTCGTGTAGGTGGCATAGTGGAGCCAGAAATGGATTTACTTTCTAGCATT
CTTTCATCATTTAATGACTTGTTCGGCAATATAGACTGGAACGATGCTGATAATGTTCGCCGCCAAATTCTAGAAATACC
AGGAATGGTTACAAAAGACGAGCGCTATATTAACGCAATGAAAAATTCAGACAAGCAAAATGCGCGTATGGAAAGTGAAC
GTGCCCTTCAGTCGGTTATATTTAGTATAATGGCGGATAATATGGAGTTATTTAAGCAGTTTAATGATAATCCTTCGTTT
AAGAAATGGCTGTCGGATCTTGTTTTCAATTTAACGTATAACCCTGAAGGAAAGCCATTTGAAACTCCTTCCAATGATTC
AAATAACAAATAA

Protein sequence :
MTATNTRESGLESLIVDWLVNQNGYEQGSNADYNRDYAIDETRLFRFLSATQPDEMEKLGVFKSDLKKAQFLNRLRGEIA
KRGIIDVLRNGIKVYPADLVMFYLTPSERNIKAKALFEQNIFSVTRQLQYSKDATRLALDLCIFINGLPVITCELKNQLT
KQNVDDAVYQYKTDRDPKELLFQFKRCMVHFAVDDARVKFSTKLDGKASWFLPFDKGYNDGAGNPPNSSGIMTDYLWKDI
LEKYMLAHIIENYAQVVEKVDQETKKKTYTQIFPRYHQLSAVESLLADVRHNGVGQRYLIQHSAGSGKSNSIAWLAHQLV
GLEKNGKAIIDSVVVVTDRVILDKQIRDTIKQFMQVSSTVAWAEHSDDLRKAINGGKKIIITTVHKFPIILDSIGSEHKG
RSFAIIIDEAHSSQSGNMSAKMNIVLSGEVTGEEEDFEDKINRLMEGRKMLKNASYFAFTATPKNKTLEMFGIPYQDGDE
IKHRPFHVYTMKQAIQEGFILDVLKYYTPVDSYYRLAKTIEDDPLFDKKKAQKKLRQFVESNKFAISQKAEIMVNHFHDQ
VISKGKIGGKARAMVVTSSIERCIEYYYAINKCLADRRSPYKAIIAFSGEKEYGGKTLTSAAINGFPDNTIEKVFRKDPY
RFLIVADMFQTGYDEPLLHTMYVDKMLSDIKAVQTLSRLNRSHPQKHDTFVLDFANKTETIEAAFSKYYRTTILSNETDP
NKLYDLIAIMESHQVYESGHVDSLVELYLNGAERDRLDPILDACTAIYKELDDEGKIEFKSAAKAFVRTYGFLGAILPYG
NAEWEKLSIFLNLLIPKLPSPKEDDLSQGILDSIDLDSYRVEARDSMSLVLDDADAEIGPVPAGRVGGIVEPEMDLLSSI
LSSFNDLFGNIDWNDADNVRRQILEIPGMVTKDERYINAMKNSDKQNARMESERALQSVIFSIMADNMELFKQFNDNPSF
KKWLSDLVFNLTYNPEGKPFETPSNDSNNK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-164 43
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-164 43
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-163 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Teth514_1218 YP_001662848.1 hypothetical protein VFG1098 Protein 4e-164 43