Gene Information

Name : Tgr7_0598 (Tgr7_0598)
Accession : YP_002512682.1
Strain : Thioalkalivibrio sulfidophilus HL-EbGR7
Genome accession: NC_011901
Putative virulence/resistance : Unknown
Product : type I site-specific restriction-modification system, R subunit
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 630678 - 633836 bp
Length : 3159 bp
Strand : -
Note : KEGG: pap:PSPA7_0077 type I site-specific restriction-modification system, R subunit

DNA sequence :
ATGGATAAGGCCAGGGAGAATGTGTTTCAGCAGGCCATCGTCACCGATCTGGTGGGGCAGGGCTGGCTGGAAGGGAAGTC
GGAGCACTATGACCGGGTGCTGGCCCTCTACCCGGAGGACCTGGTCGGCTACGTGCGCAACACCCAGCCCGAGGCGCTAG
AAAAGCTCACCAAGTTCTACGGCGATAAGGTGGAGGACAAGCTGCTGCAACGCGCGGCCGAGCAAATGGACAAGCATGGC
GCCCTGCACGTGCTGCGCCACGGCTTCAAGGACCGTGGCGCGAAGATTCGTCTGTGCACCTTCAAGCCCGACCACGGCCT
GAACCCCGAGACCCTGGCCCGCTATGAGGCCAACCGCCTGCGGGTGGTGCAGGAGGTCTCCTACTCGCCCCACGCCCGCG
AAGGCTACAACCCGCGCCTGGACCTGGTCCTGTTCGTCAACGGCGTGCCGGTGGCGACCCTGGAGCTGAAGTCCGAGTTC
AAGCAGGCCATCGACAATGCCAAGTGGCAGTACAAGAAGGACCGCCCGCCCAAAGACCCCAAGACCCGCAAGCCCGAGCC
GCTGCTGGCCTTCGGCAAGCGCGCCCTGGTGCACTTCGCCGTCAGCCAGGAAGAGGTGTGGATGGCCACCAGGCTCGATG
GCATGAAGACCTTCTTCCTGCCCTTTAACAAGGGCTTTGAGGGCGGGGCCGGCAACCCGCCCAATCCGGACGGCTACGCT
ACGGATTATCTCTGGAGGCAGGTCTTTGCGCGCGATGCCTGGCTGGACATCCTGGGCCGCTTCATCCACCTGCAAAAGGA
GGAAAAGGAGGACGGCTTCGGCAAGCGCTACACCAAGGAGAACCTGATCTTCCCGCGCTTCCATCAGTGGGATGCGGTCA
ACCAGCTGGTGACCACCGCCCGCGCCGAGGGGCCGGGCCACAAGTACCTGATCCAGCACAGCGCGGGTTCCGGCAAGTCC
AATTCCATCGCCTGGACCGCCCACCGCTTGGCCTCGCTGCACGACGATCAGGATCAGCGCGTGTTCGACTCGGTCATCGT
CATCACCGACCGCACGGTGCTGGACGACCAGCTGCAGGAGACCATCTACCAGTTCGAGCACGCCGAGGGGGTGGTGTGCC
GCATCAGCCGCGATGAGGGCGAGGGCAGCAAGTCCGCCCAGCTGGCCGAGGCCTTGATGGGGAACACCCGCATCATCATC
GTCACCCTCCAGACATTCCCCTTTGTGCTGGAGGCGATCCAGCAGCAGACCAGCCTCAAGGAACGCCGCTTCGCCGTGAT
CGCCGACGAGGCCCATTCCTCCCAGACCGGCGCCACCGCCCGCAAGCTGCGCCAGGTGCTGATGGCCGACGAGTTGGAAG
AGGATGCCGAGATCAGCGCCGAGGACGTGCTGGACGCCACCCTGGCCGCGCGCAGCCAGGCGCACAACATCAGCTACTTC
GCCTTCACCGCCACGCCCAAGGCCAAGACCCTGCAGCTCTTCGGCCGTCCGCCCGATCCCAACCAGGAGCCGGGGCCGGA
CAACCTGCCTGAAGCCTTTCACGTCTACACCATGCAGCAGGCCATCGAGGAGGGCTTCATCCTCGACGTGCTCAAGCGCT
ACACCACCTACAGCATGGCCTTCCGCCTGGAGCAGAAGCAGGCTGCTGAGGAGGCGGTGGACAAGGGCAAGGCCGCCACC
CGGCTCTACCAGTGGGTGAAGCTGCACCCCTACAACATCGAGCAGAAGGTGCAAGTGATCGTCGAACACTTCCGCCAGCA
CGTGGCCGCCCAGCTCAGCCGCCAGGCCAAGGCCATGGTGGTGACCGACTCGCGAAAGGCTGCCGTGCGCTACAAGCTCG
CCCTGGATAAGTACGTGACCGAGCACGGCTACACCGACGTGCACGCCCTGGTGGCCTTCTCGGGCGACGTGGAGGACAAG
GAGAGCGGGGCCTCGGTCGATGGAGGGGCCTTCAACGAGCGCAACATGAACCCCGGCCTCAAGGGCCGCGACCTGCGCGA
TGCGTTCGACACCGACGAATACCAGGTGATGATCGTCGCCAACAAGTTCCAGACCGGCTTCGACCAGCCCAAGCTCTGCG
CCATGTACGTGGACAAGAAGCTCTCGGGCGTGGACTGCGTGCAGACCCTCTCACGCCTCAACCGCACCTACCCGGGCAAG
GAAGACCCCTTCGTCCTGGATTTCGTCAACAAGCCCGAGGACGTGCTGGCCGCCTTCAAGCCCTACTACCGCACCGCCGA
GCTGGAAGACGTCTCCGACCCCAACCTGATCTACGACCTGCAGGAAAAGCTGGAGAGTGAGCGGATCTTCCGCTGGGAGG
AGGTGGAAGCCTTCGCCGACGCCTTCTTCGACCCCAAGAAGACCCAGGACAAGCTCAACTACCATGTGCGCCCGGCGGTG
GACCGCTACAGGGAACGCTACAAATCCGCCATCCAGGCCCTGCAGGACGCCCAGCGCCAGGAACGCCAGGCCGACATGAG
CGGCGATGAGGTCGGCGCCGCCAACGCCCGCCGCGCCGTCCAGAGCGCCGGCGAGGAAAAGAGCGTGCTCGACCTGTTCC
GCAAGGACCTCGGCAGCTTCGTGCGCTTCTACGAGTTCATCTCCCAGATCGTCGCCCTGGACGACCGTGATCTCGAAAAG
CTCGCTGTCTACGCCCGCCACCTGCGCCCCTTGCTGCGGCAGGCCGAACTCGACCAGCCGCTGGACCTCTCCGGCATCGA
GTTGACCCACTACCGCCTGAAAAAGCAGGGCGAGCACCAGATCAACCTGCGCGATGGCGAGGGGGACTACCGCATCCGCG
GCGAAGGCCCGGGCGGCGGCCAAGCGCACGACCCGGAAAAGGAAGCGCTGGCCGAGATCATCGCCCGGCTCAACGAGCTG
TTCGCGGGCGAGGGTCTGAGCGACGCCGATCGGCTCAACTACCTGCGTACCCTCTCGGACAAGGTGATGGAGAACAAGCC
GGTGCTGGCCCAGCTGAAAAACAATACCCGTGACCAGGTCATGCACGGTGATTTCCCGGCGGCGGTACGCGATGCCGTCA
TGGACAGCCTGACGACCCACCAGGGCATGGCGGGGCAGCTGCTCAGGGATCCGGCGCAACTGCAGCGCTCCGTCGGGGTG
GTGCTCGACTTCATCCTGCACTCGCTGCGCCAGCCCTGA

Protein sequence :
MDKARENVFQQAIVTDLVGQGWLEGKSEHYDRVLALYPEDLVGYVRNTQPEALEKLTKFYGDKVEDKLLQRAAEQMDKHG
ALHVLRHGFKDRGAKIRLCTFKPDHGLNPETLARYEANRLRVVQEVSYSPHAREGYNPRLDLVLFVNGVPVATLELKSEF
KQAIDNAKWQYKKDRPPKDPKTRKPEPLLAFGKRALVHFAVSQEEVWMATRLDGMKTFFLPFNKGFEGGAGNPPNPDGYA
TDYLWRQVFARDAWLDILGRFIHLQKEEKEDGFGKRYTKENLIFPRFHQWDAVNQLVTTARAEGPGHKYLIQHSAGSGKS
NSIAWTAHRLASLHDDQDQRVFDSVIVITDRTVLDDQLQETIYQFEHAEGVVCRISRDEGEGSKSAQLAEALMGNTRIII
VTLQTFPFVLEAIQQQTSLKERRFAVIADEAHSSQTGATARKLRQVLMADELEEDAEISAEDVLDATLAARSQAHNISYF
AFTATPKAKTLQLFGRPPDPNQEPGPDNLPEAFHVYTMQQAIEEGFILDVLKRYTTYSMAFRLEQKQAAEEAVDKGKAAT
RLYQWVKLHPYNIEQKVQVIVEHFRQHVAAQLSRQAKAMVVTDSRKAAVRYKLALDKYVTEHGYTDVHALVAFSGDVEDK
ESGASVDGGAFNERNMNPGLKGRDLRDAFDTDEYQVMIVANKFQTGFDQPKLCAMYVDKKLSGVDCVQTLSRLNRTYPGK
EDPFVLDFVNKPEDVLAAFKPYYRTAELEDVSDPNLIYDLQEKLESERIFRWEEVEAFADAFFDPKKTQDKLNYHVRPAV
DRYRERYKSAIQALQDAQRQERQADMSGDEVGAANARRAVQSAGEEKSVLDLFRKDLGSFVRFYEFISQIVALDDRDLEK
LAVYARHLRPLLRQAELDQPLDLSGIELTHYRLKKQGEHQINLRDGEGDYRIRGEGPGGGQAHDPEKEALAEIIARLNEL
FAGEGLSDADRLNYLRTLSDKVMENKPVLAQLKNNTRDQVMHGDFPAAVRDAVMDSLTTHQGMAGQLLRDPAQLQRSVGV
VLDFILHSLRQP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0025 YP_042158.1 type I restriction enzyme protein Not tested SCC476 Protein 4e-148 41