Gene Information

Name : TVNIR_1050 (TVNIR_1050)
Accession : YP_007216215.1
Strain : Thioalkalivibrio nitratireducens DSM 14787
Genome accession: NC_019902
Putative virulence/resistance : Unknown
Product : Type I restriction-modification system, restriction subunit R
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 972993 - 976190 bp
Length : 3198 bp
Strand : +
Note : -

DNA sequence :
ATGATGGCGGACAGCAGGGAAGCCCGGTTTCAGCAGGACATCATCGACGCGATGACCGCCGATGGCTGGTTGACCGCTCC
GGCCAGCGGCTATGACGCCGCCGATGCGCTGTACACCGAGGATCTCGTCGCCTACCTGCAGGAGGCGTGGCCGCAGCGCT
GGGAGAAGTTCGCCGCCAAGAACCCGCAGGATCCGCAGAAGGCCCTGGTGCGCGCGGTCACGCGCGAGCTGGAGCGTGGC
GGCACGCTGGAGGTGCTGCGCCACGGGGTGAAGACGCCGGGGGTGAAGCTGGAACTGTGCAGCTTCCGGCCCGACCACGA
GATGAACCCCGAGTCCGTGGCGCGCTACCGTGCCAACCGCCTGCGGGTGGTGCCGGAGGTCTCGTACTCGCCGCATGCGC
GCAGCGGCGAGTACAACCCGCGGCTGGATCTGGTGCTGTTCGTCAACGGCATCCCCACCGCCACGCTGGAGCTGAAGAGC
GAGTTCAAGCAGTCGGTGGAGAACGCGAAGCGCCAGTACCGGCTCGACCGCCCGGTGAAGGATCCGGTCACCCGCAAGCT
GGAACCGCTGCTGACCTTCAGGCGCGGGGCGCTGGTGCACTTTGCGGTCAGCCAGGACGAGGTGGCGATGACCACCCGCC
TGAACGGCAAGGATACGGTCTTTCTGCCGTTCAACCAGGGCACCGAGGACGGCGGCGCCGGCAACCCCACACCACCGGAC
CGGGACAGCTACGCGACCGGCTACCTGTGGCAGCGCGTGTTCCAGCCGGATGCCTGGCTCCGGATCACCGGCCGCTTTCT
GCATTTGGAGCAGAAGACCGAGGAGGACTTCCACGGCAAGCGCCGCAAGAAGGAGGCATTGATCTTCCCGCGCTTCCACC
AGTGGGACGTGGTGAACCGCCTGATCGAGGCGACGCGCGAGGAAGGCGCGGGGCAGCGCTACCTGATCCAGCACAGCGCC
GGTTCCGGCAAGTCCAACTCCATCGCCTGGATCGCGCACCAGCTCGCCGCGCTGTACGACGACGCGGGCGGCAAGCTGTT
CAACTCGGTGGTGGTCGTCACCGACCGCACGGTGCTGGACAGCCAGCTCCAGGACACGATCTACCAGTTCGACCACGCCC
ACGGCGTGGTGCGGCCGATCACCCGCGATGTCGGCAGCCAGAGTAAGTCGGAGCAACTGGCCGGGGCGCTGGCCGAGCAG
ACGCGCATCATTATCGTGACCATCCAGACCTTCCCGGCGCTGTTCGACGCGCTGGACAGGCGCCCTGACCTGGCCGCCGG
GCGCTACGCGGTGATCGCCGACGAGGCGCATTCCTCGCAGACCGGCTCCTCCGCCACCAAGCTGAAAGCGATCCTCGGCA
CCGACATGCCGGAGGACGAGGAGATCAGCGCCGAGGAGCTGCTTGACGCAGCCGTGGCCGCGCGCAAGCCGGCCGAGCGC
ATCAGCTACTACGCCTTTACCGCCACGCCCAAGGCGAAGACGCTGGAGCTGTTCGGCCGCCCGCCGAACCCGGAGCGGCC
GGCATCAAAGGACAACAAGCCCGAGCCGTTCCACCTGTACTCCATGCGTCAGGCGATCGAGGAAGGCTTCATCCTCGACG
TATTGAAGAACTACACTACGTACTCGACCGCCTGGAAGCTGGCGCACCCGCACGGCGACGACCAGGAGGTCGAGTCGCGC
AAGGCGTCGACCAGGATCGCCCGCTGGGTGCGCCTGCACCCGTACAACATCGCGCAAAGGGTCGAGGTGATCGTCGAGCA
CTTCCGCGCCAACGTGCGCCATCTGCTGGACGGCCAGGCGAAGGCGATGGTGGTGACCGGCAGCCGCCAGGAGGCGGTGC
GCTACATGCTGGCGATGCGCGAGTACATTCAGGACCGGGGCTACCCGGGCATCCACGCGCTGGTTGCCTTCTCCGGCACC
GTACTGCCGGACGACGTGATCCCCGAGGAGGTGACCGAGACCAGCGCGCTGTTGAACCCGGGGCTGAAGGGCCGCGACCT
GGCCGAGGCGTTCGATACCGCCGAATTCAACGTGATGATCGTCGCCAACAAGTTCCAGACCGGGTTCGACCAGCCCAGGC
TCTGCGCGATGTACGTGGACAAGAAACTCCAGGGCGTCGACTGCGTGCAGACCCTCTCGCGCCTGAACCGCATCTTTCCG
GGCAAGGAGAATACCTTCGTCCTCGACTTCTTCAACGACGCAGAAGACGTTCTCGCGGCCTTCCGCCCGTACTACAACAA
GGCCGAGCTGGCCGATGTCTCCGACCCCAATGTCGTGTACGACCTGCAGAAGAGCCTCGACGCCGCCGGCATCTACCACT
GGGACGAGGTCGAGAACTTCGCGCGCGCCTTCTTCGATCCAAAGGCCGGCGCCAGCCAACTCAGCTACTACTGCCAGCCG
GCAAAGGAACGCTATACGACCCGGTACAAGGCCGTGCTGGAACGGATCCAGGCCTGGAAGGCCGCGCAGGCGCTCGCCGA
GCGCAACGGCGACAAGGCCGGCCTCCACCGCGCCGAACAGGAGCTGAAGGACGCCCACACCACCCGAGACGAGCTCGACC
TGTTTCGGAAGAACCTGCAGAGCTTCGTGCGCAGCTACGAGTTCCTGTCACAGATCGTCGACTTCGACGACCGCGAGCTG
GAACAGCTCTGCGTGTACGCCCGTGCGCTGCACCCGCTGCTGCGCATCGAGAACCTGGAAGAAGACCCGATCGACGTCGA
CGAACTCGAGCTGACCCACTACCGGCTGAACAAGCGCGCCGAACATCGTCTGAATCTGGCGGAGGAGGGCGGAGACTACA
GCGGGCTGAAGCCCGTCACCGACGTCGGTTCCGGCAAGCCCCACGACCCGGAAAAGAAACGCCTTTCCGAGATCATCGAC
CAGCTCAACGAACTGTTCGGCGCCGAAGTCAGCGACGCAGACAAGCTCCAGTTCGCCAACGGCATCGCCGACCGCATCCG
CCGCGACGACGCCGTGATGGCCCAGGTCGAGAACCACAGCCCGGACCAGGTGATGCATGGCCTGTTCCCGAAACGCGTTA
CCGACACCGTGCTCGACGCGATGACCGACCACGAAAAACTCTCGATGCAGGTGCTGGACAGTCCCGAACATCAGCGCGAC
TTCGCGCTACTGATCCTGCGCCTGCTGACCGCCGGGGCCGGCGAGCAACGCAACCCCGCCACGAGGACCGGCCCCTGA

Protein sequence :
MMADSREARFQQDIIDAMTADGWLTAPASGYDAADALYTEDLVAYLQEAWPQRWEKFAAKNPQDPQKALVRAVTRELERG
GTLEVLRHGVKTPGVKLELCSFRPDHEMNPESVARYRANRLRVVPEVSYSPHARSGEYNPRLDLVLFVNGIPTATLELKS
EFKQSVENAKRQYRLDRPVKDPVTRKLEPLLTFRRGALVHFAVSQDEVAMTTRLNGKDTVFLPFNQGTEDGGAGNPTPPD
RDSYATGYLWQRVFQPDAWLRITGRFLHLEQKTEEDFHGKRRKKEALIFPRFHQWDVVNRLIEATREEGAGQRYLIQHSA
GSGKSNSIAWIAHQLAALYDDAGGKLFNSVVVVTDRTVLDSQLQDTIYQFDHAHGVVRPITRDVGSQSKSEQLAGALAEQ
TRIIIVTIQTFPALFDALDRRPDLAAGRYAVIADEAHSSQTGSSATKLKAILGTDMPEDEEISAEELLDAAVAARKPAER
ISYYAFTATPKAKTLELFGRPPNPERPASKDNKPEPFHLYSMRQAIEEGFILDVLKNYTTYSTAWKLAHPHGDDQEVESR
KASTRIARWVRLHPYNIAQRVEVIVEHFRANVRHLLDGQAKAMVVTGSRQEAVRYMLAMREYIQDRGYPGIHALVAFSGT
VLPDDVIPEEVTETSALLNPGLKGRDLAEAFDTAEFNVMIVANKFQTGFDQPRLCAMYVDKKLQGVDCVQTLSRLNRIFP
GKENTFVLDFFNDAEDVLAAFRPYYNKAELADVSDPNVVYDLQKSLDAAGIYHWDEVENFARAFFDPKAGASQLSYYCQP
AKERYTTRYKAVLERIQAWKAAQALAERNGDKAGLHRAEQELKDAHTTRDELDLFRKNLQSFVRSYEFLSQIVDFDDREL
EQLCVYARALHPLLRIENLEEDPIDVDELELTHYRLNKRAEHRLNLAEEGGDYSGLKPVTDVGSGKPHDPEKKRLSEIID
QLNELFGAEVSDADKLQFANGIADRIRRDDAVMAQVENHSPDQVMHGLFPKRVTDTVLDAMTDHEKLSMQVLDSPEHQRD
FALLILRLLTAGAGEQRNPATRTGP

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0025 YP_042158.1 type I restriction enzyme protein Not tested SCC476 Protein 1e-144 41