Gene Information

Name : DESAM_20023 (DESAM_20023)
Accession : YP_007324420.1
Strain : Desulfovibrio hydrothermalis DSM 14728
Genome accession: NC_020055
Putative virulence/resistance : Virulence
Product : putative type-1 restriction system, restriction subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 305815 - 308784 bp
Length : 2970 bp
Strand : -
Note : Evidence 3 : Function proposed based on presence of conserved amino acid motif, structural feature or limited homology

DNA sequence :
ATGAAATTTACAGATACAAGCGAAGCAGGTCTTGAGACCACTATCGTCAAATCATTAATTGAAGACTCCGGTTACAGTGA
GGGAGCCCCTCAGGATTATGACCGTTCCCATGCGGTTGATCTTATTAAACTAACCAATTTTATTGAGGCTACACAGCCGG
AAGTTGCTGATCGCCTTAGCCTTAAAGTAGATTCTCCCACAAGAACAAAATTCCTCCACCGATTACAAGGCGAGATAGCC
AAACGGGGCATCATCGATGTGCTACGCAAAGGGGTGAAGCACGGCCCGGATTCCGTCACGCTCTTTTATGGCAGCCCGAC
AGCAAAGAATGAAAAGGCCAGGGAACTTTTTGATCAGAACATTTTTAGTGTGACGCGGCAACTGCGCTACAGCAACAGTA
ACACACAATTGGCACTTGATATGGGTATCTTTATCAACGGCCTTCCTGTTGCTACATTTGAATTGAAAAACAAACTCACC
AAACAGACCGTTCACGACGCAGTTCAACAATACAAAAATGACCGCGATCCCAAAGAACTGCTTTTTCAGTTTGGCCGTTG
CATGGTTCACTTTGTGGTGGATGACCATGAAGTTCGCATGTGCACTCATCTAAAGGGCAAGGCATCCTGGTTCCTTCCGT
TTAACAAAGGGCACAATGATGGTGCAGGCAACCCGCCAAATCCTGCAGGACTGGCTACGGATTACCTCTGGAAAGAGATC
CTTAACAAAGAGCGTCTTACTGATATCCTTGAAAACTACGCCCAGGTTGTTGAGGAAAAAGACGAAAAGACGGGTAGAAA
AAGATACAAACAGATATTCCCACGCTATCATCAGTTGGATGTGGTGCGTAAATTGCTGTCTGATGTCAAAAAAAATGGTG
TTGGAAAACGCTATCTGATTCAGCATTCGGCAGGAAGTGGAAAATCAAACTCTATTGCATGGCTGGCGCATCAGCTGGTT
GGATTGGAAAAAGACAAAAATCCCATCCTTGACTCAATAATCGTGGTGACCGACCGTCGGGTTCTGGACAAGCAGATCCG
TGACACCATCAGGTCCTTTGCTCAGGTTGGCAACGTGGTTGGTCATGCGGACCGCTCCGGTGACCTGCGCCGTTTTATTA
ATGAAGGCAAGCAGATCATCATAACCACAGTTCAGAAATTTCCTTTTATTCTGGCAGAGATTGGTGACGACCACAGACAA
AATAAGTTTGCCATTATCATTGATGAAGCACACTCCAGCCAGGGTGGACGTACTGCTTCAAAAATGAATATGGCGCTTTC
TGCAGATGTCTCTGATTCAGATGAAGAAGAAAGTACAGAGGATAAGATCAATGAGTTGATGGAAGGTCGCAAGATGCTGA
CCAACGCCAGCTATTTTGCCTTTACTGCCACGCCCAAAAATAAAACACTTGAAATTTTTGGTGAGCCTTCCCCTCAGCCA
GATGGGAAAGTAAAGCATTATCCTTTTCAAAGCTACACGATGAAACAGGCCATTCAGGAAGGGTTTATTCTGGATGTGCT
TAAGAACTATACACCTGTGGAAAGTTTTTACCGTCTTTCAAAAACTGTAGAAGACGATCCTCTTTTTGACACCAAAAAAG
CCCAGAAAAAGCTTCGTAAATATGTGGAATCCAATACTCATGCCATTCGTGAGAAAGCTGAAATCATGGTGGATCATTTC
CATGCACAGGTGATGGGACACCGTAAAGTTGGTGGACAGGCGCGTGTTATGGTTATTACCAGCGGCATCATGCGGGCCAT
CGAATATTTCTATGCCATAAACGACTATTTGCTTAACAAAAAATTACCTTATAAAACCATTGTGGCTTTCTCAGGCGAAC
ATGAATATGGAGGCCAGAAAGTTACAGAAGCCTCATTGAACGGTTTTCCCAGTAACAAGATTGAAGATAAGATTACTGAA
GATCCCTATCGAATTCTTGTTGTTGCCGATAAATTTCTTACCGGATATGACGAACCGTTGATGCATACCATGTATGTAGA
TAAGCCACTTGCAGGTATTAAGGCTGTTCAAACTTTATCGCGTTTGAACCGGGCACACCCAAAAAAACATGACACTTTTA
TTCTGGATTTTTTCAATGATGCTGATGTGATTCAGAAGGCATTTTCTGATTATTACCGCACAACCATTCTCAGTGAAGAA
ACAGACCCCAATAAGCTGCATGATCTTAAATCGGATTTGGATAGCTATCAGGTTTATTCACAGGAACAGATTGACCGATT
GGTGGAACTTTATCTTGGGGGAGCTGAAAGAGAAACTCTTGACCCTGTTCTTGATGCCTGTGTTGCTGTGTACAACAGTG
AACTTGATGAAGATGGGCAAGTTGATTTCAAAGGTAAGGCAAAAGCGTTTGTTCGAACCTACGGTTTTCTTGCTTCAATT
CTTCCCTTTACAAATCCCGATTGGGAAAAACTATCAATCTTTCTGAATTTTCTTATTTCTAAACTTCCCGCCCCAAAAGA
GGAAGACCTTTCAAAAGGAATTCTTGAAACCATCGATATGGATAGTTACCGGGCTGAAGTTCAGGCCAGTATGAATATTA
CTCTAGCAGATGGAGATGCAGAACTTGACGCTGTTCCAACAACTGGCGGCGGCCGAAAGCCAGAACCGGAACTGGATCAG
CTAAGCAATATCATCAAAACCTTTAATGATCTTTTTGGGAACATCGATTGGAAAGACGCAGATAAAATTCGAAAAGTCAT
TGCTGAAGAAATACCTGATAAAGTTGCTGCGGACTCTGCGTATCAGAATGCCATGAAAAATTCTGACAAACAGAATGCCC
GCATCGAACATGACAAGGCACTTGGTCGTGTAATGGTGGAATTAATTGCTGACCATACAGAACTCTTCAAACAGTTCAGC
GACAATCCGTCATTTAAAAAATGGCTGGGGGATACCATTTTTGGCGTAACCTATCAATCGACTGATCAAATCGGAACGGG
GGCGAGATAG

Protein sequence :
MKFTDTSEAGLETTIVKSLIEDSGYSEGAPQDYDRSHAVDLIKLTNFIEATQPEVADRLSLKVDSPTRTKFLHRLQGEIA
KRGIIDVLRKGVKHGPDSVTLFYGSPTAKNEKARELFDQNIFSVTRQLRYSNSNTQLALDMGIFINGLPVATFELKNKLT
KQTVHDAVQQYKNDRDPKELLFQFGRCMVHFVVDDHEVRMCTHLKGKASWFLPFNKGHNDGAGNPPNPAGLATDYLWKEI
LNKERLTDILENYAQVVEEKDEKTGRKRYKQIFPRYHQLDVVRKLLSDVKKNGVGKRYLIQHSAGSGKSNSIAWLAHQLV
GLEKDKNPILDSIIVVTDRRVLDKQIRDTIRSFAQVGNVVGHADRSGDLRRFINEGKQIIITTVQKFPFILAEIGDDHRQ
NKFAIIIDEAHSSQGGRTASKMNMALSADVSDSDEEESTEDKINELMEGRKMLTNASYFAFTATPKNKTLEIFGEPSPQP
DGKVKHYPFQSYTMKQAIQEGFILDVLKNYTPVESFYRLSKTVEDDPLFDTKKAQKKLRKYVESNTHAIREKAEIMVDHF
HAQVMGHRKVGGQARVMVITSGIMRAIEYFYAINDYLLNKKLPYKTIVAFSGEHEYGGQKVTEASLNGFPSNKIEDKITE
DPYRILVVADKFLTGYDEPLMHTMYVDKPLAGIKAVQTLSRLNRAHPKKHDTFILDFFNDADVIQKAFSDYYRTTILSEE
TDPNKLHDLKSDLDSYQVYSQEQIDRLVELYLGGAERETLDPVLDACVAVYNSELDEDGQVDFKGKAKAFVRTYGFLASI
LPFTNPDWEKLSIFLNFLISKLPAPKEEDLSKGILETIDMDSYRAEVQASMNITLADGDAELDAVPTTGGGRKPEPELDQ
LSNIIKTFNDLFGNIDWKDADKIRKVIAEEIPDKVAADSAYQNAMKNSDKQNARIEHDKALGRVMVELIADHTELFKQFS
DNPSFKKWLGDTIFGVTYQSTDQIGTGAR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-175 43
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 9e-175 43
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-174 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
DESAM_20023 YP_007324420.1 putative type-1 restriction system, restriction subunit VFG1098 Protein 4e-175 43