Gene Information

Name : Clocl_0191 (Clocl_0191)
Accession : YP_005044860.1
Strain : Clostridium clariflavum DSM 19732
Genome accession: NC_016627
Putative virulence/resistance : Virulence
Product : helicase, type I site-specific restriction-modification system restriction subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 196282 - 199266 bp
Length : 2985 bp
Strand : +
Note : PFAM: Type I restriction enzyme R protein N terminus (HSDR_N); Type III restriction enzyme, res subunit

DNA sequence :
TTGGCAAAAACTCCAGTGGAACTTAGTGAAAAGGGTTTTGAGGAATATGTTGAAGAACATCTATTAAATTCAGGATATAT
TAAGGGTAATCCTGATGACTATAACAAAGAATTTGCCATTGATGCTAAACTTCTTATTGATTTTTTAGAAGATACACAGC
CAAAGAAGATGGAGAGATTAAGAGAAATATATAAGGAACAGTATCAGTTTAAATTATTAAGCCGGTTAAATAGGGAATTA
AACAACCGTGGCATGATTGATGTATTAAGACATGGAATTAAAGACTATGGTGTGTATTTAGACCTTGCCTATTTCCGGCC
TGCCAGCAAATTAAATGATGAAATGATAAAACTCTATGAGAAAAATCGAATATCTGTAACAAGGCAGGTACACTACAGCA
CTAAAAACGAAAACAGTATAGATATGCTTATATGTATCAATGGACTTCCTGTTGTTGTTTTAGAACTTAAAAATCCTTTC
ACTGGTCAAACATATCAAGATGCAATCATGCAGTACAAGAAAGACAGAAGCCCTAATGAACTGTTGTTTCAGTTTAAGAA
AAGAGCCATTGTATTCTTTGCAGTAGATACTCAAGAAGCATATATGACTACGAGACTTTCAGGAGATAAAACTTCTTTTC
TTCCATTTAATAAAGGTTGTGATGGAGGAAAAGGAAATCCAGACAATCCTGATGGCTTAAAAACGGATTATCTTTGGGAA
GAGATACTTCAAAAAGATAGCCTGATGGACATATTAAAAAGATTTGTATTTATAGAGACTCAAGAAAAGAAAGACATAGA
CGGTAATACTTATACCTCGGAAACCGTCATATTCCCAAGATACCACCAATTAGATGTAGTAAGGAAACTGGAAGCCGATG
CAAAAAAGAAAGGCGTAGGGACGAACTATCTTGTGCAGCATAGCGCAGGGTCAGGCAAAACCAACTCCATATCATGGCTT
GCCCACAGACTTGCCAATCTTCATGATGATAATGATAACCCTGTATTTGATTCGGTTATTGTTATTACGGATAGAAGAGT
TTTGGACAGGCAGTTGCAGGACAGCATTTACCAGTTGGAGCACAAATATGGAGTTGTTCAAAAAATAGATAAAGACTCCA
ACCAATTGGCTGATGCCTTGAAGAGCGGAACAAGAATAATTATTTCCACTCTGCAAAAGTTCCCCTTTATTATCGAAAAG
GTAGGAGAGTTGGAAAATCGTAAGTATGCTGTTATTATTGATGAGGCCCATTCCAGCAGTGCAGGAGAAAACATGGCTTC
TTTAAGGGAAGTGCTGTCGGTAAGTACCCTTGAAGAAGCGGCAAAACTGGATGAAGAGTTAGAGGGCAAAGAATACGACC
CTGAGGAGGAGATTTTAAAAACAATAAAGAAAAGGGGAAAACAGCCTAATATCAGTTTCTTTGCTTTTACGGCTACGCCT
AAAGCAAAAACTCTTGAGATGTTCGGGACAATAGGACCTGACGGACTGCCCCATCCATTTCATTTATACTCTATGAGGCA
GGCAATTGAAGAAGGCTTTATTTTAGATGTGCTTCAAAACTATGTTACATATGAAACATACTTCAAACTGGCAAAGAAAA
TAGAGGATGACCCAACCTTTGATAGAGCAAAAGCGACTAAAGCAATAACCCGATATGTAAGCCTGCACCCACATAACATT
GCACAAAAGACTGAAATCATGGTGGAACACTTCAGGAGCGTAACAAGGCATAAAATAGGCGGAAGGGCCAAGGCAATGGT
TGTAACCAGTTCCAGGCTCCATGCAGTACGCTATAAGCATGCCTTTGATGAATATATCAAAAAGAAAGGCTACAGAGATA
TGAAAACCCTGGTAGCCTTTTCGGGAACAGTAAAAGATGGCGGTGTAGATTACAAAGAAAGTGACATGAACGGATTTAAG
GAATCAGAACTTCCAGAACGTTTTGCCACTGATGAGTACCAGGTGCTTTTGGTTGCAGAAAAGTACCAGACAGGATTTGA
CCAACCCCTTTTACATACCATGTATGTGGATAAAAAGTTGTCAGGAGTCAAGGCGGTACAGACTTTATCAAGGCTCAATA
GAACCTGCACAGGAAAAGATGATACCTTTATCCTTGACTTTGTAAATAAGGCAGAGGATATTCAGGAAGCCTTCAAGCCC
TATTACCAGGCAACCATTGTGGAAGAAGTGACAGAGCCTAACCTGCTTTATGATATTGAAACTCAACTACATTCTTATGG
TGTATATCTCAAAGAGGAATTGGATAAGTTTGCTTATATTTATTTTAAGCCCAAGGATAAAAAGACTTCCAAAGATAGGG
CAATGTTGAATCACTTGATAGATATAACCGTAGAGAGGTTTAAGAAACTAGAGGAACAGCGAAAGCAAGATTTCAGTAGC
CAAGCGACGAAATACATAAGGCTTTATTCATTCATTCTACAAATCACGCCTTTTGAGGATGTTGAATTGCATAAGTTGTA
TGTATATTTAACATATCTGCTAAAAAAACTTCCAAAGGGAAAAGGCTCCACTGTTCACCTCGCTGATGAAATTGCTTTAG
AGTATTATACTACCAGAAAGACATTTGAAGGGAGTATCTCATTAACTGCTGATGATGAAGTACCAGTTACACCTGTTAAA
TTTGCAGGAACAGGGGTAAAAGAAGAACAGGAAGAATATCTGTCCAGCATTATTGAGCGTCTTAATAAGCGGTTTGGAAC
GGACTTCACAAAAGCGGACCAATTATCGGTAGAGCAGATTAAAGAGGATTTTGCTGCTGATGAGGATTTGGTTCAAAAGG
CTAAGACAAATACTATTGATGACTTTAGACTTGCTTTTGAAAAAGTATTTATTAATAAAGTAATTGACAGGATGGACCAG
AACCAGGCATTCTTTACCCGTGTTTTAGACGATGAACAATTTAAGAATGCACTTATGGAGTATATGCTGGTTGAGACCTA
TGAGAAGTTAAACAGCAGGGCTTAG

Protein sequence :
MAKTPVELSEKGFEEYVEEHLLNSGYIKGNPDDYNKEFAIDAKLLIDFLEDTQPKKMERLREIYKEQYQFKLLSRLNREL
NNRGMIDVLRHGIKDYGVYLDLAYFRPASKLNDEMIKLYEKNRISVTRQVHYSTKNENSIDMLICINGLPVVVLELKNPF
TGQTYQDAIMQYKKDRSPNELLFQFKKRAIVFFAVDTQEAYMTTRLSGDKTSFLPFNKGCDGGKGNPDNPDGLKTDYLWE
EILQKDSLMDILKRFVFIETQEKKDIDGNTYTSETVIFPRYHQLDVVRKLEADAKKKGVGTNYLVQHSAGSGKTNSISWL
AHRLANLHDDNDNPVFDSVIVITDRRVLDRQLQDSIYQLEHKYGVVQKIDKDSNQLADALKSGTRIIISTLQKFPFIIEK
VGELENRKYAVIIDEAHSSSAGENMASLREVLSVSTLEEAAKLDEELEGKEYDPEEEILKTIKKRGKQPNISFFAFTATP
KAKTLEMFGTIGPDGLPHPFHLYSMRQAIEEGFILDVLQNYVTYETYFKLAKKIEDDPTFDRAKATKAITRYVSLHPHNI
AQKTEIMVEHFRSVTRHKIGGRAKAMVVTSSRLHAVRYKHAFDEYIKKKGYRDMKTLVAFSGTVKDGGVDYKESDMNGFK
ESELPERFATDEYQVLLVAEKYQTGFDQPLLHTMYVDKKLSGVKAVQTLSRLNRTCTGKDDTFILDFVNKAEDIQEAFKP
YYQATIVEEVTEPNLLYDIETQLHSYGVYLKEELDKFAYIYFKPKDKKTSKDRAMLNHLIDITVERFKKLEEQRKQDFSS
QATKYIRLYSFILQITPFEDVELHKLYVYLTYLLKKLPKGKGSTVHLADEIALEYYTTRKTFEGSISLTADDEVPVTPVK
FAGTGVKEEQEEYLSSIIERLNKRFGTDFTKADQLSVEQIKEDFAADEDLVQKAKTNTIDDFRLAFEKVFINKVIDRMDQ
NQAFFTRVLDDEQFKNALMEYMLVETYEKLNSRA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SAS0025 YP_042158.1 type I restriction enzyme protein Not tested SCC476 Protein 0.0 45
VC1765 NP_231400.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 4e-135 42
VC0395_A1363 YP_001217306.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 4e-135 42
VPI2_0013c ACA01830.1 type I restriction enzyme HsdR Not tested VPI-2 Protein 1e-135 42

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
Clocl_0191 YP_005044860.1 helicase, type I site-specific restriction-modification system restriction subunit VFG1098 Protein 2e-135 42