Gene Information

Name : Acav_4671 (Acav_4671)
Accession : YP_004237117.1
Strain : Acidovorax avenae ATCC 19860
Genome accession: NC_015138
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 5346115 - 5349318 bp
Length : 3204 bp
Strand : -
Note : TIGRFAM: Restriction endonuclease, type I, EcoRI, R subunit; PFAM: Protein of unknown function DUF3387; Restriction endonuclease, type I, R subunit/Type III, Res subunit; Restriction endonuclease, type I, EcoRI, R subunit/Type III, Res subunit, N-terminal

DNA sequence :
ATGACAGAAGATCAACTCGAACAGGACGCACTGAGTTGGCTCGTCGAAGTCGGCTACACCCACCTGAGCGGCTACGACAT
CGCCCCCGATGGCCCCGCACCTGAGCGCGACAGCTTCCGCCAGGTGCTGCTGCCCCAGCGCCTGCGCGACGCCATCGCCC
GCCTGAACCCGCACATTCCCCTGGCAGCGCGCGAAGACGCCTTCAAGCAGGTGCAGGACCTGGGCACCCCGGTGCTGCTG
TCGGCCAACCGGCACTTCCACCGGCTGCTGGTGGGCGGTGTGCCTGTGCAGTACCAGCAGGACGGCGAAACCCGGGGCGA
CTTCGTGCGCCTGGTGGACTGGGGCAATGCACAGGCCAATGAATGGCTGGCTGTCAACCAGTTCTCGCTCAAGGGGCCGC
ACCACACGCGCCGCCCGGACATCATCCTGTTCCTCAACGGCCTGCCCGTGGTGCTGCTGGAACTGAAGAACCCGGCGGAT
GAAAACGCCAGCATCTGGAAGGCCTACGAGCAGATCCAGACGTACAAGGCGCAGATCCCCGATGTGTTCCAGTACAACGA
GGTGCTGGTGGTGTCGGACGGCTCCGAGGCGCTGATGGGCTCGCTTTCCAGCAATGCCGAGCGCTTCATGGCCTGGCGCA
CCATCGATGGCGTGGCGCTGGACCCGCTGGGCCAGTTCAACGAGCTGGAAACCCTGGTGCGCGGCGCGCTGGCCCCGGCC
TATGTGCTGGACTATCTGCGCTACTTCGTGCTGTTCGAGGACGATGGCGGCCTGGTCAAGAAGATCGCGGGCTACCACCA
GTTCCATGCGGTGCGTGCGGCCATAGAGCAGGTGGTGGCCGCATCCCGCCCGAGTGGTTCGCACAAGGGCGGGGTGGTGT
GGCACACCCAGGGCAGCGGCAAGAGCATCACCATGACCTGCTTTGCCGCCCGGGTGATGCAAGAGCCGGCCATGCAGAAC
CCCACCATCGTGGTCATCACCGACCGCAATGACCTGGACGGCCAGCTCTTCGGCGTGTTCAGTCTGGCGCAGGACCTGCT
GCGCGAGCAGCCCGTGCAGGTCGAGACCCGGCAAGACCTGCGCGCCAAGCTTGCCAACCGGCCCTCCGGTGGCATCGTGT
TCGCTACCATACAGAAGTTCATGCCGGGCGAAGACGAGGACACCTTCCCGGTGCTCTCGAACCGCAGCAACATCGTCGTG
ATCGCCGACGAAGCGCACCGCACGCAGTACGGGTTCGAGGCGAAGCTCAAGACCATCAGGCGCAAGGCTGGCCAGCCGGA
GGGCGCAGCGACTGCAAGGACCACGGGTGACGCAGCCAGCTCGGCGCTGACCGTCGATTTCGTTCCACCGGCGTATGAGG
TGGAACACAAATACCAGGCCGGCTACGCCCAGCACTTGCGCGACGCCTTGCCCCACGCCACCTTCGTGGCCTTCACCGGC
ACGCCCGTGTCCAGCGAAGACCGCGACACGCGCGCCGTGTTTGGCGATTACATCCATGTCTATGACATGCAGCAGGCCAA
GGAAGACGGCGCCACCGTGGCCATCTACTATGAAAGCCGCCTGGCCAAGCTCAGCCTGAAGCAAGACGAGCTGCCGCACC
TCGACGAAGAGGTGGACGAGCTGGCCGAAGACGAGGAAGAAAGCACCCAGGCCAGGCTCAAGAGCCGCTGGGCCGCCCTG
GAAAAGGTGGTAGGCGCCGAGCCCCGCGTGGCGGCCGTGGCGGCGGATCTGGTCAAGCATTTCGAGGAGCGCAACAAGGC
ACAGAGCGGCAAGGCCATGGTCGTGGCCATGAGCCGCGACATCTGCGTCCACCTGTACGACGAGATCACCAGGCTGCGCC
CCGGCTGGCATGACGCAGACCCTGAAAAGGGCGCCGTCAAGATCGTGATGACCGGCTCGGCCAGCGACAAGGCCTTGCTG
CGTCCGCACATCTACGGCGGCCAGGTCAAGAAGCGGCTGGAGAAGCGCTTCAAGGACCCGGCCGATCCGCTGCGCCTGGT
GATCGTGCGCGACATGTGGCTGACCGGCTTCGACGCCCCTTGCGTGCACACCCTGTATGTGGACAAGCCCATGAAGGGCC
ACAACCTCATGCAGGCCATCGCCCGCGTGAACCGCGTGTTCAAGGACAAGCAGGGCGGCCTGGTGGTCGATTACATCGGC
ATTGGCAACGAACTCAAGGCCGCGATGAAGGAGTACACGGCCAGCAAGGGCCGAGGCAAACCCACGGTCGATGCCCACGA
GGCCTACGCCGTGCTGGAGGAAAAGCTCGACGTGCTCCGCGCCATGCTGCATGGCTTCGACTACAGCGGCTTCCTCACGG
GTGGGCACAAGGTGCTGGCGGGTGCGGCCAACCATGTACTGGGCCTGAAGAGCGAAGGCCGGCGCGATGGCAAAAAGCGT
TTCGCGGATACCGCTCTGGCCATGAGCAAGGCCTTCACCCTGTGCTGCACGCTGGACGAAGCCAAGGCCGTGCGCGAAGA
AGTGGCCTTCATGCAGGCCGTGAAGGTGATCCTGACCAAGAAGGACATCACCCAGCAGAAGAAGACCGACGAACAGCGCG
AGCTGGCCATCCGCCAGATCATCGGCTCGGCGGTGGTGTCCGACCGCGTGGTGGACATCTTCGATGCCGTGGGTCTGGAC
AAGCCCAACATCGGACTGCTGGACGACGATTTCCTGGCGCAGGTGAAGAACCTGCCCGAGCGCAACCTGGCGGTGGAACT
GCTGGAGCGCCTGCTGGAAGGCGAGATCAAGAGCCGTTTCGCCACCAACGTGGTGCAGGAGCGCAAGTTCTCCGAGCTGC
TGGGCAACGTCATCAAGCGCTACCAGAACCGTTCCATCGAAACCGCCCAGGTCATGGAAGAGCTGGTGGAGATGGCCAGG
AAATTCCGCGAGGCTGCATCACGCGGCGAGAGCCTGGGCCTGACCGACGACGAGGTGAAGTTCTACGACGCGCTGATCCT
CAACGAATCGGCGGCGCGGGAACTGAGCGACGAAACGCTGAAAAAAATCGCTCATGAGCTGACGACCAGCCTGCGCCAGA
ACATCAGCGTGGACTGGGCGCACCGCGAGAGCGTGCGGGCCAAGCTGCGGCTGATGGTCAAGCGCATCTTGCGCAAGTAC
AAGTACCCGCCTGACTTGGCTGATGCCGCGGTGGAGCTGGTACTGGAGCAGGCCGAGACGATCGGGGATGAGTGGGTCAA
GTAA

Protein sequence :
MTEDQLEQDALSWLVEVGYTHLSGYDIAPDGPAPERDSFRQVLLPQRLRDAIARLNPHIPLAAREDAFKQVQDLGTPVLL
SANRHFHRLLVGGVPVQYQQDGETRGDFVRLVDWGNAQANEWLAVNQFSLKGPHHTRRPDIILFLNGLPVVLLELKNPAD
ENASIWKAYEQIQTYKAQIPDVFQYNEVLVVSDGSEALMGSLSSNAERFMAWRTIDGVALDPLGQFNELETLVRGALAPA
YVLDYLRYFVLFEDDGGLVKKIAGYHQFHAVRAAIEQVVAASRPSGSHKGGVVWHTQGSGKSITMTCFAARVMQEPAMQN
PTIVVITDRNDLDGQLFGVFSLAQDLLREQPVQVETRQDLRAKLANRPSGGIVFATIQKFMPGEDEDTFPVLSNRSNIVV
IADEAHRTQYGFEAKLKTIRRKAGQPEGAATARTTGDAASSALTVDFVPPAYEVEHKYQAGYAQHLRDALPHATFVAFTG
TPVSSEDRDTRAVFGDYIHVYDMQQAKEDGATVAIYYESRLAKLSLKQDELPHLDEEVDELAEDEEESTQARLKSRWAAL
EKVVGAEPRVAAVAADLVKHFEERNKAQSGKAMVVAMSRDICVHLYDEITRLRPGWHDADPEKGAVKIVMTGSASDKALL
RPHIYGGQVKKRLEKRFKDPADPLRLVIVRDMWLTGFDAPCVHTLYVDKPMKGHNLMQAIARVNRVFKDKQGGLVVDYIG
IGNELKAAMKEYTASKGRGKPTVDAHEAYAVLEEKLDVLRAMLHGFDYSGFLTGGHKVLAGAANHVLGLKSEGRRDGKKR
FADTALAMSKAFTLCCTLDEAKAVREEVAFMQAVKVILTKKDITQQKKTDEQRELAIRQIIGSAVVSDRVVDIFDAVGLD
KPNIGLLDDDFLAQVKNLPERNLAVELLERLLEGEIKSRFATNVVQERKFSELLGNVIKRYQNRSIETAQVMEELVEMAR
KFREAASRGESLGLTDDEVKFYDALILNESAARELSDETLKKIAHELTTSLRQNISVDWAHRESVRAKLRLMVKRILRKY
KYPPDLADAAVELVLEQAETIGDEWVK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 54
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 47
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 46
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 46