Gene Information

Name : APECO78_16850 (APECO78_16850)
Accession : YP_007382535.1
Strain : Escherichia coli APEC O78
Genome accession: NC_020163
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 3509784 - 3513023 bp
Length : 3240 bp
Strand : +
Note : COG0610 Type I site-specific restriction-modification system, R (restriction) subunit and related helicases

DNA sequence :
ATGGCAAAGATGACCGAATCCGATATTGAAGTAATGGCAATTGAGCACCTGCAAGGGCTGGGCTATGAGTATGTTTACGG
CCCGGACATTGAACCCAGTGGAATCAATCCGTTACGTAGCTATCAGCAGGTTATCCTTGAAGATAAAGTGCGTACAGCAT
TGCAACGAATTAACCCGCACCTTAGTGAGCAGAAGTGTGAAGAAGCTCTAAAACAGGTGATGCAGATCAGCTCACCTGAC
CTAATGGCAAACAATCTAACCTTTCATCGCCTTTTGACCGAAGGAATCAATATCGAAGTCAGCAAGGATGGTAATACACA
AGGAGAACTGGCCAGCCTGATCGACTTTAACGATCCCACTAATAATGCATTTCTGGTCATTAATCAGCTCACCATCAAAG
AAGGCAACCATACCCGCCGACCTGATCTTATTTTATTCATCAATGGCCTGCCATTAGTCGTTATCGAACTTAAAAATGCC
GCTGACGAAAATGCAACGGTAACAGGTGCCTATAATCAGATTAAAACCTATCAGAACCAAATCCCCGGTCTGTTTAACTA
CAATGCTTTTAATGTGATATCTGACGGGCTGGAAGCCAAAGCGGGGACGGTTTCTGCCGATTTCAGTCGTTATATGACGT
GGAAAACTGCTAACGGTAAAACGCAAGCCACCAGTACCCAACCACAGCTTGAAGTTTTATTACAGGGGTTGCTTAATCCT
GTAACGCTGCTGGATATAATCCGCCACTTTATCGTGTTTGAGGCCAGCAAACATGAAGACAGCAAAGGGATTATCAGTAT
CCGTACTGTTAAAAAAATGGCGGCTTATCACCAGTACTACGCGGTCAATGCAGCAGTTCTTTCCACTATTCGTGCCTCAG
CGGTGAATGCGGACTCCCCCTCTGCCGAAGTGGCACTGCGCCAGCAGGGACGTAACAGTAAAGATCTTGTTAATGCGCAA
AAAACCGGAGATCGCAAAGCGGGCGTAGTCTGGCATACTCAGGGTTCCGGTAAATCGCTTTCGATGGTGTTTTATACCGG
GAAAATTGTGCTGGCGCTGGATAATCCGACAGTTGTTGTGATTACTGACCGTAACGACTTAGATGATCAGCTATTCGGTA
CGTTCTCTTCCGCGACCCAGCTACTTCGTCAGACACCAAAACAGGCCAACAACCGGGAAGAACTCAAAGAATATTTGCGT
GTCGCCTCTGGCGGTGTGGTGTTTACTACTATTCAAAAATTCCAGCCTGATGATGGCAGCAATATCTATGAGTTGTTGTC
AGACAGAACCAATATTGTCGTTATCGCTGATGAAGCACACCGTTCCCAGTACGGTTTCAGCGCCAAAGAAGTTGACGTGA
AAGACAGCGAAGGCAACGTAACAGGTAAACGCACCGTTTACGGCTTTGCCAAATATATGCGTGATGCCTTACCTAATGCG
ACCTATCTCGGCTTTACCGGAACCCCCATAGAAAAAACGGACGTCAACACGCCTGCTGTTTTTGGTAACTATGTTGATAT
CTACGATATCTCGCAGGCCGTTGAAGATGGTGCAACAGTTCGTATCTTTTATGAAAGCCGTCTTGCCAAAATTGCCATCA
GTGATGAAGGTCGTCAGCTTATTGAAGACTTTGATGATGAGTTTAACGAGGACGAACTGACGCTCACGCAAAAAGAACGT
TCTAAATGGGCCAGAATCGAAGGTTTGATTGGCAGTTCAAAACGTATTAAAGCGATTGCGGCGGATATGGTTCTGCACTT
TGAGCAGCGCTTAAAATCCAATGCCGATCATGGTAAGGGCATGATTGTTACCATGTCCCGCCGTATTGCTGCTGAACTAT
ATAAAGAAATCATAGCCTTAAAACCTGAATGGCACAGCGATGATTTAAATGACGGTATAATAAAAGTCGTCATGACCTCT
TCTGCTGCTGACGGGCCAGAAATTGCCAAACACCACACCACAAAAAAAGAACGTCAGGTTCTGGCTAACCGTATGAAGGA
TGACGACGACAAGCTGAAACTGGTGATAGTGCGTGATATGTGGTTAACCGGCTTCGACGCTCCCAGCATGCATACGCTGT
ATATCGACAAACCAATGAAAGGCCACAACCTTATGCAGGCAATTGCCCGTGTGAACCGTGTGTATAAAGATAAGATAGGC
GGTCTGGTTGTTGACTATCTGGGCATTGCCTCTGATTTAAAAGAAGCCCTCTCCTTTTACTCTGATGCAGGTGGACGTGG
AGATCCTGCTGAGGTTCAGGAAGAAGCCGTAACGCTCATGCAGGAAAAGCTGGAAATCCTGGAAGGCATGATGCATGGAT
ACGATTACAAAGCTTACTTTGCCGCAACTACCTCACAACGCCTGACAATTATTCTTGAATCAGAAAACCATATTTTAGGG
CTGGATAACGGTAAAGGTAAAATGCGTTTCCTCGCTGCGGTTGCAGCCTTATCGCAGGCATTTGCATTGGCGACACCGCA
CGATAAAGCAATGGAAGCGGCACCCGAAGTAGCATTCTTCCAGGCAGTAAAAGCCAGACTGAATAAATTTACTGAAAACT
CAGACGGATCAGAAGAAGAACACAATGACAGTCTCGAAGTTCGGGTAAAACAGACTATCGATCAGGCTCTGGTTACCGAT
AAAGTTGTTGATATTTTTGACGCTGCTGGGATACAAAAACCTGATATTTCCGTTCTTTCTGAAGAATTCCTTCAGGAAAT
GAAGGATTACCAACACAGAAATATTGCTTTGGAAACGCTTAAAAAACTGCTTTCTGACGAGATCAAGGTTCGCTCGAATC
AAAGTATCACCCAAGGCAAAAAACTGATTGATATGCTGACCTCTGCAATCAATGGCTACCAGAACAAGGTACTGACCGCA
GCGGAGGTAATTGATGAGCTAATCAAGCTTGCCAAGACTATCCAGGAATCTGACAGCCTTGCCAGCCAGTTAAACCTCAG
CGCTTATGAATATGCCTTCTATTCTGCTGTTGCAGATAACGACAGTGCTCGCGAGTTAATGGAAAAGGAAAAACTACGAG
AACTGGCAGTCGTACTTACGGAGGCTATCCGCAACAATGTCAGTCTTGACTGGACAGTGAAAGAAGCAGCAAGAGCAAAA
ATTCGCGTGGTGGTAAAACGTCTGCTCAAAAAATATGGTTATCCGCCTGATATGTCATTGCTCGCCACAGAGACTGTTTT
GAAGCAGGCTGAACTTTTAGCTGGAGAATTAGGAAAATGA

Protein sequence :
MAKMTESDIEVMAIEHLQGLGYEYVYGPDIEPSGINPLRSYQQVILEDKVRTALQRINPHLSEQKCEEALKQVMQISSPD
LMANNLTFHRLLTEGINIEVSKDGNTQGELASLIDFNDPTNNAFLVINQLTIKEGNHTRRPDLILFINGLPLVVIELKNA
ADENATVTGAYNQIKTYQNQIPGLFNYNAFNVISDGLEAKAGTVSADFSRYMTWKTANGKTQATSTQPQLEVLLQGLLNP
VTLLDIIRHFIVFEASKHEDSKGIISIRTVKKMAAYHQYYAVNAAVLSTIRASAVNADSPSAEVALRQQGRNSKDLVNAQ
KTGDRKAGVVWHTQGSGKSLSMVFYTGKIVLALDNPTVVVITDRNDLDDQLFGTFSSATQLLRQTPKQANNREELKEYLR
VASGGVVFTTIQKFQPDDGSNIYELLSDRTNIVVIADEAHRSQYGFSAKEVDVKDSEGNVTGKRTVYGFAKYMRDALPNA
TYLGFTGTPIEKTDVNTPAVFGNYVDIYDISQAVEDGATVRIFYESRLAKIAISDEGRQLIEDFDDEFNEDELTLTQKER
SKWARIEGLIGSSKRIKAIAADMVLHFEQRLKSNADHGKGMIVTMSRRIAAELYKEIIALKPEWHSDDLNDGIIKVVMTS
SAADGPEIAKHHTTKKERQVLANRMKDDDDKLKLVIVRDMWLTGFDAPSMHTLYIDKPMKGHNLMQAIARVNRVYKDKIG
GLVVDYLGIASDLKEALSFYSDAGGRGDPAEVQEEAVTLMQEKLEILEGMMHGYDYKAYFAATTSQRLTIILESENHILG
LDNGKGKMRFLAAVAALSQAFALATPHDKAMEAAPEVAFFQAVKARLNKFTENSDGSEEEHNDSLEVRVKQTIDQALVTD
KVVDIFDAAGIQKPDISVLSEEFLQEMKDYQHRNIALETLKKLLSDEIKVRSNQSITQGKKLIDMLTSAINGYQNKVLTA
AEVIDELIKLAKTIQESDSLASQLNLSAYEYAFYSAVADNDSARELMEKEKLRELAVVLTEAIRNNVSLDWTVKEAARAK
IRVVVKRLLKKYGYPPDMSLLATETVLKQAELLAGELGK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 47
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 46
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 46
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 46