Gene Information

Name : EcE24377A_0289 (EcE24377A_0289)
Accession : YP_001461443.1
Strain : Escherichia coli E24377A
Genome accession: NC_009801
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : V : Defense mechanisms
COG ID : COG0610
EC number : -
Position : 311933 - 315067 bp
Length : 3135 bp
Strand : +
Note : identified by similarity to GB:AAB70710.1; match to protein family HMM PF04313; match to protein family HMM PF04851; match to protein family HMM TIGR00348

DNA sequence :
ATGCTGAGCGAAGACGATTTAGAACAGCAAAGCCTGCAATGGTTTGCTGAACTGGGCTGGGAAGTATTGCACGGGCCGGA
TATTGCGCCAGATGGCAACAATCCGCTGCGTGCGTCGTTCCACGATGTGTTTTTGCGCCCGGTGCTACTGGAGCAGTTGC
AAAAGATTAACCCACATCTCCCGGTTGCCGTGCTGGATGAGGTGATACTGCGTATCGCCCATGCGCAAAGCCCGGATCTG
GTCGTCAGTAACAAAGCCTTCCATCACCTGCTGCTCGACGGCGTGCCGGTGCAGTACAAGCAGGATGACAAAGTTATCCA
CGACAAAGCGCTACTGATGGATTTTAACCACCCAAATAATAACCACTTTACGGTGGTGAATCAGGTGGCCATCCAGGGAA
CGAAGCAGGTACGTCGCCCGGACGTTATCTGCTATATCAACGGATTGCCAGTAGTAGTGATTGAGCTGAAAAGCCCGATT
GATGCTAATGCCGATATCTGGGCGGCATTTAATCAGTTGCAGACCTATAAAAACGAACTCAGCGATCTGTTCATCTGCAA
CGAAGCGCTGGTGGTCAGCGATGGCTACAACGCCCGTATTGGCTCTCTCACCGCGAACGAAGAGCGCTTCTTGCCGTGGA
AAACGCTGTCTAATGAAGACGACAAACCGCTGTTTGAATGGCAGCTTGAAACGGTAGTAAAAGGGTTCTTTAACCGCGAA
CTGCTGCTCGACTACATTCGTTACTTCATCCTGTTTGAAAGCGACGGCAAACGACTGATTAAAAAGATTGCCGCTTACCA
CCAATTCCACGCAGTACGTGAAGCGGTGACGGCGACGATTGTGGCCTCTACCGGTAAACACTTGCCGCTGCGCAGCAACA
TCACGCCAGGCAGTAAAAAAGCCGGTGTGGTGTGGCATACGCAGGGTTCCGGTAAGAGTATCTCGATGTGTTGTTACGCC
GGAAAACTGCTACAACAAGCGGAAATGAACAACCCGACCATCGTGGTGGTGACCGACCGTAACGATCTCGACGGCCAACT
GTATGCCACCTTCTGCCAGGCACAGGATTTGCTTAAGCAGGAACCGTTACAGGCAAACGATCGCGACCAGCTCCGCGAGA
TGCTCAATGTCCGTGAATCAGGCGGGATTATTTTTACCACCGTACAAAAATTTGCCCCGCTTGATGGCGAACAGACTCAC
CCGGCGCTAAACCTGCGCAGCAATATCGTCGTCATTTCCGATGAAGCGCACCGCAGCCAGTATGGTCTTAGCGCCACGCT
GAACCGGGAGACTGGCGCTTATAAATACGGTTACGCCAAACATATGCGCGATGCGTTACCCAATGCGTCGTTTATGGGCT
TTACCGGAACACCGGTTTCTTCCGAAGATAAAGACACCCGCGCGGTGTTTGGTGATTACGTCTCTATCTACGATATACAG
GATGCGGTGGAAGATGGCGCAACCGTGCCTATCTACTATGAATCGCGCCTGGCAAAACTCGACCTCAACCACGAAGAGCT
GGAAACGCTGTCTAATCAGGTGGATGAGCTGGTCGAAGATGAAGAGACCGATCAGAAAGAGAAAACCAAAAGTGACTGGA
GCCGTCTGGAAAAACTGGTTGGTTCTGAACCTCGTATCAATGAGGTGGCCGCCGATCTGGTTCAGCATTTTGAGGCACGT
AACGCCACAATGAATGGCAAAGCAATGATTGTTGCCATGAGCCGTGAGATCTGCGTGAAGCTGTATGATGCGCTGGTGGC
TCTACGCCCGGAATGGCACAGTGATGACGTCGAGAAAGGTGAAATCAAAATCATCATGACCGGCTCAGCCTCCGATAATA
AATTCCTTCAGCCGCACATCTACAATAAGCAAACCAAAAAACGCCTTGAAGCGCGCTTTAAAGATCTCAACGACCCGTTG
AAACTGGTGATTGTGCGCGATATGTGGCTTACCGGGTTTGATGCGCCATGTTGTCATACCATGTATATCGACAAACCGAT
GCGCGGGCATAATCTGATGCAGGCCATTGCGCGCGTCAACCGCGTCTTCAAAGATAAACCGGGCGGTTTAGTGGTGGATT
ACATCGGTATCGCCAATGAACTGAAACAGGCGCTGAAAACTTATACCGATTCAAAAGGTAAAGGACAGACGACAGTCGAT
GCTCATGAAGCGTTCTCCATCCTGCTTGAAAAGCTGGATGTGATTCATGGAATGTTTGCCAAAACACCAACCGCCGCCGG
GTTTGATTACACCGGTTTTGCAGAGGTGCCCCAACGGTTTTTACTCAAAGCCGCAGATTATGTGCTGGGCCTTGATGACG
GTAAGAAGCGCTTTTTCGATGTCGTGCTGGCGATGAACAAAGCCTGGTCGCTGTGCAGTACGTTAGATGAAGCTAAACCC
TTGCAAAAAGAGATCGCGTTTTTGTCGGCGGTGAAAGTGGCGATTATCAAGCTGACGACAACCGACAAAAAATTCAGTCA
GTCAGAGAAAAATACGCTACTCGGTAAAATCCTCGATAACGCCATTATTGCGACGGGCGTGGATGATGTGTTTGCGCTGG
CGGGTCTGGATAAGCCGAATATTGGATTGTTGTCAGACGAGTTTCTGGAAGAAGTGCGCGAATTGCCGCAGCGTAATCTG
GCAGTCGAGTTGCTGGAGAAACTGCTGAACGACGGCATTCATGCCCGCACCAAAAACAACGTGGTGCAGGAGAAGAAATA
CTCAGATCGCCTGAAAGCCGTGCTGCTCAAATACAATAACCGCGCCATTGAAACTGCGCAGGTGATTGAAGAACTGATCC
AGATGGCAAAAGAGTTTCAGGAAGCGATGGCGCGTGATGAAGCGCTGGGGCTAAACCCGGACGAAATCGCGTTCTACGAT
GCGCTGGCAGAAAACGAAAGTGCGGTACGGGAGCTGGGTGATGACGTCCTTAAGAAACTGGCTATCGAAGTCACGTTAAA
ATTGCGCCAGTCCACAACCGTAGACTGGCAGGTGCGAGAAAGCGTGCGTGCGCGGTTACGTATTCTGGTGCGTCAGACGC
TGCGTAAGTACAAGTATCCGCCAGATAAAACACCTTATGCAGTTGAACTGATACTGAAGCAGGCTGAAGTGGTGTCGAAC
AGCTGGACGGTATAG

Protein sequence :
MLSEDDLEQQSLQWFAELGWEVLHGPDIAPDGNNPLRASFHDVFLRPVLLEQLQKINPHLPVAVLDEVILRIAHAQSPDL
VVSNKAFHHLLLDGVPVQYKQDDKVIHDKALLMDFNHPNNNHFTVVNQVAIQGTKQVRRPDVICYINGLPVVVIELKSPI
DANADIWAAFNQLQTYKNELSDLFICNEALVVSDGYNARIGSLTANEERFLPWKTLSNEDDKPLFEWQLETVVKGFFNRE
LLLDYIRYFILFESDGKRLIKKIAAYHQFHAVREAVTATIVASTGKHLPLRSNITPGSKKAGVVWHTQGSGKSISMCCYA
GKLLQQAEMNNPTIVVVTDRNDLDGQLYATFCQAQDLLKQEPLQANDRDQLREMLNVRESGGIIFTTVQKFAPLDGEQTH
PALNLRSNIVVISDEAHRSQYGLSATLNRETGAYKYGYAKHMRDALPNASFMGFTGTPVSSEDKDTRAVFGDYVSIYDIQ
DAVEDGATVPIYYESRLAKLDLNHEELETLSNQVDELVEDEETDQKEKTKSDWSRLEKLVGSEPRINEVAADLVQHFEAR
NATMNGKAMIVAMSREICVKLYDALVALRPEWHSDDVEKGEIKIIMTGSASDNKFLQPHIYNKQTKKRLEARFKDLNDPL
KLVIVRDMWLTGFDAPCCHTMYIDKPMRGHNLMQAIARVNRVFKDKPGGLVVDYIGIANELKQALKTYTDSKGKGQTTVD
AHEAFSILLEKLDVIHGMFAKTPTAAGFDYTGFAEVPQRFLLKAADYVLGLDDGKKRFFDVVLAMNKAWSLCSTLDEAKP
LQKEIAFLSAVKVAIIKLTTTDKKFSQSEKNTLLGKILDNAIIATGVDDVFALAGLDKPNIGLLSDEFLEEVRELPQRNL
AVELLEKLLNDGIHARTKNNVVQEKKYSDRLKAVLLKYNNRAIETAQVIEELIQMAKEFQEAMARDEALGLNPDEIAFYD
ALAENESAVRELGDDVLKKLAIEVTLKLRQSTTVDWQVRESVRARLRILVRQTLRKYKYPPDKTPYAVELILKQAEVVSN
SWTV

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 89
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 47
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 47
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 46