Gene Information

Name : Ngar_c29150 (Ngar_c29150)
Accession : YP_006863491.1
Strain : Candidatus Nitrososphaera gargensis enrichment culture Ga9.2
Genome accession: NC_018719
Putative virulence/resistance : Unknown
Product : HsdR family type I site-specific deoxyribonuclease
Function : -
COG functional category : -
COG ID : -
EC number : 3.1.21.3
Position : 2273829 - 2276933 bp
Length : 3105 bp
Strand : +
Note : -

DNA sequence :
ATGCCAAAAGCGATAAGCGAAGCAGACGTTGAGGAAAACGTCTTGGCCATTTTAGAAAGTATGGGCTACAAAATTATCAG
AGGCGACAATGAAGATTGTCTTCCCGGTGGCTCGTCAGCACTAAGGGCTGATTACAAGGATGTTGTCCTTGTCGATAGGC
TCACTGACTCCCTAAGGAAAATCAACCCATCGGCTCCAATTGATGCATTAGACCAAGCAATCAAGCAAGTCCTAAGAAGC
GAAAGCCAAAAGCTGATAGCAAACAATGAGAGTTTTCACAAAATGCTAGTCGATGGAATAGATATTCCAGTGCAGACACT
AGAGGGAGAAACCTACAAAAAGATATGGCTCTTTGATTTTGAGGATCCAGAGAACAACGAGTTTTTGGCAGTCAACCAGT
TCACAGTCGTCGAAAACAATATCGAGCGTAGACCGGATGTCATACTCTTTGTCAACGGAATACCGTTGCTGGTTATTGAA
CTAAAGAACTTGGCTGACGAAAATGCCACTATATGGACAGCCTACGATCAACTTCAGACATACAAGGAACAACTTCCTTC
GCTATTCAAGTATAATGAAATTCTAGTCATAAGCGACGGTATTGAAGCAAGGGCTGGAACTTTAACATCTGAACGCGAAA
GATTCGCTCAATGGAAGACCATTGACGGCGGAGCGCCAAGAAAAGGATTGACTGAAATCGAAGTCCTTATCAGAGGCATG
TGCAACAAGCAAAGGTTCCTAGACATAGTTAGAAACTTTATCGTCTTTGAAAAAGACAAGAGCGTTAGCAAAAAGCTGGC
GGCATACCATCAGTACTGGGCTGTAAACAAGGCCTTAGAATCTACGATCAAGGCTAGGAAAGGGAATAAAAAAGCAGGCA
TTGTCTGGCATACCCAAGGCTCTGGAAAGTCGCTAACCATGGTCTTTTATACTGGCAAGCTGGTAAGGGAGCTTGACAAT
CCTACGGTTGTTGTTCTAACAGACAGAAATGACCTAGATGATCAGCTTTTTGGCACTTTTAGCAGATGCCAGGACATCAT
ACGACAGGAGCCGCAGCAGGCCAACTCAAGAAAAGAGTTGCAAGACTTGCTCAAGGTTTCATCGGGCGGCATTGTCTTTA
CCACCATACAGAAATTCTTGCCAGAGGAAGACAACAGAGAGAAGTATCCCGTATTGTCAGAAAGAGATAACATTGTTGTT
ATTGCCGACGAAGCACACAGAAGCCAGTATGGCTTTGCTGCAAAAATACTGAACAAGGATGACAAGACTCTGATAACTTA
TGGCTATGCAAAATACCTCAGGGACGCCTTGCCCAATGCTTCGTTCATAGGCTTTACAGGAACACCTATAGAAAAGGCAG
ACAGATCGACTCCGGCTGTCTTTGGCAAGTATGTTGACACTTATGATATAGAACAGGCTGTAAATGATGGAGCCACTGTA
AGAATATACTATGAAAGCAGACTTGCCAAACTAGAGCTCAAGCCTGAGGAAAGGCCGAAGATAGACAGCGAGTTTGAAGA
AGTGACAGAAGGCGAAGAAGTCGAAGGCAAGGAAAAACTGAAGAGCAAATGGGCAAGGGTCGAAAAGGTAGCAGGCGCAC
CAATGAGGATAAAGAGGATTGCAAAGGACATTGTCGACCACTTTGAAAAGAGGACATCTGTGCTTGAAGGAAAAGGCATG
ATAGTCTGCATGTCTAGAAGAATTTGTGTTGAACTTTACAACGAGATAGTAGAGCTTCGGCCCGAATGGAAGAATAGCGA
TGATGAAAAGGGAGCAATCAAAGTGGTCATGACTGGCTCTGCTTCAGACCCAAAGGAATGGCAAGAGCACATTAGAAACA
AAATCCGTCGAAAGAGGATTGGAGATAACTTCAAGGACCCAAAACATGAGCTGAGGCTAATCATTGTAAGGGACATGTTT
CTGACCGGCTACGACGCGCCGTCGCTTCATACAATGTATCTTGACAAACCAATGAAAGGCCATACATTGATGCAGGCCAT
AGCAAGGGTTAACCGCGTCTATCCGGGAAAAGAGGGTGGACTAATTGTTGATTACATGGGGGTTGGAGCAGAATTAAAGA
AAGCATTGATGGACTATACTGCCAGTGGTGGCAAGGGCAAGCCGGCCTTTGACCAAGAAAAGGCGGTAATGATGATGGTT
GAAAAATACGAAGTTGTAAAAGACATGTTCCACGGCTTTAACTACAGAAAGTTCTTTGAGCTCAAACCAAGTGAAAGGAT
TTCCTTTATTCCTCAGGCAATGGAGCATATCCTCAAAGAGCCTGGCAAAAAGGAAAGATACGCCAGAGAAGTTACAGCGC
TTCTAAAGGCATTCTCACTGGCAGTTCCACATGACAGGGCAATGAAGATAAAAGAAGAAGTAGGGTTGTTCCAGGCTATA
AAATCCGCAATTGCAAAGACAACTGAAACCGGAAAGGAAAGCCAGGAGGAAAAATTTGACAGCGCAATAAAGCAGATTCT
TTCAAAGGCTGTGATATCGGATAGAATCATAGATATTTTTGAAGCAGCAGGCATACAAAAGCCAGAGCTATCTATCCTGT
CAGATGGTTTTCTTGCTGAAGTAAAAGATATGCCACAAAAGAACTTGGCTTTTGAAGCTCTCAAAAAGCTGCTTAATGAC
GAAATCAGGTTCATGTCCAAGAGAAACCTTGTGCAGGCAAAATCGTTCATGGAAATGCTGGACAAGACCATAAAGAAATA
CACCAACAGAAACGTTGAGGCAGCGCAGGTAATTGAAGAGTTGATCGAGTTGGCAAAGAAAGTCAGAGCAGAAAAGAACA
GAGCCAAGGAGCAGAACATGAGCGAAGACGAGCTGGCATTCTATGATGCGCTCGAAGTAAACGATAGTGCAGTAAAGATT
CTTGGGGATGAAACATTGAGAAAGATTGCTGTAGAGTTGACGCAGATGATACGCAACAGCGTAACAATTGACTGGACGCA
GAGGGAGAGCGTGCAGGCTGCTATACGTCTGAACGTCAAAAAGATTTTGAGGAAATACGGCTATCCGCCGGACAAGGAAA
AGAAGGCTACAGAAACAGTGTTGCGTCAGGCTGAATTAGTCGCCAAAAACTGGGTATCAGGCTGA

Protein sequence :
MPKAISEADVEENVLAILESMGYKIIRGDNEDCLPGGSSALRADYKDVVLVDRLTDSLRKINPSAPIDALDQAIKQVLRS
ESQKLIANNESFHKMLVDGIDIPVQTLEGETYKKIWLFDFEDPENNEFLAVNQFTVVENNIERRPDVILFVNGIPLLVIE
LKNLADENATIWTAYDQLQTYKEQLPSLFKYNEILVISDGIEARAGTLTSERERFAQWKTIDGGAPRKGLTEIEVLIRGM
CNKQRFLDIVRNFIVFEKDKSVSKKLAAYHQYWAVNKALESTIKARKGNKKAGIVWHTQGSGKSLTMVFYTGKLVRELDN
PTVVVLTDRNDLDDQLFGTFSRCQDIIRQEPQQANSRKELQDLLKVSSGGIVFTTIQKFLPEEDNREKYPVLSERDNIVV
IADEAHRSQYGFAAKILNKDDKTLITYGYAKYLRDALPNASFIGFTGTPIEKADRSTPAVFGKYVDTYDIEQAVNDGATV
RIYYESRLAKLELKPEERPKIDSEFEEVTEGEEVEGKEKLKSKWARVEKVAGAPMRIKRIAKDIVDHFEKRTSVLEGKGM
IVCMSRRICVELYNEIVELRPEWKNSDDEKGAIKVVMTGSASDPKEWQEHIRNKIRRKRIGDNFKDPKHELRLIIVRDMF
LTGYDAPSLHTMYLDKPMKGHTLMQAIARVNRVYPGKEGGLIVDYMGVGAELKKALMDYTASGGKGKPAFDQEKAVMMMV
EKYEVVKDMFHGFNYRKFFELKPSERISFIPQAMEHILKEPGKKERYAREVTALLKAFSLAVPHDRAMKIKEEVGLFQAI
KSAIAKTTETGKESQEEKFDSAIKQILSKAVISDRIIDIFEAAGIQKPELSILSDGFLAEVKDMPQKNLAFEALKKLLND
EIRFMSKRNLVQAKSFMEMLDKTIKKYTNRNVEAAQVIEELIELAKKVRAEKNRAKEQNMSEDELAFYDALEVNDSAVKI
LGDETLRKIAVELTQMIRNSVTIDWTQRESVQAAIRLNVKKILRKYGYPPDKEKKATETVLRQAELVAKNWVSG

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
api52 CAF28526.1 hsdr-like Type I restriction enzyme Not tested YAPI Protein 0.0 48
hsdR YP_251977.1 type I restriction-modification system restriction subunit Not tested SCCmec Protein 0.0 48
hsdR BAH57699.1 type I restriction-modification system endonuclease homologue Not tested Type-VII SCCmec Protein 0.0 48
hsdR BAD24840.1 type I restriction-modification system endonuclease homologue Not tested Type-V SCCmec Protein 0.0 48