Gene Information

Name : HPAKL117_02240 (HPAKL117_02240)
Accession : YP_007017000.1
Strain : Helicobacter pylori Aklavik117
Genome accession: NC_019560
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 456358 - 464403 bp
Length : 8046 bp
Strand : -
Note : COG0827 Adenine-specific DNA methylase

DNA sequence :
ATGGCATATAAACCCAGCAAAAAGAAACTTCAAACTTTAAGAGAAGAACCTAATTTATTCAGCATTTTAGATGATGGCGA
TGTAATCAATACTAGCTCTTATAAACAAGAGCCAGAGCCACAAATAAAAGAACCACAATCTAGCGCTACACAAAATCCTA
ACAAAGAACAAGAAACGCCTAATTATGTCGTAAAAACTCAAATCAATACAGCAAGAATGATTTCTAGAAATCCTATTGAA
TGGGCAAGGTATTTAAGTTTTGAAAGACGAGTGCATAAGGATAATAGTAAAGAAGATGTCAATTTCTTTGCTAATGGTGA
GATAAAAGAAAGTTCTCGTGTTTATGAAGCGAATGAGGAAGGGTTTGAAAGACGCATAACTAAAAGATACGATCTGATTG
ATACCACAAGGAATAAAGAATTTTTTTCAAAAGAAATTGAAATTTTAACCTACACAAACAGCTTAGAAGAATTGAAAGAG
CAAGGTTTAGAAATCCAATTGACACACCACCATGAAACGCATAAGAAAACCTTAGAAAATGGCAATGAAATCGCTAAAGA
ATACGACTATCTTAAGGATATTTACCAAGAAGTAGAAAGAACAAGAGATGGTGAATTGGTAAGAGAAATAATCCCTAGTA
TTTCTAGCACTGAGTATTTTAAGCTCTATAACAAACTGCCTTTTGAATCAATAAACAATGAAAATACCAAATTGAATACT
AATAACACTATTAAAAATACAATAGAGACTAATACTTCTATTGGTAACAATATTATTCAAAATAATGATAATAATAATCC
CAATTTAAGCATTGCTGATTTAGAATTAGAGCAACAGAATTTAGGAGAACAAAATGGAAAAGAAAGAACAAATCGCGCAG
ATGAGCCGAATAGAACTAGAGCAGGAATTCCGCAAGAAATTCACCGCAGAAGCGAACATGGAGGATTGCAAGAGGGAGTG
GAGCGATCGAGTGATGAAGAACTTTTACACCAAGACTCTAGTTTATTTATTGAGCCTAGAGAGCAGGGAGGAACAAGAGG
AGTTTATAGATCTAGCGACCAACAGGCAGTTTCAGAAAAATCCCATAGAGAGAGAGATAGACTACATGAACATGTATCTA
GAGGAGATGGAGTATCAGCAAGAGCAGATAGCAATGGAGCATCAAGTCAAGCAAGCCGAATGGAAAATGGAGCAAGAAGC
GAAGAAAAGGGGGATAATCCCAGCGATGAGAGAGGAGTATCACAGACACCGCAATCCCCATCTCATCAACAAAATAGCTC
CAGAGATTTGGGGCTTTCTCTCTCTAGAGAACAGCCAGGACAGACTGGACGCTTACGCCTTTTTAATCATGGACAGATGG
GCTCATTATTTCCCACAGACCATGAAAACCAAAGATCAAAGAACGATAATGAGCTTGATAGAAGCAGTGATAGAGCAAAC
GAAAATGGAGACAAAAGCCCTAGACAGAATGGAAGCGCAAATCAAGAGAGCGCAAGGAGTGAGCGATATGGAATTGCTCA
AGGAAGCTCAAGTCAATCAGTATTACCACTTGCTCAAAGCCGATTACATCATGCAGGACTCAGCGCACCAAATGGACTTG
GAAACCTTGAAGAAGACAGGGATCAAGAGAGAGGACTTTTATCAAATTTAGACCATTTAGAAAGCCTGCTTAACGCTATT
AGAAACAACACCATAGCGAGTGAGCCTGACTTTAGATCTAGACTATTAGAAGCCATTCAAAACAACGAGCCTTTAAAAGA
TAGCATTGTAGGGATGCAGCTCCTTAAAGATCCTATGACTAAAGTCTTTTATGACAAATTCCAATTAAAAATCAGCCCTA
AAAAAGTCTTAGAGATTTTAGAAAATCGCATTAAAAAATCCATTGAAACAGCGAATGAAACGCTAAACGCATTCAATGTG
TTGGATAGTCAAGCTATTGATTTAAACGCTATTTCTAATAGTGTAGGATTAAATCCCACACAAGAGAGTGTAATAACAGA
CAATAGCGTAGAGTTAAATAACGCTCAAGAACAAACCGCGCAAGAGCAAGACACACAAGAAAACGCGCAAACCACGATAA
AACAAGAAACACCAACCGCACCAGCCATCCCCCTTAATCCTAAAATAGATTTTAAACCGAGCGAAGAAGTTTTAATCAAG
GGAGCTAAAACTCGCTACAAGGCTAACATAAAAGCCATTGAGCTTTTAAAAGAATTGCAAGCCAAACAAGAGATCTTAAA
AGGCGATTATTACGCCACTCAAGAAGAGCAAGAAATCCTAGCGCAATTTAGCGGATGGGGTGGGTTAGAAAGCTACTTTA
AAAAAGATCAACGCCCTGAAGAATTTAAGGAATTAAACGCCCTACTCACTAAAGATGAATTCAGAAGAGCTTATTTGAGC
GCAAGAGACGCTTACTACACCCCTAAATTAGTTATTGATAGCATTTATCATGGATTAGATCAATTAGGGTTTAATAACGA
CAACCATCAAAAAGAAATCTTTGAACCCAGTTTAGGCACAGGCAAATTCATCGCTCATGCGCCAAGCGATAAGAATTACC
GCTTTATGGGAACAGAATTAGATCCTATTAGCGCTAATATTTCTAAATTCCTTTACCCTAATCAAGTCATCAAAAACACC
GCTTTAGAAAACCATCAATTCTATCAAGAATACGATGCGTTTGTGGGCAATCCTCCTTATGGCAGTCATAAAATCTATAG
TTCCAATGACAAAGAATTGAGTAACGAGAGCGTCCATAATTACTTTTTAGGGAAATCTATCAAAGAATTGAAAGATGATG
GTATAGGAGCTTTTGTGGTGAGTTCTTGGTTTATGGATGGTAAAAACCCTAAAATGAGAGAACACATCGCGCAAAACGCC
ACTTTTTTAGGAGCGATAAGATTGCCTAATAGCGTGTTTAAAGCAACAGGAGCTGAAGTGAGTAGCGACATTGTGTTTTT
TAAAAAAGGCGTTGATGAAGCAACCAATCAAAGCTTCACTAAAGCTATGCCTTATTATGACAAGATCATTGATAGCTTGA
ATGATGACACCCTTTTTGCCTTGCAAAACAACCGCTTTGATAGTTTTACTCCTAGCGATCAACTTAAGATTGTCAATGCG
ATTGCAAGCCATTTTGGTTTCCAACAAGAAAAATTGCAACGCTGGTATGAAAAAATAGACACCGCTAACTTTGGCTACAA
AGAACAAGACTATAAGATCATCAAAGGCTTCATTGATAAAGTTGGCGAGAATAATATCAATCTCAACGAACAAACCTTGA
ATGAATATTTTATCCACCACCCTGAAAACATTCTAGGGCATTTGAGTTTGGAAAAAACCCGCTATAGCTTTGAAATAAAT
GGCGAACAAATTTACAAATACGAGTTGCAAGCTTTAGAGGATAAAAGCTTGGATTTATCCCAAGCTCTTAATCAAGCGAT
AGAAAAATTGCCTAAAGGCGTCTATCAATACCATAAGACTACCCTTAAAACAGACGCGCTCATCATTGATGCCAATAACG
AACGCTATCAAGAAATTCAAAAGCTTATCAAAAATTTAGAAAGGGGGGAATTAGTCAAGTGGGATAATCTTTATTTCCAA
CTGGAACAAAATAATGAAATGGGCATCTTTTTAAAACCCACTAAAATCAACTCTAAAGTCCAAGATTCACGACTAAAAGC
CTATTTTAAAATTAAAGACGCTTTGAATGATTTAACGAGTGCGGAATTAAGCCCCTTAAGCTCTGATTTGGAGCTAGAAA
GTAAAAGAGTTAGGCTCAATCTTGTTTATGATGAATTTGTCAAGAAATTTGGCTATCTCAATGAGAATAAAAATCGTAAG
GACATCAAACAAGATTTGTATGGCGCTAAAGTCTTAGGATTAGAAAAAGACTTTGAAAAAGAAATCACCCCTAGAAGCGC
TAAAATGCAAAACATAGAGCCAAGGCAAGCTCAAGCTAAAAAAGCTCAAATCTTTTTTGAAAGGACTTTAAACCCCAAAA
AAGAACTTATTATCACTAACGCTAAAGAGGCATTAATTGCAAGCATCAATCAAAAAGGGTGTTTGGACTTGCATTTCATT
AGAGATCATTTCACAACCCAAAGCTTAGAAACCACGATTAAAGAACTTTTAGAGCAAAAACTTATTTATAAAGACCACAA
GGATAATGGCGACTATGTTTTAGCGAACGATTATTTGAGCGGCAATGTGAAAAGAAAACTCAAAGAAGTTAAAGAGGCTA
TCAATCAAGGCGTGGAAGGATTAGAGATTAATTTGAAAGATTTAGAGCTGATTATCCCTAAAGATTTGAAAGCCACTGAA
ATCATGGCTAATATCAACAGCCCTTGGATACCCACTCAGTATTTAGAAGAGTTTTTAATAGAATTGAGCGCTAACCATTA
TGAAAAGCAATACGGCGATAAAATGACAGATTACCAACTAGGCAATCTCAAAGAAGACATCAAAGTAGAACACCTAAGCG
GTGCTTATGAAGTTTTTGCTAGAAACAATGAATTAAACGAGCTTTATGGCATCAGGCATAAAGACAAGCCGCATTCTTAT
AAAGTGCCTTTTGAAAGCCTTTTAAATAAAGTCTTAAACAACAAGGATTTGAGCGTTAAATACGCCCAAGTTGATCCCAA
TGACCCTAAAAAAGAAATCTTTATCACTGATGAAGAGCAAAGCAATCTCGCCAGACAAAAAGCAGAAGAATTGAAAGAAG
CTTTTAAAGACTGGATTTACAAGGATTATGCAAGAAGAACCCATTTAGAGCAAATCTATAATGACACTTTCAACAACTCT
GTTTTAAAAACCTATGATGGCTCGCAATTAGAGTTAGAGGGCTTTAACCACCATATCAGCTTGCGCCCCCACCAAAAGAA
CGCTATTTTTAGAACCATCCAAGACAGGGCGGTGTGTTTAGACCATCAGGTTGGAGCAGGCAAGACTTTGTGCGCTATAG
CCAGTTGCATGGAACAAAAACGCATGGGATTAGTGAATAAAACGCTCATTGCCGTGCCTAACCATTTAACCAAGCAATGG
GGCGATGAATTTTATAAGGCTTACCCTAACGCTAATGTGTTAGTTGTTGATAGCAAGGACATCACTGAAAAAGAAAGAGA
ACTTTTATTCAATCAAATCGCTAACAACAATTATGACGCTGTGATTATCGCGCACACCCATTTGGAATTATTGTCTAACC
CTAGAGGAATCATAGAAGAATTGAAAGAAGAAGAGCTAGTGAATGCCGAAAAAAACTTTGAAAGGCAAGAACTGGCTTAT
AAAAATAACCCTAGAGAAACTAAAAAACCCAATGAAAGAGCCTTTAAAAACAAACTGGATAAAATCCGCGCTCAATACGA
TGCGATTTTAGAAAAACAAGGCTCTCATATTGATATTAGTCAAATGGGGATTGACAATTTGATTGTGGATGAAGCCCACT
TATTCAAAAATCTAGCCTTTGAAACTTCTATGGAAAAAATTGCAGGGCTTGGTAACCAACAAGGCTCTAATCGCGCTAGA
GATTTGTTTATTAAAACGCGCTACTTGCATCAAAACAATAAGAAAATCATGTTTTTAACCGGCACGCCTATAGCTAATTC
CTTGAGTGAAATGTATCACTTGCAACGCTACCTGACCCCTGATGTGTTGAAAGAAAGAGGGTTAGAATTCTTTGATGATT
GGGCTAAGACTTATGGGGAAGTGGTGAATGATTTTGAATTAGACACTTCCGCTCAAAGTTATAAAATGGTTAATCGCTTT
TCTAAATTTAGCGATGTGCAAGGCTTAAGCACCATGTATAGAGCTTTTGCGGATATTGTCTCCAATGATGATATTTTAAA
GCATAACCCCCACTTTGTGCCTAAAGTGTATGGGGATAAACCTATCAATGTGGTGGTGAAAAGAAGCGAAGAAGTGGCTC
AATTTATTGGCGTGGCTGATGAAAATGGCAAATATAATGAAGGCCCTATCATTGATAGGATGCAAAAATGCGAGAGCAAG
AAAAGCAAAAAAGGGCAAGACAATATCCTTTCTTGCACCACAGACGCTAGAAAAGTGGCTTTGGATTACCGCTTGATTGA
CCCTAACGCTAAAGTAGAAAAAGAATTTTCTAAAAGCTATGCTATGGCAAAAAATATCTATGAGAATTATTTAGAAACTA
ATGCCACTAAAGGCACACAACTTGGTTTCATAGGGCTATCCACACCCAAAACCCATAGCCAAAAAGTGAGTTTAGAAGCG
CTAGATAACGCTCATGAGATAGAAAATAAAAATCCCCTAGATGAAGCTCAAGAACTTTTAGAGAGCTTGTCTAGTTATGA
TGAAAATGGCAATCTTATCGCTCCTAGTAAGAAAGAATTAGAGAACGAACTCAAAGAGAAAGAGGCTAAAAGCGTCAATT
TAGATGAAGAGATAGCTAAAGGCTGTAAGTTTGATGTTTATAGCGATGTTTTAAGGCATTTAGTCCAAATGGGTATCCCA
CAAAATGAAATCGCTTTCATCCATGACGCTAAAACTGAAGAGCAAAAACAGGATTTGTTTAAAAATATTAATCGTGGCGA
AGTCAGGGTATTATTGGGCAGTCCCGCTAAAATGGGCGTAGGCACTAATGTGCAAGAGAGATTAGTCGCTATGCATGAAT
TAGATTGCCCATGGAGACCTGATGAATTGTTGCAAATGGAAGGGCGTGGGATAAGACAAGGCAATATTTTGCACCAAAAC
GATCCTGAAAATTTTAGAATGAAAATCTATCGCTACGCCACTGAAAAAACTTATGATAGCCGTATGTGGCAAATCATAGA
GACTAAATCTAAAGGCATAGAGCAATTTAGAAACGCGCACAAATTAGGCTTGAATGAATTAGAAGACTTTAATATGGGGA
GTTCTAATGCGAGTGAGATGAAAGCAGAAGCGACAGGTAATCCCTTGATTATTGAAGAAGTCAAATTGAGGGCTGAAATC
AAAAACGAAGAAGCAAAATACAAAGCTTTCAATAAAGAAAATTATTTCAATGAAGAGAATTTGAAAAACAACTCTTCTAA
ATTGGATTATCTTAAGCAGGAATTGAAAGATTTAGAAACGCTTCAAAGCGCTGTAATGATCCCCACTCATACAGAGATCA
AGCTCTATGATTTGAAAAATGAAGAGAGTAAGGATTATGAGCTTATCAAAGTTAAAGAAGTAGAGCCTTTAAAAGAAAAC
GCCTCTATGAGTGAAGAATTAACGCACAAAAAACTCAAAGAACAAAACAAGCAAATAGCCGAACAAAATAAAGAAAAGCT
AGATGCTATTAAAAAGCAATTTGCAAGCAATTTGAACGACTTGTTTTTCAATGAGGAAAGAGATTGTAAGCTTTTAGAAT
ACAAGGGCTTTGTGGTGAACGCTTATAAAACTAAGTATCAAGTGGAGTTTAGTTTAAACCCTAAAGACAATCCAAATATT
GCCTATAGCCCTAGCAACATGGTTTATAAAAACGATACTGCCAACATGTTTAGCTCTTATAATTTCTGCGGCGAGATTAA
ATTTGATGGGTTTTTAAAAAGATTGGATAACGCTATCACTAAACTCCCTGAAAAAATCAAGGAATTAGAAAACTCCATTA
AAATCACTCAAGAAAATATCGCTAAATACACAAGATTAGTGGAACAAAAACCTTCTTACCCACGACTAGAATACTTGCAA
GCTTTAAAATGGGATCATAAAACTCTAATAGATGATTTAGCTAAAATGAGCAAAGACAGAGATTATAAGCCTGTATTCAA
CCCTAAATCTCAAGAAGTCTTAGAGAAAATGAACGCTGAAAAAAGAGCGAGTTTAGAGAATGAGAGTAAAGAAATGACTG
AAATTAAAAACAGCAATAAAGAGCAAGAGATTAAGAGAGATATAAAGAGCAATGATGAAGTAAGACAACATATAGAGCAA
GTGATTGAGAAAGAAATAGAAAAAGGCACTGAAAATATTTCTTCTAGTGAGCTTATAACCACTAACAATATTGATTACTA
CGAGAACGAAGAAGTAGAAATCATTAAATCAAGGGGTAGAAGGTGA

Protein sequence :
MAYKPSKKKLQTLREEPNLFSILDDGDVINTSSYKQEPEPQIKEPQSSATQNPNKEQETPNYVVKTQINTARMISRNPIE
WARYLSFERRVHKDNSKEDVNFFANGEIKESSRVYEANEEGFERRITKRYDLIDTTRNKEFFSKEIEILTYTNSLEELKE
QGLEIQLTHHHETHKKTLENGNEIAKEYDYLKDIYQEVERTRDGELVREIIPSISSTEYFKLYNKLPFESINNENTKLNT
NNTIKNTIETNTSIGNNIIQNNDNNNPNLSIADLELEQQNLGEQNGKERTNRADEPNRTRAGIPQEIHRRSEHGGLQEGV
ERSSDEELLHQDSSLFIEPREQGGTRGVYRSSDQQAVSEKSHRERDRLHEHVSRGDGVSARADSNGASSQASRMENGARS
EEKGDNPSDERGVSQTPQSPSHQQNSSRDLGLSLSREQPGQTGRLRLFNHGQMGSLFPTDHENQRSKNDNELDRSSDRAN
ENGDKSPRQNGSANQESARSERYGIAQGSSSQSVLPLAQSRLHHAGLSAPNGLGNLEEDRDQERGLLSNLDHLESLLNAI
RNNTIASEPDFRSRLLEAIQNNEPLKDSIVGMQLLKDPMTKVFYDKFQLKISPKKVLEILENRIKKSIETANETLNAFNV
LDSQAIDLNAISNSVGLNPTQESVITDNSVELNNAQEQTAQEQDTQENAQTTIKQETPTAPAIPLNPKIDFKPSEEVLIK
GAKTRYKANIKAIELLKELQAKQEILKGDYYATQEEQEILAQFSGWGGLESYFKKDQRPEEFKELNALLTKDEFRRAYLS
ARDAYYTPKLVIDSIYHGLDQLGFNNDNHQKEIFEPSLGTGKFIAHAPSDKNYRFMGTELDPISANISKFLYPNQVIKNT
ALENHQFYQEYDAFVGNPPYGSHKIYSSNDKELSNESVHNYFLGKSIKELKDDGIGAFVVSSWFMDGKNPKMREHIAQNA
TFLGAIRLPNSVFKATGAEVSSDIVFFKKGVDEATNQSFTKAMPYYDKIIDSLNDDTLFALQNNRFDSFTPSDQLKIVNA
IASHFGFQQEKLQRWYEKIDTANFGYKEQDYKIIKGFIDKVGENNINLNEQTLNEYFIHHPENILGHLSLEKTRYSFEIN
GEQIYKYELQALEDKSLDLSQALNQAIEKLPKGVYQYHKTTLKTDALIIDANNERYQEIQKLIKNLERGELVKWDNLYFQ
LEQNNEMGIFLKPTKINSKVQDSRLKAYFKIKDALNDLTSAELSPLSSDLELESKRVRLNLVYDEFVKKFGYLNENKNRK
DIKQDLYGAKVLGLEKDFEKEITPRSAKMQNIEPRQAQAKKAQIFFERTLNPKKELIITNAKEALIASINQKGCLDLHFI
RDHFTTQSLETTIKELLEQKLIYKDHKDNGDYVLANDYLSGNVKRKLKEVKEAINQGVEGLEINLKDLELIIPKDLKATE
IMANINSPWIPTQYLEEFLIELSANHYEKQYGDKMTDYQLGNLKEDIKVEHLSGAYEVFARNNELNELYGIRHKDKPHSY
KVPFESLLNKVLNNKDLSVKYAQVDPNDPKKEIFITDEEQSNLARQKAEELKEAFKDWIYKDYARRTHLEQIYNDTFNNS
VLKTYDGSQLELEGFNHHISLRPHQKNAIFRTIQDRAVCLDHQVGAGKTLCAIASCMEQKRMGLVNKTLIAVPNHLTKQW
GDEFYKAYPNANVLVVDSKDITEKERELLFNQIANNNYDAVIIAHTHLELLSNPRGIIEELKEEELVNAEKNFERQELAY
KNNPRETKKPNERAFKNKLDKIRAQYDAILEKQGSHIDISQMGIDNLIVDEAHLFKNLAFETSMEKIAGLGNQQGSNRAR
DLFIKTRYLHQNNKKIMFLTGTPIANSLSEMYHLQRYLTPDVLKERGLEFFDDWAKTYGEVVNDFELDTSAQSYKMVNRF
SKFSDVQGLSTMYRAFADIVSNDDILKHNPHFVPKVYGDKPINVVVKRSEEVAQFIGVADENGKYNEGPIIDRMQKCESK
KSKKGQDNILSCTTDARKVALDYRLIDPNAKVEKEFSKSYAMAKNIYENYLETNATKGTQLGFIGLSTPKTHSQKVSLEA
LDNAHEIENKNPLDEAQELLESLSSYDENGNLIAPSKKELENELKEKEAKSVNLDEEIAKGCKFDVYSDVLRHLVQMGIP
QNEIAFIHDAKTEEQKQDLFKNINRGEVRVLLGSPAKMGVGTNVQERLVAMHELDCPWRPDELLQMEGRGIRQGNILHQN
DPENFRMKIYRYATEKTYDSRMWQIIETKSKGIEQFRNAHKLGLNELEDFNMGSSNASEMKAEATGNPLIIEEVKLRAEI
KNEEAKYKAFNKENYFNEENLKNNSSKLDYLKQELKDLETLQSAVMIPTHTEIKLYDLKNEESKDYELIKVKEVEPLKEN
ASMSEELTHKKLKEQNKQIAEQNKEKLDAIKKQFASNLNDLFFNEERDCKLLEYKGFVVNAYKTKYQVEFSLNPKDNPNI
AYSPSNMVYKNDTANMFSSYNFCGEIKFDGFLKRLDNAITKLPEKIKELENSIKITQENIAKYTRLVEQKPSYPRLEYLQ
ALKWDHKTLIDDLAKMSKDRDYKPVFNPKSQEVLEKMNAEKRASLENESKEMTEIKNSNKEQEIKRDIKSNDEVRQHIEQ
VIEKEIEKGTENISSSELITTNNIDYYENEEVEIIKSRGRR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HPP12_0447 YP_002301083.1 DNA methylase Not tested cag PAI Protein 0.0 93