PAI Gene Information


Name : cagA
Accession : BAD51747.1
PAI name : cag PAI
PAI accession : AB190937
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cytotoxin associated protein A
Function : -
Note : -
Homologs in the searched genomes :   48 hits    ( 47 protein-level,   1 DNA-level )  
Publication :
    -Yamazaki,S., Yamakawa,A. and Azuma,T., "Direct Submission", Submitted (22-SEP-2004) Takeshi Azuma, University of Fukui, Faculty of Medical Sciences, Second depertment of Internal Medicine; Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:81-776-61-8351, Fax:81-776-61-8110).

    -Yamazaki,S., Yamakawa,A., Okuda,T., Ohtani,M., Suto,H., Ito,Y., Yamazaki,Y., Keida,Y., Higashi,H., Hatakeyama,M. and Azuma,T., "Distinct diversity of vacA, cagA, and cagE genes of Helicobacter pylori associated with peptic ulcer in Japan", J. Clin. Microbiol. 43 (8), 3906-3916 (2005) PUBMED 16081930.


DNA sequence :
ATGACTAACGAAACTATTGACCAACAACCACAAACCGAAGTGGCTTTTAACCCGCAGCAATTTATTAATAATCTTCAGGT
AGCTTTTCTTAAGCTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAACCAATTGTTGATAAGAATGATAGGGATA
ACAGGCAAGCTTTTGATGGAATCTCGCAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAGACTTTATCAATAAGAGCAATGATCTAATCAACAAAGACAATCTCATTGATGTAGAATCTTCCACAAA
GAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGTTGGGTGTCCCATCAAAACGATCCGTCTAAAATCA
ACACCCGATCGATCCGAAATTTTATGGAAAATATCATACAACCCCCTATCCCTGATGACAAAGAAAAAGCAGAGTTTTTG
AAATCTGCCAAACAATCTTTTGCAGGAATCATTATAGGGAATCAAATCCGAACGGATCAAAAGTTCATGGGCGTGTTTGA
TGAATCCTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGGGCCTACTGGTGGGGATTGGTTGGATATTTTTCTCTCGT
TTATATTTGACAAAAAACAATCTTCTGATGTCAAAGAAGCAATCAATCAAGAGCCAGTTCCCCATGTCCAACCAGATATA
GCCACTACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGGGATTTGCTTGATGAAAGGGGTAATTTTTCTAAATT
CACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGCGTCGCCGACATTGATCCTAATTACAAGTTCAATCAATTATTGA
TTCACAATAACGTTCTGTCTTCTGTGTTAATAGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCATTATTGTATGCG
GGCAATGGTGGTTTTGGAGCCAAACACGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGTAACAATGTGGCTAC
AATAATTAATGTGCATATGAAAAACGGCAGTGGCTTAGTCATAGCAGGTGGTGAGAAAGGGATTAATAATCCTAGTTTTT
ATCTCTACAAAGAAGACCAACTCACAGGCTCACAACGAGCATTGAGTCAAGAAGAGATCCGAAACAAAGTAGATTTCATG
GAATTTCTTGCACAAAATAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCGAACTGAGATTAAGGA
TTTCCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGATCGTATTGCTTTTGTTTCTAAAAAAGACCCAAAAC
ATTCAGCTTTAATTACTGAGTTTGGTAATGGGGATTTTAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCT
TTAGATAGGGAGAAAAATGTCACTCTTCAAGGTAGCCTAAAACATGATGGCGTGATGTTTGTTGATTATTCTAATTTCAA
ATACACCAACGCCTCCAAGAGTCCCAATAAGGGTGTAGGCGTTACGAATGGCGTTTCCCATTTAGAAGCAGGCTTTAACA
AGGTAGCTGTCTTTAATTTGCCTGATTTAAATAATCTCGCTATCACTAGTTTCGTAAGGCGGAATTTAGAGGATAAACTA
GTCGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTCATCAAAGATTTTTTGAGCAGCAACAAAGAATTGGTTGGAAA
AGCTTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAACTATGATGAAGTGAAAAAAGCTCAGAAAGATC
TTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGAGAAAAAATTGGAGAGCAAAAGCGGTAACAAAAAC
AAAATGGAAGCAAAATCTCAAGCTAACAGCCAAAAAGATGGGATTTTTATGTTGATCAATAAAGAGGCTAATAGAGACGC
AAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAAAAGGGAATTGTCTGATAAACTTGAAAATGTCAACAAGAATT
TGAAAGACTTTAGTAAATCTTTTGATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGCAGAAGAAACACTAAAA
GCCCTTAAAGGCTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAATCTTAATGCAGCTTTGAA
TGACTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATTCCATTAAAGATG
CGATCTTCAATCAAAAGATAACGGATAAAGTTGATGATCTCAATCAAGCGGTATCAGTGGCTAAAGCAACGGGTGATTTC
AGTAGGGTAGAGCAAGCGTTAGCCGATCTCAAAAACTTCTCAAAGGAGCAATTGGCTCAACAAGCTCAAAAAAATGAAAG
TCTCAATGCTGGAAAAAAATCTGAAATATACCAATCCGTTAAGAATGGTGTGAACGGAACCCTAGTCGGTAATGGATTAT
CTGGAATAGAGGCCACAGCTCTCGCCAAAAATTTTTCGGATATCAAGAAAGAATTGAATGAGAAATTTAAAAATTTCAAT
AACAATAATAATGGTCTCAAAAACAGCACAGAACCCATTTATGCTAAAGTTAATAAAAAGAAAACAGGACAAGTAGCTAG
CCCTGAAGAACCCATTTATACTCAAGTTGCTAAAAAGGTAACTCAAAAAATTGACCAACTCAATCAAGCAGCAAGTGGTT
TGGGTGGTGTAGGGCAAGCGGGCTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCGATCGGTT
AGCCCTGAACCCATTTATGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCT
CAGTAAGGTAGGGCGATCGGTTAGCCCTGAACCCATTTATGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAAAA
GGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGGAATCAAGAATTGGCTCAGAAAATTGACAATCTCAGT
CAAGCGGTGTCAGAAGCTAAAGCAGGTTTCTTTGGCAATCTAGAGCAAACGATAGACAAGCTCAAAGATTCTACAAAACA
CAATCCCATGAATCTATGGGCTGAAAGTGCAAAAAAAGTGCCTGCTAGTTTGTCAGCGAAACTAGACAATTACGCTACTA
ACAGCCACACACGCATTAATAGCAATGTCCAAAATGGAGCAATCAATGAAAAAGCGACCGGCATGCTAACGCAAAAAAAC
CCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCACATAATGTGGGAAGCGTTCCTTTGTCAGAGTATGATAAAAT
TGGCTTCAACCAGAAGAATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCACCAAGTTGAACAATGCTGTAAAAGACG
TTAAGTCTGGCTTTACGCAATTTTTAGCCAATGCATTTTCTACAGGATATTACTGCTTGGCGGGGGAAAATGCGGAGCAT
GGAATCAAAAATGTTAATACCAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETIDQQPQTEVAFNPQQFINNLQVAFLKLDNAVASFDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKNPTKKN
QYFSDFINKSNDLINKDNLIDVESSTKSFQKFGDQRYRIFTSWVSHQNDPSKINTRSIRNFMENIIQPPIPDDKEKAEFL
KSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGGPTGGDWLDIFLSFIFDKKQSSDVKEAINQEPVPHVQPDI
ATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNVLSSVLIGSHNGIEPEKVSLLYA
GNGGFGAKHDWNATVGYKDQQGNNVATIINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIRNKVDFM
EFLAQNNAKLDNLSEKEKEKFRTEIKDFQKDSKAYLDALGNDRIAFVSKKDPKHSALITEFGNGDFSYTLKDYGKKADKA
LDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKSPNKGVGVTNGVSHLEAGFNKVAVFNLPDLNNLAITSFVRRNLEDKL
VAKGLSPQEANKLIKDFLSSNKELVGKALNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKN
KMEAKSQANSQKDGIFMLINKEANRDARAIAYAQNLKGIKRELSDKLENVNKNLKDFSKSFDEFKNGKNKDFSKAEETLK
ALKGSVKDLGINPEWISKVENLNAALNDFKNGKNKDFSKVTQAKSDLENSIKDAIFNQKITDKVDDLNQAVSVAKATGDF
SRVEQALADLKNFSKEQLAQQAQKNESLNAGKKSEIYQSVKNGVNGTLVGNGLSGIEATALAKNFSDIKKELNEKFKNFN
NNNNGLKNSTEPIYAKVNKKKTGQVASPEEPIYTQVAKKVTQKIDQLNQAASGLGGVGQAGFPLKRHDKVDDLSKVGRSV
SPEPIYATIDDLGGPFPLKRHDKVDDLSKVGRSVSPEPIYATIDDLGGPFPLKRHDKVDDLSKVGLSRNQELAQKIDNLS
QAVSEAKAGFFGNLEQTIDKLKDSTKHNPMNLWAESAKKVPASLSAKLDNYATNSHTRINSNVQNGAINEKATGMLTQKN
PEWLKLVNDKIVAHNVGSVPLSEYDKIGFNQKNMKDYSDSFKFSTKLNNAVKDVKSGFTQFLANAFSTGYYCLAGENAEH
GIKNVNTKGGFQKS