PAI Gene Information


Name : cagA (HP0547)
Accession : AAR03970.1
PAI name : cag PAI
PAI accession : AY330644
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagA
Function : -
Note : Cag26; cag26; jhp0495; cytotoxicity associated immunodominant antigen
Homologs in the searched genomes :   49 hits    ( 47 protein-level,   2 DNA-level )  
Publication :
    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates", Gene 328, 85-93 (2004) PUBMED 15019987.

    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Direct Submission", Submitted (26-JUN-2003) Biotechnology, Royal Institute of Technology, Alba Nova University Centre, Roslagstullsbacken 21, S-106 91 Stockholm, Sweden.


DNA sequence :
ATGACTAACGAAACCATTAACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATCAATAATCTTCAAGT
GGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATACGATCCTGATCAAAAACCAATCGTTGATAAGAACGATAGGGATA
ACAGGCAAGCTTTTGATGGAATCTCGCAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAGACTTTATCAATAAGAGCAATGATCTAATCAACAAAGACAATCTCATTGATGTAGAATCTTCCACAAA
GAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGTTGGGTGTCCCATCAAAACGATCCGTCTAAAATCA
ACACCCAATCGATCCAAAATTTTATGGAAAATATCATACAACCCCCTATCCCTGATGACAAAGAAAAAGCAGAGTTTTTG
AAATCTGCCAAACAATCTTTTGCAGGAATCATTATAGGGAATCAAATCCGAACGGATCAAAAGTTCATGGGCGTGTTTGA
TGAATCCTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGAGCCTACTGGTGGGGATTGGTTGGATATTTTTTTATCAT
TTATATTTGACAAAAAACAATCTTCTGATGTCAAAGAAGCAATCAATCAAGAACCAGTTCCCCATGTCCAACCAGATATA
GCCACTACCACCACCGACATACAAGGCTTACCGCCTGAATCTAGGGATTTGCTTGATGAAAGGGGTAATTTTTCTAAATT
CACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGAGTCGCTGACATTGATCCCAATTACAAGTTCAATCAATTATTGA
TTCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCATTGTTGTATGCG
GGCAATGGTGGTTTTGGAGCCAAGCACGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGTAACAATGTGGCTAC
AATAATTAATGTGCATATGAAAAACGGCAGTGGCTTGGTCATAGCAGGTGGTGAGAAAGGGATTAATAACCCTAGTTTTT
ATCTCTACAAAGAAGACCAACTCACAGGCTCACAACGAGCATTGAGTCAAGAAGAGATCCAAAACAAAGTGGATTTCATG
GAATTTCTTGCACAAAACAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCGAACTGAGATTAAGGA
TTTCCAAAAAGACTCTAAGCCTTATTTAGACGCCCTAGGGAATGATCGTATTGCTTTTGTTTCTAAAAAAGACACAAAAC
ATTCAGCTTTAATTACTGAGTTTAATAAGGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCT
TTAGATAGGGAGAAAAATGTTACTCTTCAAGGTAGCCTAAAACATGATGGCGTGATGTTTGTTGATTATTCTAATTTCAA
ATACACCAACGCCTCCAAGAATCCCAATAAGGGTGTAGGCGCTACGAATGGCGTTTCCCATTTAGACGCAGGCTTTGACA
AGGTAGCTGTCTTTAATTTGCCTGATTTAAATAATCTCGCTATCACTAGTTTCGTAAGGCGGAATTTAGAGGATAAACTA
ACCACTAAAGGATTGTCCCTACAAGAAGCTAATAAGCTTATCAAAGATTTTTTGAGCAGCAACAAAGAATTGGTTGGAAA
AGCTTTAAGCTTCAATAAAGCTGTAGCTGACGCTAAAAACACAGGCAACTATGATGAAGTGAAACGAGCTCAGAAAGATC
TTGAAAAATCTCTAAAGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAAAAGCGGCAACAAAAAT
AAAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAGGCTAATAGAGACGC
AAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAGAAGGGAATTGTCTGATAAACTTGAAAATATCAACAAGGATT
TGAAAGACTTTGATAAATCTTTTGATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGCAGAAGAAACACTAAAA
GCCCTTAAAGGCTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAATGCAGCTTTGAA
TGACTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATTCCGTTAAAGATG
TGATCATCAATCAAAAGGTAACGGATAAAGTTGACAATCTCAATCAAGCGGTATCAATAGCTAAAGCAATAGGCGATTTC
AGTGGGGTAGAGCAAGCGTTAGCCGATCTCAAGAATTTCTCAAAGGAGCAATTGGCCCAACAAGCTCAAAAAAATGAAGA
TTTCAATACTGGAAAAAAATCTGAAATATACCAATCCGTTAAGAATGGTGTGAATGGAACCCTAGTCGGTAATGGATTAT
CTGGAATAGAGGCCACAGCTCTCGCCAAAAATTTTTCGGATATCAAGAAAGAATTGAATGAGAAATTTAAAAATTTCAAT
AACAATAACAATAATGGACTCGAAAACAGCACAGAACCCATTTATGCTAAAGTTAATAAAAAGAAAACAGGGCAAGCAGC
TAGCCCTGAAGAACCCATTTATGCTCAAGTTGCTAAAAAGGTGAATGCAAAAATTGACCGACTCAATCAAATAGCAAGTG
GTTTGGGTGGTGTAGGGCAAGCAGTGGGCTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCGA
TCGGTTAGCCCTGAACCCATTTATGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGACAAGGCATTCTAAAGTTGA
TGATCTCAGTAAGGTAGGGCTTTCAAGGGATCAAAAATTGGCTCAGAAAATTGACAATCTCAATCAAGCGGTATCAGAAG
CTAAAGCAGGTTTTTTTGGCAACCTAGAGCAAACGATAGACAATCTCAAAGATTCTACAAAAAAGAATGTTGTGAATCTA
TGGGTTGAAAGTGCAAAAAAAGTACCTGCTAGTTTGTCAGCGAAACTAGATAATTATGCTACTAACAGCCACACACGCAT
TAATAGCAATATCCAAAATGGAGCAATCAACGAAAAAGCAACCGGCATGCTAACGCAAAAAAACCCTGAGTGGCTTAAGC
TCGTGAATGATAAGATAGTTGCGCATAATGTAGGAAGCGTTCCTTTGTCAGAGTATGATAAAATTGGCTTCAACCAGAAG
AATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCGCCAAGTTGAACAATGCTGCAAAAGACATTAAGTCTGGCTTTAC
GCAATTTTTAACCAATGCATTTTCTACAGGATATTACTGCTTGGAGCGGGAAAATGCGGAGCATGGAATCAAAAATGTTA
ATACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETINQQPQTEAAFNPQQFINNLQVAFLKVDNAVASYDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKNPTKKN
QYFSDFINKSNDLINKDNLIDVESSTKSFQKFGDQRYRIFTSWVSHQNDPSKINTQSIQNFMENIIQPPIPDDKEKAEFL
KSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGEPTGGDWLDIFLSFIFDKKQSSDVKEAINQEPVPHVQPDI
ATTTTDIQGLPPESRDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVSLLYA
GNGGFGAKHDWNATVGYKDQQGNNVATIINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIQNKVDFM
EFLAQNNAKLDNLSEKEKEKFRTEIKDFQKDSKPYLDALGNDRIAFVSKKDTKHSALITEFNKGDLSYTLKDYGKKADKA
LDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGATNGVSHLDAGFDKVAVFNLPDLNNLAITSFVRRNLEDKL
TTKGLSLQEANKLIKDFLSSNKELVGKALSFNKAVADAKNTGNYDEVKRAQKDLEKSLKKREHLEKEVAKKLESKSGNKN
KMEAKAQANSQKDEIFALINKEANRDARAIAYAQNLKGIRRELSDKLENINKDLKDFDKSFDEFKNGKNKDFSKAEETLK
ALKGSVKDLGINPEWISKVENLNAALNDFKNGKNKDFSKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSIAKAIGDF
SGVEQALADLKNFSKEQLAQQAQKNEDFNTGKKSEIYQSVKNGVNGTLVGNGLSGIEATALAKNFSDIKKELNEKFKNFN
NNNNNGLENSTEPIYAKVNKKKTGQAASPEEPIYAQVAKKVNAKIDRLNQIASGLGGVGQAVGFPLKRHDKVDDLSKVGR
SVSPEPIYATIDDLGGPFPLTRHSKVDDLSKVGLSRDQKLAQKIDNLNQAVSEAKAGFFGNLEQTIDNLKDSTKKNVVNL
WVESAKKVPASLSAKLDNYATNSHTRINSNIQNGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGSVPLSEYDKIGFNQK
NMKDYSDSFKFSAKLNNAAKDIKSGFTQFLTNAFSTGYYCLERENAEHGIKNVNTKGGFQKS