PAI Gene Information


Name : cagA (HP0547)
Accession : AAR03939.1
PAI name : cag PAI
PAI accession : AY330642
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagA
Function : -
Note : Cag26; cag26; jhp0495; cytotoxicity associated immunodominant antigen
Homologs in the searched genomes :   48 hits    ( 47 protein-level,   1 DNA-level )  
Publication :
    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates", Gene 328, 85-93 (2004) PUBMED 15019987.

    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Direct Submission", Submitted (26-JUN-2003) Biotechnology, Royal Institute of Technology, Alba Nova University Centre, Roslagstullsbacken 21, S-106 91 Stockholm, Sweden.


DNA sequence :
ATGACTAACGAAACCATTAACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAACTTATCAATAATCTTCAAGT
GGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAACCAATCGTTGATAAGAATGATAGGGATA
ATAGGCAAGCTTTTGAGAAAATCTCGCAGCTAAGGGAGGAATTCGCTAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAAGCTTTATCAGTAAGAGCAATGATTTAATCAACAAAGACAGTCTCATTGATACAGGTTCTTCCATAAA
GAGCTTTCAGAAATTTGGGACTCAGCGTTACCAAATTTTTATGAATTGGGTGTCCCATCAAAAAGATCCATCTAAAATCA
ACACCCAAAAAATCCGAGGTTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAAGAGAAAGCGGAGTTTTTG
AGGTCTGCCAAACAAGCTTTTGCAGGAATTATCATAGGAAACCAAATCCGATCGGATCAAAAATTCATGGGCGTGTTTGA
TGAATCTTTGAAAGAGAGGCAAGAAGCAGAAAAAAATGGAGAGCCTAATGGAGATCCTACTGGTGGGGATTGGTTGGATA
TTTTTTTATCATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAACCAGTTCCTCATGTC
CAACCAGATGTAGCCACTACCACCACTGACATACAAAGCTTACCGCCTGAAGCTAGGGATTTGCTTGATGAAAGGGGTAA
TTTTTCTAAATTCACTCTTGGCGATATGAACATGTTAGATGTTGAGGGAGTCGCTGACATTGATCCTAATTACAAGTTCA
ACCAATTATTGATCCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCA
TTGTTGTATGGAAACAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCTACCGTTGGTTATAAAAACCAACAAGGCGA
CAATGTGGCTACACTCATTAATGTGCATATGAAAAATGGCAGTGGGTTAGTCATAGCAGGTGGTGAGAAAGGGGTTAACA
ACCCTAGTTTTTATCTCTACAAAGAAGACCAGCTCACAGGCTTGAAACAAGCATTGAGTCAAGAAGAGATCCAAAACAAA
GTAGATTTCATGGAATTTCTTGCACAAAATAATGCTAAACTAGACAACTTGAGCGTGAAAGAGAAAGAAAAATTCCAAAA
TGAGATTGAAGATTTTCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGATCACATTGCTTTTGTTTCTAAAA
AAGATCAAAAACATTTAGCTTTAATTACTGAATTTGGTAATGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAA
CAAGATAAAGCTTTAGATAGGGAGATAAAAACCACTCTTCAAGGTAACCTAAAACATGATGGCGTGATGTTTGTTAATTA
TTCTAATTTCAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGTGTGGGTGCTACGAATGGCGTTTCCCATTTAGAAG
CAAATCTTAGCAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAGTTATATAAGGCGAGACTTA
GAAGATAAACTGTGGGCTAAAGGATTGTCCCCACAAGAAACTAATAAGCTCATCAAAGACTTTTTGAACAGCGACAAGGA
ATTGGTTGAAAAAGTTTCAAATCTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGACGAAGTGAAAAAAG
CTCAGAAAGATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGTGAAAAAATTGGAGAACAGAAAC
GACAACAAAAATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCAATTATCAATGAAGAGGC
TGGTAAGGAAGCAAGAGCGGCCGCTTGCGTTCAGAATTTTAAAGGCATCAGAAGGGAATTGTCTGATAAGCTTGAAAACA
TCAACAAGAATTTGAAAGACTTTGATAAATCTTTTGATGAATTTAAAAATGGCAAAAATAAGGATTTCAGCAAGACAGAA
GAAACGCTAAAAGCCCTTAAAAGCTCAGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAA
TACAGCTTTGAATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATT
CAATTAAAGATGTGATCATCAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAAGCTGTATCAATAGCTAAGGCA
ACAGGCGATTTCAGTGGGGTAGAGCAAATGCTAGCCGATCTCAAGAATTTCTCAAAGGAGCAATTGGCTCAACAAGCTCA
AAAAAATGAAGATTTCAATACTGGAAAAAATTCTGAACTATACCAATCCGTTAAGAATGGTGTAAATAAAACCCTAGTCG
GTAATGGGTTATCTGGAATAGAGGCCACAGCTCTCGCCAAAAAATTTTCGGATATCGAGAAAGAATTGAATGAGAAATTT
AAAAATTTCAACAACAATAACAATAATGGACTCAAAAACGAACCCATTTATGCTAAAGTTAATAAAAAGAAAACAGGACA
AGTAGCTAGCCCTGAAGAACCCATTTATACTCAAGTTGCTAAAAAGGTAAATGCAAAAATTGACCGACTCAATCAAATAG
CAAGTGGTTTGGGTGGTGTAGGGCAAGCAACGGGCTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTA
GGGCTTTCAGCTAACCATGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAAAAGGCATGATAA
AGTTGATGATCTCAGTAAGGTAGGGCTTTCAGCTAACCATGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTT
TCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAGCTAACCATGAACCCATTTACGCTACG
ATTGATGATCTCGGCGGACCTTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAGCTAA
CCATGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCA
GTAAGGTAGGGCTTTCAGCTAACCATGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAAAAGG
CATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGGAATCAAGAATTGGCTCAGAAAATTGACAATCTCAATCA
AGCGGTATCAGAAGCTAAAACATGTCATTTTGACAACCTAGATCAAATGATAGACGAGCTCAAAGATTCTGCAAAAAAGA
ATGTTATGAATCTATATGTTGAAAGTGCAAAAAAAGTGCCTACTAGTTTGTCAGCGAAATTGGACAATTACGCTACTAAC
AGCCACGCAGGCATTAATAGCAATGTCAAAAATGGAACAATCAATGAAAAAGCGACCGGCATGCTAACGCAAAAAAACCC
TGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCACATAATGTGGGAAGCGCTCCTTTGTCAGCGTATGATAAAATTG
GATTCAACCAAAAGAATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCACCAGGTTGAGCAATGCCGTAAAAGACATT
AAGTCTGGCTTTGTACAATTTTTAACCAATGCATTTTCTACAGGATCTTACAGCTTGATGAAAGCAAATGTGGAACATGG
AGTCAAAAATACTAATACGAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETINQQPQTEAAFNPQQLINNLQVAFLKVDNAVASFDPDQKPIVDKNDRDNRQAFEKISQLREEFANKAIKNPTKKN
QYFSSFISKSNDLINKDSLIDTGSSIKSFQKFGTQRYQIFMNWVSHQKDPSKINTQKIRGFMENIIQPPISDDKEKAEFL
RSAKQAFAGIIIGNQIRSDQKFMGVFDESLKERQEAEKNGEPNGDPTGGDWLDIFLSFVFNKKQSSDLKETLNQEPVPHV
QPDVATTTTDIQSLPPEARDLLDERGNFSKFTLGDMNMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVS
LLYGNNGGPEARHDWNATVGYKNQQGDNVATLINVHMKNGSGLVIAGGEKGVNNPSFYLYKEDQLTGLKQALSQEEIQNK
VDFMEFLAQNNAKLDNLSVKEKEKFQNEIEDFQKDSKAYLDALGNDHIAFVSKKDQKHLALITEFGNGDLSYTLKDYGKK
QDKALDREIKTTLQGNLKHDGVMFVNYSNFKYTNASKSPDKGVGATNGVSHLEANLSKVAVFNLPNLNNLAITSYIRRDL
EDKLWAKGLSPQETNKLIKDFLNSDKELVEKVSNLNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVVKKLENRN
DNKNRMEAKAQANSQKDKIFAIINEEAGKEARAAACVQNFKGIRRELSDKLENINKNLKDFDKSFDEFKNGKNKDFSKTE
ETLKALKSSVKDLGINPEWISKVENLNTALNEFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSIAKA
TGDFSGVEQMLADLKNFSKEQLAQQAQKNEDFNTGKNSELYQSVKNGVNKTLVGNGLSGIEATALAKKFSDIEKELNEKF
KNFNNNNNNGLKNEPIYAKVNKKKTGQVASPEEPIYTQVAKKVNAKIDRLNQIASGLGGVGQATGFPLKRHDKVDDLSKV
GLSANHEPIYATIDDLGGPFPLKRHDKVDDLSKVGLSANHEPIYATIDDLGGPFPLKRHDKVDDLSKVGLSANHEPIYAT
IDDLGGPFPLKRHDKVDDLSKVGLSANHEPIYATIDDLGGPFPLKRHDKVDDLSKVGLSANHEPIYATIDDLGGPFPLKR
HDKVDDLSKVGLSRNQELAQKIDNLNQAVSEAKTCHFDNLDQMIDELKDSAKKNVMNLYVESAKKVPTSLSAKLDNYATN
SHAGINSNVKNGTINEKATGMLTQKNPEWLKLVNDKIVAHNVGSAPLSAYDKIGFNQKNMKDYSDSFKFSTRLSNAVKDI
KSGFVQFLTNAFSTGSYSLMKANVEHGVKNTNTKGGFQKS