PAI Gene Information


Name : cagA (jhp0495)
Accession : NP_223213.1
PAI name : cag PAI
PAI accession : NC_000921_P1
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag island protein, cytotoxicity associated immunodominant antigen
Function : -
Note : similar to H. pylori 26695 gene HP0547
Homologs in the searched genomes :   49 hits    ( 47 protein-level,   2 DNA-level )  
Publication :
    -Alm,R.A., Ling,L.-S.L., Moir,D.T., King,B.L., Brown,E.D., Doig,P.C., Smith,D.R., Noonan,B., Guild,B.C., deJonge,B.L., Carmel,G., Tummino,P.J., Caruso,A., Uria-Nickelsen,M., Mills,D.M., Ives,C., Gibson,R., Merberg,D., Mills,S.D., Jiang,Q., Taylor,D.E., Vov, "Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori", Nature 397 (6715), 176-180 (1999) PUBMED 9923682.

    -Alm,R.A., Ling,L.-S.L., Moir,D.T., King,B.L., Brown,E.D., Doig,P.C., Smith,D.R., Noonan,B., Guild,B.C., deJonge,B.L., Carmel,G., Tummino,P.J., Caruso,A., Uria-Nickelsen,M., Mills,D.M., Ives,C., Gibson,R., Merberg,D., Mills,S.D., Jiang,Q., Taylor,D.E., Vov, "Direct Submission", Submitted (13-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -King,B.L., Alm,R.A. and Trust,T.J., "Direct Submission", Submitted (12-JAN-1999) Astra Research Center Boston, 128 Sidney Street, Cambridge, MA 02139, USA.

    -Merrell,D.S., Thompson,L.J., Kim,C.C., Mitchell,H., Tompkins,L.S., Lee,A. and Falkow,S., "Growth phase-dependent response of Helicobacter pylori to iron starvation", Infect. Immun. 71 (11), 6510-6525 (2003) PUBMED 14573673.


DNA sequence :
ATGACTAACGAAGCCATTAACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATCAATAATCTTCAAGT
GGCTTTTATTAAAGTTGATAATGTTGTCGCTTCATTTGATCCTAATCAAAAACCAATCGTTGATAAGAATGATAGGGATA
ATAGGCAAGCTTTTGAGAAAATCTCGCAGCTAAGGGAGGAATTCGCTAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAAGCTTTATCAGTAAGAGCAATGATTTAATCGACAAAGACAATCTCATTGATACAGGTTCTTCCATAAA
GAGCTTTCAGAAATTTGGGACTCAGCGTTACCAAATTTTTATGAATTGGGTGTCCCATCAAAACGATCCGTCTAAAATCA
ACACCCAAAAAATCCGAGGTTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAAGAGAAAGCGGAGTTTTTG
AGGTCTGCCAAACAAGCTTTTGCAGGAATTATCATAGGAAACCAAATCCGATCGGATCAAAAATTCATGGGCGTGTTTGA
TGAATCTTTGAAAGAGAGGCAAGAAGCAGAAAAAAATGGAGAGCCTAATGGAGATCCTACTGGTGGGGATTGGCTTGATA
TTTTTTTATCATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAACCAGTTCCTCATGTC
CAACCAGATGTAGCCACTACCACCACTGACATACAAAGCTTACCGCCTGAAGCTAGGGATTTGCTTGATGAAAGGGGTAA
TTTTTCTAAATTCACTCTTGGCGATATGAACATGTTAGATGTTGAGGGAGTCGCTGACATTGATCCTAATTACAAGTTCA
ACCAATTATTGATCCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCA
TTGTTGTATGGAAACAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAAACCAACGAGGCGA
CAATGTGGCTACACTCATTAATGTGCATATGAAAAATGGCAGTGGGTTAGTCATAGCAGGTGGTGAGAAAGGGATTAACA
ACCCTAGTTTTTATCTCTACAAAGAAGACCAACTCACAGGCTCACAACGAGCATTGAGTCAAGAAGAGATCCAAAACAAA
GTGGATTTCATGGAATTTCTTGCACAAAATAATGCTAAATTAGACAACTTGAGCAAGAAAGAGAAAGAAAAATTCCAAAA
TGAGATTGAAGATTTTCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGATCACATTGCTTTTGTTTCTAAAA
AAGACAAAAAACATTTAGCTTTAGTTGCTGAGTTTGGTAATGGGGAATTGAGCTACACTCTCAAAGATTATGGGAAAAAA
GCAGATAAAGCTTTAGATAGGGAGGCAAAAACCACTCTTCAAGGTAGCCTAAAACATGATGGCGTGATGTTTGTTGATTA
TTCTAATTTCAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGTGTGGGTGCTACGAATGGCGTTTCCCATTTAGAAG
CAGGCTTTAGCAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAGTGTCGTAAGGCAGGATTTA
GAGGATAAACTAATCGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTTGTCAAAGATTTTTTGAGCAGCAACAAAGA
ATTGGTTGGAAAAGCTTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAACTATGACGAGGTGAAACAAG
CTCAGAAAGATCTTGAAAAATCTCTAAAGAAACGAGAGCGTTTGGAGAAAGATGTAGCGAAAAATTTGGAGAGCAAAAGC
GGCAACAAAAATAAAATGGAAGCAAAATCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAGGC
TAATAGGGATGCAAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAAAAGGGAATTGTCTGATAAACTTGAAAATA
TCAACAAGGATTTGAAAGACTTTAGTAAATCTTTTGATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGCAGAA
GAAACACTAAAAGCCCTTAAAGGCTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAA
TGCAGCTTTGAATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATT
CCATTAAAGATGTGATCATCAATCAAAAGATAACGGATAAAGTTGATAATCTCAATCAAGCGGTATCAGTGGCTAAAGCA
ACGGGTGATTTCAGTGGGGTAGAGCAAGCGTTAGCCGATCTCAAAAATTTCTCAAAGGAGCAATTGGCTCAACAAGCTCA
AAAAAATGAAGATTTCAATACTGGAAAAAATTCTGCACTATACCAATCCGTTAAGAATGGTGTAAACGGAACCCTAGTCG
GTAATGGGTTATCTAAAGCAGAAGCCACAACTCTTTCTAAAAACTTTTCGGACATCAAGAAAGAGTTGAATGCAAAACTT
GGAAATTTCAATAACAATAACAATAATGGACTCGAAAACAGCACAGAACCCATTTATACTCAAGTTGCTAAAAAGGTAAA
AGCAAAAATTGACCGACTCGATCAAATAGCAAGTGGTTTGGGTGATGTAGGGCAAGCAGCGAGCTTCCTTTTGAAAAGGC
ATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAGCTAACCATGAACCCATTTACGCTACGATTGATGATCTCGGC
GGACCTTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGGGAGCAAAAATTGACTCA
GAAAATTGACAATCTCAACCAGGCGGTATCAGAAGCTAAAGCAAGTCATTTTGACAACCTAGATCAAATGATAGACAAGC
TCAAAGATTCTACAAAAAAGAATGTTGTGAATCTATATGTTGAAAGTGCAAAAAAAGTGCCTACTAGTTTGTCAGCGAAA
TTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCAAAAATGGAACAATCAATGAAAAAGCGACCGG
CATGCTAACGCAAAAAAATTCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCGCATAATGTGGGAAGTGCTCCTT
TGTCAGCGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCACCAGGTTG
AGCAATGCCGTAAAAGACATTAAGTCTGGCTTTGTGCAATTTTTAACCAATATATTTTCTATGGGATCTTACAGCTTGAT
GAAAGCAAGTGTGGAACATGGAGTCAAAAATACTAATACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNEAINQQPQTEAAFNPQQFINNLQVAFIKVDNVVASFDPNQKPIVDKNDRDNRQAFEKISQLREEFANKAIKNPTKKN
QYFSSFISKSNDLIDKDNLIDTGSSIKSFQKFGTQRYQIFMNWVSHQNDPSKINTQKIRGFMENIIQPPISDDKEKAEFL
RSAKQAFAGIIIGNQIRSDQKFMGVFDESLKERQEAEKNGEPNGDPTGGDWLDIFLSFVFNKKQSSDLKETLNQEPVPHV
QPDVATTTTDIQSLPPEARDLLDERGNFSKFTLGDMNMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVS
LLYGNNGGPEARHDWNATVGYKNQRGDNVATLINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIQNK
VDFMEFLAQNNAKLDNLSKKEKEKFQNEIEDFQKDSKAYLDALGNDHIAFVSKKDKKHLALVAEFGNGELSYTLKDYGKK
ADKALDREAKTTLQGSLKHDGVMFVDYSNFKYTNASKSPDKGVGATNGVSHLEAGFSKVAVFNLPNLNNLAITSVVRQDL
EDKLIAKGLSPQEANKLVKDFLSSNKELVGKALNFNKAVAEAKNTGNYDEVKQAQKDLEKSLKKRERLEKDVAKNLESKS
GNKNKMEAKSQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDKLENINKDLKDFSKSFDEFKNGKNKDFSKAE
ETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSVAKA
TGDFSGVEQALADLKNFSKEQLAQQAQKNEDFNTGKNSALYQSVKNGVNGTLVGNGLSKAEATTLSKNFSDIKKELNAKL
GNFNNNNNNGLENSTEPIYTQVAKKVKAKIDRLDQIASGLGDVGQAASFLLKRHDKVDDLSKVGLSANHEPIYATIDDLG
GPFPLKRHDKVDDLSKVGLSREQKLTQKIDNLNQAVSEAKASHFDNLDQMIDKLKDSTKKNVVNLYVESAKKVPTSLSAK
LDNYATNSHTRINSNVKNGTINEKATGMLTQKNSEWLKLVNDKIVAHNVGSAPLSAYDKIGFNQKNMKDYSDSFKFSTRL
SNAVKDIKSGFVQFLTNIFSMGSYSLMKASVEHGVKNTNTKGGFQKS