PAI Gene Information


Name : cagA
Accession : AAC44706.1
PAI name : cag PAI
PAI accession : AF282853
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagA
Function : -
Note : similar to cytotoxin associated immunodominant antigen encoded by GenBank Accession Number X70039; HP547; cag26; JHP495
Homologs in the searched genomes :   48 hits    ( 47 protein-level,   1 DNA-level )  
Publication :
    -Censini,S., Lange,C., Xiang,Z., Crabtree,J.E., Ghiara,P., Borodovsky,M., Rappuoli,R. and Covacci,A., "cag, a pathogenicity island of Helicobacter pylori, encodes type I-specific and disease-associated virulence factors", Proc. Natl. Acad. Sci. U.S.A. 93 (25), 14648-14653 (1996) PUBMED 8962108.

    -Censini,S., Rappuoli,R. and Covacci,A., "Direct Submission", Submitted (17-MAR-2000) Molecular Biology, Chiron-Biocine, Via Fiorentina 1, Siena, SI 53100, Italy REMARK Sequence update by submitter.

    -Censini,S., Rappuoli,R., Lange,C. and Covacci,A., "Direct Submission", Submitted (06-JUN-1996) Molecular Biology, Chiron-Biocine, Via Fiorentina 1, Siena, SI 53100, Italy.

    -Covacci,A. and Rappuoli,R., "Tyrosine-phosphorylated bacterial proteins: Trojan horses for the host cell", J. Exp. Med. 191 (4), 587-592 (2000) PUBMED 10684850.


DNA sequence :
ATGACTAACGAAACCATTGACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATCAATAATCTTCAAGT
AGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATACGATCCTGATCAAAAACCAATCGTTGATAAGAACGATAGGGATA
ACAGGCAAGCTTTTGAAGGAATCTCGCAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTAATCAACAAAGACAATCTCATTGATGTAGAATCTTCCACAAA
GAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGTTGGGTGTCCCATCAAAACGATCCGTCTAAAATCA
ACACCCGATCGATCCGAAATTTTATGGAAAATATCATACAACCCCCTATCCTTGATGATAAAGAGAAAGCGGAGTTTTTG
AAATCTGCCAAACAATCTTTTGCAGGAATCATTATAGGGAATCAAATCCGAACGGATCAAAAGTTCATGGGCGTGTTTGA
TGAGTCCTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGAGCCTACTGGTGGGGATTGGTTGGATATTTTTCTCTCAT
TTATATTTGACAAAAAACAATCTTCTGATGTCAAAGAAGCAATCAATCAAGAACCAGTTCCCCATGTCCAACCAGATATA
GCCACTACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTACTTGATGAAAGGGGTAATTTTTCTAAATT
CACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGAGTCGCTGACATTGATCCCAATTACAAGTTCAATCAATTATTGA
TTCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCATTGTTGTATGGG
GGCAATGGTGGTCCTGGAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGCAACAATGTGGCTAC
AATAATTAATGTGCATATGAAAAACGGCAGTGGCTTAGTCATAGCAGGTGGTGAGAAAGGGATTAACAACCCTAGTTTTT
ATCTCTACAAAGAAGACCAACTCACAGGCTCACAACGAGCATTAAGTCAAGAAGAGATCCAAAACAAAATAGATTTCATG
GAATTTCTTGCACAAAATAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAGGAAAAATTCCGAACTGAGATTAAAGA
TTTCCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGATCGTATTGCTTTTGTTTCTAAAAAAGACACAAAAC
ATTCAGCTTTAATTACTGAGTTTGGTAATGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCT
TTAGATAGGGAGAAAAATGTTACTCTTCAAGGTAGCCTAAAACATGATGGCGTGATGTTTGTTGATTATTCTAATTTCAA
ATACACCAACGCCTCCAAGAATCCCAATAAGGGTGTAGGCGTTACGAATGGCGTTTCCCATTTAGAAGTAGGCTTTAACA
AGGTAGCTATCTTTAATTTGCCTGATTTAAATAATCTCGCTATCACTAGTTTCGTAAGGCGGAATTTAGAGGATAAACTA
ACCACTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTTATCAAAGATTTTTTGAGCAGCAACAAAGAATTGGTTGGAAA
AACTTTAAACTTCAATAAAGCTGTAGCTGACGCTAAAAACACAGGCAATTATGATGAAGTGAAAAAAGCTCAGAAAGATC
TTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGAGAAAAAATTGGAGAGCAAAAGCGGCAACAAAAAT
AAAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAGGCTAATAGAGACGC
AAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAAAAGGGAATTGTCTGATAAACTTGAAAATGTCAACAAGAATT
TGAAAGACTTTGATAAATCTTTTGATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGCAGAAGAAACACTAAAA
GCCCTTAAAGGTTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAATGCAGCTTTGAA
TGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATTCCGTTAAAGATG
TGATCATCAATCAAAAGGTAACGGATAAAGTTGATAATCTCAATCAAGCGGTATCAGTGGCTAAAGCAACGGGTGATTTC
AGTAGGGTAGAGCAAGCGTTAGCCGATCTCAAAAATTTCTCAAAGGAGCAATTGGCCCAACAAGCTCAAAAAAATGAAAG
TCTCAATGCTAGAAAAAAATCTGAAATATATCAATCCGTTAAGAATGGTGTGAATGGAACCCTAGTCGGTAATGGGTTAT
CTCAAGCAGAAGCCACAACTCTTTCTAAAAACTTTTCGGACATCAAGAAAGAGTTGAATGCAAAACTTGGAAATTTCAAT
AACAATAACAATAATGGACTCAAAAACGAACCCATTTATGCTAAAGTTAATAAAAAGAAAGCAGGGCAAGCAGCTAGCCT
TGAAGAACCCATTTACGCTCAAGTTGCTAAAAAGGTAAATGCAAAAATTGACCGACTCAATCAAATAGCAAGTGGTTTGG
GTGTTGTAGGGCAAGCAGCGGGCTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGG
AATCAAGAATTGGCTCAGAAAATTGACAATCTCAATCAAGCGGTATCAGAAGCTAAAGCAGGTTTTTTTGGCAATCTAGA
GCAAACGATAGACAAGCTCAAAGATTCTACAAAACACAATCCCATGAATCTATGGGTTGAAAGTGCAAAAAAAGTACCTG
CTAGTTTGTCAGCGAAACTAGACAATTACGCTACTAACAGCCACATACGCATTAATAGCAATATCAAAAATGGAGCAATC
AATGAAAAAGCGACCGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCGCATAA
TGTAGGAAGCGTTCCTTTGTCAGAGTATGATAAAATTGGCTTCAACCAGAAGAATATGAAAGATTATTCTGATTCGTTCA
AGTTTTCCACCAAGTTGAACAATGCTGTAAAAGACACTAATTCTGGCTTTACGCAATTTTTAACCAATGCATTTTCTACA
GCATCTTATTACTGCTTGGCGAGAGAAAATGCGGAGCATGGAATCAAGAACGTTAATACAAAAGGTGGTTTCCAAAAATC
TTAA

Protein sequence :
MTNETIDQQPQTEAAFNPQQFINNLQVAFLKVDNAVASYDPDQKPIVDKNDRDNRQAFEGISQLREEYSNKAIKNPTKKN
QYFSDFINKSNDLINKDNLIDVESSTKSFQKFGDQRYRIFTSWVSHQNDPSKINTRSIRNFMENIIQPPILDDKEKAEFL
KSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGEPTGGDWLDIFLSFIFDKKQSSDVKEAINQEPVPHVQPDI
ATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVSLLYG
GNGGPGARHDWNATVGYKDQQGNNVATIINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIQNKIDFM
EFLAQNNAKLDNLSEKEKEKFRTEIKDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFGNGDLSYTLKDYGKKADKA
LDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGVTNGVSHLEVGFNKVAIFNLPDLNNLAITSFVRRNLEDKL
TTKGLSPQEANKLIKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKN
KMEAKAQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNGKNKDFSKAEETLK
ALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSVAKATGDF
SRVEQALADLKNFSKEQLAQQAQKNESLNARKKSEIYQSVKNGVNGTLVGNGLSQAEATTLSKNFSDIKKELNAKLGNFN
NNNNNGLKNEPIYAKVNKKKAGQAASLEEPIYAQVAKKVNAKIDRLNQIASGLGVVGQAAGFPLKRHDKVDDLSKVGLSR
NQELAQKIDNLNQAVSEAKAGFFGNLEQTIDKLKDSTKHNPMNLWVESAKKVPASLSAKLDNYATNSHIRINSNIKNGAI
NEKATGMLTQKNPEWLKLVNDKIVAHNVGSVPLSEYDKIGFNQKNMKDYSDSFKFSTKLNNAVKDTNSGFTQFLTNAFST
ASYYCLARENAEHGIKNVNTKGGFQKS