PAI Gene Information


Name : cagA
Accession : BAC10421.1
PAI name : cag PAI
PAI accession : AB090075
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagA
Function : -
Note : -
Homologs in the searched genomes :   49 hits    ( 47 protein-level,   2 DNA-level )  
Publication :
    -Higashi,H., Tsutsumi,R., Fujita,A., Yamazaki,S., Asaka,M., Azuma,T. and Hatakeyama,M., "Biological activity of the Helicobacter pylori virulence factor CagA is determined by variation in the tyrosine phosphorylation sites", Proc. Natl. Acad. Sci. U.S.A. 99 (22), 14428-14433 (2002) PUBMED 12391297.

    -Yamakawa,A., Yamazaki,S. and Azuma,T., "Direct Submission", Submitted (19-AUG-2002) Takeshi Azuma, Fukui Medical University, Second Department of Internal Medicine; Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:81-776-61-8351, Fax:81-776-61-8110).


DNA sequence :
ATGACTAACGAAACTATTGATCAAACAACAACACCAGACCAAACGGGTTTTGTTCCGCAACGATTTATCAATAATCTTCA
AGTAGCTTTTATCAAAGTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAATCAATCGTTGATAAGAATGATAAGG
ATAACAGGCAAGCTTTTGAGAAAATCTCGCAACTAAGGGAAGAATACGCCAATAAAGCGATCAAAAATCCTGCCAAAAAG
AATCAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGCTGTAGATTCTTCCGT
AGAGAGCTTTCGGAAATTTGGGGATCAGCGTTACCAAATTTTTACGAGTTGGGTGTCCCTTCAAAAAGATCCGTCTAAAA
TCAACACCCAACAAATCCGAAATTTTATGGAAAATGTCATACAACCCCCTATCTCTGATGATAAAGAAAAAGCGGAGTTT
TTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGGAACCAAATCCGATCGGATGAAAAATTCATGGGCGTGTT
TGATGAATCTTTGAAAGCAAGGCAAGAAGCAGAAAAAAATGCAGAGCCTGCTGGTGGGGATTGGCTTGATATTTTTTTAT
CATTTGTATTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGCCTGATTTTGAACAAAAT
TTAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTGCTTGATGAAAGGGGTAATTTTTTTAA
ATTCACTCTTGGTGATGTGGAGATGTTGGATGTTGAGGGAGTCGCTGACAAGGATCCCAATTACAAGTTCAATCAATTAT
TGATCCACAATAACGCTTTATCTTCTATGCTAATGGGGAGTCATAGTAACATAGAACCTGAAAAAGTTTCATTATTGTAT
GGGGATAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAGGGCAACAATGTGGC
CACACTCATTAATGCGCATCTTAATAACGACAGTGGGTTAATCATAGCGGGTAATGAGGATGGGATTAAAAATCCTAGCT
TCTACCTCTACAAAGAAGATCAACTCACAGGCTTGAAACAAGCAATGAGTCAAGAAGAGATCCAAAACAAAGTGGATTTC
ATGGAATTTCTTGCACAAAACAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCAAGCTGAGATTGA
AAATTTCCAAAAAGACCGTAAGGCTTATTTGGACGCTCTAGGGAATGATCACATTGCTTTTGTTTCTAAAAAAGACCCAA
AACATTTAGCTTTGGTTACTGAGTTTGGTAATGGGGAAGTGAGCTATACTCTCAAAGATTATGGGAAAAAACAAGATAAA
GCTTTAGATGGGGAGACAAAAACCACTCTTCAAGGTAACCTAAAATATGATGGCGTGATGTTTGTTGATTATTCTAATTT
CAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGTGTAGGCGCTACGAATGGCGTTTCCCATTTGGAAGCGAATTTTA
GCAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAATTATATAAGGCGAGATTTAGAAGATAAA
TTGTGGGCTAAAGGATTGTCCTCACAAGAAGCTAATAAGCTCATCAAAGACTTTTTGAACAGCAACAAAGAAATGGGGGG
AAAAGTGTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGATGAAGTGAAAAAAGCTCAGAAAG
ATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAGAAACGACAACAAA
AATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCGTTGATCAATCAAGAGGCTAGTAAGGA
AGCAAGAGCGGCCGTTTTCGATCCGAATCTTAAAGGCATCAGGAGCGAATTGTCTGATAAACTTGAAAATATCAACAAGA
ATTTGAAAGACTTTGGCAAATCTTTTGATGAACTCAAAAATGGCAAAAATAATGATTTCAGCAAGGCAGAAGAAACGCTA
AAAGCCCTTAAAGACTCGGTGAAAGATTTAGGCATCAATCCAGAATGGATTTCAAAAATTGAAAACCTTAATGTAGCTTT
GAATGATTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACACAAGCAAAAAGCGACCTTGAAAATTCCATTAAGG
ATGTGATCATTAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAGGCTGTATCAGAGACTAAATTAACAGGCGAT
TTCAGTAAGGTAGAGCAAGCCCTAGCCGAACTCAAAAGCTTGTCATTGGATCTTGGAAAAAATTCTGATCTACAAAAATC
CGTTAAAAATGGTGTAAATGGAACCCTAGTCGGTAATGGGTTGTCTAAAACAGAAGCCACAACGCTCACCAAAAATTTTT
CGGACATCAGAAAAGAATTGAACGAGAAGTTATTTGGAAATTCCAATAACAATAATAATGGACTCAAAAATAGTGCAGAG
CCTATTTACGCTAAAGTTAATAAAAAGAAAACAGGACAAGCAACTAGCCCTGAAGAGCCCATTTATGCTCAAGTTGCTAA
AAAAGTGAGTGCAAAAATTGACCAACTCAACGAATCTACATCAGCAATAAATAGAAAAATTGACCGGATTAACAAAATTG
CATCAGCAGGTAAAGGAGTGGGCGGTTTCAGTGGAGCAGGGCGATCAGCTAGCCCTGAACCCATTTACGCTACAATTGAT
TTTGATGAGGCAAATCAAGCAGGCTTCCCTTTGAGAAGAAGTGCTGCAGTTAATGATCTCAGTAAAGTAGGACTTTCAAG
GGAACAAGAATTGACTCGTAGAATTGGCGATCTCAATCAGGCGGTATCAGAAGCTAAAACAGGTCATTTTGACAACCTAG
AACAAAAGATAGATGAACTCAAAGATTCTACGAAAAAGAATGCTTTGAAGCTATGGGTTGAAAGCGCGAAACAAGTGCCT
ACTGGTTTGCAAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCCACAATGGAGCAAT
CAATGAAAAAGCGACTGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCACATA
ATGTGGGAAGTGCTCATTTGTCAGAGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGATTCGTTC
AAGTTTTCCACCAAGTTGAACAATGCCGTAAAAGACATTAAGTCTAGCTTTGTGCAATTTTTAACCAATACATTTTCTAC
AGGATCTTACAGCTTGACGAAAGCAAATGTGGAACATGGAGTCAAAAATACTACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETIDQTTTPDQTGFVPQRFINNLQVAFIKVDNAVASFDPDQKSIVDKNDKDNRQAFEKISQLREEYANKAIKNPAKK
NQYFSDFINKSNDLINKDNLIAVDSSVESFRKFGDQRYQIFTSWVSLQKDPSKINTQQIRNFMENVIQPPISDDKEKAEF
LRSAKQSFAGIIIGNQIRSDEKFMGVFDESLKARQEAEKNAEPAGGDWLDIFLSFVFNKKQSSDLKETLNQEPRPDFEQN
LATTTTDIQGLPPEARDLLDERGNFFKFTLGDVEMLDVEGVADKDPNYKFNQLLIHNNALSSMLMGSHSNIEPEKVSLLY
GDNGGPEARHDWNATVGYKDQQGNNVATLINAHLNNDSGLIIAGNEDGIKNPSFYLYKEDQLTGLKQAMSQEEIQNKVDF
MEFLAQNNAKLDNLSEKEKEKFQAEIENFQKDRKAYLDALGNDHIAFVSKKDPKHLALVTEFGNGEVSYTLKDYGKKQDK
ALDGETKTTLQGNLKYDGVMFVDYSNFKYTNASKSPDKGVGATNGVSHLEANFSKVAVFNLPNLNNLAITNYIRRDLEDK
LWAKGLSSQEANKLIKDFLNSNKEMGGKVLNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKKLESRNDNK
NRMEAKAQANSQKDKIFALINQEASKEARAAVFDPNLKGIRSELSDKLENINKNLKDFGKSFDELKNGKNNDFSKAEETL
KALKDSVKDLGINPEWISKIENLNVALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSETKLTGD
FSKVEQALAELKSLSLDLGKNSDLQKSVKNGVNGTLVGNGLSKTEATTLTKNFSDIRKELNEKLFGNSNNNNNGLKNSAE
PIYAKVNKKKTGQATSPEEPIYAQVAKKVSAKIDQLNESTSAINRKIDRINKIASAGKGVGGFSGAGRSASPEPIYATID
FDEANQAGFPLRRSAAVNDLSKVGLSREQELTRRIGDLNQAVSEAKTGHFDNLEQKIDELKDSTKKNALKLWVESAKQVP
TGLQAKLDNYATNSHTRINSNVHNGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGSAHLSEYDKIGFNQKNMKDYSDSF
KFSTKLNNAVKDIKSSFVQFLTNTFSTGSYSLTKANVEHGVKNTTKGGFQKS