PAI Gene Information


Name : cagA
Accession : AAF17597.1
PAI name : cag PAI
PAI accession : AF202972
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagA
Function : -
Note : -
Homologs in the searched genomes :   48 hits    ( 47 protein-level,   1 DNA-level )  
Publication :
    -Asahi,M., Azuma,T., Ito,S., Ito,Y., Suto,H., Nagai,Y., Tsubokawa,M., Tohyama,Y., Maeda,S., Omata,M., Suzuki,T. and Sasakawa,C., "Helicobacter pylori CagA protein can be tyrosine phosphorylated in gastric epithelial cells", J. Exp. Med. 191 (4), 593-602 (2000) PUBMED 10684851.

    -Ito,Y. and Azuma,T., "Direct Submission", Submitted (09-NOV-1999) Second Department of Internal Medicine, Fukui Medical University, Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan.


DNA sequence :
ATGACTAACGAAACTATTGATCAAACAACAACACCAGACCAAACGGGTTTTGTTCCGCAACGATTTATCAATAATCTTCA
AGTAGCTTTTATCAAAGTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAACCAATCGTTGATAAGAATGATAAGG
ATAACAGGCAAGCTTATGAGAAAATCTCGCAACTAAGGGAAGAATACGCCAATAAAGCGATCAAAAATCCTGCCAAAAAG
AATCAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGCTGTAGATTCTTCCGT
AGAGAGCTTTCGGAAATTTGGGGATCAGCGTTACCAAATTTTTACGAGTTGGGTGTCCCTTCAAAAAGATCCGTCTAAAA
TCAACACCCAACAAATCCGAAATTTTATGGAAAATGTCATAAAACCCCCTATCTCTGATGATAAAGAAAAAGCGGAGTTT
TTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGGAACCAAATCCGATCGGATGAAAAATTCATGGGCGTGTT
TGATGAATCTTTGAAAGCAAGGCAAGAAGCAGAAAAAAATGCAGAGCCTGCTGGTGGGGATTGGCTTGATATTTTTTTAT
CATTTGTATTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGCCTGATTTTGAACAAAAT
TTAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTGCTTGATGAAAGGGGTAATTTTTTTAA
ATTCACTCTTGGTGATGTGGAGATGTTGGATGTTGAGGGAGTCGCTGACAAGGATCCCAATTACAAGTTCAATCAATTAT
TGATCCACAATAACGCTTTATCTTCTATGCTAATGGGGAGTCATAGTAACATAGAACCTGAAAAGGTTTCATTATTGTAT
GGGGATAATGGTGGCCCTGAAGCTAGGCATGATTGGAACGCTACCGTTGGTTATAAAAACCAACAAGGCAACAATGTGGC
CACACTCATTAATGCGCATCTTAATAACGGCAGCGGGTTAATCATAGCGGGTAATGAGGATGGGATTAAAAACCCTAGCT
TCTATCTCTATAAAGAAGATCAACTCACAGGTTTGAAACAAGCGTTGAGTCAAGAAGAGATCCAAAACAAAGTGGATTTC
ATGGAATTTCTCGCACAAAACAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCAAACTGAGATTGA
AAATTTCCAAAAAGACCGTAAGGCTTATTTGGACGCTCTAGGGAATGATCACATTGCTTTTGTTTCTAAAAAAGACCCAA
AACATTTAGCTTTGGTTACTGAGTTTGGTAATGGGGAATTGAGCTATACTCTCAAAGATTATGGGAAAAAACAAGATAAA
GCTTTAGATGGGGAGACAAAAACCACTCTTCAAGGTAGCCTAAAATATGATGGCGTGATGTTTGTCAATTATTCTAATTT
CAAATACACCAACGCCTCCAAGAGTCCTAATAAGGGTTTAGGCACTACGAATGGCGTTTCCCATTTGGAAGCGAATTTTA
GCAAAGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAATTATATAAGGCGAGATTTAGAAGATAAA
CTGTGGGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTCATCAAAGACTTTTTAAACAGCAACAAAGAAATGGTTGG
AAAAGTTTCAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGATGAAGTGAAAAAAGCTCAGAAAG
ATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAGAAACGACAACAAA
AATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCACTTATCAGTCAAGAGGCTAGTAAGGA
AGCAAGAGTGGCCACTTTCGATCCGTATCTTAAAGGCGTCAGGAGCGAATTGTCTGATAAACTTGAAAATATCAACAAGA
ATTTGAAAGACTTTGGCAAATCTTTTGATGAACTCAAAAGTGGCAAAAATAATGATTTCAGCAAGGCAGAAGAAACGCTA
AAAGCCCTTAAAGACTCGGTGAAAGATTTAGGCATCAATCCAGAATGGATTTCAAAAATTGAAAACCTTAATGCAGCTTT
GAATGATTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACACAAGCAAAAAGCGACCTTGAAAATTCCATTAAGG
ATGTGATCATTAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAGGCTGTATCAGAGATTAAATTAACAGGCGAT
TTCAGTAAGGTAGAGCAAGCCCTAGCCGAACTCAAAAACTTGTCATTGGATCTTGGAAAAAATTCTGATCTACAAAAATC
CGTTAAAAATGGTGTAAATGGAACCCTAGTCAGTAATGGGTTGTCTAAAACAGAAGCCACAACGCTCACCAAAAATTTTT
CGGACATCAGGAAAGAATTGAACGAGAAATTATTTGGAAATTCCAATAACAATAATAATGGACTCAAAAACAACACAGAG
CCTATTTATGCTCAAGTTAATAAAAAGAAAACAGGACAAGCAACTAGCCCTGAAGAGCCCATTTACGCTCAAGTTGCTAA
AAAGGTGAGTGCAAAAATTGACCAACTCAACGAAGCTACATCAGCAATAAATAGAAAAATTGACCGGATTAACAAAATTG
CATCAGCAGGTAAAGGAGTGGGCGGTTTCAGTGGAGCAGGGCGATCAGCTAGCCCTGAACCCATTTACGCTACAATTGAT
TTTGATGAGGCAAATCAAGCAGGCTTCCCTTTGAGAAGAAGTGCTGCAGTTAATGATCTCAGTAAAGTAGGGCTTTCAAG
GGAACAAGAATTGACTCGTAGAATTGGCGATCTCAGTCAGGCAGTGTCAGAAGCTAAAACAGGTCATTTTGGCAACCTAG
AACAAAAGATAGATGAACTCAAAGATTCTACAAAAAAGAATGCTTTGAAGCTATGGGTTGAAAGTGCGAAACAAGTGCCT
ACTAGTTTGCAAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCCAAAGTGGAACAAT
CAATGAAAAAGCGACCGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCGCATA
ATGTGGGAAGTGCTCCTTTGTCAGCGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGACTCGTTC
AAGTTTTCCACCAAGTTGAACAATGCCGTAAAAGACATTAAGTCTAGCTTTGTGCAATTTTTAACCAATACATTTTCTAC
AGGATCTTACAGCTTGATGAAAGCAAATGTGGAACATGGAGTCAAAAATACTAATACAAAAGGTGGTTTCCAAAAATCTT
AA

Protein sequence :
MTNETIDQTTTPDQTGFVPQRFINNLQVAFIKVDNAVASFDPDQKPIVDKNDKDNRQAYEKISQLREEYANKAIKNPAKK
NQYFSDFINKSNDLINKDNLIAVDSSVESFRKFGDQRYQIFTSWVSLQKDPSKINTQQIRNFMENVIKPPISDDKEKAEF
LRSAKQSFAGIIIGNQIRSDEKFMGVFDESLKARQEAEKNAEPAGGDWLDIFLSFVFNKKQSSDLKETLNQEPRPDFEQN
LATTTTDIQGLPPEARDLLDERGNFFKFTLGDVEMLDVEGVADKDPNYKFNQLLIHNNALSSMLMGSHSNIEPEKVSLLY
GDNGGPEARHDWNATVGYKNQQGNNVATLINAHLNNGSGLIIAGNEDGIKNPSFYLYKEDQLTGLKQALSQEEIQNKVDF
MEFLAQNNAKLDNLSEKEKEKFQTEIENFQKDRKAYLDALGNDHIAFVSKKDPKHLALVTEFGNGELSYTLKDYGKKQDK
ALDGETKTTLQGSLKYDGVMFVNYSNFKYTNASKSPNKGLGTTNGVSHLEANFSKVAVFNLPNLNNLAITNYIRRDLEDK
LWAKGLSPQEANKLIKDFLNSNKEMVGKVSNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKKLESRNDNK
NRMEAKAQANSQKDKIFALISQEASKEARVATFDPYLKGVRSELSDKLENINKNLKDFGKSFDELKSGKNNDFSKAEETL
KALKDSVKDLGINPEWISKIENLNAALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSEIKLTGD
FSKVEQALAELKNLSLDLGKNSDLQKSVKNGVNGTLVSNGLSKTEATTLTKNFSDIRKELNEKLFGNSNNNNNGLKNNTE
PIYAQVNKKKTGQATSPEEPIYAQVAKKVSAKIDQLNEATSAINRKIDRINKIASAGKGVGGFSGAGRSASPEPIYATID
FDEANQAGFPLRRSAAVNDLSKVGLSREQELTRRIGDLSQAVSEAKTGHFGNLEQKIDELKDSTKKNALKLWVESAKQVP
TSLQAKLDNYATNSHTRINSNVQSGTINEKATGMLTQKNPEWLKLVNDKIVAHNVGSAPLSAYDKIGFNQKNMKDYSDSF
KFSTKLNNAVKDIKSSFVQFLTNTFSTGSYSLMKANVEHGVKNTNTKGGFQKS