PAI Gene Information


Name : cagA (HPF16_0811)
Accession : YP_005779046.1
PAI name : cag PAI
PAI accession : NC_017368_P1
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein
Function : -
Note : -
Homologs in the searched genomes :   48 hits    ( 47 protein-level,   1 DNA-level )  
Publication :
    -Furuta,Y., Kawai,M., Yahara,K., Takahashi,N., Handa,N., Tsuru,T., Oshima,K., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I., "Birth and death of genes linked to chromosomal inversion", Proc. Natl. Acad. Sci. U.S.A. 108 (4), 1501-1506 (2011) PUBMED 21212362 REMARK DOI:10.1073/pnas.1012579108.

    -Kawai,M., Furuta,Y., Yahara,K., Tsuru,T., Oshima,K., Handa,N., Takahashi,N., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I., "Evolution in an oncogenic bacterial species with extreme genome plasticity: Helicobacter pylori East Asian genomes", BMC Microbiol. 11, 104 (2011) PUBMED 21575176 REMARK Publication Status: Online-Only.

    -Kawai,M., Furuta,Y., Yahara,K., Tsuru,T., Oshima,K., Handa,N., Takahashi,N., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I., "Direct Submission", Submitted (05-APR-2012) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Kawai,M., Furuta,Y., Yahara,K., Tsuru,T., Takahashi,N., Handa,N., Oshima,K., Hattori,M., Yoshida,M., Azuma,T., Uchiyama,I. and Kobayashi,I., "Direct Submission", Submitted (22-JUL-2010) Contact:Masahira Hattori The University of Tokyo, Center for Omics and Bioinformatics; Kashiwanoha 5-1-5, Kashiwa, Chiba 277-8561, Japan.


DNA sequence :
ATGACTAATGAAACCATTGATCAAACAACAACACCAGACCAAACGGGTTTTGTTCCGCAACGATTTATCAATAATCTTCA
AGTAGCTTTTATCAAAGTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAATCAATCGTTGATAAGAATGATAAGG
ATAACAGGCAAGCTTTTGAGAAAATCTCGCAACTAAGGGAAGAATACGCCAATAAAGCGATCAAAAATCCTGCCAAAAAG
AATCAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGCTGTAGATTCTTCCGT
AGAGAGCTTTCGGAAATTTGGGGATCAGCGTTACCAAATTTTTACGAGTTGGGTGTCCCTTCAAAAAGATCCGTCTAAAA
TCAACACCCAACAAATCCGAAATTTTATGGAAAATGTCATACAACCCCCTATCTCTGATGATAAAGAAAAAGCGGAGTTT
TTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGGAACCAAATCCGATCGGATGAAAAATTCATGGGCGTGTT
TGATGAATCTTTGAAAGCAAGGCAAGAAGCAGAAAAAAATGCAGAGCCTGCTGGTGGGGATTGGCTTGATATTTTTTTAT
CATTTGTATTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGCCTGATTTTGAACAAAAT
TTAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTGCTTGATGAAAGGGGTAATTTTTTTAA
ATTCACTCTTGGTGATGTGGAGATGTTGGATGTTGAGGGAGTCGCTGACAAGGATCCCAATTACAAGTTCAATCAATTAT
TGATCCACAATAACGCTTTATCTTCTATGCTAATGGGGAGTCATAGTAACATAGAACCTGAAAAAGTTTCATTATTGTAT
GGGGATAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAGGGCAACAATGTGGC
CACACTCATTAATGCGCATCTTAATAACGACAGTGGGTTAATCATAGCGGGTAATGAGGATGGGATTAAAAATCCTAGCT
TCTACCTCTACAAAGAAGATCAACTCACAGGCTTGAAACAAGCAATGAGTCAAGAAGAGATCCAAAACAAAGTGGATTTC
ATGGAATTTCTTGCACAAAACAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCAAGCTGAGATTGA
AAATTTCCAAAAAGACCGTAAGGCTTATTTGGACGCTCTAGGGAATGATCACATTGCTTTTGTTTCTAAAAAAGACCCAA
AACATTTAGCTTTGGTTACTGAGTTTGGTAATGGGGAAGTGAGCTATACTCTCAAAGATTATGGGAAAAAACAAGATAAA
GCTTTAGATGGGGAGACAAAAACCACTCTTCAAGGTAACCTAAAATATGATGGCGTGATGTTTGTTGATTATTCTAATTT
CAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGTGTAGGCGCTACGAATGGCGTTTCCCATTTGGAAGCGAATTTTA
GCAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAATTATATAAGGCGAGATTTAGAAGATAAA
TTGTGGGCTAAAGGATTGTCCTCACAAGAAGCTAATAAGCTCATCAAAGACTTTTTGAACAGCAACAAAGAAATGGTTGG
AAAAGTTTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGATGAAGTGAAAAAAGCTCAGAAAG
ATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAGAAACGACAACAAA
AATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCGTTGATCAATCAAGAGGCTAGTAAGGA
AGCAAGAGCGGCCGTTTTCGATCCGAATCTTAAAGGCATCAGGAGCGAATTGTCTGATAAACTTGAAAATATCAACAAGA
ATTTGAAAGACTTTGGCAAATCTTTTGATGAACTCAAAAATGGCAAAAATAATGATTTCAGCAAGGCAGAAGAAACGCTA
AAAGCCCTTAAAGACTCGGTGAAAGATTTAGGCATCAATCCAGAATGGATTTCAAAAATTGAAAACCTTAATGTAGCTTT
GAATGATTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACACAAGCAAAAAGCGACCTTGAAAATTCCATTAAGG
ATGTGATCATTAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAGGCTGTATCAGAGACTAAATTAACAGGCGAT
TTCAGTAAGGTAGAGCAAGCCCTAGCCGAACTCAAAAGCTTGTCATTGGATCTTGGAAAAAATTCTGATCTACAAAAATC
CGTTAAAAATGGTGTAAATGGAACCCTAGTCGGTAATGGGTTGTCTAAAACAGAAGCCACAACGCTCACCAAAAATTTTT
CGGACATCAGAAAAGAATTGAACGAGAAGTTATTTGGAAATTCCAATAACAATAATAATGGACTCAAAAATAGTGCAGAG
CCTATTTACGCTAAAGTTAATAAAAAGAAAACAGGACAAGCAACTAGCCCTGAAGAGCCCATTTATGCTCAAGTTGCTAA
AAAAGTGAGTGCAAAAATTGACCAACTCAACGAATCTACATCAGCAATAAATAGAAAAATTGACCGGATTAACAAAATTG
CATCAGCAGGTAAAGGAGTGGGCGGTTTCAGTGGAGCAGGGCGATCAGCTAGCCCTGAACCCATTTACGCTACAATTGAT
TTTGATGAGGCAAATCAAGCAGGCTTCCCTTTGAGAAGAAGTGCTGCAGTTAATGATCTCAGTAAAGTAGGACTTTCAAG
GGAACAAGAATTGACTCGTAGAATTGGCGATCTCAATCAGGCGGTATCAGAAGCTAAAACAGGTCATTTTGACAACCTAG
AACAAAAGATAGATGAACTCAAAGATTCTACGAAAAAGAATGCTTTGAAGCTATGGGTTGAAAGCGCGAAACAAGTGCCT
ACTGGTTTGCAAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCCACAATGGAGCAAT
CAATGAAAAAGCGACTGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCACATA
ATGTGGGAAGTGCTCATTTGTCAGAGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGATTCGTTC
AAGTTTTCCACCAAGTTGAACAATGCCGTAAAAGACATTAAGTCTAGCTTTGTGCAATTTTTAACCAATACATTTTCTAC
AGGATCTTACAGCTTGACGAAAGCAAATGTGGAACATGGAGTCAAAAATACTACAAAAAGTGGTTTCCAGAAATCTTAA

Protein sequence :
MTNETIDQTTTPDQTGFVPQRFINNLQVAFIKVDNAVASFDPDQKSIVDKNDKDNRQAFEKISQLREEYANKAIKNPAKK
NQYFSDFINKSNDLINKDNLIAVDSSVESFRKFGDQRYQIFTSWVSLQKDPSKINTQQIRNFMENVIQPPISDDKEKAEF
LRSAKQSFAGIIIGNQIRSDEKFMGVFDESLKARQEAEKNAEPAGGDWLDIFLSFVFNKKQSSDLKETLNQEPRPDFEQN
LATTTTDIQGLPPEARDLLDERGNFFKFTLGDVEMLDVEGVADKDPNYKFNQLLIHNNALSSMLMGSHSNIEPEKVSLLY
GDNGGPEARHDWNATVGYKDQQGNNVATLINAHLNNDSGLIIAGNEDGIKNPSFYLYKEDQLTGLKQAMSQEEIQNKVDF
MEFLAQNNAKLDNLSEKEKEKFQAEIENFQKDRKAYLDALGNDHIAFVSKKDPKHLALVTEFGNGEVSYTLKDYGKKQDK
ALDGETKTTLQGNLKYDGVMFVDYSNFKYTNASKSPDKGVGATNGVSHLEANFSKVAVFNLPNLNNLAITNYIRRDLEDK
LWAKGLSSQEANKLIKDFLNSNKEMVGKVLNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKKLESRNDNK
NRMEAKAQANSQKDKIFALINQEASKEARAAVFDPNLKGIRSELSDKLENINKNLKDFGKSFDELKNGKNNDFSKAEETL
KALKDSVKDLGINPEWISKIENLNVALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSETKLTGD
FSKVEQALAELKSLSLDLGKNSDLQKSVKNGVNGTLVGNGLSKTEATTLTKNFSDIRKELNEKLFGNSNNNNNGLKNSAE
PIYAKVNKKKTGQATSPEEPIYAQVAKKVSAKIDQLNESTSAINRKIDRINKIASAGKGVGGFSGAGRSASPEPIYATID
FDEANQAGFPLRRSAAVNDLSKVGLSREQELTRRIGDLNQAVSEAKTGHFDNLEQKIDELKDSTKKNALKLWVESAKQVP
TGLQAKLDNYATNSHTRINSNVHNGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGSAHLSEYDKIGFNQKNMKDYSDSF
KFSTKLNNAVKDIKSSFVQFLTNTFSTGSYSLTKANVEHGVKNTTKSGFQKS