PAI Gene Information


Name : cagA (HPF30_0779)
Accession : YP_005774524.1
PAI name : cag PAI
PAI accession : NC_017365_P1
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein
Function : -
Note : -
Homologs in the searched genomes :   48 hits    ( 47 protein-level,   1 DNA-level )  
Publication :
    -Furuta,Y., Kawai,M., Yahara,K., Takahashi,N., Handa,N., Tsuru,T., Oshima,K., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I., "Birth and death of genes linked to chromosomal inversion", Proc. Natl. Acad. Sci. U.S.A. 108 (4), 1501-1506 (2011) PUBMED 21212362 REMARK DOI:10.1073/pnas.1012579108.

    -Kawai,M., Furuta,Y., Yahara,K., Tsuru,T., Oshima,K., Handa,N., Takahashi,N., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I., "Evolution in an oncogenic bacterial species with extreme genome plasticity: Helicobacter pylori East Asian genomes", BMC Microbiol. 11, 104 (2011) PUBMED 21575176 REMARK Publication Status: Online-Only.

    -Kawai,M., Furuta,Y., Yahara,K., Tsuru,T., Oshima,K., Handa,N., Takahashi,N., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I., "Direct Submission", Submitted (05-APR-2012) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.

    -Kawai,M., Furuta,Y., Yahara,K., Tsuru,T., Takahashi,N., Handa,N., Oshima,K., Hattori,M., Yoshida,M., Azuma,T., Uchiyama,I. and Kobayashi,I., "Direct Submission", Submitted (22-JUL-2010) Contact:Masahira Hattori The University of Tokyo, Center for Omics and Bioinformatics; Kashiwanoha 5-1-5, Kashiwa, Chiba 277-8561, Japan.


DNA sequence :
ATGACTAATGAAACCATTGATCAAACAATAACACCAGATCAAACACCAAACCAAACGGATTTTGTTCCGCAACGATTTAT
CAATAATCTTCAAGTAGCTTTTATCAAAGTTGATAACGCTGTCGCTTCATTTGATCCCGATCAAAAACCAATCGTTGATA
AGAATGATAGGGATAACAGGCAAGCTTTTGAGAAAATCTCGCAACTAAGGGAAGAATACGCCAATAAAGCGATCAAAAAT
CCCACCAAAAAGAATCAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGCTGT
AGATTCTTCCGTAGAGAGCTTTCGGAAATTTGGGGATCAGCGTTACCAAATTTTTACGAGTTGGGTGTCCCTTCAAAAAG
ATCCGTCTAAAATCAACACCCAACAAATCCGAAATTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAGGAA
AAAGCGGAGTTTTTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGGAACCAAATCCGATCGGATGAAAAATT
CATGGGCGTGTTTGATGAATCTTTGAAAGCAAGGCAAGAAGCAGAAAAAAATGCAGAGCCTGCTGGTGGGGATTGGCTTG
ATATTTTTTTATCATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGCCTGAT
TTTGAACAAAATTTAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAATCTAGGGATTTGCTTGATGAAAGGGG
TAATTTTTTTAAATTCACTCTTGGTGATATGGAAATGTTGGATGTTGAGGGAGTCGCTGACAAGGATCCCAATTACAAGT
TCAATCAATTATTGATCCACAATAACGCTCTATCTTCTGTGCTAATGGGGGGTCATAGCAACATAGAACCTGAAAAAGTT
TCATTATTGTATGGAGATAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTACAAAAACCAACAAGG
CAACAATGTGGCCACACTCATTAATGCGCACCTTCGCAACGGCAGTGGGTTAGTCATAGTGGGTAATGAGGATGGGATTA
AAAACCCTAGCTTCTATCTCTATAAAGAAGATCAACTCACAGGTTTGAAACAAGCGTTGAGTCAAGAAGAGATCCAAAAC
AAAGTGGATTTCATGGAATTTCTTGCACAAAACAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCA
AACTGAGATTGAAGATTTCCAAAAAGACCGTAAGGCTTATTTAGACGCCCTAGGGAATGATCGCATCGCTTTTGTTTCTA
AAAAAGACCCAAAACATTTAGCTTTGGTTACTGAGTTTGGTAATGGGGAAGTGAGCTATACCCTCAAAGATTATGGGAAA
AAACAAGATAAAGCTTTAGATGGGGAGACAAAAACCACTCTTCAAGGTAGCCTAAAATATGATGGCGTGATGTTTGTCAA
TTATTCTAATTTCAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGCGTGGGCACTACGAATGGCGTTTCCCATTTGG
AAGCAAATTTTAGCAAGGTAGCTGTCTTTAATTTACCTAATTTAAATAATCTCGCTATCACTAATTATATAAGGCGAGAT
TTAGAAGATAAATTGTTGGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTCATCAAAGACTTTTTGAACAGCAACAA
AGAAATGGTTGGAAAAGTTTCAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGATGAAGTGAAAA
AAGCTCAGAAAGATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAGA
AACGACAACAAAAATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCACTTATCAATCAAGA
GGCTAGTAAGGAAGCAAGAGCGGCCGCTTTCGATCCGAATCTTAAAGGCATCAGGAGCGAATTGTCTGATAAACTTGAAA
ATATCAACAAGAATTTGAAAGACTTTGGCAAATCTGTTGATGAACTCAAAAATGGCAAAAATAATGATTTCAGCAAGGCA
GAAGAAACGCTAAAAGCCCTTAAAGACTCGGTGAAAGATTTAGGCATCAATCCAGAATGGATTTCAAAAATTGAAAACCT
TAATGCAGCTTTGAATGATTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACACAAGCAAAAAGCGACCTTGAAA
ATTCCATTAAGGATGTGATCATTAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAGGCTGTATCAGAGACTAAA
TTAACAGGCGATTTCAGTAAGGTAGAGCAAGCCCTAGCCGAACTCAAAAGCTTGTCATTGGATCTTGGAAAAAATTCTGA
TCTACAAAAATCCGTTAAAAATGGTGTAAATGGAACCCTAGTCGGTAATGGGTTATCTAAAACAGAAGCCACAACGCTCA
CCAAAAATTTTTCGGACATCAGGAAAGAATTGAACGAGAAATTATTTGGAAATTCCAATAACAATAATAATGGACTCAAA
AACGAACCCATTTATGCTCAAGTTAATAAAAAGAAAGCAGGACAAGTAGCTAGCCCTGAAGAGCCCATTTACGCTCAAGT
TGCTAAAAAGGTGAGTGCAAAAATTGACCAACTCAACGAAGCTACATCAGCAATAAATAGAAAAATTGACCGGATTAACA
AAATTGCATCAGCAGGTAAAGGAGTGGGCGGTTTCAGTGGAGCAGGGCAATCAGCTAGCCCTGAACCCATTTACGCTACA
ATTGATTTTGATGATGCAAATCAAGCAGGCTTCCCTCTTAGGAGAAGTGCTGCAGTTAATGATCTCAGTAAAGTAGGGCT
TTCAAGGGAACAAGAATTGACTCGTAGAATTGGCGATCTCAATCAGGCGGTATCAGAAGCTAAAACAGGTCGTTTTGACA
ACCTAGAACAAAAGATAGATGAACTCAAAGATTCTACGAAAAAGAATGCTTTGAAGCTATGGGTTGAAAGCGCGAAACAA
GTGCCTATTGGTTTGCAAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCCAAAGTGG
AACAATCAATGAAAAAGCGACCGGCATGCTAACGCAAAGAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTG
CACATAATGTGGGAAGTGCTCATTTGTCAGAGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGAT
TCGTTCAAGTTTTCCACCAAGTTGAACAATGCCGTAAAAGACATTAAGTCTAGCTTTGTGCAATTTTTAACCAACACATT
TTCTACAGGATCTTACAGCTTGATGAAAGCAAATGTGGAACATGGAGTCAAAAATACTACAAAAAGTGGTTTCCAGAAAT
CTTAA

Protein sequence :
MTNETIDQTITPDQTPNQTDFVPQRFINNLQVAFIKVDNAVASFDPDQKPIVDKNDRDNRQAFEKISQLREEYANKAIKN
PTKKNQYFSDFINKSNDLINKDNLIAVDSSVESFRKFGDQRYQIFTSWVSLQKDPSKINTQQIRNFMENIIQPPISDDKE
KAEFLRSAKQSFAGIIIGNQIRSDEKFMGVFDESLKARQEAEKNAEPAGGDWLDIFLSFVFNKKQSSDLKETLNQEPRPD
FEQNLATTTTDIQGLPPESRDLLDERGNFFKFTLGDMEMLDVEGVADKDPNYKFNQLLIHNNALSSVLMGGHSNIEPEKV
SLLYGDNGGPEARHDWNATVGYKNQQGNNVATLINAHLRNGSGLVIVGNEDGIKNPSFYLYKEDQLTGLKQALSQEEIQN
KVDFMEFLAQNNAKLDNLSEKEKEKFQTEIEDFQKDRKAYLDALGNDRIAFVSKKDPKHLALVTEFGNGEVSYTLKDYGK
KQDKALDGETKTTLQGSLKYDGVMFVNYSNFKYTNASKSPDKGVGTTNGVSHLEANFSKVAVFNLPNLNNLAITNYIRRD
LEDKLLAKGLSPQEANKLIKDFLNSNKEMVGKVSNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKKLESR
NDNKNRMEAKAQANSQKDKIFALINQEASKEARAAAFDPNLKGIRSELSDKLENINKNLKDFGKSVDELKNGKNNDFSKA
EETLKALKDSVKDLGINPEWISKIENLNAALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSETK
LTGDFSKVEQALAELKSLSLDLGKNSDLQKSVKNGVNGTLVGNGLSKTEATTLTKNFSDIRKELNEKLFGNSNNNNNGLK
NEPIYAQVNKKKAGQVASPEEPIYAQVAKKVSAKIDQLNEATSAINRKIDRINKIASAGKGVGGFSGAGQSASPEPIYAT
IDFDDANQAGFPLRRSAAVNDLSKVGLSREQELTRRIGDLNQAVSEAKTGRFDNLEQKIDELKDSTKKNALKLWVESAKQ
VPIGLQAKLDNYATNSHTRINSNVQSGTINEKATGMLTQRNPEWLKLVNDKIVAHNVGSAHLSEYDKIGFNQKNMKDYSD
SFKFSTKLNNAVKDIKSSFVQFLTNTFSTGSYSLMKANVEHGVKNTTKSGFQKS