PAI Gene Information


Name : cagA
Accession : BAD51766.1
PAI name : cag PAI
PAI accession : AB190956
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cytotoxin associated protein A
Function : -
Note : -
Homologs in the searched genomes :   49 hits    ( 47 protein-level,   2 DNA-level )  
Publication :
    -Yamazaki,S., Yamakawa,A. and Azuma,T., "Direct Submission", Submitted (22-SEP-2004) Takeshi Azuma, University of Fukui, Faculty of Medical Sciences, Second depertment of Internal Medicine; Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:81-776-61-8351, Fax:81-776-61-8110).

    -Yamazaki,S., Yamakawa,A., Ito,Y., Higashi,H., Hatakeyama,M. and Azuma,T., "Analysis of the vacA, cagA, and cagE full- length genes in Helicobacter pylori in Japan", Unpublished.


DNA sequence :
ATGACTAACGAAACTATTGATCAAACAAGAACACCAGATCAAACACAAAGCCAAACAGCTTTTGATCCGCAACGATTTAT
CAATAATCTTCAAGTGGCTTTTATTAAAGTTGATAATGTTGTCGCTTCATTTGATCCTAATCAAAAACCAATCGTTGATA
AGAATGATAGGGATAATAGGCAAGCTTTTGAGAAAATCTCGCAGCTAAGGGAGGAATTCGCTAATAAAGCGATCAAAAAT
CCTGCCAAAAAGAATCAGTATTTTTCAAGCTTTATCAGTAAGAGCAGTGATTTAATCAACAAAGACACTCTCATTGATAC
AGGTTCTTCCATAAAGAGCTTTCAGAAATTTGGGACTCAGCGTTACCAAATTTTTATGAATTGGGTGTCCCATCAAAAAG
ATCCATCTAAAATCAACACCCAAAGAATCCGAAGTTTTATTGAAAATATCATACAACCCCCTATCTCTGATGATAAAGAA
AAAGCGGAGTTTTTGAGGTCTGCCAAACAAGCTTTTGCAGGAATTATCATAGGAAACCAGATCCAATCAGATCAAAAATT
CATGGGCGTGTTTGATGAATCTTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGGGTCTACTGGAGAGCCTATTGGTG
GGGATTGGCTTGATATTTTTTTATCATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCGATCAAGAA
CCAGTTCCTCATGTCCAACCAGATATAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAATCTAGGGATTTGCT
TGATGAAAGGGGTAATTTTTCTAAATTCACTCTTGGTGATATGGAAATGTTAGATGTTGAGGGTGTCGCTGACATTGATC
CTAATTACAAGTTCAACCAATTATTGATTCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATAATGGCGTAGAA
CCTGAAAAAGTTTCATTATTGTATGGGGGCAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAA
AAACCAACAAGGCAACAATGTGGCTACACTCATTAATGTGCATATGAAAAACGGCAGTGGGTTAGTCATAGCAGGTGGTG
AGAAAGGGGTTAACAACCCTAGTTTTTATCTCTACAAAGAAGACCAGCTCACAGGCTTGAAACAAGCATTGAGTCAAAAA
GAGATCCAAAACAAAGTGGATTTCATGGAATTTCTTGCAAAAAACAACGCTAGATTAGATAACTTGAGCGAGAAAGAGAA
AGAAAAATTCCAAACTGAGATTGAAGATTTCCAAAAAAACCCTAAGGCTTATTTAGACGCCCTAGGGAATGATCATATTG
CTTTTGTTTCTAAAAAAGACAAAAAACATTTAGCTTTAGTTACTGAGTTTGGTAATGGGGAATTGAGCTACACTCTCAAA
GATTATGGGAAAAAACAAGATAAAGCTTTAGATAGGGAGACAAAAACCACTCTTCAAGGTAACCTAAAACATGATGGCGT
GATGTTTGTTAATTATTCTAATTTCAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGTGTGGGTGCTACGAATGGCG
TTTCCCATTTGGAAGCAAATCTTAGTAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAGTTAT
ATAAGGCGAGACTTAGAAGATAAACTGTGTGCTAAAGGATTGTCCCCACAAGAAGTTAATAAGCTCATCAAAGACTTTTT
GAACAGCAACAAAGAATTGGTTGAAAAAGCTTTAAACTTCAATAAAACTGTAGCTGAAGCTAAAAACACAGGCAATTATG
ACGAAGTGAAAAAAGCTCAGAAAGATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAACGAAAAAA
TTGGAGAGAAAAAGCGATAACAAAAATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCAAT
TATCAATGAAGAGGCTGGTAAGGAAGCAAGAGTAGCCGCTTGCGTTCAGAATCTTAAAGGCATCAGAATGGAATTGTCTG
ATAAGCTTGAAAACATCAACAAGAATTTGAAAGACTTTGATAAATCTTTTGATGAATTCAAAAATGGCAAAAATAAGGAT
TTCAGCAAGACAGAAGAAACGCTAAAAGCCCTTAAAGACTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAA
AGTTGAAAACCTTAATACAGCTTTGAATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAA
GCGACCTTGAAAATTCCATTAAAGATGTGATCATCAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAAGCTGTA
TCAATAGCTAAAGCAACGGGTGATTTCAGTAGGGTAGAGCAAGCGCTAGCCGATCTCAAGAATTTCTCAAAAGGACAATT
GACTCAACAAGCTCAAAAAAATGAAGATTTCAATACTGGAAAAAATTCTAAACTATACCAATCCGTTAAGAATGGTGTAA
ATGGAACCCTAGTCGGTAATGGGTTATCTGGAATAGAGGCCACAGCTCTCGCCAAAAAATTTTCGGATATCAAGAAAGAA
TTGAATGAGAAATTTAAAAATTTCAACAACAATAACAATAATGGACTCAAAAACGAACCCATTTATGCTGAAGTTAATAA
AAAGAAAACAGGACAAGTAGCTAGCCCTGAAGAACCCATTTATACTCAAGTTGCTAAAAAGGTAAAAGCAAAAATTGACC
GACTCGATCAAATAGCAAGTGGTTTGGGTGGTGTAGGGCAAGCGGGGTTCTCTTTGAAAGGGCATACTAAAGTTGGTGAT
CTCAGTAAGGTAGGGCTTTCAGCTAACCATGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAA
AAAGCATGATAAAGTTGGTGATCTCAGTAAGGTAGGGCTTTCAAGGGAGCAAGAATTGAAACAAAAGATTGACAATCTCA
ATCAGGCGGTATCAGAAGCTAAAGCATGTCATTTTGGCAACCTAGATCAAATGATAGACAAGCTCAAAGATTCTACAAAA
AAGAATGTTATGAATCTATATGTTGAAAGTGCAAAAAAAGTGCCTACTAGTTTGTCAGCGAAATTGGACAATTACGCTAC
TAACAGCCACACACGCATTAATAGCAATGTCAAAAATGGAACAATCAATGAAAAAGAGACTAGCATGTTAATGCGAAAAA
ACCCTGAGTGGCTTAAGCTCGTGAATGATAAGATAGTTGCGCATAATGTGGGAAGTGCTCCTTTGTCAGCGTATGATAAA
ATTGGATTCAATCAAAAGAATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCACCAGGTTGAGCAATGCCGTAAAAGA
CATTAAGTCTGGCTTTGTGCAATTTTTAACCAATATATTTTCTATGGGATCTTACAGCTTGATGAAAGCAAGTGTGGAAC
ATGGAGTCAAAAATACTAATACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETIDQTRTPDQTQSQTAFDPQRFINNLQVAFIKVDNVVASFDPNQKPIVDKNDRDNRQAFEKISQLREEFANKAIKN
PAKKNQYFSSFISKSSDLINKDTLIDTGSSIKSFQKFGTQRYQIFMNWVSHQKDPSKINTQRIRSFIENIIQPPISDDKE
KAEFLRSAKQAFAGIIIGNQIQSDQKFMGVFDESLKERQEAEKNGGSTGEPIGGDWLDIFLSFVFNKKQSSDLKETLDQE
PVPHVQPDIATTTTDIQGLPPESRDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGVE
PEKVSLLYGGNGGPEARHDWNATVGYKNQQGNNVATLINVHMKNGSGLVIAGGEKGVNNPSFYLYKEDQLTGLKQALSQK
EIQNKVDFMEFLAKNNARLDNLSEKEKEKFQTEIEDFQKNPKAYLDALGNDHIAFVSKKDKKHLALVTEFGNGELSYTLK
DYGKKQDKALDRETKTTLQGNLKHDGVMFVNYSNFKYTNASKSPDKGVGATNGVSHLEANLSKVAVFNLPNLNNLAITSY
IRRDLEDKLCAKGLSPQEVNKLIKDFLNSNKELVEKALNFNKTVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVTKK
LERKSDNKNRMEAKAQANSQKDKIFAIINEEAGKEARVAACVQNLKGIRMELSDKLENINKNLKDFDKSFDEFKNGKNKD
FSKTEETLKALKDSVKDLGINPEWISKVENLNTALNEFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAV
SIAKATGDFSRVEQALADLKNFSKGQLTQQAQKNEDFNTGKNSKLYQSVKNGVNGTLVGNGLSGIEATALAKKFSDIKKE
LNEKFKNFNNNNNNGLKNEPIYAEVNKKKTGQVASPEEPIYTQVAKKVKAKIDRLDQIASGLGGVGQAGFSLKGHTKVGD
LSKVGLSANHEPIYATIDDLGGPFPLKKHDKVGDLSKVGLSREQELKQKIDNLNQAVSEAKACHFGNLDQMIDKLKDSTK
KNVMNLYVESAKKVPTSLSAKLDNYATNSHTRINSNVKNGTINEKETSMLMRKNPEWLKLVNDKIVAHNVGSAPLSAYDK
IGFNQKNMKDYSDSFKFSTRLSNAVKDIKSGFVQFLTNIFSMGSYSLMKASVEHGVKNTNTKGGFQKS