Name : cagA
Accession : BAD51744.1
PAI name : cag PAI
PAI accession : AB190934
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cytotoxin associated protein A
Function : -
Note : -
Homologs in the searched genomes : 48 hits ( 47 protein-level, 1 DNA-level )
Publication :
-Yamazaki,S., Yamakawa,A. and Azuma,T., "Direct Submission", Submitted (22-SEP-2004) Takeshi Azuma, University of Fukui, Faculty of Medical Sciences, Second depertment of Internal Medicine; Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:81-776-61-8351, Fax:81-776-61-8110).
-Yamazaki,S., Yamakawa,A., Okuda,T., Ohtani,M., Suto,H., Ito,Y., Yamazaki,Y., Keida,Y., Higashi,H., Hatakeyama,M. and Azuma,T., "Distinct diversity of vacA, cagA, and cagE genes of Helicobacter pylori associated with peptic ulcer in Japan", J. Clin. Microbiol. 43 (8), 3906-3916 (2005) PUBMED 16081930.
DNA sequence : | |
ATGACTAACGAAACTATTGATCAAACAATAACACCAGATCAAACAGATTTTGTTCCGCAACGATTTATCAATAATCTTCA
AGTAGCTTTTATCAAAGTTGATAGTGCTGTCGCTTCATTTGATCCCGATCAAAAACCAATCGTTGATAAGAATGATAGGG
ATAACAGGCAAGCTTTTGAGAAAATCTCGCAACTAAGGGAAGAATACGCCAATAAAGCGATCAAAAATCCTGCCAAAAAG
AATCAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGCTGTAGATTCTTCCGT
AGAGAGCTTTCGGAAATTTGGGGATCAGCGTTACCAAATTTTTACGAGTTGGGTGTCCCTTCAAAAAGATCCGTCTAAAA
TCAACACCCAACAAATCCGATATTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAGGAAAAAGCAGAGTTT
TTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGGAACCAAATCCGATCGGATGAAAAATTCATGGGCGTGTT
TGATGAATCTTTGAAAGCAAGGCAAGAAGCAGAAAAAAATGCAGAGCCTGCTGGTGGGGATTGGCTTGATATTTTTTTAT
CATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGCCTGATTTTGAACAAAAT
TTAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAATCTAGAGATTTGCTTGATGAAAGGGGTAATTTTTCTAA
ATTCACTCTTGGTGATATGGAAATGTTGGATGTTGAGGGAGTCGCTGACAAGGATCCCAATTACAAGTTCAATCAATTAT
TGATCCACAATAACGCTCTATCTTCTGTGCTAATGGGGGGTCATAGCAACATAGAACCTGAAAAAGTTTCGTTATTGTAT
GGGGATAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGCAGTAATGTGGC
CACACTCATTAATGCACATCTTAATAACGGCAGCGGGTTAATCATAGCGGGTAATGAAAATGGGATTAAAAACCCTAGCT
TCTATCTCTATAAAGAAGATCAACTCACAGGTTTGAAACAAGCATTGAGTCAAGAAGAGATCCAAAACAAAGTGGATTTC
ATGGAATTTCTCGTGCGAAACAACGCTAAATTAGATAACTTGAGCAAGAAAGAGAAAGAAAAATTCCAAACTGAGATTGA
AAATTTCCAAAAAAACCCTAAGGCTTATTTGGACGCTCTGGGGAATGATCACATTGCTTTTGTTTCTAAAAAAGACCCAA
AACATTTAGCTTTGGTTACTGAGTTTGGTAATGGGGAAGTGAGCTATACTCTCAAAGATTATGGGAAAAAACAAGATAAA
GCTTTAGATGGGGAGACAAAAACCACTCTTCAAGGTAGCCTAAAATATGATGGCGTGATGTTTGTCAATTATTCCAATTT
CAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGCGTGGGCACTACGAATGGTGTTTCCCGTTTGGAAGCAAATTTTA
GCAAGGTAGCTGTCTCTAATTTGCCTAATTTAAATAATCTCGCTATCACTAATTATATAAGGCGAGATTTAGAAGCTAAC
TTGTGGGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTCCTCAAAGGCTTTTTGAACAGCAACAAAGAAATGGTTGG
AAAAGTTTCAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGATGAAGTGAAAAAAGCTCAGAAAG
ATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAGAAACGACAACAAA
AATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCACTTATCAATCAAGAGGCTAGTAAGGA
AGCAAGAGCGGCCGCTTTCGATCCGAATCTTAAAGGCATCAGGAGCGAATTGTCTGATAAACTTGCAAATATCAACAAGA
ATTTGAAAGACTTTGGCAAATCTTTTGATGAACTCAAAAATGGCAAAAATAATGATTTCAGCAAGGCAGAAGAAACGCTA
AAAGCCCTTAAAGACTCGGTGAAAGATTTAGGCATCAATCCAGAATGGATTTCAAAAATTGAAAACCTTAATGCAGCTTT
GAATGATTTCAAAAATGGCAAAAATAAGGATTTCAGTAAGGTAACACAAGCAAAAAGCGACCTTGAAAATTCCATTAAGG
ATGTGATCATTAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAGGCTGTATCAGAGACTAAATTAACAGGCGAT
TTCAGTAAGGTAGAGCAAGCCCTAGCCGAACTCAAAAACTTGTCATTGGATCTTGGAAAAAATTCTGATCTACAAAAATC
CGTTAAAAATGGTGTAAATGGAACCCTAGTCGGTAATGGGTTGTCTAAAACAGAAGCCACAACGCTCACCAAAAATTTTT
CGGACATCAGGAAAGAATTGAACGAGAAATTATTTGGAAATTCCAATAACAATAATAATGGACTCAAAAACAACACAAAT
CCTATTTATGCTCAAGTCAATAAAAAGAAAACAGGACAAGCAGCTAGCCCTGAAGAGCCCATTTACGCTCAAGTTGCTAA
AAAGGTGAGTGCAAAAATTGACCAACTCAACGAAGCTACATCAGCAATAAATAGAAAAATTGACCGGATTAACAAAATTG
CATCAGCAGGTAAAGGAGTGGGCGGTTTCAGTGGAGCAGGACAAGCAACTAGCCCTGAACCCATTTACGCTACAATTGAT
TTTGATGAGGCAAATCAAGCAGGCTTCCCTTTGAGAAGAAGTGCTGCAGTTAATGACCTCAGTAAAGTAGGGCTTTCAAG
GGAGCAAGAATTGACTCGTAGAATTGGCGATCTCAATCAGGCGGTATCAGAAGCTAAAACAGGTTATTTTGACAACCTAG
AACAAAAGATAGATGAACTCAAAGATTCTACAAAAAAGAATGCTTTGAAGTTATTGGTTGAAAGCGCGAAACAAGTGCCT
ACTAGTTTGTCAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCCAAGATGGAACAAT
CAATGAAAAAGCGACCGGTGTGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCACATA
ATGTGGGAAGTGCTCATTTGTCAGAGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGATTCGTTC
AAGTTTTCCACCAAGTTGAACAACGCCGTAAAAGACATTAAGTCTAGCTTTGTGCAATTTTTAACCAATACATTTTCTAC
AGGATCTTACAGCTTGATGAAAGCAAATGTGGAACATGGAGTCAAAAATACTACAAAAGGTGGTTTCCAAAAATCTTAA
|
Protein sequence : | |
MTNETIDQTITPDQTDFVPQRFINNLQVAFIKVDSAVASFDPDQKPIVDKNDRDNRQAFEKISQLREEYANKAIKNPAKK
NQYFSDFINKSNDLINKDNLIAVDSSVESFRKFGDQRYQIFTSWVSLQKDPSKINTQQIRYFMENIIQPPISDDKEKAEF
LRSAKQSFAGIIIGNQIRSDEKFMGVFDESLKARQEAEKNAEPAGGDWLDIFLSFVFNKKQSSDLKETLNQEPRPDFEQN
LATTTTDIQGLPPESRDLLDERGNFSKFTLGDMEMLDVEGVADKDPNYKFNQLLIHNNALSSVLMGGHSNIEPEKVSLLY
GDNGGPEARHDWNATVGYKDQQGSNVATLINAHLNNGSGLIIAGNENGIKNPSFYLYKEDQLTGLKQALSQEEIQNKVDF
MEFLVRNNAKLDNLSKKEKEKFQTEIENFQKNPKAYLDALGNDHIAFVSKKDPKHLALVTEFGNGEVSYTLKDYGKKQDK
ALDGETKTTLQGSLKYDGVMFVNYSNFKYTNASKSPDKGVGTTNGVSRLEANFSKVAVSNLPNLNNLAITNYIRRDLEAN
LWAKGLSPQEANKLLKGFLNSNKEMVGKVSNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKKLESRNDNK
NRMEAKAQANSQKDKIFALINQEASKEARAAAFDPNLKGIRSELSDKLANINKNLKDFGKSFDELKNGKNNDFSKAEETL
KALKDSVKDLGINPEWISKIENLNAALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSETKLTGD
FSKVEQALAELKNLSLDLGKNSDLQKSVKNGVNGTLVGNGLSKTEATTLTKNFSDIRKELNEKLFGNSNNNNNGLKNNTN
PIYAQVNKKKTGQAASPEEPIYAQVAKKVSAKIDQLNEATSAINRKIDRINKIASAGKGVGGFSGAGQATSPEPIYATID
FDEANQAGFPLRRSAAVNDLSKVGLSREQELTRRIGDLNQAVSEAKTGYFDNLEQKIDELKDSTKKNALKLLVESAKQVP
TSLSAKLDNYATNSHTRINSNVQDGTINEKATGVLTQKNPEWLKLVNDKIVAHNVGSAHLSEYDKIGFNQKNMKDYSDSF
KFSTKLNNAVKDIKSSFVQFLTNTFSTGSYSLMKANVEHGVKNTTKGGFQKS
|
|