Name : cagA (HP0547)
Accession : AAR03909.1
PAI name : cag PAI
PAI accession : AY330639
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagA
Function : -
Note : Cag26; cag26; jhp0495; cytotoxicity associated immunodominant antigen
Homologs in the searched genomes : 48 hits ( 47 protein-level, 1 DNA-level )
Publication :
-Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates", Gene 328, 85-93 (2004) PUBMED 15019987.
-Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Direct Submission", Submitted (26-JUN-2003) Biotechnology, Royal Institute of Technology, Alba Nova University Centre, Roslagstullsbacken 21, S-106 91 Stockholm, Sweden.
DNA sequence : | |
ATGACTAACGAAACCATTAACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATCAATAATCTTCAAGT
GGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATACGATCCTGATCAAAAACCAATCGTTGATAAGAACGATAGGGATA
ACAGGCAAGCTTTTGATGGAATCTCGCAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAGACTTTATCAATAAGAGCAATGATCTAATCAACAAAGACAATCTCATTGATGTAGAATCTTCCACAAA
GAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGTTGGGTGTCCCATCAAAACGATCCGTCTAAAATCA
ACACCCGATCGATCCGAAATTTTATGGAAAATATCATACAACCCCCTATCCCTGATGACAAAGAAAAAGCAGAGTTTTTG
AAATCTGCCAAACAATCTTTTGCAGGAATCATTATAGGGAATCAAATCCGAACGGATCAAAAGTTCATGGGCGTGTTTGA
TGAATCCTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGAGCCTACTGGTGGGGATTGGTTGGATATTTTTTTATCAT
TTGTGTTCAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAACCAGTTCCTCATGTCCAACCAGATGTA
GCCACTACCACCACCGACATACAAGGCTTACCGCCTGAATCTAGGGATTTGCTTGATGAAAGGGGTAATTTTTCTAAATT
CACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGAGTCGCTGACATTGATCCCAATTACAAGTTCAACCAATTATTGA
TTCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCATTATTGTATGGA
AACAATGGTGGCCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAAACCAACAAGGCGACAATGTGGCTAC
ACTCATTAATGTGCATATGAAAAACGGCAGTGGGTTAGTCATAGCAGGTGGTGAGAAAGGGATTAACAACCCTAGTTTTT
ACCTCTACAAAGAAGACCAACTCACAGGCTCACAACGAGCATTGAGTCAAAAAGAGATCCAAAACAAAGTAGATTTCATG
GAATTTCTTGCACGAAACAATGCTAAATTAGACAACTTGAGCGTGAAAGAGAAAGAAAAATTCCAAAATGAGATTGAAGA
TTTTCAAAAAGACTCTAAAGCTTATTTAGACGCCCTAGGGAATGATCGTATTGCTTTTGTTTCTAAAAAAGACACAAAAC
ATTCAGCTTTAATTACTGAGTTTGGTAATGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCT
TTAGATAGGGAGAAAAATGTTACTCTTCAAGGTAGCCTAAAACATGATGGCGTGATGTTTATTGATTATTCTAATTTCAA
ATACACCAACGCCTCCAAGAATCCCAATAAGGGTGTAGGCACTACGAATGGCGTTTCCCATTTAGAAGCAGGCTTTAACA
AGGTAGCTGTCTTTAATTTGCCTAGTTCAAATGAACTCACTATCACTGGTTTTGCAAAGCGGAATTTAGAGGATAAACTA
GCCGCTAAAGGATTTTCCCCACAAGAAGCTAATAAGTTCATCAAAGACTTTTTGAGTAGCAACAAGGAATTGGTTGGAAA
AGCTTTAAATTTCAATAAAGTTGTAGCTGAAGCTAAAAACACAGGCAACTATGATGAAGTGAAAAAAGCTCAGAAAGATC
TTGAAAAATCTCTAAGGAAACGAGAGCATTTGGAGAAAGAAGTAGCGAAAAATTTGGAGAGCAAAAGCGGCAACAAAAAC
AAAATGGAAGCGAAATCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAGGCTAATAGGGACGC
AAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAGAAGTGAATTGTCTGGTAAGCTTGAAAATGTCAACAAGAATT
TGAAAGACTTTAGTAAATCTTTTGATGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGCAGAAGAAACGCTAAAA
GCCCTTAAAGGCTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAATGCAGCTTTGAA
TGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATTCCATTAAAGATG
TGATCATCAATCAAAAGATAACGGATAAAGTTAACAATCTCAGTTCGGCTGTATCAGTGGCTAAAGCAACGGGCGATTTC
AGTAGGGTAGAGCAAGTGCTAGCCGGTCTCAAAAATTTCTCAAAGGAGCAATTGGCTCAACAAGCTCAAAAAAATGAAGA
TTTCAATACTGGAAAAAAATCTGAAATATACCAATCCGTTAAGAATGGTGTGAATGGAACCCTAGTCGGTAATGGGTTAT
CTCAAGCAGAAGCCATAACTCTTTCTAAAAACTTTTCGGACATCAAGAAAGAGTTGAATGCAAAATTGGGGAATTTCAAT
AACAATAACAATAATGGACTCAAAAACGAACCCATTTATGCTAAAGTTAATAAAAAGAAAGCAGGACAAGCAGCTAGCCC
TGAAGAACCCATTTACGCTCAAGTTGCTAAAAAGGTAAATGCAAAAATTGACCGACTCAATCAAATAGCAAGTGGTTTGG
GTGGTGTAGGGCAAGCAGCGGGCTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAGCT
AACCCTGAACCCATTTACGCTACGATTGATGAGCTCGACGGACCTTTCCCTTTGAAAAGGCATGATAAAGTTGATGATCT
CAGTAAGGTAGGGCTTTCAAGGAATCAAGAATTGACTCAGAAAATTGACAATCTCAATCAAGCGGTATCAGAAGCTAAAG
CAGGTTTTTTTGGCAATCTAGAGCAAACGATAGACAAGCTCAAAGATTCTACAAAACACAATGTCGTGAATCTATGGGTT
GAAAGTGCAAAAAAAGTGCCTGCTAGTTTGTCAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAG
CAATGTCAAAAATGGAACAATCAATGAAAAAGTGACCGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGA
ATGATAAGATAGTTGCACATAATGTGGGAAGCGCTCCTTTGTCAGAGTATGATAAAATTGGCTTCAACCAGAAAAATATG
AAAGATTATTCTGATTCATTCAAGTTTTCCACCAGGTTGAGCAATGCTGTAAAAGACATTAAGTCTGGCTTTGTGCAATT
TTTAACCAATACATTTTCTACAGCATCTTATTACTACTTGGCAGGAGAAAATGCGGAGCATGGAATCAAAAATGCTAATA
CAAAAGGTGGTTTCCAAAAATCTTAA
|
Protein sequence : | |
MTNETINQQPQTEAAFNPQQFINNLQVAFLKVDNAVASYDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKNPTKKN
QYFSDFINKSNDLINKDNLIDVESSTKSFQKFGDQRYRIFTSWVSHQNDPSKINTRSIRNFMENIIQPPIPDDKEKAEFL
KSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGEPTGGDWLDIFLSFVFNKKQSSDLKETLNQEPVPHVQPDV
ATTTTDIQGLPPESRDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVSLLYG
NNGGPEARHDWNATVGYKNQQGDNVATLINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQKEIQNKVDFM
EFLARNNAKLDNLSVKEKEKFQNEIEDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFGNGDLSYTLKDYGKKADKA
LDREKNVTLQGSLKHDGVMFIDYSNFKYTNASKNPNKGVGTTNGVSHLEAGFNKVAVFNLPSSNELTITGFAKRNLEDKL
AAKGFSPQEANKFIKDFLSSNKELVGKALNFNKVVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKNLESKSGNKN
KMEAKSQANSQKDEIFALINKEANRDARAIAYAQNLKGIRSELSGKLENVNKNLKDFSKSFDEFKNGKNKDFSKAEETLK
ALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVNNLSSAVSVAKATGDF
SRVEQVLAGLKNFSKEQLAQQAQKNEDFNTGKKSEIYQSVKNGVNGTLVGNGLSQAEAITLSKNFSDIKKELNAKLGNFN
NNNNNGLKNEPIYAKVNKKKAGQAASPEEPIYAQVAKKVNAKIDRLNQIASGLGGVGQAAGFPLKRHDKVDDLSKVGLSA
NPEPIYATIDELDGPFPLKRHDKVDDLSKVGLSRNQELTQKIDNLNQAVSEAKAGFFGNLEQTIDKLKDSTKHNVVNLWV
ESAKKVPASLSAKLDNYATNSHTRINSNVKNGTINEKVTGMLTQKNPEWLKLVNDKIVAHNVGSAPLSEYDKIGFNQKNM
KDYSDSFKFSTRLSNAVKDIKSGFVQFLTNTFSTASYYYLAGENAEHGIKNANTKGGFQKS
|
|