PAI Gene Information


Name : cagA
Accession : AGC69806.1
PAI name : cag PAI
PAI accession : JQ685154
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein A
Function : -
Note : -
Homologs in the searched genomes :   49 hits    ( 47 protein-level,   2 DNA-level )  
Publication :
    -Barrozo,R.M., Cooke,C.L., Hansen,L.M., Lam,A.M., Gaddy,J.A., Johnson,E.M., Cariaga,T.A., Suarez,G., Peek,R.M. Jr., Cover,T.L. and Solnick,J.V., "Functional Plasticity in the Type IV Secretion System of Helicobacter pylori", PLoS Pathog. 9 (2), E1003189 (2013) PUBMED 23468628.

    -Hansen,L.M., "Direct Submission", Submitted (17-FEB-2012) Center for Comparative Medicine, University of California, Davis, County Road 98 & Hutchison Drive, Davis, CA 95616, USA.


DNA sequence :
ATGACTAACGAAACCATTAACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATCAATAATCTTCAAGT
GGCTTTTATTAAAGTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAACCAATCGTTGATAAGAATGATAGGGATA
ACAGGCAAGCTTTTGAGAAAATCTCGCAGCTAAGGGAGGAATTCGCTAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAAACTTTATCAGTAAGAGCAGTGATTTAATCAACAAAGACGGTCTCATTGATACAGGTTCTTCCATAAA
AAGCTTTCAGAAATTTGGGACTCAGTGTTACCAAATTTTTATGAATTGGGTGTCCCATCAAAAAGATCCATCTCAAATCA
ACACCCAAAAAATCCGAGGTTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAAGAGAAAGCGGAGTTTTTG
AGGTCTGCCAAACAAGCTTTTGCAGGAATTATCATAGGAAACCAAATCCGATCGGATCAAAAATTCATGGGCGTGTTTGA
TGAATCTTTGAAAGAGAGGCAAGAAGCAGAAAAAAATGGAGAGCCTAATGGAGATCCTACTGGTGGGGATTGGCTTGATA
TTTTTTTATCATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAACCAGTTCCTCATGTC
CAACCAGATGTAGCCACTACCACCACTGACATACAAAGCTTACCGCCTGAATCTAGAGATTTACTTGATGAAAGGGGTAA
TTTTTCTAAATTCACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGAGTCGCTGACATTGATCCCAATTACAAGTTCA
ACCAATTATTGATTCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATGATGGCATAGAACCTGAAAAAGTTTCA
TTATTGTATGGAAACAATGGTGGCCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTCATAAAAACCAACAAGGCAA
CAATGTGGCTACACTCATTAATGTGCATATGAAAAACGGCAGTGGGTTAGTCATAGCAGGTGGTGAGAAAGGGGTTAACA
ACCCTAGTTTTTATCTCTACAAAGAAGATCAGCTCACAGGCTTGAAACAAGCATTGAGTCAAAAAGAGATCCAAAACAAA
GTAGATTTCATGGAATTTCTTGCACAAAACAATGCTAAATTAGACAACTTGAGCAAGAAAGAGAAAGAAAAATTCCAAAA
TGAGATTGAAGATTTTCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGATCACATTGCTTTTGTTTCTAAAA
AAGACAAAAAACATTTAGCTTTAGTTACTGAGTTTGGTAATGGGGATTTGAGCTACACTCTTAAAGATTATGGGAAAAAA
CAAGATAAAGCTTTAGATAGGGAGATAAAAACCACTCTTCAAGGTAACCTAAAACATGATGGCGTGATGTTTGTTAATTA
TTCTAATTTCAAATACACCAACGCCTCCAAGAGTCCTGATAAGGGTGTGGGTGCTACGAATGGCGTTTCCCATTTGGAAG
CAAATCTTAGCAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAGTTATATAAGGCGAGACTTA
GAAGAGAAACTGGGGGCTAAAGGATTGTCCCTACAAGAAGCTAATAAGCTCATCAAAGACTTTTTGAACAGCAACAAGGA
ATTGGTTGGAAAAGTTTTAAACCTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGATGAAGTGAAAAAAG
CTCAGAAAGATCTTGAAAAATCTATAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGTGAAAAAATTGGAGAACAGAAAC
GACAACAAAAATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCAATTATCAATAAAGAGGC
TAGTAAGGAAGCAAGAGCGACCGCTTGCGTTCAGAAGTTTAAAGGCATCAAAATAGAATTGTTTGATAAGTTTGAAAACA
TCAACAAGAATTTGAAAGACTTTGATAAATCTTTTGATGACTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGCAGAA
GAAACGCTAAAAGCCCTTAAAGGCTCGGTGAAGGATTTAGGCATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAA
TACAGCTTTGAATGACTTCAAAAATGGCAAAAATAAGGATTTCAGCAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATT
CCATTAAAGATGTGATCATCAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAAGCGGTATCAGTGGCTAAAGAA
ACAGGCGATTTCAGTGGGGTAGAGCAAGCGCTAGCCGATCTCAAGAATTTCTCAAAAGGACAATTGGCTCAACAAGCTCA
AAAAAATGAAGATTTCAATACTGGAAAAAATTCTGAACTATACCAATCCGTTAAGAATGGTGTAAATGGAACCCTAGTCG
GTAATGGGTTATCTGGAATAGAGGCCACAGCTCTCACCAAAAATTTTTCGGATATCAAGAAAGAATTGAATGAGAAATTT
AAAAATTTCAATAACAATAATAATGGTCTCAAAAACAGCGGAGAACCCATTTATGCTCAGGTTAATAAAAAGAAAACAGG
ACAAGTAGCTAGCCCTGAGGAACCCATTTATACTCAAGTTGCTAAAAAGGTAAAAGCAAAAATTGACCAATTCAATCAAG
TAGCAAGTGGTTTGGGTGGTGTAGGGCAAGCGGGATTCTCTTTGAAAGGGCATACTAAAGTTGATGATCTCAGTAAGGTA
GGGCGATCAGTTAGCCCTGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTTCCCTTTGAAAAGGCATGATAA
AGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGGAATCAAGAATTGGCTCAGAAAATTGACAATCTCAATCAGGCGGTAT
CAGAAGCTAAAACATGTCATTTTGACAACCTAGATCAAATGATAGACAAGCTCAAAGATTCTACAAAAAAGAATGTTACG
AATCTATATATTGAAAGTGCAAAAAAAGTGCCTACTAGTTTGTCAGCGAAATTGGACAATTATGCTATTAACAGCCACAT
ACGCATTAATAGCAATGTCAAAAATGGAACAATCAATGAAAAAGTGACCGGCATGCTAACGCAAAAAAACCCTGAGTGGC
TCAAGCTCGTGAATGATAAGATAGTTGCACATAATGTGGGAAGCGTTCCTTTGTCAGAGTATGATAAAATTGGCTTCAAC
CAGAAAAATATGAAAGATTATTCTGATTCATTCAAGTTTTCCACCAGGTTGAGCAATGCTGTAAAAGACATTAAGTCTGG
CTTTGTGCAATTTTTAACCAATACATTTTCTATGGGATCTTACAGCTTGATGAAAGCAAGTGTGGAACATGGAGTCAAAA
ATACTAATACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETINQQPQTEAAFNPQQFINNLQVAFIKVDNAVASFDPDQKPIVDKNDRDNRQAFEKISQLREEFANKAIKNPTKKN
QYFSNFISKSSDLINKDGLIDTGSSIKSFQKFGTQCYQIFMNWVSHQKDPSQINTQKIRGFMENIIQPPISDDKEKAEFL
RSAKQAFAGIIIGNQIRSDQKFMGVFDESLKERQEAEKNGEPNGDPTGGDWLDIFLSFVFNKKQSSDLKETLNQEPVPHV
QPDVATTTTDIQSLPPESRDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHDGIEPEKVS
LLYGNNGGPEARHDWNATVGHKNQQGNNVATLINVHMKNGSGLVIAGGEKGVNNPSFYLYKEDQLTGLKQALSQKEIQNK
VDFMEFLAQNNAKLDNLSKKEKEKFQNEIEDFQKDSKAYLDALGNDHIAFVSKKDKKHLALVTEFGNGDLSYTLKDYGKK
QDKALDREIKTTLQGNLKHDGVMFVNYSNFKYTNASKSPDKGVGATNGVSHLEANLSKVAVFNLPNLNNLAITSYIRRDL
EEKLGAKGLSLQEANKLIKDFLNSNKELVGKVLNLNKAVAEAKNTGNYDEVKKAQKDLEKSIRKREHLEKEVVKKLENRN
DNKNRMEAKAQANSQKDKIFAIINKEASKEARATACVQKFKGIKIELFDKFENINKNLKDFDKSFDDFKNGKNKDFSKAE
ETLKALKGSVKDLGINPEWISKVENLNTALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSVAKE
TGDFSGVEQALADLKNFSKGQLAQQAQKNEDFNTGKNSELYQSVKNGVNGTLVGNGLSGIEATALTKNFSDIKKELNEKF
KNFNNNNNGLKNSGEPIYAQVNKKKTGQVASPEEPIYTQVAKKVKAKIDQFNQVASGLGGVGQAGFSLKGHTKVDDLSKV
GRSVSPEPIYATIDDLGGPFPLKRHDKVDDLSKVGLSRNQELAQKIDNLNQAVSEAKTCHFDNLDQMIDKLKDSTKKNVT
NLYIESAKKVPTSLSAKLDNYAINSHIRINSNVKNGTINEKVTGMLTQKNPEWLKLVNDKIVAHNVGSVPLSEYDKIGFN
QKNMKDYSDSFKFSTRLSNAVKDIKSGFVQFLTNTFSMGSYSLMKASVEHGVKNTNTKGGFQKS