PAI Gene Information


Name : cagE
Accession : AAF80209.1
PAI name : cag PAI
PAI accession : AF282853
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : CagE
Function : -
Note : similar to PtlC, VirB4, TraB, and TrbE; HP544; cag23; JHP492
Homologs in the searched genomes :   43 hits    ( 43 protein-level )  
Publication :
    -Censini,S., Lange,C., Xiang,Z., Crabtree,J.E., Ghiara,P., Borodovsky,M., Rappuoli,R. and Covacci,A., "cag, a pathogenicity island of Helicobacter pylori, encodes type I-specific and disease-associated virulence factors", Proc. Natl. Acad. Sci. U.S.A. 93 (25), 14648-14653 (1996) PUBMED 8962108.

    -Censini,S., Rappuoli,R. and Covacci,A., "Direct Submission", Submitted (17-MAR-2000) Molecular Biology, Chiron-Biocine, Via Fiorentina 1, Siena, SI 53100, Italy REMARK Sequence update by submitter.

    -Censini,S., Rappuoli,R., Lange,C. and Covacci,A., "Direct Submission", Submitted (06-JUN-1996) Molecular Biology, Chiron-Biocine, Via Fiorentina 1, Siena, SI 53100, Italy.

    -Covacci,A. and Rappuoli,R., "Tyrosine-phosphorylated bacterial proteins: Trojan horses for the host cell", J. Exp. Med. 191 (4), 587-592 (2000) PUBMED 10684850.


DNA sequence :
GTGGCAAGCAAGCAGGCTGATGAACAAAAAAAGCTAATTATAGAGCAAGAGGTTCAAAAGCGGCAGTTTCAAAAAATAGA
AGAACTTAAAGCAGACATGCAAAAGGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGTTTGGTT
TCCCTGAAACTTTTATTTATTCCTCTATATTTATATTGTTTGTAACCATTGTATTATCTGTTATTCTTTTTCAAGCCTAT
GAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGATTATAGGCTTTATCAAAGAAT
GGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCGTTCATGAGCATTTTTTCCATGA
AGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGACTTGTGAGCGCTGCAAACTCCTAT
CTAGCAAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAATCAAATT
GGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTAGGAATT
TTGTTACCCCTGAATTCAAATTCTATTTTCACACTATTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGACTATGGT
CTTATTTTTTCTAATGATTTCATGCGAGCCTATAATGAGAAGCAAAAGAGAGAAAGTTTTTATGATATTAGTTTTTTTCT
GACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCCGTTATGAATAAAAAGCATTTTGCAGACAATAATTTTGAAG
AGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGGATAGAGCTCATAGAAGAGCTGTTGAGCAAATAC
CACCCCACTAGATTAAAAGAATACACTAAAGATGGCGTTATTTACTCCAAACAATGCGAATTTTACAATTTTCTTGTGGG
AATGAATGAAGCCCCTTTTATTTGCAACCGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAGAAGTTT
ATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGTGAATAC
GCCCCTAAATCACAAAGCGATTTGTTTGATAAAATCAACGCCCTAGACAGCGAATTTATTTTCATGCATGCTTATTCGCC
TAAAAACTCACAGGTTTTAAAGGACAAACTGGCTTTCACCTCTAGAAGAATTATTATTAGTGGAGGCTCTAAAGAACAGG
GCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGCAGTTATGGTAATTCTTTAGTGCTG
TTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTTAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTTAGCCAA
CGCAGCGACTTTCTCTATGGAAAATTACTTTTTTGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTGATGTAA
CTTCTAATAATTTTGCTGATTTCATCGCTATGAGGGCTATGAGTTTTGATGGCAATCAAGAGAATAACGCTTGGGGCAAT
AGTGTGATGACGCTAAAAAGCGAGATCAATTCGCCTTTTTATCTGAACTTCCACATGCCCACTGATTTTGGTTCAGCTTC
AGCAGGACACACTTTGATACTTGGCTCAACCGGTTCAGGTAAGACAGTGTTTATGTCAATGACCTTGAACGCTATGGGAC
AATTTGCCTATAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTTTATATGGATAAAGATTATGGC
GCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAGATTGAGCTAGGGACAGATACAGGATTAAATCCTTTTGC
TTGGGCGGCTTGTGTGCAAAAAACAAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGCTTGTGA
AAAACTTAGCAACTAAAAGCGATGAAAAAGATGAAAATGGCAACAGCATCTCTTTTAGCCTAGCAGATTCTAATACGCTT
GCAGCGGCAGTAACCAACCTTATCACAGGAAATATGAATCTAGATTATCCCATCACTCAACTTATTAATGCTTTCGGGAA
AGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCGCCTTTTTGCAAATCAACCAATGGTGAATTTCAATGGCTTT
TTGATAATAAAGCAACAGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGACAATAAT
GATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATCCAAGAGGCAATGGATGGGCGTAGATTTGTCTTAGATAT
TGATGAAGCCTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGAAAAGAA
ACGCTATTGTCAGACTTGCGACTCAAAGCATCACTGATCTTTTGGCTTGCCCTATTGCTGATACGATTAGAGAACAATGC
CCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTGGCTAATGTTACAGAAAAAGAATT
TGAAATCATCACTAAGGGACTAGATAGGAAAATCCTCTACAAACAGGATGGAAGCCCTAGCGTTATCGCTAGTTTTAATT
TGAGAGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGATAAGATTATCCAA
AACCATAGTATCATAGATAAATATCAGGCCTTGAGACAAATGTATCAACAAATAGAGGAGTATTAA

Protein sequence :
MASKQADEQKKLIIEQEVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQAY
EPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAANSY
LANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTIKKKIVIDETNRDYG
LIFSNDFMRAYNEKQKRESFYDISFFLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLSKY
HPTRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEISEY
APKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSLVL
FADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAWGN
SVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFAYNFPANVSKDKQKLTMVYMDKDYG
AYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSNTL
AAAVTNLITGNMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLDNN
DVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIREQC
PTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKIIQ
NHSIIDKYQALRQMYQQIEEY