PAI Gene Information


Name : cagE (HPB8_699)
Accession : YP_003728720.1
PAI name : cag PAI
PAI accession : NC_014256_P1
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein E
Function : -
Note : CAG pathogenicity island protein 23 (Protein picB), CagE, TrbE, VirB component of type IV transporter system, CagE, TrbE, VirB family, component of type IV transporter system, VirB4_CagE: type IV secretion/conjugal transfer ATPase, VirB4 family
Homologs in the searched genomes :   43 hits    ( 43 protein-level )  
Publication :
    -Blom,J., "Direct Submission", Submitted (30-NOV-2009) Blom J., Cebitec, Bielefeld University, Universitaetsstr. 25, 33615, GERMANY.

    -Farnbacher,M., Jahns,T., Willrodt,D., Daniel,R., Haas,R., Goesmann,A., Kurtz,S. and Rieder,G., "Sequencing, annotation, and comparative genome analysis of the gerbil-adapted Helicobacter pylori strain B8", BMC Genomics 11, 335 (2010) PUBMED 20507619 REMARK Publication Status: Online-Only.

    -Farnbacher,M., Jahns,T., Willrodt,D., Daniel,R., Haas,R., Goesmann,A., Kurtz,S. and Rieder,G., "Direct Submission", Submitted (18-JUN-2010) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.


DNA sequence :
GTGGCAAGCAAACAAGCTGACGAACAAAAAAAGCTAGTCATAGAGCAAGAGGTTCAAAAGCGCCAATTTCAAAAAATAGA
AGAACTTAAAGCAGACATGCAAAAGGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGTTTGGTT
TCCCTGAAACTTTTATTTATTCCTCTATATTTATATTGTTTGTAACAATTGTATTATCTGTTATTCTTTTTCAAGCCTAT
GAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGATTACAGGCTTTATCAAAGAAT
GGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCGTTCATGAGCATTTTTTCCATGA
AGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGACTTGTGAGCGCTGCAAACTCCTAT
CTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAATCAAATT
GGGGGGCATTGATTTTTTAACCACTTCAAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTAGGAATT
TTGTTACCCCTGAATTCAAATTTTATTTTCACACTATTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGACTATGGT
CTTATTTTTTCTAATGATTTTATGCGAGCCTATAATGAGAAGCAAAAGAGAGAAAGTTTTTATGATATTAGTTTTTATCT
CACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCCGTTATGAATAAAAAGCATTTTGCAGACAATAATTTTGAAG
AGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGGATAGAGCTCATAGAAGAGCTATTGAGTAAATAC
CACCCCATTAGATTAAAAGAATACACTAAAGATGGCGTTATTTACTCCAAACAATGCGAGTTTTATAATTTCCTTGTGGG
AATGAATGAAGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAGAAGTTT
ATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGTGAATAC
GCCCCTAAATCACAGAGCGATTTGTTTGATAAAATCAACGCTCTAGACAGCGAATTTATCTTTATGCATGCTTATTCGCC
TAAAAACTCACAGGTTTTAAAGGACAAACTAGCTTTCACTTCTAGAAGAATTATTATTAGTGGAGGCTCTAAAGAGCAGG
GCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGCGATATTACGCTAGGCAGTTATGGTAATTCTTTAGTGTTG
TTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTTAAGGAATGTGTCTCTAGTCTTAACGCTAAAGGTTTTTTAGCCAA
CGCGGCGACTTTCTCTATGGAAAATTACTTTTTTGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTGATGTAA
CTTCTAATAATTTTGCTGATTTTATTGCGATGAGGGCTATGAGCTTTGATGGCAATCAGGAGAATAACGCTTGGGGCAAT
AGCGTCATGACACTAAAAAGCGAGATCAATTCGCCTTTTTATCTGAACTTCCACATGCCCACTGATTTTGGTTCAGCTTC
AGCAGGACACACTTTGATACTTGGATCAACCGGTTCAGGTAAGACAGTGTTTATGTCAATGACCTTGAACGCTATGGGGC
AATTTGTTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTCTATATGGATAAAGATTATGGC
GCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAAATTGAGCTAGGGACAGATACAGGATTAAATCCTTTTGC
TTGGGCAGCTTGTGTGCAAAAAACAAATGCAACAATGGAGCAAAAACAAACGGCTATTTCTGTTGTCAAAGAGCTTGTGA
AAAACTTAGCAACCAAAAGCGATGAAAAAGATGAAAATGGCAACAGCATCTCTTTTAGCCTAGCAGATTCTAATACGCTT
GCAGCGGCAGTAACCAACCTTATCACAGGAGATATGAACCTAGATTATCCCATCACTCAACTTATTAATGCTTTCGGGAA
AGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCACCTTTTTGCAAATCAACCAATGGTGAATTTCAATGGCTTT
TTGATAATAAAGCAACGGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGACAATAAT
GATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATCCAAGAGGCAATGGATGGGCGTAGATTTGTCTTAGATAT
TGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTGAGAGACATGCTAAAAACTGCAAGGAAAAGAA
ACGCTATTGTCAGACTTGCGACTCAAAGCATCACTGATCTTTTGGCTTGCCCTATTGCTGATACGATTAGAGAACAATGC
CCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTAGCTAATGTTACAGAAAAAGAATT
TGAAATCATCACTAAGGGACTAGACAGGAAAATTCTCTACAAACAAGATGGAAGCCCTAGCGTTATCGCTAGTTTTAATT
TGAGAGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATTATCCAA
AACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAAATGTATCAACAAATAGAGGAGTATTAA

Protein sequence :
MASKQADEQKKLVIEQEVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQAY
EPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAANSY
LANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTIKKKIVIDETNRDYG
LIFSNDFMRAYNEKQKRESFYDISFYLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLSKY
HPIRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEISEY
APKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSLVL
FADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAWGN
SVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFVHNFPANVSKDKQKLTMVYMDKDYG
AYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSNTL
AAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLDNN
DVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIREQC
PTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKIIQ
NHSIIDKYQALRQMYQQIEEY