PAI Gene Information


Name : cag23 (HP0544)
Accession : AAR03906.1
PAI name : cag PAI
PAI accession : AY330639
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : Cag23
Function : -
Note : cagE; jhp0492; virB4-like protein
Homologs in the searched genomes :   43 hits    ( 43 protein-level )  
Publication :
    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates", Gene 328, 85-93 (2004) PUBMED 15019987.

    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Direct Submission", Submitted (26-JUN-2003) Biotechnology, Royal Institute of Technology, Alba Nova University Centre, Roslagstullsbacken 21, S-106 91 Stockholm, Sweden.


DNA sequence :
GTGTTTGTGGCAAGCAAACAAGCTGACGAACAAAAAAAGCTAATCATAGAGCAAGAGGTTCAAAAGCGGCAGTTTCAAAA
AATAGAAGAGCTTAAAGCAGACATGCAAAAGGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGT
TTGGTTTCCCTGAAACTTTTATTTATTCCTCTATATTTATATTGTTTGTAACAATTGTATTATCTGTTATTCTTTTTCAA
GCCTATGAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGATTATAGGCTTTATCA
AAGAATGGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCGTTCATGAGCATTTTTT
CCATGAAGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGACTTGTGAGTGCTGCAAAC
TCCTATCTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAAT
CAAATTGGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTA
GGAATTTTGTTACCCCTGAATTCAAATTTTATTTCCACACTGTTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGAC
TATGGTCTTATTTTTTCTAATGATTTTATGCGAGCCTATAATGAGAAGCAAAAAAGAGAAAGTTTTTATGACATTAGTTT
TTTTCTCACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCCGTTATGAATAAAAAGCATTTTGCAGACAATCATT
TTGAAGAGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGGATAGAGCTCATAGAAGAGCTGTTGAGT
AAATACCACCCCACTAGATTGAAAGAATACACCAAAGATGGCGTTATTTACTCCAAGCAATGCGAGTTTTACAATTTTTT
GGTGGGAATGAATGAGGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAG
AAGTTTATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATCAGT
GAATACGCCCCTAAATCACAGAGCGATTTGTTTGATAAAATCAACGCTCTAGACAGCGAATTCATCTTTATGCATGCTTA
TTCGCCTAAAAACTCACAGGTTTTAAAGGACAAACTGGCTTTCACCTCTAGAAGAATTATTATTAGTGGAGGTTCTAAAG
AGCAGGGCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGTAGTTATGGTAATTCTTTA
GTGCTGTTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTTAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTT
AGCCAACGCAGCGACTTTCTCTATGGAAAATTACTTTTTTGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTG
ATGTAACTTCTAATAATTTTGCTGATTTCATCGCTATGAGGGCTATGAGTTTTGATGGCAATCAAGAGAATAACGCTTGG
GGCAATAGTGTGATGACGCTAAAAAGCGAGATCAATTCGCCTTTTTATTTGAACTTCCACATGCCTACTGATTTTGGTTC
AGCTTCAGCAGGACACACTTTGATACTTGGCTCAACCGGTTCAGGTAAGACAGTGTTTATGTCAATGACCTTGAACGCTA
TGGGACAATTTGCTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTCTATATGGATAAAGAT
TATGGCGCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAGATTGAGCTAGGGACAGATACAGGATTAAATCC
TTTTGCTTGGGCGGCTTGTGTGCAAAAAACAAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGC
TTGTGAAAAACTTAGCGACCAAAAGCGATGAAAAAGATGAAAATGGCAACAGCATCTCTTTTAGCCTAGCAGATTCTAAT
ACGCTTGCAGCGGCAGTAACCAACCTTATCACAGGAAATATGAACCTAGATTATCCCATCACTCAACTTATTAATGCTTT
CGGAAAAGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCGCCTTTTTGCAAATCAACCAATGGTGAATTTCAAT
GGCTTTTTGATAATAAAGCAACGGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGAC
AATAATGATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATCCAAGAGGCAATGGATGGGCGTAGATTTGTCTT
AGATATTGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGA
AAAGAAACGCTATTGTCAGGCTTGCGACTCAAAGTATCACTGATCTTTTGGCTTGCCCTATTGCTGATACGATTAGAGAA
CAGTGCCCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTGGCTAATGTTACAGAAAA
AGAATTTGAAATCATCACTAAGGGACTAGATAGGAAAATTCTCTACAAACAAGATGGAAGCCCTAGCGTTATCGCTAGTT
TTAATTTGAGAGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATT
ATCCAAAACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAAATGTATCAACAAATAGAGGAGTATTAA

Protein sequence :
MFVASKQADEQKKLIIEQEVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTVKKKIVIDETNRD
YGLIFSNDFMRAYNEKQKRESFYDISFFLTIEQDLLDTLNEPVMNKKHFADNHFEEFQRIIRAKLENFKDRIELIEELLS
KYHPTRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFAHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSN
TLAAAVTNLITGNMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIEEY