PAI Gene Information


Name : HP0527
Accession : BAD14026.1
PAI name : cag PAI
PAI accession : AB120425
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein
Function : -
Note : similar to virB10, H.pylori 26695 gene HP0527:cag7, J99 gene JHP0476, NCTC11638 gene ORF13andORF14:cag-Y
Homologs in the searched genomes :   32 hits    ( 31 protein-level,   1 DNA-level )  
Publication :
    -Azuma,T., Yamakawa,A. and Yamazaki,S., "Direct Submission", Submitted (12-SEP-2003) Takeshi Azuma, University of Fukui Faculty of Medical Sciences, General Medicine Second Department of Internal Medicine; Shimoaizuki 23-3, Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:8.

    -Azuma,T., Yamakawa,A., Yamazaki,S., Ohtani,M., Ito,Y., Muramatsu,A., Suto,H., Yamazaki,Y., Keida,Y., Higashi,H. and Hatakeyama,M., "Distinct diversity of the cag pathogenicity island among Helicobacter pylori strains in Japan", J. Clin. Microbiol. 42 (6), 2508-2517 (2004) PUBMED 15184428.


DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATCTATCCAATGAAGAAGC
GACAGAAGTCAATCATTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCCTTATCTTAACAACCCCACAGAAA
CTCAAACCCATTTTGATGGAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATAGCAGTCTAGCAGACAAGTTGTTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATCTATCCAATGAAGAAGCAACAGAGGTC
AATCATTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCCTTATCTTGACAACCCCACAGAAACTAAAACCCA
AGAAACTAAAACCCATTTTGATGGAGACAAGTCAGAAGAAATAACTAACGACTCTAACGATCAAGAGATTATCAAAGGAA
GCAAAAAGAAATACATTATTGGTGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCAC
TACTTCATGCCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAG
GCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATC
CCAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCGTTGAGAGCCTTTTATGAATGTATT
AGTAATGGCGGCAACTATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTCTAGA
GGCTTATAATGACTGCATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACC
TAAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGA
AACGAGTGCCTAAAACTCATAAATGACCCTGAGGTTAGAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCA
AGAGTATAAGGATTGTATCAAAAACGCCAAAACTGAAGCTGAGAAAAACAAATGCTTGAAAGGCTTGTCTAAAGAAGCTA
TAGAAAGATTGAAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAAT
ATTCCCCAAGACTTACAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGACTGCGTATCAAGAGCTAGGAA
TGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGGAAAAAGTTAGAACAACAAGCGCTAGATTGTT
TGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCC
AAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACT
CACCCCTGAAGCGAGGAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCA
AAAACGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGC
GTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACCCCTGA
AGCGAAAAAACTTTTAGAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCA
AAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAGAGCT
AGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAATCTAGAAAAAG
CGTTAAGGCTTACTTGGACTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTG
AAGCGAGGAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAAACGAA
GCTGAGAAAAAAGAATGCGAAAAATTACTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGC
TTATTTGGACTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACCCCTGAAGCGAAAA
AACTTTTAGAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTC
CCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAGAGCTAGGAATGA
AAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAATCTAGAAAAAGCGTTAAGG
CTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAA
AAACTTTTAGAAGAAGCTAAAGAAAGCCTTAAAGCTTATAAGGACTGTCTCTCTCAAGCTAGAAATGAAACTGAAAGGAG
AGCTTGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAA
CCGAAGCTGATAAAAAAAGGTGCATCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGTGTTAAG
GCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAG
GAAACTTTTAGAAGAAGCCAAAGAGAGTCTGAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGA
GAGCTTGCGAGAAATTACTCACTCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTG
GACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGTTCTT
AGCGAAGCAAGTGCTAAATTGTTTGGAAAAAGCTGGAAATGAAGAAGAAAGGAAAGCATGTCTTAAAAATCTCCCTAAAG
ACTTACAGGAAAATGTTTTAGCTAAAGAGAGTCTTAAGGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAA
AGGAGAGCTTGCGAGAAACTACTCACCCCTGAAGCGAGGAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTA
TTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAAAAACAGGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGT
TCTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCT
ATCATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGA
TTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAA
ATAAAAGGACACAAAGCAAACAAAATCAATTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGGATAACTTA
GATGACCCTACCGATCAACAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAAT
TAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAG
GTTATCCGTTGTTGCCAATGGATTTCAAAAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAA
ATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGACATTACCAAACAATACGAAACAGAAAAAACCATTAAGGA
TAAGAATTTAGAAGCTAAATTGGCTAAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCA
CAGCAGAATCTAAAGTAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCTAAGAATATCAGTGAAATCGCTCTT
AAGAACAAAAAAGAAAAGAATGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAAAACA
AGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTG
AAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATG
AACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTAT
GACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAG
GCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGC
GTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAG
GACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTC
TAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAACGAGGGTGATAGTATTAAGATTCTCACAATGGACGATATT
GATTTTAGCGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACCAAAACCTT
GTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEEATEVNHFEDSSKESKESSDPYLNNPTETQTHFDGDKLEETQTQMDSGGNETSES
SNSSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNEYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEV
NHFEDSSKESKESSDPYLDNPTETKTQETKTHFDGDKSEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFH
YFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECI
SNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEER
NECLKLINDPEVREKFRKELELQKELQEYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKN
IPQDLQKELLADMSVKAYKDCVSRARNEKEKKECEKLLTPEARKKLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLA
KESLKAYKDCVSQAKTEAEKKECEKLLTPEARKLLEEEAKESVKAYLDCVSQAKNEAEKKECEKLLTPEAKKKLEEAKKS
VKAYLDCVSQAKNEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESLKAYKDCVSRA
RNEKEKKECEKLLTPEAKKLLEESRKSVKAYLDCVSQAKNEAEKKECEKLLTPEARKLLEEEAKESVKAYLDCVSQAKNE
AEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKNEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDL
PKDLQKKVLAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEESRKSVKAYLDCVSQAKTEAEKKECEKLLTPEAK
KLLEEAKESLKAYKDCLSQARNETERRACEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCIKDLPKDLQKKVLAKESVK
AYLDCVSRARNEKEKKECEKLLTPEARKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYL
DCVSRARNEKEKKECEKLLTPEARKFLAKQVLNCLEKAGNEEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEE
RRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAA
IMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNL
DDPTDQQAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADK
IASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAESKVESNKIDKDVAETAKNISEIAL
KNKKEKNGEFVDENGNPIDDKKKAEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNM
NGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIAS
VVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDI
DFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN