PAI Gene Information


Name : HP0527
Accession : BAD13779.1
PAI name : cag PAI
PAI accession : AB120416
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein
Function : -
Note : similar to virB10, H.pylori 26695 gene HP0527:cag7, J99 gene JHP0476, NCTC11638 gene ORF13andORF14:cag-Y
Homologs in the searched genomes :   24 hits    ( 23 protein-level,   1 DNA-level )  
Publication :
    -Azuma,T., Yamakawa,A. and Yamazaki,S., "Direct Submission", Submitted (12-SEP-2003) Takeshi Azuma, University of Fukui Faculty of Medical Sciences, General Medicine Second Department of Internal Medicine; Shimoaizuki 23-3, Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:8.

    -Azuma,T., Yamakawa,A., Yamazaki,S., Ohtani,M., Ito,Y., Muramatsu,A., Suto,H., Yamazaki,Y., Keida,Y., Higashi,H. and Hatakeyama,M., "Distinct diversity of the cag pathogenicity island among Helicobacter pylori strains in Japan", J. Clin. Microbiol. 42 (6), 2508-2517 (2004) PUBMED 15184428.


DNA sequence :
ATGAATGAAGAAAACGATAAATTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
AACAGAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGAACATCATCTTGACAACCCTACAGAAA
CTAAAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAGGAAAATAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGAACATCATCTTGACAACCCTACAGAAACTAAAACCAA
TTTTGATGAATACGAGTCAGAAGAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACA
TTATTGGTGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATACCTTTG
GAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCG
ATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTAT
ACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTGAGGGCCTTTTATGAATGTATTAGTAATGGTGGCAAC
TATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTG
CATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTAC
TGAACCAACAAAAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAA
CTCATAAATGACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTG
TATCAAAAACGCCAAAACAGAAGCTGAGAAAAACGAATGTTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAAC
AGCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTG
CAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAA
AGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACT
GCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAGCTTTTAGAA
GAAGCTAAAAAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAA
ATTACTCACCCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAAC
GAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAGGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGAT
TGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGA
AGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCACGAGCTAGGAATGGAAAAGAGAAACAAGAATGCGAGA
AATTACTCACCCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAAAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGA
GCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCT
AGATTGTCTGAAAAACGCTAAAACCGAAGCTGAGAAAAAGAGGTGTGTCAAAGATCTTCCTAAAGACTTGCAGAAAAAGG
TTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAG
AAATTGCTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATGAAGACTGCGTTTCAAG
AGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGC
TAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAGAGGTGTGTCAAAGATCTTCCTAAAGACTTGCAGAAAAAG
GTTTTAGCTAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGA
GAAATTGCTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTC
AAGCTAGAAATGAAGAAGAAAGGAGAGCTTGTGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAAGAAGTT
AAGAAGAGCGTTAAGGCTTACTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACT
CACCCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAGA
GGTGTGTCAAAGATCTTCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTT
TCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGC
TAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGTGAGAAATTAC
TCACGCCTGAAGCGAAAAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTACTTGGATTGCGTTTCAAGAGCT
AGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAATTTTTAGCGAAGCAAGTGCTAAA
TTGTTTGGAAAAAGCTAGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTT
TAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGTGAGAAA
TTACTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAG
AGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCC
AACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGAT
GGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAG
GACCGATGAAGAAAAAAGGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCA
AACAAAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAA
GAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGA
AGTGGATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAATATGGCGGCTAAAGGTTATCCATTGTTGCCAA
TGGATTTCAAAAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCT
ATTTATGCTTCCATAGAGCCTGACATTACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAA
ATTAGCTAAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAGAGTAG
AAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAG
AGTGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAACACAAGATGAAACAAGCCCTGT
CAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTA
AAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTA
CTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGT
CTTTACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAG
GGGTAGATGGCTATGTGAATAACCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGTTTCTTG
CAAACTGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTA
CGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATA
TCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTAT
GATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGA
AATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKFETSKKTQQHSPQDLSNEEATEANHFEDSSKESKESSEHHLDNPTETKTNFDEYESEETQTQMDSGGNETSES
SNGSLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEA
NHFEDSSKESKESSEHHLDNPTETKTNFDEYESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPL
EDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGN
YEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLK
LINDPEIREKFRKELGLQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDL
QKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLE
EAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKD
CVSKARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNGKEKQECEKLLTPEAKKLLEEAKKSLKAYKDCVSR
ARNEKEKQECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECE
KLLTPEAKKLLEEAKESLKAYEDCVSRARNEKEKQECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKK
VLAKESVKAYLDCVSRARNEKEKQECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEAKKLLEQEV
KKSVKAYLDCVSRARNEKEKQECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCV
SRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEAKKLLEQEVKKSVKAYLDCVSRA
RNEKEKQECEKLLTPEARKFLAKQVLNCLEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERRACEK
LLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLD
GLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQ
EAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNP
IYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEK
SGEFVDENGNPIDDKKKTETQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMIL
LDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFL
QTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVY
DVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN