PAI Gene Information


Name : HP0547
Accession : BAD13826.1
PAI name : cag PAI
PAI accession : AB120417
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein
Function : -
Note : similar to H.pylori 26695 gene HP0547:cag26, J99 gene JHP0495, NCTC11638 gene cag-A
Homologs in the searched genomes :   46 hits    ( 45 protein-level,   1 DNA-level )  
Publication :
    -Azuma,T., Yamakawa,A. and Yamazaki,S., "Direct Submission", Submitted (12-SEP-2003) Takeshi Azuma, University of Fukui Faculty of Medical Sciences, General Medicine Second Department of Internal Medicine; Shimoaizuki 23-3, Matsuoka-cho, Yoshida-gun, Fukui 910-1193, Japan (E-mail:azuma@fmsrsa.fukui-med.ac.jp, Tel:8.

    -Azuma,T., Yamakawa,A., Yamazaki,S., Ohtani,M., Ito,Y., Muramatsu,A., Suto,H., Yamazaki,Y., Keida,Y., Higashi,H. and Hatakeyama,M., "Distinct diversity of the cag pathogenicity island among Helicobacter pylori strains in Japan", J. Clin. Microbiol. 42 (6), 2508-2517 (2004) PUBMED 15184428.


DNA sequence :
ATGACTAACGAAACTATTGATCAAACAACAACACTGGATCAAACACCAAACCAAACGGATTTTGTTCCGCAACGATTTAT
CAATAATCTTCAAGTAGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATTTGATCCTGATCAAAAACCAATCGTTGATA
AGAATGATAGGGATAACAGGCAAGCTTTTGAAAAAATCTCGCAACTAAGGAAAGAATACGCTAATAAAGCGATCAAAAAT
CCCACCAAAAAGAATCAATATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGCTGT
AGATTCTTCCGTAGAGAGCTTTCGGAAATTTGGGGATCAGCGTTACCAAATTTTTACGAGTTGGGTGTCCCTTCAAAAAG
ATCCGTCTGAAATCAACACCCAACAAATCCGAAATTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAGGAA
AAAGCGGAGTTTTTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGGAACCAAATCCGATCGGATGAAAAATT
CATGGGCGTGTTTGATGAATCTTTGAAAGCAAGGCAAGAAGCAGAAAAAAATGCAGAGCCTGCTGGTGGGGATTGGCTTG
ATATTTTTTTATCATTTGTGTTTAACAAAAAACAATCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGCCTGAT
TTTGAACAAAATTTAGCCACTACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTGCTTGATGAAAGGGG
TAATTTTTTTAAATTCACTCTTGGTGATGTAGAGATGTTGGATGTTGAGGGAGTCGCTGACAAGGATCCCAATTACAAGT
TCAATCAATTATTGATCCACAATAACGCTCTGTCTTCTGTGCTAATGGGGGGTCATAGTAACATAGAACCTGAAAAAGTT
TCATTATTGTATGGGGATAATGGTGGTCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGG
CAACAATGTGGCCACACTCATCAATGCGCATCTTAATAACGGCAGCGGGTTAATCATAGTGGGTAATGAGGATGGGATTA
AAAACCCTAGCTTCTATCTCTACAAAACAGACCAACTCACAGGCTTGAAACAAGCGTTGAGTCAAGAAGAGATCCAAAAC
AAAGTGGATTTCATGGAATTTCTTGCAAAAAACAACGCTAAATTAGACAACTTGAGCGAGAAAGAGAAAGAAAAATTCCA
AACTGAGATTGAAAATTTCCAAAAAGACCGTAAGGCTTATTTAGACGCTCTAGGGAATGATCACATTGCTTTTGTTTCTA
AAAAAGACCCAAAACATTTAGCTTTGGTTACTGAGTTTGGTAATGAGGAAGTGAGCTATACTCTCAAAGATTATGGGAAA
AAACAAGATAAAGCTTTAGATGGGGAGACAAAAACCACTCTTCAAGGTAGCCTAAAATATGATGGCGTGATGTTTGTTGA
TTATTCCAATTTCAAATACACCAACGCCTCCAAGAGTCCTAATAAGGGTTTAGGCGCTACGAATGGCGTTTCCCATTTGG
AAGCAAATTTTAGCAAGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTTGCTATCACTAATTATATAAGGCGAGAT
TTAGAAGATAAATTGTGGGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTCATCAAAGACTTTTTGAACAGCAACAA
AGAAATGGTTGGAAAAGTTTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAATTATGACGAAGTGAAAA
AAGCTCAGAAAGATCTTGAAAAATCTCTAAGGAAACGAGAGCATTTAGAGAAAGAAGTAGCGAAAAAATTGGAGAGCAGA
AACGACAACAAAAATAGAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATAAGATTTTTGCGTTGATCAATACAGA
GGCTAGTAAGGAAGCAAGAGTGGCCGCTTTCGATCCGAATCTTAAAAGCACCAGGAGCGAATTGTCTGATAAACTTGAAA
ATATCAACAAGAATTTGAAAGACTTTGGCAAATCTTTTGATGAACTCAAAAATGGCAAAAATAATGATTTCAGCAAGGCA
GAAGAAACGCTAAAAGCCCTTAAAGACTCGGTGAAAGATTTAGGCATCAATCCAGAATGGATTTCAAAAATTGAAAACCT
TAATGCAGCTTTGAATGATTTCAAAAATGGCAAAAATAAGGATTTCAGTAAGGTAACACAAGCAAAAAGCGACCTTGAAA
ATTCCATTAAGGATGTGATCATCAATCAAAAGATAACGGATAAAGTTGACAATCTCAATCAGGCTGTATCAGAGACTAAA
TTAACAGGCGATTTCAGTAAGGTAGAGCAAGCCCTAGCCGAACTCAAAAGCTTGTCATTGGATCTTGGAAAAAATTCTGA
TCTACAAAAATCCGTTAAAAATGGTGTAAATGGAACCCTGGTCGGTAATGGGTTATCTAAAACAGAAGCCACAACGCTCA
CCAAAAATTTTTCGGACATCAGGAAAGAATTGAACGAGAAGTTATTTGGAAATTCCAATAACAATAATAATGGGCTCAAA
AACAGCACAGAGCCCATTTACGCTCAAGTTGCTAAAAAGGTGAGTGCAAAAATTGACCAACTCAACGAAGCTACATCAGC
AATAAATAGAAAAATTGACCGGATTAACAAAATTGCATCAGTTGGAAAAAATTCTGATCTACAAAAATCCGTTAAAAATG
GTGTAAATGGAACCCTGGTCGGTAATGGGTTATCTAAAACAGAAGCCACAACGCTCACCAAAAATTTTTCGGACATCAGG
AAAGAATTGAACGAGAAGTTATTTGGAAATTCCAATAACAATAATAATGGGCTCAAAAACAGCACAGAGCCCATTTACGC
TAAAGTTAATAAAAAGAAAACAGGACAAGTAGCTAGCCCTGAAGAGCCCATTTACGCTCAAGTTGCTAAAAAGGTGAGTG
CAAAAATTGACCAACTCAACGAAGCTACATCAGCAATAAATAGAAAAATTGACCGGATTAACAAAATTGCATCAGCAGGT
AAAGGAGTGGGCGGTTTCAGTGGAGCAGGGCGATCAGCTAGCCCTGAACCCATTTACGCTACAATTGATTTTGATGAGGC
AAATCAAGCAGGCTTCCCTTTGAGAAGAAGTACTGGAGTTAATGACCTCAGTAAAGTAGGGCTTTCAAGGAAACAAGAAT
TGACTCGTAGAATTGGCGATCTCAATCAGGCGGTATCAGAAGCTAAAACAGGTTATTTTGACAACCTAGAACAAAAGATA
GATGAACTCAAAGATTCTACGAAAAAGAATGCTTTGAAGTTATTTGTTGAAAGTGCGAAACAAGTGCCTACTGGTTTGCA
AGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCCAAAGTGGAGCAATCAATGAAAAGG
CGACCGGCATGCTGACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAGATAGTTGCACATAATGTGGGAAGC
ACTCATTTGTCAGAGTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCAC
CAAGTTGAACAATGCCGTAAAAGACATTAAGTCTAGCTTTGTGCAATTTTTAACCAATACATTTTCTACAGGATCTTACA
GCTTGATGAAAGCAAATGTGGAACATGGAGTCAAAAGTACTACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETIDQTTTLDQTPNQTDFVPQRFINNLQVAFLKVDNAVASFDPDQKPIVDKNDRDNRQAFEKISQLRKEYANKAIKN
PTKKNQYFSDFINKSNDLINKDNLIAVDSSVESFRKFGDQRYQIFTSWVSLQKDPSEINTQQIRNFMENIIQPPISDDKE
KAEFLRSAKQSFAGIIIGNQIRSDEKFMGVFDESLKARQEAEKNAEPAGGDWLDIFLSFVFNKKQSSDLKETLNQEPRPD
FEQNLATTTTDIQGLPPEARDLLDERGNFFKFTLGDVEMLDVEGVADKDPNYKFNQLLIHNNALSSVLMGGHSNIEPEKV
SLLYGDNGGPEARHDWNATVGYKDQQGNNVATLINAHLNNGSGLIIVGNEDGIKNPSFYLYKTDQLTGLKQALSQEEIQN
KVDFMEFLAKNNAKLDNLSEKEKEKFQTEIENFQKDRKAYLDALGNDHIAFVSKKDPKHLALVTEFGNEEVSYTLKDYGK
KQDKALDGETKTTLQGSLKYDGVMFVDYSNFKYTNASKSPNKGLGATNGVSHLEANFSKVAVFNLPNLNNLAITNYIRRD
LEDKLWAKGLSPQEANKLIKDFLNSNKEMVGKVLNFNKAVAEAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVAKKLESR
NDNKNRMEAKAQANSQKDKIFALINTEASKEARVAAFDPNLKSTRSELSDKLENINKNLKDFGKSFDELKNGKNNDFSKA
EETLKALKDSVKDLGINPEWISKIENLNAALNDFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSETK
LTGDFSKVEQALAELKSLSLDLGKNSDLQKSVKNGVNGTLVGNGLSKTEATTLTKNFSDIRKELNEKLFGNSNNNNNGLK
NSTEPIYAQVAKKVSAKIDQLNEATSAINRKIDRINKIASVGKNSDLQKSVKNGVNGTLVGNGLSKTEATTLTKNFSDIR
KELNEKLFGNSNNNNNGLKNSTEPIYAKVNKKKTGQVASPEEPIYAQVAKKVSAKIDQLNEATSAINRKIDRINKIASAG
KGVGGFSGAGRSASPEPIYATIDFDEANQAGFPLRRSTGVNDLSKVGLSRKQELTRRIGDLNQAVSEAKTGYFDNLEQKI
DELKDSTKKNALKLFVESAKQVPTGLQAKLDNYATNSHTRINSNVQSGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGS
THLSEYDKIGFNQKNMKDYSDSFKFSTKLNNAVKDIKSSFVQFLTNTFSTGSYSLMKANVEHGVKSTTKGGFQKS