PAI Gene Information


Name : cag5 (HP0524)
Accession : AAR03916.1
PAI name : cag PAI
PAI accession : AY330640
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : Cag5
Function : -
Note : cag-beta; jhp0473; orf10; virD4-like protein
Homologs in the searched genomes :   45 hits    ( 43 protein-level,   2 DNA-level )  
Publication :
    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates", Gene 328, 85-93 (2004) PUBMED 15019987.

    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Direct Submission", Submitted (26-JUN-2003) Biotechnology, Royal Institute of Technology, Alba Nova University Centre, Roslagstullsbacken 21, S-106 91 Stockholm, Sweden.


DNA sequence :
ATGGAAGACTTTTTGTATAACACCTTATATTTCATAGAGGATTATAAGTTGGTTGTTATTTCTAGTTTCATAGGGTTAAT
AGCGTTATTTTTTCTTTACAAATTCATAAAAGCTCAAAAAAAGGCTTTTAAAGATAAAGCTAACCAACCTCAAAAGAAAA
AAAGCTTTAAAGAAATCATTATAGATGGGCTGAAAGAAAGAGTTAAAACCTTTGGCTTTTGGTTGCAAGCTATACTATTA
CTATCCTATTCTTTTATCACATCAGGGTTATTTTTCTTGATTCTCTTAGGTAATTTTTATGATGATAATCGATCGCCTGA
GAGTGATGATGATCTTTTTGATATATGGGTCTATGCGATACAAGATTTTCCTAATTACTATTTTAAGGCACTCACTTTTA
GCTCACTCAAGATTTATGGGTTCAATATATCCTTAGTCGTATATAGTTCTATTTTATGCTCTTATATCTTCATTACCTTT
TTTGTGTGGTTCTTAAAATACTTAACTCGGACTAGAGATATAGGAGCGAATAAAAAAGTTGATGATCTCTTTGGTAGTGC
GAGTTGGGAAACTGAAGAGAAAATGATCAAAGCCAAGCTCATCACGCCCAACAATAAAAAACGCGCCTTTGACAAACGAG
AGGTGATTGTAGGCAGGCGTGGCTTGGGGGATTTTATCGCTTACGCAGGGCAGGCGTTCATTGGCTTGATTGCTCCTACT
AGAAGCGGTAAGGGGGTGGGTTTCATCATGCCCAATATGATCAATTATCCTCAAAATATCGTTGTGTTTGACCCTAAAGC
TGACACTATGGAGACTTGCGGAAAAATCAGAGAAAAACGCTTCAACCAAAAAGTGTTCATCTATGAACCTTTCTCCTTAA
AAACACACCGATTTAATCCTTTCGCTTATGTGGATTTTGGTAATGATGTGGTTTTGACTGAAGACATACTCTCTCAAATT
GACACACGCCTAAAAGGGCATGGGATGGTGGCTAGTGGAGGGGATTTTTCCACTCAAATCTTTGGATTGGCTAAGCTCGT
GTTCCCTGAAAGACCTAATGAAAAAGATCCTTTTTTTAGCAATCAAGCGCGAAATCTTTTTGTCATCAATTGCAATATTT
ATAGGGATCTCATGTGGACTAAAAAGGGGCTTGAGTTTGTCAAAAGAAAAAAAATCATCATGCCTGAAACCCCCACGATG
TTTTTCATAGGTTCTATGGCAAGCGGGATCAACTTGATTGATGAAGACACAAACATGGAAAAAGTCGTGTCTTTAATGGA
ATTTTTTGGAGGTGAAGAAGATAAGAGTGGCGATAATCTAAGAGCGCTTAGTCCTACTACTAGAAACATGTGGAATAGCT
TCAAGACAATGGGTGGTGCTAAAGAAACTTATAGCTCTGTTCAAGGGGTCTATACATCAGCGTTTGCGCCTTACAATAAC
GCCATGATTAGGAATTTCACGAGCGCTAATGATTTTGATTTCAGGCGTTTAAGGATCGATGCAGTGAGTATTGGCGTGAT
CGCTAATCCTAAAGAAAGCACTATTGTTGGGCCGATACTAGAGCTGTTTTTCAATGTAATGATTTATAGCAATTTGATTC
TGCCAATCCATGATCCACAATGCAAAAGAAGTTGCTTGATGCTTATGGATGAATTCACGCTCTGTGGCTATTTAGAGACC
TTTGTTAAAGCGGTAGGGATTATGGCAGAATACAACATGCGCCCTGCTTTTGTGTTTCAAAGTAAGGCGCAACTAGAGAA
TGACCCCCCACTTGGTTATGGTAGGAATGGCGCTAAGACTATTTTAGACAACCTTTCTTTGAATATGTATTATGGGATTA
ACAACGATAACTACTATGAACATTTTGAAAAACTTTCTAAGGTATTAGGGAAATACACAAGGCAAGACGTGAGCCGAAGC
ATTGATGATAATACAGGTAAGACCAACACTTCTATCAGCAACAAAGAGCGGTTTTTGATGACCCCTGATGAATTGATGAC
TATGGGCGATGAGCTTATCATTCTAGAGAATACGCTCAAACCCATCAAATGCCACAAGGCGCTTTACTATGATGATCCAT
TCTTCACCGATGAACTCATTAAGGTAAGTCCAAGCTTGAGCAAGAAATACAAATTGGGGAAAGTGCCTAATCAAGCAACT
TTCTATGATGATTTGCAAGCCGCTAAAACTAGAGGTGAATTGAGCTATGACAAGTCTTTAGTGCCTGTGGGTTCAAGCGA
ATTGTGA

Protein sequence :
MEDFLYNTLYFIEDYKLVVISSFIGLIALFFLYKFIKAQKKAFKDKANQPQKKKSFKEIIIDGLKERVKTFGFWLQAILL
LSYSFITSGLFFLILLGNFYDDNRSPESDDDLFDIWVYAIQDFPNYYFKALTFSSLKIYGFNISLVVYSSILCSYIFITF
FVWFLKYLTRTRDIGANKKVDDLFGSASWETEEKMIKAKLITPNNKKRAFDKREVIVGRRGLGDFIAYAGQAFIGLIAPT
RSGKGVGFIMPNMINYPQNIVVFDPKADTMETCGKIREKRFNQKVFIYEPFSLKTHRFNPFAYVDFGNDVVLTEDILSQI
DTRLKGHGMVASGGDFSTQIFGLAKLVFPERPNEKDPFFSNQARNLFVINCNIYRDLMWTKKGLEFVKRKKIIMPETPTM
FFIGSMASGINLIDEDTNMEKVVSLMEFFGGEEDKSGDNLRALSPTTRNMWNSFKTMGGAKETYSSVQGVYTSAFAPYNN
AMIRNFTSANDFDFRRLRIDAVSIGVIANPKESTIVGPILELFFNVMIYSNLILPIHDPQCKRSCLMLMDEFTLCGYLET
FVKAVGIMAEYNMRPAFVFQSKAQLENDPPLGYGRNGAKTILDNLSLNMYYGINNDNYYEHFEKLSKVLGKYTRQDVSRS
IDDNTGKTNTSISNKERFLMTPDELMTMGDELIILENTLKPIKCHKALYYDDPFFTDELIKVSPSLSKKYKLGKVPNQAT
FYDDLQAAKTRGELSYDKSLVPVGSSEL