PAI Gene Information


Name : cag8 (HP0528)
Accession : AAR03891.1
PAI name : cag PAI
PAI accession : AY330639
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : Cag8
Function : -
Note : cag-X; jhp0477; orf15; virB9-like protein
Homologs in the searched genomes :   44 hits    ( 44 protein-level )  
Publication :
    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates", Gene 328, 85-93 (2004) PUBMED 15019987.

    -Blomstergren,A., Lundin,A., Nilsson,C., Engstrand,L. and Lundeberg,J., "Direct Submission", Submitted (26-JUN-2003) Biotechnology, Royal Institute of Technology, Alba Nova University Centre, Roslagstullsbacken 21, S-106 91 Stockholm, Sweden.


DNA sequence :
ATGGGGCAGGCATTTTTTAAAAAAATTGTTAACTGTTTCTGTCTTGGTTATTTATTTTTATCTAGCACAATAGAAGCAGT
AGCACTTGACATTAAGAATTTTAATCGTGGTAGGGTGAAAGTGGTGAATAAGAAGATTGCTTATTTGGGAGATGAAAAAC
CTATTACGATTTGGACTTCATTAGACAATGTTACCGTGATCCAACTTGAAAAAGATGAAACTATTTCTTACATCACAACA
GGTTTCAATAAAGGTTGGAGTATTGTGCCTAATTCTAATCATATATTCATTCAACCTAAATCGGTAAAAAGTAATCTCAT
GTTTGAAAAAGAAGCAGTGAATTTTGCCCTAATGACAAGAGATTACCAAGAATTTTTAAAAACAAAAAAACTTATCGTAG
ATGTGCCTGACCCTAAAGAATTAGAAGAACAAAAAAAAGCTCTAGAAAAAGAAAAAGAAGCTAAAGAACAGGCGCAAAAA
GCGCAAAAAGATAAAAGAGAAAAAAGAAAAGAAGAGCGTGCAAAAAATAGAGCCAATTTAGAAAATCTCACTAACGCTAT
GAGTAACCCACAAAATTTGAGCAATAACAAAAATCTTAGCGAACTTATCAAGCAACAACGAGAAAATGAATTAGACCAAA
TGGAACGACTAGAGGACATGCAAGAACAGGCTCAAGCTAATGCGCTCAAACAAATTGAAGAACTCAACAAGAAACAAGCT
GAAGAGACAATCAAGCAAAGAGCCAAAGATAAAATCAGTATTAAGACAGATAAATCTCAAAAAAGTCCTGAGGATAACTC
CATAGAATTATCTCCTAGCGATAGCGCTTGGAGGACCAATCTTGTTGTGCGGACTAATAAAGCCTTGTATCAATTCATTT
TGAGAATAGCTCAAAAAGACAATTTTGCTTCGGCGTATCTAACAGTCAAATTAGAATACCCACAAAGACACGAAGTCTCT
AGCGTTATTGAAGAGGAATTAAAAAAGAGAGAAGAAGCAAAGAGGCAGAAAGAATTGATCAAGCAAGAAAATCTTAACAC
CACAGCCTACATCAATAGAGTAATGATGGCGAGCAATGAACAGATCATCAACAAAGAAAAAATAAGAGAAGAAAAACAAA
AAATTATCTTAGATCAAGCAAAGGCGCTAGAGACTCAATATGTGCATAATGCATTAAAAAGAAACCCCGTGCCTAGAAAC
TACAATTACTACCAAGCGCCTGAAAAACGCTCTAAACATATTATGCCCTCTGAAATTTTTGATGATGGCACATTCACTTA
TTTTGGTTTCAAAAACATCACTCTCCAACCTGCTATTTTTGTGGTTCAACCTGATGGGAAATTGAGCATGACTGATGCTG
CCATTGATCCTAACATGACCAATTCAGGATTGAGATGGTATAGAGTTAATGAAATTGCAGAAAAGTTTAAGCTCATTAAA
GACAAAGCCCTTGTAACAGTAATCAATAAAGGCTATGGGAAAAATCCATTGACAAAAAATTACAATATCAAAAACTATGG
TGAATTGGAGCGCGTGATTAAAAAGCTCCCTCTTGTCAGAGATAAATAA

Protein sequence :
MGQAFFKKIVNCFCLGYLFLSSTIEAVALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTSLDNVTVIQLEKDETISYITT
GFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTRDYQEFLKTKKLIVDVPDPKELEEQKKALEKEKEAKEQAQK
AQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA
EETIKQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKDNFASAYLTVKLEYPQRHEVS
SVIEEELKKREEAKRQKELIKQENLNTTAYINRVMMASNEQIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRN
YNYYQAPEKRSKHIMPSEIFDDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK
DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK