PAI Gene Information


Name : cagY
Accession : AGC69791.1
PAI name : cag PAI
PAI accession : JQ685139
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein Y
Function : -
Note : -
Homologs in the searched genomes :   3 hits    ( 2 protein-level,   1 DNA-level )  
Publication :
    -Barrozo,R.M., Cooke,C.L., Hansen,L.M., Lam,A.M., Gaddy,J.A., Johnson,E.M., Cariaga,T.A., Suarez,G., Peek,R.M. Jr., Cover,T.L. and Solnick,J.V., "Functional Plasticity in the Type IV Secretion System of Helicobacter pylori", PLoS Pathog. 9 (2), E1003189 (2013) PUBMED 23468628.

    -Hansen,L.M., "Direct Submission", Submitted (17-FEB-2012) Center for Comparative Medicine, University of California, Davis, County Road 98 & Hutchison Drive, Davis, CA 95616, USA.


DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
AACAGAGATCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTAACAACCCTACAGAAA
CTAAAACCAATTTTGATGGAGACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGTGGTGATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAACGG
GCTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAGATC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCCTTATCTTGATAACCCCACAGAAACTAAAACCAA
TTTTGATGGAGACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGTGGTGATGAAACTTCAGAATCTAGCAATGGCA
GTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGAT
GAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAACGGGCTTAATTGA
TGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAGATCAATCACTTTG
AAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCCTTATCTTGATAACCCCACAGAAACTAAAACCAATTTTGATGGA
GACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGTGGTGATGAAACTTCAGAATCTAGCAATGGCAGTCTAGCAGA
CAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGATGAAGAAACCC
AAGAACTGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAACGGGCTTAATTGATGATGAAACT
TCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAGATCAATCACTTTGAAGATTCTTC
AAAAGAATCCAAAGAAAGCTCAGATCCTTATCTTGATAACCCCACAGAAACTAAAACCAATTTTGATGAAGACAAGTCAG
AAGAAATAACTAACGACTCTAACGATCAAGAAATTATCAAAGGAAGCAAAAAGAAATATATTATTGGTGGCATTGTAGTC
GCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAAGCTCTCGTTT
TAGCAAAGACAGGAATCTTTATGTTAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATG
AAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTAAATATTGCA
GAAATTGAGGACAAAAACCCGTTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCT
TATCAAAGACAAAAAACTTCAAGATCAGATGAAAAAGACTCTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTG
AAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAA
GTGGCGCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACGGAAAGAGTGCCTAAAACTCATAAATGACCCTGAGAT
TAGAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAG
AAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTT
AAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCCGAGAAAAAAGAGTGTGAGAAATTACTCACCCCTGAAGC
GAGAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTATTTGGACTGCGTATCTCAAGCCAAAAACGAAGATGAGA
AAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTTTTAGAGCAACAAGCGTTAGATTGTTTGAAAAATGCT
AAAACCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGT
TAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCCGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAG
CGAGAAAACTCTTAGAAGAGGCTAAAGAGAGCATTAAAGCTTATAAAGACTGCGTATCTCAAGCCAAAAACGAAGATGAG
AAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAACAACAAGCGCTAGATTGTTTGAAAAACGC
TAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACCTGCAGAAAAAGGTTTTAGCTAAAGAGAGCG
TTAAAGCTTATAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAA
GCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACCGAAGCTGA
GAAAAAAGAATGCGAAAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGCCTTAAAGCTTATA
AAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAAAAATTGCTCACCCCTGAAGCGAAAAAACTT
TTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAA
AGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAG
AGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTAT
AAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCACGCCTGAAGCGAGGAAACT
CTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAG
AATGCGAGAAATTACTCACCCCTGAAGCGAGAAAGTTCTTAGCGAAGCAAGTGCTAAATTGTTTGGAAAAAGCTGGAAAT
GAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCTAAAGAGAGTCTTAAAGC
TTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAGAA
AACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAA
AAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAA
AGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATCATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGC
TCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAA
TGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCAATTGAGTAAAAC
AGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAACAAGCCATAGAGCAATGTTTAG
AGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCGGATTTATAGCGAT
CTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCGTTGTTGCCAATGGATTTCAAAAATGGCGGCGA
TATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCCG
ACATTACCAAACAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGC
AATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAAAGTAGAAAGCAATAAGATAGACAAAGA
TGTCGCAGAAACTGCTAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAATGGGGAATTTGTAGATGAAA
ATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAG
AGTGATCCCACATTTGTTTTAGCGCAATACACCCCCATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGG
TATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATTTTACTAGATAAAGGCACTAAGGTGT
ATGGGAATTATCAAAGCGTGAAAGGTGGCACACCTATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCT
GATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAA
TCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTC
TAGATAAACTCATAGGTCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAAT
GGTAGTATGCAAAGTTCAGCCCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAA
TGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATTACCAACAAAT
CTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGT
GGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDHHLNNPTETKTNFDGDKSEETQTQMDSGGDETSES
SNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNGYQEETQTGLIDDETSKKTQQHSPQDLSNEEATEI
NHFEDSSKESKESSDPYLDNPTETKTNFDGDKSEETQTQMDSGGDETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLD
EETQELNEEDDQENNGYQEETQTGLIDDETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDNPTETKTNFDG
DKSEETQTQMDSGGDETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNGYQEETQTGLIDDET
SKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDNPTETKTNFDEDKSEEITNDSNDQEIIKGSKKKYIIGGIVV
AVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIA
EIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQ
VALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKKECEKLLTPEAKKLLEEEAKESV
KAYLDCVSQAKTEAEKKECEKLLTPEARKKLEEAKKSVKAYLDCVSQAKNEDEKKECEKLLTPEARKLLEQQALDCLKNA
KTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKESIKAYKDCVSQAKNEDE
KKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESVKAYKDCVSRARNEKEKKECEKLLTPE
AKKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKL
LEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAY
KDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKFLAKQVLNCLEKAGN
EEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEK
KECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRK
CQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQQAIEQCLEGLSDSERALILGIKRQADEVDRIYSD
LRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGG
NKKDDDKEKSKKSTAEAKVESNKIDKDVAETAKNISEIALKNKKEKNGEFVDENGNPIDDKKKAEKQDETSPVKQAFIGK
SDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITP
DGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAIN
GSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKG
GN