Gene Information

Name : cagY (HPP12_0534)
Accession : YP_002301168.1
Strain : Helicobacter pylori P12
Genome accession: NC_011498
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein Y VirB10-like protein
Function : -
COG functional category : U : Intracellular trafficking, secretion and vesicular transport
COG ID : COG2948
EC number : -
Position : 559964 - 565669 bp
Length : 5706 bp
Strand : -
Note : -

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGC
AACAGAAGTCAATCATTTTGAAGATCTTTTAAAAGAAGAAAGCTCAGATAATCATCTTGACAACCCCACAGAAATTAAAA
CCAATTTTGATGGAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGTGGTGATGAAACTTCAGAATCTAGCAAT
GGCAGTCTAGCAGACAAGTTGTTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTT
AGATGAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGAAACTCAAACGGGCTTAA
TTGATGATGAAACTTCTAAAAAAACCCAACAAGATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGTCAATCGC
TTTGAAGATTCTTTAAAAGAAGAAAGCTCAGATCAGCATCTTGACAATTCCGCAGAAACTCAAACCAATTTTGATAAAGA
CAAGTCAGAAGAAATAACTAACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTAGTGGTC
TTGTAGTCGCTGTTCTTATCGTGATTATCTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAAGC
TCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGA
ACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTAA
ATATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGCATTAGTAATGGTGGCAACTATGAAGAATGT
TTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAGATGAAAAAGACTCTAGAGGCTTATAAAGACTGCATCAAAAATGC
CAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAA
AAGTTCAAGTGGCGCTAGATTGTTTGAAAAAGGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGAC
CCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGTGTATAAGGATTGTATCAAAAACGC
CAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAACAAGTGCTAG
ATTGTTTGAAAAACGCTAAAACTGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTA
CTAGCTGATATGAGTGTCAAGGCTTACAAGGACTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAA
ATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAAGTGCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAAC
GAAAAAAATGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCCAAAGAGAGCGTTAAAGCTTATAAAGAC
TGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGA
AGAAGAAGCCAAAGAGAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCG
AGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTAGATTGCGTATCT
CAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGC
GCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAA
AGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGC
GAGAAATTACTCACCCCTGAAGCGAGGAAACTCTTAGAAGAGGCTAAAGAGAGTGTTAAAGCTTATAAAGACTGCGTATC
AAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAAAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAG
CCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTG
CTCACCCCTGAAGCGAGGAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAG
AAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCACCCCTGAAGCGAGGAAACTCTTAGAACAACAAGCGCTAGATT
GTTTGAAAAACGCTAAAACTGATGAAGAACGAAAAAAATGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTA
GCCAAAGAGAGCGTTAAAGCTTATAAAGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATT
GCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCA
AAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGC
GTTAAGGCTTACTTAGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGA
AGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCA
AAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCC
AAAAACGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACCCCTGAAGCGAGGAAACTCTTAGAAGAGGCTAAAGAGAG
TCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTG
AAGCGAGGAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAA
AAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCAAGAAAATTTTTAGCGAAGCAAGTGCTAAATTGTTTGGA
AAAAGCTGGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCTAAAG
AGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCACC
CCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTAGATTGCGTTTCAAGAGCTAGGAA
TGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGAAAATTCTTAGCGAAGCAAGTGCTAAATTGTT
TGGAAAAAGCTGGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCT
AAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACT
CACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTAGATTGCGTTTCAAGAGCTA
GGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGAAAATTCTTAGCGAAAGAACTCCAACAA
AAAGATAAAGCGATCAAAGATTGCTTGAAAAATGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTT
GAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTTGCGGATTGTTTGGCTATGGCTAAAACCG
ATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAA
AATCAATTGAGCAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACCGATCAACAAGC
CATAGAGCAATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCACTAATTCTAGGAATCAAACGACAAGCTGATGAAGTGG
ATCTGATTTATAGCGAACTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTACCAATGGAT
TTCAAAAATGGTGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTA
TGCTTCCATAGAGCCTGATATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTGG
CTAAGGCTTTAGGTGGTAATAAAAAAGATGACGATAAAGAAAAAAGTGAAAAATCCACAGCAAAAGCTAAAGCAGAAAGC
AATAAGATAGACAAAGATGTCGCAGAAACTGCCAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGG
GGATTTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGAAGAAAAACAAGATGAAACAAGCCCTGTCAAAC
AGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCCATTGAAATCACTCTGACTTCTAAAGTA
GATGCCACTCTCACGGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGTACTATGATCTTATTAGA
CAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACGCCTATTATGACACGCTTAATGATAGTCTTCA
CTAAAGCCATTACGCCTGATGGTGTGATTATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTA
GATGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAAC
TGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTT
TGGGTCAAGCTATCAATGGCAGCATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCC
CCAAGTTTTTACAAAAATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGCGGTGTGTATGATGT
TAAAATTACTAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACCAAAACTTTGTCTAGAGAACATGAAGAAATCA
CCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQDSPQDLSNEEATEVNHFEDLLKEESSDNHLDNPTEIKTNFDGDKLEETQTQMDSGGDETSESSN
GSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNEYQEETQTGLIDDETSKKTQQDSPQDLSNEEATEVNR
FEDSLKEESSDQHLDNSAETQTNFDKDKSEEITNDSNDQEIIKGSKKKYIISGLVVAVLIVIILFSRSIFHYFMPLEDKS
SRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEEC
LKLIKDKKLQDQMKKTLEAYKDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKKAKTDEERNECLKLIND
PEIREKFRKELGLQKELQVYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQVLDCLKNAKTDEERNECLKNIPQDLQKEL
LADMSVKAYKDCVSKARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVKAYKD
CVSQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKKSVKAYLDCVS
QAKNEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQAKNEAEKKEC
EKLLTPEARKLLEEAKESVKAYKDCVSKARNEKEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKL
LTPEARKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQQALDCLKNAKTDEERKKCLKDLPKDLQSDIL
AKESVKAYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKKS
VKAYLDCVSQAKNEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQA
KNEAEKKECEKLLTPEARKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNE
KEKQECEKLLTPEARKFLAKQVLNCLEKAGNEEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLT
PEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLNCLEKAGNEEERKACLKNLPKDLQENVLA
KESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQ
KDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVADCLAMAKTDEEKRKCQNLYSDLIQEIQNKRTQSKQ
NQLSKTERLHQASECLDNLDDPTDQQAIEQCLEGLSDSERALILGIKRQADEVDLIYSELRNRKTFDNMAAKGYPLLPMD
FKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSEKSTAKAKAES
NKIDKDVAETAKNISEIALKNKKEKSGDFVDENGNPIDDKKKEEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKV
DATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGV
DGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIP
PSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 83
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 83
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 82
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 81
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 80
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 79
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 77

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
cagY YP_002301168.1 cag pathogenicity island protein Y VirB10-like protein VFG0287 Protein 0.0 83