Gene Information

Name : HPAG1_0502 (HPAG1_0502)
Accession : YP_627243.1
Strain : Helicobacter pylori HPAG1
Genome accession: NC_008086
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein Y
Function : -
COG functional category : U : Intracellular trafficking, secretion and vesicular transport
COG ID : COG2948
EC number : -
Position : 520227 - 524114 bp
Length : 3888 bp
Strand : -
Note : -

DNA sequence :
TTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAA
AGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTTTTAGAAGAAGCCAAAG
AAAGTCTTAAAGCTTATAAAGACTGCGTATCAAAAGCTAGGAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACG
CCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAAGCTTATTTAGATTGCGTATCTCAAGCCAAAAC
TGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTA
AGGCTTACTTAGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAGAAACTACTCACCCCTGAAGCG
AAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGA
TCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAAAGCTAGGA
ATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGC
GTTAAAGCTTATTTAGATTGCGTATCTCAAGCCAAAAATGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGA
AGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTACTTAGATTGCGTATCTCAAGCCAAAAACGAAGCTG
AGAAAAAAGAATGCGAGAAACTACTCACCCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAAC
GCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAAAGAG
CGTTAAGGCTTATTTGGATTGCGTTTCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTG
AAGCGAGAAAACTTTTAGAAGAAGCCAAAGAAAGTCTTAAAGCTTATAAAGACTGCGTATCAAAAGCTAGGAATGAAGAA
GAAAGGAGAGCTTGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAAGC
TTATTTAGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAA
AAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTAGATTGCGTATCTCAAGCCAAAACCGAAGCTGATAAAAAA
GAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAAC
CGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAAAGAGCGTTAAGG
CTTATTTGGATTGCGTTTCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGA
AAACTTTTAGAAGAAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAA
AGAATGCGAGAAATTGCTCACGCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGG
ACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAATTTTTA
GCGAAGCAAGTGCTAAATTGTTTGGAAAAAGCTGGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGA
CTTACAGGAAAATGTTTTAGCTAAAGAGAGCCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAA
GGAGAGCTTGCGAGAAACTACTCACCCCTGAAGCGAGAAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTAT
TTGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGGAAATT
TTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTA
TCATGAAATGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGAT
TGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAA
TAAAAGGACACAAAACAAACAAAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAATTTAG
ATGACCCTACTGATCAAGAGGCCATAGAGCAATGTTTAGAAGGCTTGAGCGATAGTGAAAGGGCACTAATTCTAGGAATC
AAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGAACTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGG
TTATCCATTGTTACCAATGGATTTCAAAAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAA
TAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGATATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGAT
AAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTAGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCAC
AGCAGAATCTAAAGTAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCTAAGAATATCAGTGAAATCGCTCTTA
AGAACAAAAAAGAAAAGAATGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAAAACAA
GATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGA
AATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAATATGA
ATGGCACTATGATCTTACTAGATAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATG
ACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGG
CATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAACCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCG
TGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGG
ACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCGCAGATGTCTAATCAAATTCT
AGGACAACTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTG
ATTTTAGCGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATCATCAAACAAAGCACCAAAACTTTG
TCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTGA

Protein sequence :
MKDLPKDLQKKVLAKKSVKAYLDCVSKARNEKEKKECEKLLTPEARKLLEEAKESLKAYKDCVSKARNEEERRACEKLLT
PEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKNEAEKKECEKLLTPEA
KKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESLKAYKDCVSKARNEKEKKECEKLLTPEAKKLLEEEAKES
VKAYLDCVSQAKNEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKNEAEKKECEKLLTPEAKKLLEQQALDCLKN
AKTDEERKKCLKDLPKDLQKKVLAKKSVKAYLDCVSKARNEKEKKECEKLLTPEARKLLEEAKESLKAYKDCVSKARNEE
ERRACEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEADKK
ECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKKSVKAYLDCVSKARNEKEKKECEKLLTPEAR
KLLEEAKESLKAYKDCVSKARNEKEKKECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKFL
AKQVLNCLEKAGNEEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAY
LDCVSKARNEKEKKECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLD
CLKTARTDEEKRKCQNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGI
KRQADEVDLIYSELRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKD
KNLEAKLAKALGSNKKDDDKEKSKKSTAESKVESNKIDKDVAETAKNISEIALKNKKEKNGEFVDENGNPIDDKKKAEKQ
DETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIM
TRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSER
TPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTL
SREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 85
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 83
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 82
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 79

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPAG1_0502 YP_627243.1 cag pathogenicity island protein Y VFG0287 Protein 0.0 85