Gene Information

Name : HPPN135_02585 (HPPN135_02585)
Accession : YP_005789734.1
Strain : Helicobacter pylori Puno135
Genome accession: NC_017379
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 511766 - 517369 bp
Length : 5604 bp
Strand : -
Note : Cag7; COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AATAAAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACTCCACAGAAA
CTAAAACCAATTTTGATGAATACAAGTCAGAAGAAACCCAAACCCAAATGGATTTTGGAGGTAATGAAACTTCAGAATCT
AGCAATCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTT
AGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACAATAAAAGCCAATC
ACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCTTGACAACTCCACAGAAACTAAAACCAATTTTGAT
GGAGAAAAGTCAGAAGAAACCCAAACCCAAATGGATTCTGGAGATAATGAAACTTCAGAATCTAGCAATCTAGCAGACAA
GTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGATGAAGAAATCCAAG
AACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAATTCAAATGGACTTAATTGATGATGAAACTTCT
AAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAATAAAAGCCAATCACTTTGAAGATTCTTCAAA
AGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAACTAAAACCAATTTTGATGAATACAAGTCAGAAG
AAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGCATTGTAGTCGCT
GTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTATTTCATACCTTTGGAAGATAAAAGCTCTCGTTTTAG
CAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAA
AAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGAA
ATTGAGGACAAAAACCCATTGAGGGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTAT
CAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTGAAG
AAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTG
GCGCTAGATTGTTTGAAAAACGCTAAGACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACCCTGAGATTAG
AGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAAG
CTGAGAAAAACAAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCTAGATTGTTTGAAA
AACGCTAAAACTGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATAT
GAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGC
CTGAAGCGAGAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACGAAAAAAGTGT
TTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCGTTAAAGCTTATAAAGACTGCGTATCTCA
AGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCA
AAGAGAGCGTTAAAGCTTACCTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGCTTGCGAGAAATTACTC
ACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAAC
TGAAGCTGAGAAAAAAGCTTGTGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTT
TGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCC
AAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACT
CACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGA
ATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGC
GTTAAAGCTTATAAAGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGA
AGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAG
AGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCT
TATCTGGATTGCGTATCTCAAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGAAA
AAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAAAAAAAAG
AATGCGAGAAATTACTCACGCCTGAAGCGAGAAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGAC
TGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGA
GCAACAAGCACTAGATTGTTTGAAAAACGCTAAAACCGAATCTGAGAGAAAAAGGTGTGTCAAAGATCTCCCTAAAGACT
TGCAGAAAAAGGTTTTAGCTAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAA
AAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGA
CTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTGCTCACGCCTGAAGCGAGAAAACTCTTAG
AGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGC
GAGAAATTGCTCACGCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAGTTGTTTGGAAAAAGCTAGAAATGAAGA
AGAAAGAAAAGCATGTCTTAAAGATATCCCTAAAGACTTACAGAAAAATGTTTTAGCTAAAGAGAGTCTTAAAGCTTATA
AAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTACTCACCCCTGAAGCGAGAAAACTC
TTAGAGCAAGAAGTTAAAAAAAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGA
ATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGATT
GCTTGAAAAACGCCGATCCTAACGACAGAGCAGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAA
TACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGTCA
AAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCAATTGAGTAAAACAGAAA
GATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAAGGC
TTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTAAG
AAACCGCAAAACCTTTGACAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGCGGCGATATTG
CCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGACATT
ACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCAATAA
AAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAGAGTAGAAAGCAATAAGATAGACAAAGATGTCG
CAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGTAGATGAAAATGGT
AATCCCATTGACGACAAAAAGAAAACAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGA
TCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAG
TGAGTGGGGTTGTGGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATGGG
AATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGG
TGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACT
TTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGAT
AAACTCATAGGACTTGGCAAAGGCAGAAGTGAAAGAACGCCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAG
TATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGG
GCGATAGTATTAAAATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATTACCAACAAATCTGTG
GTAGATGAAATCATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATCACCACAAGCCCCAAAGGTGGCAA
TTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEETIKANHFEDSSKESKESSDHHLDNSTETKTNFDEYKSEETQTQMDFGGNETSES
SNLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEETIKANHFEDSSKESKESSDHLDNSTETKTNFD
GEKSEETQTQMDSGDNETSESSNLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEEIQMDLIDDETS
KKTQQHSPQDLSNEETIKANHFEDSSKESKESSDHHLDNPTETKTNFDEYKSEEITNDSNDQEIIKGSKKKYIIGGIVVA
VLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAE
IEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQV
ALDCLKNAKTDEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQQALDCLK
NAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEARKKLEQQVLDCLKNAKTDEERKKC
LKDLPKDLQSDILAKESVKAYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKACEKLL
TPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKACEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLA
KESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKQECEKLLTPEARKLLEQEVKKS
VKAYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEEEAKESVKA
YLDCVSQARTEAEKKECEKLLTPEARKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEQEVKKSVKAYLD
CVSRARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTESERKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEK
KECEKLLTPEARKLLEEAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQEC
EKLLTPEARKFLAKQVLSCLEKARNEEERKACLKDIPKDLQKNVLAKESLKAYKDCLSQARNEEERKACEKLLTPEARKL
LEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLK
YLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEG
LSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDI
TKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENG
NPIDDKKKTEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYG
NYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALD
KLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSV
VDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 93
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 90
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 89
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 87
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 83

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPPN135_02585 YP_005789734.1 cag pathogenicity island protein VFG0287 Protein 0.0 90