Gene Information

Name : HPPN120_02565 (HPPN120_02565)
Accession : YP_005788189.1
Strain : Helicobacter pylori Puno120
Genome accession: NC_017378
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag7)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 508876 - 514551 bp
Length : 5676 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AATAAAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCTTGACAACTCCACAGAAACTA
AAACCAATTTTGATGGAGAAAAGTCAGAAGAAACCCAAACCCAAATGGATTCTGGAGATAATGAAACTTCAGAATCTAGC
AATCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGA
TGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAATTCAAATGGACTTAATTG
ATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAATAAAAGCCAATCACTTT
GAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCTTGACAACTCCACAGAAACTAAAACCAATTTTGATGGAGA
AAAGTCAGAAGAAACCCAAACCCAAATGGATTCTGGAGATAATGAAACTTCAGAATCTAGCAATCTAGCAGACAAGTTAT
TCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGATGAAGAAATCCAAGAACCG
AACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAATTCAAATGGACTTAATTGATGATGAAACTTCTAAAAA
AACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAATAAAAGCCAATCACTTTGAAGATTCTTCAAAAGAAT
CCAAAGAAAGCTCAGATCATCTTGACAACTCCACAGAAACTAAAACCAATTTTGATGGAGAAAAGTCAGAAGAAATAACT
AACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGCATTGTAGTCGCTGTTCTTAT
TGTGATTATTTTATTTTCTAGAAGCATTTTTCACTATTTCATACCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAGATA
GGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGCAAT
ATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGAGGA
CAAAAACCCATTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTATCAAAGACA
AAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTGAAGAAGAAAGG
ATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTGGCGCTAGA
TTGTTTGAAAAACGCTAAGACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACCCTGAGATTAGAGAGAAAT
TCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAAGCTGAGAAA
AACAAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCTAGATTGTTTGAAAAACGCTAA
AACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTGGCTGATATGAGCGTCA
AGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCG
AGAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAGA
TCTCCCCAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCGTTAAAGCTTATAGAGACTGCGTATCTCAAGCCAAAA
CTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGC
GTTAAGGCTTACCTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGA
AGCGAAAAAGAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTG
AGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCAAAAAAACTCTTAGAGCAACAAGCACTAGATTGTTTGAAAAAC
GCTAAAACCGATGAAGAACGAAAAAAGTGCTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAAAG
CGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTG
AAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAA
GAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGCTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAAGC
TTACCTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAA
AACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAA
GCTTGTGAGAAATTGCTCACCCCTGAAGCGAGAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGA
TTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAAAAGTTAG
AAGAAGCTAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAG
AAATTACTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTC
AAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTCTTAGAGCAACAAG
CGCTAGATTGTTTGAAAAGTGCTAAAACCGAAGCTGAGAGAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAA
AAGGTTTTAGCTAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATG
CGAGAAATTGCTCACACCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCT
CTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAA
GTTAAGAATAGCGTTAAGGCTTATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATT
GCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAGTTGTTTGGAAAAAGCTAGAAATGAAGAAGAAAGAA
AAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAGATGTTTTAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGC
CTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCA
AGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGA
AATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAA
AACGCCGATCCTAACGACAGAGTAGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCA
AGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGTCAAAACCTTT
ATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCAATTGAGTAAAACAGAAAGATTGCAT
CAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAAGGCTTGAGCGA
TAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCA
AAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGCGGCGATATTGCCACTATT
AACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGACATTACTAAGCA
ATACGAAACAGAGAAAACCATTAAGGATAAGAGTTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCGATAAAAAAGATG
ACGATAAAGAAAAAAGTAAAAAACCCACAGCAGAAGCTAAAGCAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACT
GCCAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGACTTTGTAGATGAAAATGGTAATCCCAT
TGACGATAAAAAGAAAACAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACAT
TTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGG
GTTGTGGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATGGGAATTATCA
AAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGGGTGATAA
TACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTCATGAAG
CGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGATAAACTCAT
AGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAGCATGCAAA
GTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAACGAGGGCGATAGT
ATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGA
AATTATCAAACAAAGCACCAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEETIKANHFEDSSKESKESSDHLDNSTETKTNFDGEKSEETQTQMDSGDNETSESS
NLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEEIQMDLIDDETSKKTQQHSPQDLSNEETIKANHF
EDSSKESKESSDHLDNSTETKTNFDGEKSEETQTQMDSGDNETSESSNLADKLFKKARKLVDNKRPFTQQKNLDEEIQEP
NEEDDQENNGYQEEIQMDLIDDETSKKTQQHSPQDLSNEETIKANHFEDSSKESKESSDHLDNSTETKTNFDGEKSEEIT
NDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGN
MIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEER
IKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEK
NKCLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEA
RKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVKAYRDCVSQAKTEAEKKECEKLLTPEAKKLLEEEAKES
VKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKN
AKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEK
EKKECEKLLTPEARKLLEQEVKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERK
ACEKLLTPEARKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKKLEEAKKSVKAYLDCVSQAKTEAEKKECE
KLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKLLEQQALDCLKSAKTEAERKRCVKDLPKDLQK
KVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQE
VKNSVKAYLDCVSRARNEKEKKECEKLLTPEARKFLAKQVLSCLEKARNEEERKACLKNIPKDLQKDVLAKESLKAYKDC
LSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKFLAKELQQKDKAIKDCLK
NADPNDRVAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLH
QASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATI
NATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKSLEAKLAKALGGDKKDDDKEKSKKPTAEAKAESNKIDKDVAET
AKNISEIALKNKKEKSGDFVDENGNPIDDKKKTEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSG
VVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMK
RIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDS
IKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 93
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 91
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 87
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 85
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 83
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 83

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPPN120_02565 YP_005788189.1 cag pathogenicity island protein (cag7) VFG0287 Protein 0.0 91