Gene Information

Name : HPV225_0522 (HPV225_0522)
Accession : YP_005763455.1
Strain : Helicobacter pylori v225d
Genome accession: NC_017355
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 499882 - 505188 bp
Length : 5307 bp
Strand : -
Note : -

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AATAAAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCACCATCTTGACAACTCCACAGAGA
CTAAAACCAATTTTGATGAATACAAATCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATAGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAATAAAAGCC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACTCCACAGAAACTAAAACCAA
TTTTGATGGAGAAAAGTCAGAAGAAATAACTAACAATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACA
TTATTGGTGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTATTTCATACCTTTG
GAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCG
ATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTAT
ACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTGAGGGCCTTTTATGAATGTATTAGTAATGGTGGCAAC
TATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTG
CATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTAC
TGAACCAACAAAAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAA
CTCATAAATGACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTG
TATCAAAAACGCCAAAACAGAAGCTGAGAAAAACAAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAAC
AGCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTG
CAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAA
AGAATGTGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAA
CCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGTGATATTTTAGCCAAAGAGAGCGTTAAA
GCTTATAAAGACTGCGTATCTCAAGCCAAAACTGAAGCCGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAA
AAAGCTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACCTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGA
AAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACCTG
GATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTT
AGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAAATCTCCCTAAAG
ACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAG
AAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTGGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAA
AGACTGCGTTTCAAGATCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTT
TGGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACCTGGATTGCGTATCTCAAGCCAGAACTGAAGCTGAGAAAAAAGAA
TGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACCTGGACTGCGT
ATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAG
AAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAA
TTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACCTGGATTGCGTATCTCAAGC
CAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGA
AGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACG
CCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAGGCTGAGAAAAAAAGGTG
TGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAA
GAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAA
GAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTACTCAC
CCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGA
ATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAGTTGT
TTGGAAAAAGCTAGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGC
TAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTAC
TCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCT
AGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAGCTCCAACA
AAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCAGCTATTATGAAGTGTTTGGATGGTT
TGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACC
GATGAAGAAAAAAGGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACA
AAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAG
CCATAGAGAAATGTTTAGAAGGCTTGAGCGATAGTGAAAGGGCACTAATTCTAGGAATTAAACGACAAGCTGATGAAGTG
GATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGA
TTTCAAAAATGGTGGCGATATTGCCACTATTAACGCCACCAATGTTGATACGGACAAAATAGCTAGCGATAATCCTATTT
ATGCTTCCATAGAGCCTGACATTACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTA
GCTAAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAGAGTAGAAAG
CAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTG
GGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAAAACAAGATGAAACAAGCCCTGTCAAA
CAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGT
AGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAG
ACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTT
ACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGT
AGATGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAA
CTGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCT
TTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCC
CCCAAGTTTTTACAAAAACGAGGGCGATAGTATTAAAATTCTCACAATGGACGATATTGATTTTAGTGGCGTATATGATG
TTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATC
ACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEETIKANHFEDSSKESKESSDHHLDNSTETKTNFDEYKSEETQTQMDSGGNETSES
SNSSLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETIKA
NHFEDSSKESKESSDHHLDNSTETKTNFDGEKSEEITNNSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPL
EDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGN
YEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLK
LINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDL
QKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVK
AYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYL
DCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTDEERKKCLKNLPKDLQKKVLAKESVKAYLDCVSRARNEKE
KKECEKLLTPEAKKLLEEAKESLKAYKDCVSRSRNEKEKQECEKLLTPEAKKLLEEEAKESVKAYLDCVSQARTEAEKKE
CEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSRARNEKEKKECEK
LLTPEAKKKLEEAKKSVKAYLDCVSQARTEAEKKECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLT
PEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAK
ESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLSC
LEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRA
RNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTART
DEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEKCLEGLSDSERALILGIKRQADEV
DLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDTDKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKL
AKALGGNKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKTEKQDETSPVK
QAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVF
TKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYA
LGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEI
TTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 97
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 92
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 92
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 91
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 90
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
cagY AGC69791.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 83
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 80

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPV225_0522 YP_005763455.1 cag pathogenicity island protein VFG0287 Protein 0.0 91