Gene Information

Name : HPSH_04285 (HPSH_04285)
Accession : YP_001910322.1
Strain : Helicobacter pylori Shi470
Genome accession: NC_010698
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein CagY
Function : -
COG functional category : U : Intracellular trafficking, secretion and vesicular transport
COG ID : COG2948
EC number : -
Position : 832665 - 838229 bp
Length : 5565 bp
Strand : +
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AATAAAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAAATCATCATCTTGACAACTCCACAGAAA
CTAAAACCAATTTTGATGAATACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTT
AGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAATTCAAATGGATTTAA
TTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATCTATCCAATGAAGAAACAATAAAAGCCAATCAC
TTTGAAGATTCTTCAGAAGAATCCAAAGAAAACTCAGATCATCATCTTGACAACTCCACAGAAACTAAAACCAATTTTGA
TGGAGAAAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAATCTAGCAGACA
AGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGATGAAGAAATCCAA
GAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAATTCAAATGGATTTAATTGATGATGAAACTTC
TAAAAAAACCCAACAACATTCACCCCAAGATCTATCCAATGAAGAAACAATAAAAGCCAATCACTTTGAAGATTCTTCAA
AAGAATCCAAAGAAAACTCAGATCATCATCTTGACAACTCCACAGAAACTAAAACCAATTTTGATGGAGAAAAGTCAGAA
GAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGCATTGTAGTCGC
TGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTATTTCATACCTTTGGAAGATAAAAGCTCTCGTTTTA
GCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAA
AAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGA
AATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGTAACTATGAAGAATGTTTGAAGCTTA
TCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTGAA
GAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGT
GGCACTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACCCTGAGATTA
GAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAA
GCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAGGAAGCTATAGAAAGATTGAAACAGCAAGCGCTAGATTGTTTGAA
AAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATA
TGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACG
CCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTG
TTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCGTTAAAGCTTATAAAGACTGCGTATCTC
AAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCC
AAAGAGAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACT
CACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACCTGGATTGCGTATCTCAAGCCAAAA
CTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAGCTTTTAGAGCAACAAGCGCTAGATTGT
TTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGC
TAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAAAAATTAC
TCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTATCTCAAGCCAGA
ACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCTAAAGAGAG
CGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTG
AAGCGAGAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACCTGGATTGCGTATCTCAAGCCAAAACTGAAGCT
GAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAACAACAAGCGCTAGATTGTTTGAAAAA
CGCTAAAACCGAAGCTGAGAAAAAAAGGTGCGTCAAAGATCTTCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGA
GTCTTAAGGCTTATAAAGACTGTGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCT
GAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAA
AGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTT
ATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAA
CTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGC
TTGTGAGAAATTGCTCACGCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACT
GCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCG
AAGCAAGCGCTAAGTTGTTTGGAAAAAGCTAGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTT
ACAGAAAAATGTTTTAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAA
AAGCTTGTGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAATAGCGTTAAGGCTTATTTG
GACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTT
AGCGAAAGAGCTCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCAGCTATTA
TGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGT
TTGAAAACGGCTAAGACCGATGAAGAAAAAAGGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAA
AAGGGCACAAAGCAAACAAAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATG
ACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAAGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAA
CGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTA
TCCATTGTTGCCAATGGATTTCAAAAATGGTGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAG
CTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGACATTACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAG
AATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGC
AGAAGCTAGAGTAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGA
ACAAAAAAGAAAAGAGTGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAAAACAAGAT
GAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAAT
CACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTGGCCAAAGATGTATGGAACATGAACG
GCACCATGATCTTACTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACA
CGCTTAATGATAGTCTTCACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCAT
GCTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGGTAGCAAGCGTGG
TTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACA
CCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGG
GCAACTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGTATTAAAATTCTCACAATGGACGATATTGATT
TTAGTGGCGTATATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCT
AGAGAGCATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEETIKANHFEDSSKESKESSNHHLDNSTETKTNFDEYKSEETQTQMDSGGNETSES
SNLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEEIQMDLIDDETSKKTQQHSPQDLSNEETIKANH
FEDSSEESKENSDHHLDNSTETKTNFDGEKSEETQTQMDSGGNETSESSNLADKLFKKARKLVDNKRPFTQQKNLDEEIQ
EPNEEDDQENNGYQEEIQMDLIDDETSKKTQQHSPQDLSNEETIKANHFEDSSKESKENSDHHLDNSTETKTNFDGEKSE
EITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNE
KGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTE
EERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTE
AEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERKKCLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLT
PEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVKAYKDCVSQARTEAEKKECEKLLTPEAKKLLEEEA
KESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEQQALDC
LKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSQAR
TEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSRARNEKEKQECEKLLTPEARKKLEEAKKSVKAYLDCVSQAKTEA
EKKECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKKECEKLLTP
EAKKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSRARNEKEKKECEKLLTPEARK
LLEEAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLA
KQALSCLEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKNSVKAYL
DCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDC
LKTAKTDEEKRKCQNLYSDLIQEIQNKRAQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIK
RQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDK
NLEAKLAKALGGNKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKTEKQD
ETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMT
RLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVVASVVNSFLQTAPIIALDKLIGLGKGRSERT
PEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLS
REHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 93
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 93
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 90
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 85
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 85
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 85
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 82
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 82
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 76

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPSH_04285 YP_001910322.1 cag pathogenicity island protein CagY VFG0287 Protein 0.0 90