Gene Information

Name : HPGAM_02695 (HPGAM_02695)
Accession : YP_005780262.1
Strain : Helicobacter pylori Gambia94/24
Genome accession: NC_017371
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein Y VirB10-like protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 537102 - 542918 bp
Length : 5817 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTAAAA
CCCATTTTGATGAAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAAT
GGCAGTCTGGCAGACAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAAGAGTTT
AGATGAAGAAGCCCAAAAACTGAACGAAGAAGATGATCAAGAAAATAATGAGCATCAAGAAGAAACTCAAACGGACTTGA
TTGATGGTGAAACTTCTGAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGCAACAGAAGCCAATCAT
TTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAAGTTCAGACAATCATCTTGACAA
TTCCGCAGAAACTAAAACCCAAGAAACTAAAACCCATTTTGATGAAGACAAGCTAGAAGAAATAACTGACGACTCTAACG
ATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGCGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTA
TTTTCTAGAAGCATTTTTCACTACTTTGTACCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGT
CAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGA
ATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTG
AGAGCCTTTTATGAGTGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGA
TCAAATGAAAAAGACTCTAGAGGCTTATAACGACTGCATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAG
ATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTGGCTCTAGATTGTTTGAAAAAC
GCTAAAACCGATGAAGAACGGAAAGAGTGCCTAAAACTCATAAATGACCCTGAGATTAGAGAGAAATTCCGCAAGGAATT
AGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAAGCTGAGAAAAACGAATGCTTGA
AAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAA
CGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGA
CTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAG
AACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGAC
TTACAAAGCGATATTTTAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGCATCTCAAGCCAAAACTGAAGCCGAAAA
AAAAGAATGCGAGAAATTACTCACGCCCGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATT
TGGATTGCGTATCTCAGGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAG
TTAGAAGAAGCTAAAAAAAGCATTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCCGAAAAAAAAGAATG
CGAGAAATTACTCACGCCCGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCG
TATCTCAGGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAA
GCTAAAAAAAGCATTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCCGAAAAAAAAGAATGCGAGAAATT
ACTCACGCCCGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAGG
CCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAA
AGCATTAGGGTTTATTTGGATTGCGTATCTCAGGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCC
TGAAGCGAAAAAACTATTAGAAGAATCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAG
CTGAAAGAAAAGAGTGTGAGAAATTGCTCACGCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGCGTTAAGGCT
TACAAGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGAAA
ACTTTTAGAAGAATCTAAAAAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAAAGAAAAG
AATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAACTATTAGAAGAAGCTAAAGAGAGTGTTAAGGCTTACAAGGACTGC
GTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAACA
ACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTAC
AGAAAAAGGTTTTAGCCAAAGAGAGTGTTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAA
GAATGCGAGAAGTTACTCACGCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGTGTTAAGGCTTACAAGGACTG
CGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGAAAACTATTAGAGC
AAGAAGTTAAAAAGAGCGTTAAGGCTTACAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAG
AAATTACTCACGCCTGAAGCTAGGAAACTTTTAGAGAATCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGA
GAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTACAGAAAAAGGTTTTAGCCAAAGAGAGTGTTAGGGTTTATTTGG
ATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGTGAGAAATTACTCACCCCCGAAGCGAGAAAACTATTA
GAAGAAGCCAAAGAGAGTGTTAGGGTTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGTGA
GAAATTACTCACCCCCGAAGCGAGAAAACTATTAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTC
AAGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGGAAACTTTTAGAAGAATCTAAA
AAAAGCGTTAAGGCTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGTGAGAAATTACTCAC
TCCCGAAGCGAGAAAACTATTAGAAGAAGCTAAAGAGAGTGTTAAAGCCTATAAAGACTGCCTCTCTCAAGCTAGAAATG
AAACTGAAAGGAGAGCCTGCGAGAAATTACTCACCCCTGAAGCGAGGAAACTCTTAGAGCAAGAAGTTAAAAAAAGCGTT
AAGGCTTATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATACCTCACGCCCGAAGC
TAGGAAATTTTTAGAGAAACAGCGCCAACAAAAAGATAAAGCGATAAAGGATTGCTTGAAAAACGCCGATCCTAACGACA
GAGCGGCTATCATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCCAGAGAAAAGGCT
GTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGA
AATCCAAAATAAAAAGGCACAGAACAAACAAAATCAATTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGG
ATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGATAGCGAAAGGGCGCTAATT
CTAGGAATCAAACGACAAGCTGATGAAGTGGATCGGATTTATAGCGATCTAAGAAGCCGCAAAACCTTTGATAACATGGC
GGCTAAAGGTTATCCATTGTTACCAATGGATTTCAAAAATGGTGGCGATATTGCTACTATTAATGCCACTAATGTTGATG
CGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGATATTACTAAGCAATACGAAACAGAGAAAACC
ATTAAGGATAAGAGTTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCGATAAGAAAGATGACGATAAAGAAAAAAGTAA
AAAACCCACAGCAGAAACTAAAGCAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGCGAAA
TCGCTCTTAAGAACAAAAAAGAAAATAATGGGGGATTTGTAGATGAAAATGGTAATCCCGTTGATGATAAAAAGAAAGAA
GAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACAC
TCCCATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACGGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTAT
GGAACATGAACGGCACTATGATCTTATTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACG
CCTATTATGACTCGTTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGGGTGATTATACCTCTAGCAAACGCTCA
AGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTCATGAAGCGCATAGGCTTTGCTGTGA
TAGCAAGCGTGGTTAATAGCTTCTTGCAAACCGCGCCTATCATAGCCCTAGATAAACTCATAGGCCTTGGCAAAGGCAGA
AGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGCAGTATGCAAAGTTCAGCTCAGATGTCTAA
TCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGCATTAAGATTCTCACCATGG
ACGATATTGATTTTAGCGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACC
AAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTGA

Protein sequence :
MNEENDKLETSKKAQQDSPQDLSNEEATEANHFEDLLKEESSDNHLDNPTETKTHFDEDKLEETQTQMDSGGNETSESSN
GSLADKLFKKARKLVDDKRPFTQQKSLDEEAQKLNEEDDQENNEHQEETQTDLIDGETSEKAQQDSPQDLSNEEATEANH
FEDLLKEESSDNHLDNPTESSDNHLDNSAETKTQETKTHFDEDKLEEITDDSNDQEIIKGSKKKYIIGGIVVAVLIVIIL
FSRSIFHYFVPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPL
RAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKN
AKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEE
RNECLKNIPQDLQKELLADMSVKAYKDCVSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKD
LQSDILAKESLKAYKDCASQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKK
LEEAKKSIRVYLDCVSKAKNEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKKLEE
AKKSIRVYLDCVSKAKNEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKKLEEAKK
SIRVYLDCVSQAKTEAEKQECEKLLTPEAKKLLEESKKSVKAYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKA
YKDCVSRARNEKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSQAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDC
VSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSKAKNEAERK
ECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKLLEQEVKKSVKAYKDCVSRARNEKEKKECE
KLLTPEARKLLENQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSRARNEKEKKECEKLLTPEARKLL
EEAKESVRVYLDCVSRARNEKEKKECEKLLTPEARKLLEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEARKLLEESK
KSVKAYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCLSQARNETERRACEKLLTPEARKLLEQEVKKSV
KAYLDCVSRARNEKEKQECEKYLTPEARKFLEKQRQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKA
VLDCLKTARTDEEKRKCQNLYSDLIQEIQNKKAQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALI
LGIKRQADEVDRIYSDLRSRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKT
IKDKSLEAKLAKALGGDKKDDDKEKSKKPTAETKAESNKIDKDVAETAKNISEIALKNKKENNGGFVDENGNPVDDKKKE
EKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGT
PIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGR
SERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQST
KTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 81
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 78
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 76
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 75
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 73