Gene Information

Name : HPGAM_02695 (HPGAM_02695)
Accession : YP_005780262.1
Strain : Helicobacter pylori Gambia94/24
Genome accession: NC_017371
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein Y VirB10-like protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 537102 - 542918 bp
Length : 5817 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTAAAA
CCCATTTTGATGAAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAAT
GGCAGTCTGGCAGACAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAAGAGTTT
AGATGAAGAAGCCCAAAAACTGAACGAAGAAGATGATCAAGAAAATAATGAGCATCAAGAAGAAACTCAAACGGACTTGA
TTGATGGTGAAACTTCTGAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGCAACAGAAGCCAATCAT
TTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAAGTTCAGACAATCATCTTGACAA
TTCCGCAGAAACTAAAACCCAAGAAACTAAAACCCATTTTGATGAAGACAAGCTAGAAGAAATAACTGACGACTCTAACG
ATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGCGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTA
TTTTCTAGAAGCATTTTTCACTACTTTGTACCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGT
CAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGA
ATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTG
AGAGCCTTTTATGAGTGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGA
TCAAATGAAAAAGACTCTAGAGGCTTATAACGACTGCATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAG
ATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTGGCTCTAGATTGTTTGAAAAAC
GCTAAAACCGATGAAGAACGGAAAGAGTGCCTAAAACTCATAAATGACCCTGAGATTAGAGAGAAATTCCGCAAGGAATT
AGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAAGCTGAGAAAAACGAATGCTTGA
AAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAA
CGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGA
CTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAG
AACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGAC
TTACAAAGCGATATTTTAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGCATCTCAAGCCAAAACTGAAGCCGAAAA
AAAAGAATGCGAGAAATTACTCACGCCCGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATT
TGGATTGCGTATCTCAGGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAG
TTAGAAGAAGCTAAAAAAAGCATTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCCGAAAAAAAAGAATG
CGAGAAATTACTCACGCCCGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCG
TATCTCAGGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAA
GCTAAAAAAAGCATTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCCGAAAAAAAAGAATGCGAGAAATT
ACTCACGCCCGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAGG
CCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAA
AGCATTAGGGTTTATTTGGATTGCGTATCTCAGGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCC
TGAAGCGAAAAAACTATTAGAAGAATCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAG
CTGAAAGAAAAGAGTGTGAGAAATTGCTCACGCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGCGTTAAGGCT
TACAAGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGAAA
ACTTTTAGAAGAATCTAAAAAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAAAGAAAAG
AATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAACTATTAGAAGAAGCTAAAGAGAGTGTTAAGGCTTACAAGGACTGC
GTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAACA
ACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTAC
AGAAAAAGGTTTTAGCCAAAGAGAGTGTTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAA
GAATGCGAGAAGTTACTCACGCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGTGTTAAGGCTTACAAGGACTG
CGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGAAAACTATTAGAGC
AAGAAGTTAAAAAGAGCGTTAAGGCTTACAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAG
AAATTACTCACGCCTGAAGCTAGGAAACTTTTAGAGAATCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGA
GAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTACAGAAAAAGGTTTTAGCCAAAGAGAGTGTTAGGGTTTATTTGG
ATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGTGAGAAATTACTCACCCCCGAAGCGAGAAAACTATTA
GAAGAAGCCAAAGAGAGTGTTAGGGTTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGTGA
GAAATTACTCACCCCCGAAGCGAGAAAACTATTAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTC
AAGCCAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGGAAACTTTTAGAAGAATCTAAA
AAAAGCGTTAAGGCTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGTGAGAAATTACTCAC
TCCCGAAGCGAGAAAACTATTAGAAGAAGCTAAAGAGAGTGTTAAAGCCTATAAAGACTGCCTCTCTCAAGCTAGAAATG
AAACTGAAAGGAGAGCCTGCGAGAAATTACTCACCCCTGAAGCGAGGAAACTCTTAGAGCAAGAAGTTAAAAAAAGCGTT
AAGGCTTATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATACCTCACGCCCGAAGC
TAGGAAATTTTTAGAGAAACAGCGCCAACAAAAAGATAAAGCGATAAAGGATTGCTTGAAAAACGCCGATCCTAACGACA
GAGCGGCTATCATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCCAGAGAAAAGGCT
GTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGA
AATCCAAAATAAAAAGGCACAGAACAAACAAAATCAATTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGG
ATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGATAGCGAAAGGGCGCTAATT
CTAGGAATCAAACGACAAGCTGATGAAGTGGATCGGATTTATAGCGATCTAAGAAGCCGCAAAACCTTTGATAACATGGC
GGCTAAAGGTTATCCATTGTTACCAATGGATTTCAAAAATGGTGGCGATATTGCTACTATTAATGCCACTAATGTTGATG
CGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGATATTACTAAGCAATACGAAACAGAGAAAACC
ATTAAGGATAAGAGTTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCGATAAGAAAGATGACGATAAAGAAAAAAGTAA
AAAACCCACAGCAGAAACTAAAGCAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGCGAAA
TCGCTCTTAAGAACAAAAAAGAAAATAATGGGGGATTTGTAGATGAAAATGGTAATCCCGTTGATGATAAAAAGAAAGAA
GAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACAC
TCCCATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACGGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTAT
GGAACATGAACGGCACTATGATCTTATTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACG
CCTATTATGACTCGTTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGGGTGATTATACCTCTAGCAAACGCTCA
AGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTCATGAAGCGCATAGGCTTTGCTGTGA
TAGCAAGCGTGGTTAATAGCTTCTTGCAAACCGCGCCTATCATAGCCCTAGATAAACTCATAGGCCTTGGCAAAGGCAGA
AGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGCAGTATGCAAAGTTCAGCTCAGATGTCTAA
TCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGCATTAAGATTCTCACCATGG
ACGATATTGATTTTAGCGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACC
AAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTGA

Protein sequence :
MNEENDKLETSKKAQQDSPQDLSNEEATEANHFEDLLKEESSDNHLDNPTETKTHFDEDKLEETQTQMDSGGNETSESSN
GSLADKLFKKARKLVDDKRPFTQQKSLDEEAQKLNEEDDQENNEHQEETQTDLIDGETSEKAQQDSPQDLSNEEATEANH
FEDLLKEESSDNHLDNPTESSDNHLDNSAETKTQETKTHFDEDKLEEITDDSNDQEIIKGSKKKYIIGGIVVAVLIVIIL
FSRSIFHYFVPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPL
RAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKN
AKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEE
RNECLKNIPQDLQKELLADMSVKAYKDCVSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKD
LQSDILAKESLKAYKDCASQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKK
LEEAKKSIRVYLDCVSKAKNEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKKLEE
AKKSIRVYLDCVSKAKNEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKKLEEAKK
SIRVYLDCVSQAKTEAEKQECEKLLTPEAKKLLEESKKSVKAYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKA
YKDCVSRARNEKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSQAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDC
VSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSKAKNEAERK
ECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKLLEQEVKKSVKAYKDCVSRARNEKEKKECE
KLLTPEARKLLENQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSRARNEKEKKECEKLLTPEARKLL
EEAKESVRVYLDCVSRARNEKEKKECEKLLTPEARKLLEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEARKLLEESK
KSVKAYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCLSQARNETERRACEKLLTPEARKLLEQEVKKSV
KAYLDCVSRARNEKEKQECEKYLTPEARKFLEKQRQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKA
VLDCLKTARTDEEKRKCQNLYSDLIQEIQNKKAQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALI
LGIKRQADEVDRIYSDLRSRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKT
IKDKSLEAKLAKALGGDKKDDDKEKSKKPTAETKAESNKIDKDVAETAKNISEIALKNKKENNGGFVDENGNPVDDKKKE
EKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGT
PIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGR
SERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQST
KTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 81
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 78
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 76
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 75
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 73