Gene Information

Name : HPSH169_02700 (HPSH169_02700)
Accession : YP_006225732.1
Strain : Helicobacter pylori Shi169
Genome accession: NC_017740
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag7)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 543831 - 549374 bp
Length : 5544 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AACAAAAGCCAATCGCTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACTCCACAGAAA
CTAAAACCAATTTTGATGAAGAAAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTT
AGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGGACTTAA
TTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAATAAAAGCCAATCAC
TTTGAAGATTCTTCAAAAGAATCCAACGAAAGCTCAGATCATCATCTTGACAACTCCACAGAAACTAAAACCAATTTTGA
TGAATACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAATCTAGCAGACA
AGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGATGAAGAAATCCAA
GAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGGACTTAATTGATGATGAAACTTC
TAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAATAAAAGCCAATCACTTTGAAGATTCTTCAA
AAGAATCCAAAAAAAGCTCAGATCATCATCTTGACAACTACACAGAAACTAAAGCCAATTTTGATGAATACAAGTCAGAA
GAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGCATTGTAGTCGC
TGTCCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTATTTCATACCTTTGGAAGATAAAAGCTCTCGTTTTA
GCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTACTGAAAGAACGGAATGAA
AAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGA
AATTGAGGACAAAAACCCATTGAGGGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAACTTA
TCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTGAA
GAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGT
GGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACCCTGAGATTA
GAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAA
GCTGAGAAAAACAAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCACTAGATTGTTTGAA
AAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATA
TGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTGCTCACC
CCTGAAGCGAAAAAACTTTTAGAGCGACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTG
TGTCAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCGTTAAAGCTTATAGAGACTGCGTATCTC
AAGCCAGAACTGAAGCTGAGAAAAAAGAATGTGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCC
AAAGAGAGCGTCAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAACTGAGAAAAAAGAATGCGAGAAATTGCT
CACCCCTGAAGCGAGAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAA
CTGAAACTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGT
TTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGC
CAAAGAGAGTCTTAAGGTTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGC
TCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCCAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGG
AATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTCTTAGAGCAACAAGCGCTAGATTG
TTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAG
CCAAAGAGAGTCTTAAGGTTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTG
CTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCCAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAG
AAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGA
GCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCT
GAAGCGAGAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGT
CAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCTAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAG
CTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAG
AGTCTTAAAGCTTATAAAGACTGCGTATCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTACTCACGCC
TGAAGCGAGAAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATG
AAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGCACTAAGTTGTTTG
GAAAAAGCTAGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGCTAA
AGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGTGAGAAATTGCTCA
CCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGG
AATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAGCTCCAACAAAA
AGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCAGCTATTATGAAGTGTTTGGATGGTTTGA
GCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGAGCGAT
GAAGAAAAAAGGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAA
TCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCA
TAGAGCAATGTTTAGAAGGCTTGAGCGATAGTGAAAGAGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGAT
CTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTT
CAAAAATGGCGGCGATATTGCCACTATTAACGCCACCAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATG
CTTCTATAGAGCCTGACATTACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCT
AAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAGAGTAGAAAGCAA
TAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGG
AATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAAAGCAAGATGAAACAAGCCCTGTCAAACAG
GCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGA
TGCCACTCTCACAGGTATAGTGAGTGGGGTTGTGGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACA
AAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACT
AAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGTATGTTGGGTGAAGCAGGGGTAGA
TGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTG
CGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTG
GGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCC
AAGTTTTTACAAAAATGAGGGCGATAGTATTAAAATTCTCACAATGGACGATATTGATTTTAGTGGCGTATATGATGTTA
AAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATCACC
ACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEETTKANRFEDSSKESKESSDHHLDNSTETKTNFDEEKSEETQTQMDSGGNETSES
SNLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETIKANH
FEDSSKESNESSDHHLDNSTETKTNFDEYKSEETQTQMDSGGNETSESSNLADKLFKKARKLVDNKRPFTQQKNLDEEIQ
EPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETIKANHFEDSSKESKKSSDHHLDNYTETKANFDEYKSE
EITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNE
KGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTE
EERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTE
AEKNKCLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEEERKACEKLLT
PEAKKLLERQALDCLKNAKTEAEKKRCVKDLPKDLQSDILAKESVKAYRDCVSQARTEAEKKECEKLLTPEAKKLLEEEA
KESVKAYLDCVSQAKTETEKKECEKLLTPEARKKLEEAKKSVKAYLDCVSQAKTETEKKECEKLLTPEAKKLLEQQALDC
LKNAKTEAEKKRCVKDLPKDLQKKVLAKESLKVYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRAR
NEKEKQECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESLKVYLDCVSQAKTEAEKKECEKL
LTPEAKKLLEEAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSQAKTEAEKKECEKLLTP
EARKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKQECEKLLTPEAKKLLEEAKE
SLKAYKDCVSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQALSCL
EKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRAR
NEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARSD
EEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVD
LIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLA
KALGGNKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKTEKQDETSPVKQ
AFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFT
KAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYAL
GQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEIT
TSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 90
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 90
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 88
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 88
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 88
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 87
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 87
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 87
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 86
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 83
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 83
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 82

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPSH169_02700 YP_006225732.1 cag pathogenicity island protein (cag7) VFG0287 Protein 0.0 88