Gene Information

Name : HPELS_03975 (HPELS_03975)
Accession : YP_005424915.1
Strain : Helicobacter pylori ELS37
Genome accession: NC_017063
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag7)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 807016 - 812586 bp
Length : 5571 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAGATCCTTTAAAAGAAGAAAGTTCAGATAATCATCTTGACAATTCCACAGAAACTAAAA
CCCAAGAAACTAAAACCCATTTTGATGAAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACT
TCAGAATCTAGCAATGGCAGTCTGGCAGACAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCAC
TCAGCAAAAGAGTTTAGATGAAGAAGCCCAAAAACTGAACAAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAA
CTCAAACGGACTTAATTGATGATGAAACTTCTAAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGCA
ACAGAAGCCAATCATTTTGAAGATCTTTTAAAAGAATCCAAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAAC
TAAAACCCATTTTGATAAAGACAAGCTAGAAGAAACCCAAACTCAAATAGATTCTGGAGGTAATGAAACTTCAGAATCTA
GCAATGGCAGTCTAGCAGATAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAAG
AGTTTAGATGAAGAAGCCCAAAAACTGAACAAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAACGGA
CTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATCTATCTAATGAAGAAGCAACAGAAGTCA
ATCATTTTGAAGATTCTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTAAAACCAATTTTGAT
GGAGACAAGTCAGAAGAAATAACTAACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGG
CGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTTGTACCTTTGGAAGATA
AAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTTAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTG
AAAGAACGAAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTA
TTTGAATATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAGTGTATTAGTAATGGTGGCAACTATGAAG
AATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTCTAGAGGCTTATAATGACTGCATCAAA
AATGCCAAAACTGAAGAAGAAAGGATTAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCA
ACAAAAAGTTCAAGTGGCTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGGAAAGAGTGCCTAAAACTCATAA
ATGACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAA
AACGCCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAACAAGC
GCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAG
AACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAGTGT
GAGAAATTGCTCACGCCTGAAGCGAGAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGA
AGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCGTTAAAGCTTATA
AAGATTGCGTATCTCAAGCCAAAACAGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTT
TTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAGAAAAAAGA
ATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAGAGCGTTAAGGCTTACTTGGATTGCG
TATCTCAAGCTAAAACTGAAGCTGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAA
CAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCA
GAAAAAGGTTTTAGCCAAAGAGAGCGTTAGGGTTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAAAGAAAAG
AATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGCTATTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTATAAAGACTGC
GTATCAAGAGCCAGGAATGAAAAAGAGAAAAAAGAATGTGAGAAATTACTCACCCCTGAAGCGAGGAAACTATTAGAAGA
AGCTAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAAT
TGCTCACGCCTGAAGCGAGGAAACTATTAGAAGAAGCTAAAAAAAGCGTTAAGCTATTTGGATTGCGTATCTCAGGCAAA
CTGAGCGGAAAAAAAAGGCGAGAAATTGACTCACGCCCTGAAGCGAGGAAACTTTTAGAGAATCAAGCGCTAGATTGTTT
GAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCTA
AAGAGAGTGTTAGGGTTTATTTGGATTGCGTGTCAAAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTC
ACGCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGTGTTAAGGCTTACAAAGACTGCGTATCAAGAGCTAGGAA
TGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTATTAGAGCAAGAAGTTAAAAAGAGCG
TTAAGGCTTATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAA
GCTAGGAAATTTTTAGAGAATCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAA
AGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCTAAAGAGAGTGTTAAGGCTTATTTAGACTGCGTTTCAAGAGCTA
GGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTATTAGAAGAAGCTAAAGAGAGC
GTTAAGGCTTATTTGGATTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAGTGTGAGAAATTGCTCACGCCTGA
AGCGAGGAAACTATTAGAAGAAGCTAAAGAGAGCCTTAAAGCCTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAACTG
AAAGGAGAGCCTGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTATTAGAGCAAGAAGTTAAAAAGAGCGTCAAGGCT
TATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGGAA
ATTTTTAGAGAAAGAACTCCAACAAAAAGATAAAGCGATAAAAGATTGCTTGAAGAACGCCGATCCTAACGACAGAGCGG
CTATCATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCCAGAGAAAAGGCTGTCTTG
GATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCA
AAATAAAAAGGCACAGAACAAACAAAATCAATTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGGATAACT
TAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGATAGCGAAAGGGCGCTAATTCTAGGA
ATCAAACGACAAGCTGATGAAGTGGATCGGATTTATAGCGATCTAAGAAGCCGCAAAACCTTTGATAACATGGCGGCTAA
AGGTTATCCATTGTTGCCAATGGATTTTAAAAATGGTGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACA
AAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGATATTACTAAGCAATACGAAACAGAGAAAACCATTAAG
GATAAGAGTTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCGATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAACC
CACAACAGAAACTAAAGCAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGCGAAATCGCTC
TTAAGAACAAAAAAGAAAAGAGTGGGGATTTTGTAGATGAAAATGGTAATCCCATTGACGATAAAAAGAAAGAAGAAAAA
CAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCACAATACACCCCCAT
TGAAATCACTCTGACTTCTAAAGTAGATGCCACCCTCACGGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACA
TGAACGGCACTATGATCTTATTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACGCCTATT
ATGACTCGCTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGGGTGATTATACCTCTAGCAAACGCTCAAGCAGC
AGGCATGCTAGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTCATGAAGCGCATAGGCTTTGCTGTGATAGCAA
GCGTGGTTAATAGCTTCTTGCAAACCGCGCCTATCATAGCATTAGATAAACTCATAGGTCTTGGCAAAGGCAGAAGTGAA
AGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGCAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAAT
TCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAACGAGGGCGATAGTATTAAGATTCTCACAATGGACGATA
TTGATTTTAGTGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATCATCAAACAAAGCACCAAAACT
TTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTGA

Protein sequence :
MNEENDKLETSKKAQQDSPQDLSNEEATEANHFEDPLKEESSDNHLDNSTETKTQETKTHFDEDKLEETQTQMDSGGNET
SESSNGSLADKLFKKARKLVDDKRPFTQQKSLDEEAQKLNKEDDQENNGYQEETQTDLIDDETSKKAQQDSPQDLSNEEA
TEANHFEDLLKESKESSDNHLDNPTETKTHFDKDKLEETQTQIDSGGNETSESSNGSLADKLFKKARKLVDDKRPFTQQK
SLDEEAQKLNKEDDQENNGYQEETQTDLIDDETSKKTQQHSPQDLSNEEATEVNHFEDSLKEESSDNHLDNPTETKTNFD
GDKSEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFVPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLL
KERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIK
NAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIK
NAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSRARNEKEKKEC
EKLLTPEARKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVKAYKDCVSQAKTEAEKKECEKLLTPEAKKL
LEEEAKESVKAYLDCVSKAKNEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKQECEKLLTPEAKKLLEQ
QALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSQAKTEAERKECEKLLTPEARKLLEEAKKSVKAYKDC
VSRARNEKEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKKSVKLFGLRISGK
LSGKKRREIDSRPEARKLLENQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSKAKTEAEKKECEKLL
TPEARKLLEEAKESVKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPE
ARKFLENQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKES
VKAYLDCVSRARNEKEKKECEKLLTPEARKLLEEAKESLKAYKDCLSQARNETERRACEKLLTPEARKLLEQEVKKSVKA
YLDCVSRARNEKEKQECEKLLTPEARKFLEKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVL
DCLKTARTDEEKRKCQNLYSDLIQEIQNKKAQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILG
IKRQADEVDRIYSDLRSRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIK
DKSLEAKLAKALGGDKKDDDKEKSKKPTTETKAESNKIDKDVAETAKNISEIALKNKKEKSGDFVDENGNPIDDKKKEEK
QDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPI
MTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSE
RTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKT
LSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 91
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 89
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 88
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 88
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 88
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 85
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 81
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 80
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 80
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 79
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 76

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPELS_03975 YP_005424915.1 cag pathogenicity island protein (cag7) VFG0287 Protein 0.0 88