Gene Information

Name : HPLT_02610 (HPLT_02610)
Accession : YP_005772666.1
Strain : Helicobacter pylori Lithuania75
Genome accession: NC_017362
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag7, cagY)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 513266 - 519031 bp
Length : 5766 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAAGATTCACCCCAAGATTTATCCAATGAAGAAGC
GACAGAAGTCAATCATTTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTAAAA
CCCATTTTGATGGAGACAAGTCAGAAGAAACCCAAACTCAAATAGATTCTGAAGGTAATGAAACTTCAGAATCTAGCAAT
GGCAGTCTGGCAGATAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAAGAGTTT
AGATGAAGAAGCCCAAAAACTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGAAACTCAAATAGACTTAA
TTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCGACAGAAGTCAATCAT
TTTGAAGATCTTTTAAAAGAAGAAAGCTCAAACAATCATCTTGACAACCCCACAGAAACTAAAACCCATTTTGATGGAGA
CAAGTCAGAAGAAACCCAAACTCAAATAGATTCTGAAGGTAATGAAACTTCAGAATCTAGCAATGGCAGTCTGGCAGATA
AGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAAGAGTTTAGATGAAGAAGCCCAA
AAACTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGATACTCAAATAGACTTAATTGATGATGAAACTTC
TAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCGACAGAAGTCAATCATTTTGAAGATCTTTTAA
AAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTAAAACCCATTTTGATGGAGACAAGTCAGAAGAAATA
ACTAACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAAAAATACATTATTGGTGGCATTGTAGTCGCTGTTCT
TATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAG
ACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGC
AATATGATCGATAAAAATCTTTTCTTCAATGACGATCCTAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGA
GGACAAAAACCCGTTGAGAGCCTTTTATGAGTGTATTGGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTATCAAAG
ACAAAAAACTTCAAGAGCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTGAAGAAGAA
AGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAGGTGGCGCT
AGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAACTCATAAATGACCCTGAGATTAGAGAGA
AATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAGGCTGAG
AAAAATGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCTAGATTGTTTGAAAAACGC
TAAAACCGATGAAGAACGAAACGAGTGCTTAAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCG
TCAAGGCTTATAAGGACTGCGTTTCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAA
GCGAGGAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAA
AGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGTCTGAAAGCTTATAAAGACTGCGTATCAAGAGCCA
AAAACGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAG
AGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAACTGAGAAAAAAGAATGCGAGAAATTGCTCACCCC
TGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAG
CTGAGAAAAAAGAATGCGAAAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAA
AACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGA
GAGCGTTAAAGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCC
CTGAAGCGAGGAAACTCTTAGAAGAGGCTAAAGAGAGTCTGAAAGCTTATAAAGACTGCGTATCAAGAGCCAAAAACGAA
GCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTCTTAGAAGAAGAAGCCAAAGAGAGCGTTAA
TGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAACTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGA
AAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAA
AAAGAATGCGAAAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAA
AACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTA
AAGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCG
AGGAAACTCTTAGAAGAGGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAGAGCCAAAAACGAAGCTGAGAA
AAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGCGTTAAAGCTTATAAAG
ACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAAAAATTGCTCACGCCTGAAGCGAAAAAACTTTTA
GAGCAACAAGCACTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGA
CTTGCAGAAAAAGGTTTTAGCTAAAGAAAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGA
AAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAGGCTAAAGAGAGCCTTAAAGCTTATAAA
GACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAGGAAACTCTT
AGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAAT
GCGAGAAATTACTCACGCCTGAAGCGAGGAAATTTTTAGCGAAGCAAGTGCTAAATTGTTTGGAAAAAGCTGGAAATGAA
GAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCTAAAGAGAGCCTTAAAGCTTA
TAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCACCCCTGAAGCGAGGAAAC
TCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAAAAACAG
GAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAGTTCTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGA
TTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATCATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCA
AATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGC
CAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAACAAACAAAATCAATTGAGTAAAACAGA
AAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACCGATCAACAAGCCATAGAGCAATGTTTAGAGG
GCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGAACTA
AGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCGTTGTTGCCAATGGATTTCAAAAATGGCGGCGATAT
TGCCACTATTAACGCTACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGACA
TTACCAAACAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCAAT
AAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAAAAGCTAAAGCAGAAAGCAATAAGATAGACAAAGATGT
CGCAGAAACTGCTAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAATGGGGAATTTGTAGATGAAAATG
GTAATCCCATTGACGATAAAAAGAAAGAAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGT
GATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTAT
AGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATG
GGAATTATCAAAGCGTGAAAGGTGGCACACCTATTATGACACGCTTAATGATAGTTTTTACTAAAGCCATTACGCCTGAT
GGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCA
TTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAG
ATAAACTCATAGGCCTTGGCAAAGGCAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGT
AGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAACGA
GGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATTACCAATAAATCTG
TGGTAGATGAAATTATCAAACAAAGCACCAAAACTTTGTCTAGAGAACATGAAGAGATCACCACAAGCCCCAAAGGTGGC
AATTAA

Protein sequence :
MNEENDKLETSKKTQQDSPQDLSNEEATEVNHFEDLLKEESSDNHLDNPTETKTHFDGDKSEETQTQIDSEGNETSESSN
GSLADKLFKKARKLVDDKRPFTQQKSLDEEAQKLNEEDDQENNEYQEETQIDLIDDETSKKTQQHSPQDLSNEEATEVNH
FEDLLKEESSNNHLDNPTETKTHFDGDKSEETQTQIDSEGNETSESSNGSLADKLFKKARKLVDDKRPFTQQKSLDEEAQ
KLNEEDDQENNEYQEDTQIDLIDDETSKKTQQHSPQDLSNEEATEVNHFEDLLKEESSDNHLDNPTETKTHFDGDKSEEI
TNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKG
NMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECIGNGGNYEECLKLIKDKKLQEQMKKTLEAYNDCIKNAKTEEE
RIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAE
KNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPE
ARKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSRAKNEAEKKECEKLLTPEAKKLLEEEAKE
SVKAYLDCVSQAKTETEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLK
NAKTEADKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKESLKAYKDCVSRAKNE
AEKKECEKLLTPEAKKLLEEEAKESVNAYLDCVSQAKTETEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEK
KECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEA
RKLLEEAKESLKAYKDCVSRAKNEAEKKECEKLLTPEAKKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEAKKLL
EQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK
DCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKFLAKQVLNCLEKAGNE
EERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQ
ECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKC
QNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDQQAIEQCLEGLSDSERALILGIKRQADEVDLIYSEL
RNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGN
KKDDDKEKSKKSTAKAKAESNKIDKDVAETAKNISEIALKNKKEKNGEFVDENGNPIDDKKKEEKQDETSPVKQAFIGKS
DPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPD
GVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAING
SMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGG
N

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 95
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 95
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 95
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 86
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 75

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPLT_02610 YP_005772666.1 cag pathogenicity island protein (cag7, cagY) VFG0287 Protein 0.0 95