Gene Information

Name : cag7 (HPOK310_0802)
Accession : YP_007538436.1
Strain : Helicobacter pylori OK310
Genome accession: NC_020509
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 844905 - 850598 bp
Length : 5694 bp
Strand : +
Note : -

DNA sequence :
ATGAATGAAGAAAACGATAAATTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AACAGAAGCCAATCGCTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAA
CTAAAACCAATTTTGATGAATACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGAAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAATCCAAGAACCGAACGAAGAAGACAATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAACAAAAGCC
AATCGCTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAACTAAAACCAA
TTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAATGGCA
GTCTAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGAT
GAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGGACTTAATTGA
TGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCCAATCGCTTTG
AAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAACTAAAACCAATTTTGATGAA
TACGAGTCAGAAGAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGG
CATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAA
GCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAA
GAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTT
GAATATTGCAGAAATTGAGGACAAAAACCCATTGAGGGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAAT
GTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAGATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAAT
GCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATCTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACA
AAAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAAAGAAACGAGTGCCTAAAACTCATAAATG
ACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAAC
GCCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCT
AGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAAC
TACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGTGAG
AAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGA
ACGAAAAAAATGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCCAAAGAAAGTCTTAAAGCTTATAAAG
ACTGCGTATCTCAAGCTAAAACTGAAGATGAGAAAAAAGAATGTGAGAAATTACTTACGCCTGAAGCGAAAAAACTTTTA
GAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATG
CGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTAT
CTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCAAAAAAGCTTTTAGAGCGACAA
GCGCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAA
AAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAAT
GCGAGAAATTGCTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAAAGTCTTAAAGCTTATAAAGACTGTCTC
TCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGTGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAGCAAGA
AGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCATATCAAAAGCTAGGAATGAAAGAGAGAAACAAGAATGCGAGAAAT
TGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCT
AGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGA
GAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCC
CTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAA
GCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGC
TTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAA
AACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATATT
CCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGA
AAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAAAGTCTTAAAG
CTTATAAAGACTGTCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGTGAGAAATTGCTCACCCCTGAAGCGAAA
AAACTTTTAGAGCAAGAAGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCGTATCAAAAGCTAGGAATGAAAGAGAGAA
ACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAATTTTTAGCGAAGCAAGTGCTAAGTTGTTTGGAAAAAGCTA
GAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGCTAAAGAGAGTCTT
AAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGTGAGAAATTACTCACCCCTGAAGC
GAGGAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAAAGCTAGGAACGAAAAAG
AGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCG
ATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGA
GAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAA
GGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCAATTGAGT
AAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATG
TTTAGAGGGTTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATA
GCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGC
GGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGA
GCCTGACATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAGGCTAAATTAGCTAATGCTTTAG
GTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAAAGTAGAAAGCAATAAGATAGAC
AAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGTAGA
TGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAG
GCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTC
ACAGGTATAGTGAGTGGGGTTGTGGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAA
GGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTA
CGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGCTATGTG
AATAATCATTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTACAAACTGCGCCTATCAT
AGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTA
TCAATGGCAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTAC
AAAAATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATCACCAA
CAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATCACCACAAGCCCCA
AAGGTGGCAATTAA

Protein sequence :
MNEENDKFETSKKTQQHSPQDLSNEETTEANRFEDSSKESKESSDHHLDNPTETKTNFDEYKSEETQTQMDSEGNETSES
SNGSLADKLFKKARKLVDNKRPFTQQKNLDEEIQEPNEEDNQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETTKA
NRFEDSSKESKESSDHHLDNPTETKTNFDEYESEETQTQMDSGGNETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLD
EEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEANRFEDSSKESKESSDHHLDNPTETKTNFDE
YESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLK
ERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKN
AKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELGLQKELQEYKDCIKN
AKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERKECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECE
KLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEDEKKECEKLLTPEAKKLL
EEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLERQ
ALDCLKNAKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCL
SQARNEEERRACEKLLTPEAKKLLEQEVKKSVKAYLDCISKARNEREKQECEKLLTPEAKKLLEEAKESLKAYKDCVSRA
RNEKEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKKSVKAYLDCVSQAKTE
AEKKECEKLLTPEAKKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDI
PKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEAK
KLLEQEVKKSVKAYLDCVSKARNEREKQECEKLLTPEARKFLAKQVLSCLEKARNEEERKACLKNIPKDLQKNVLAKESL
KAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSKARNEKEKQECEKLLTPEARKFLAKELQQKDKA
IKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLS
KTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNG
GDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLANALGGNKKDDDKEKSKKSTAEAKVESNKID
KDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKTEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATL
TGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYV
NNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFY
KNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 90
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 90
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 84
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 82
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 81
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 79
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 79

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
cag7 YP_007538436.1 cag pathogenicity island protein VFG0287 Protein 0.0 90