Gene Information

Name : cagY (HPF32_0506)
Accession : YP_005775730.1
Strain : Helicobacter pylori F32
Genome accession: NC_017366
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein Y VirB10-like protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 541445 - 547453 bp
Length : 6009 bp
Strand : -
Note : -

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTCAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AACAGAAGCCAATCGCTTTGAAGATTCTTCAAAAGAATCCGAAGAAAGCTCAGATCATCTTGACAACCCCACAGAAACTA
AAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGC
AATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAA
TTTAGATGAAGAAACCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATAGGTATCAAGAAGAAACTCAAATGGACT
TAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCGACAGAAGCCAAT
CACTTTGAAGATTCTTCAAAAGAATCCCAAGAAAGCTCAGAACATCATCTTGACAACCCTACAGAAACTAAAACCAATTT
TGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAATGGCAGTC
TAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTTAGATGAA
GAAACCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATAGGTATCAAGAAGAAACTCAAATGGACTTAATTGATGA
TGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCGACAGAAGCCAATCACTTTGAAG
ATTCTTCAAAAGAATCCCAAGAAAGCTCAGAACATCATCTTGACAACCCTACAGAAACTAAAACCAATTTTGATGAATAC
GAGTCAGAAGAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGCAT
TGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATACCTTTGGAAGATAAAAGCT
CTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAA
CGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAA
TATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGTATTAGCAATGGTGGCAACTATGAAGAATGTT
TGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCC
AAAACTGAAGAAGAAAGAATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAA
AGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACC
CTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCC
AAAACAGAAGCTGAGAAAAACGAATGTTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCACTAGA
TTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTACAAAAAGAACTAC
TAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAA
TTACTCACTCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACG
AAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCCTGAAAGCTTATAAAGACT
GCGTGTCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAA
GAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGTTGAGAAAAAAGAATGCGA
GAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGACTGCGTATCTC
AAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCG
CTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAA
GGTTTTAGCTAAAGAGAGCGTTAAAGCTTACTTGGATTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCG
AGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTGTCT
CAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGC
CAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTAC
TCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAA
ACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTG
TTTGAAAAACGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAG
CTAAAGAGAGCGTTAAAGCTTACTTGGATTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTA
CTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTGTCTCAAGCCAA
AACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGCC
TGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAA
GCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGA
GAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGCCTGAAAGCTTATA
AAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCAAAAAAGCTT
TTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAGAGGTGTGTCAAAGATCTCCCTAA
AGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAAAGCTAGGAATGAAAAAG
AAAGAAAAGCTTGCGAGAAACTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCCAAAGAGAGTCTTAAAGCTTAT
AAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAGGAAACT
CTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAG
AATGCGAGAAATTGCTCACGCCTGAAGCGAGAAAATTCTTAGCGAAGCAAGCGCTAAGTTGTTTGGAAAAAGCTGGAAAT
GAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCTAAAGAGAGTCTTAAAGC
TTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAGGA
AACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGAAATGAAAAAGAGAAA
CAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAA
AGATTGCTTGAAAAACGCCGATCCTAACGACAGAGTGGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGC
TCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAA
TGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAATAAACAAAATCAATTGAGTAAAAC
AGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAGGAAGCCATAGAGCAATGTTTAG
AGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGAT
CTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGTGGCGA
TATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTG
ATATTACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGC
AATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAGAGTAGAAAGCAATAAGATAGACAAAGA
TGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGTAGATGAAA
ATGGTAATCCCATTGATGACAAAAAGAAAACAGAAACACAAGATGAAACAAGCCCTGTCAAACAAGCCTTTATAGGCAAG
AGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGG
TATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGT
ATGGGAATTACCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCT
GATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAA
CCACTTCATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTC
TAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAAT
GGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAA
TGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATCACCAACAAAT
CTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATCACCACAAGTCCCAAAGGT
GGCAATTAA

Protein sequence :
MNEENDKLETSQKTQQHSPQDLSNEETTEANRFEDSSKESEESSDHLDNPTETKTNFDEYESEETQTQMDSGGNETSESS
NGSLADKLFKKARKLVDNKRPFTQQKNLDEETQEPNEEDDQENNRYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEAN
HFEDSSKESQESSEHHLDNPTETKTNFDEYESEETQTQMDSGGNETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDE
ETQEPNEEDDQENNRYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEANHFEDSSKESQESSEHHLDNPTETKTNFDEY
ESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKE
RNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNA
KTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELGLQKELQEYKDCIKNA
KTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEK
LLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEAEKKECEKLLTPEAKKLLE
EEAKESVKAYLDCVSQAKTEVEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQA
LDCLKNAKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVS
QAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAK
TEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKL
LTPEAKKLLEEAKESLKAYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPE
AKKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKL
LEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSKARNEKERKACEKLLTPEAKKLLEEAKESLKAY
KDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQALSCLEKAGN
EEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEK
QECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRVAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRK
CQNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDLIYSD
LRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGG
NKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKTETQDETSPVKQAFIGK
SDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITP
DGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAIN
GSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKG
GN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 100
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 81
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 76