Gene Information

Name : cagY (HPF30_0797)
Accession : YP_005774542.1
Strain : Helicobacter pylori F30
Genome accession: NC_017365
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 855260 - 861040 bp
Length : 5781 bp
Strand : +
Note : -

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAGATTTTTCAAAAGAATCCAAAGAAAGCTCAGATCATCTTGACAACCCCACAGAAACTA
AAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTTTGGAGGTAATGAAACTTCAGAATCTAGC
AATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAAGAA
TTTAGATGAAGAAACCCAAGAACTAAACGAAGAATACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGGACT
TAATTGATGATGAAACTTCTAAAAAAACCCAACAATATTCACCCCAAGATTTATCCAATGAAGAAACAACAAAAGCCAAT
CACTTTGAAGATTCTTCAAAAGAATCCCAAGAAAGCTCAGATCATCATCTTGACAACCCTACAGAAACTAAAACCAATTT
TGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTTTGGAGGTAATGAAACTTCAGAATCTAGCAATGGCAGTC
TAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAAGAATTTAGATGAA
GAAACCCAAGAACTAAACGAAGAATACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGGACTTAATTGATGA
TGAAACTTCTAAAAAAACCCAACAATATTCACCCCAAGATTTATCCAATGAAGAAACAACAAAAGCCAATCACTTTGAAG
ATTCTTCAAAAGAATCCCAAGAAAGCTCAGATCATCATCTTGACAACCCTACAGAAACTAAAACCAATTTTGATGAATAC
GAGTCAGAAGAAATAACTAACGATTCTAATGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGCAT
TGTAGTCGCTGTTCTTATTGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATACCTTTGGAAGATAAAAGCT
CTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAA
CGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAA
TATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTT
TGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTGCATCAAAAATGCC
AAAACTGAAGAAGAAAGGATCAAGTGTTTAGATCTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAA
AGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACC
CTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCC
AAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCTAGA
TTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTAC
TAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGTGAGAAA
TTACTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACG
AAAAAAATGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCCAAAGAAAGTCTTAAAGCTTATAAAGACT
GCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAGCTTTTAGAA
GAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGA
AAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTC
AAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCG
CTAGATTGTTTGAAAAACGCTAAAACTGATGGAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAA
GGTTTTAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCG
AGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTATCT
CAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGC
CAAAGAGAGCATTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTAC
TCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAA
ACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTG
TTTGAAAAACGCTAAAACTGATGGAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAG
CTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTG
CTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAG
AAATGAAGAAGAAAGGAGAGCTTGCGAGAAGTTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGCC
TGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAA
GCGAAAAAACTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGCCAA
AGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCGTATTTGGATTGCGTTTCAAGAGCTA
GGAATGAAAAAGATAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGT
CTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGA
AGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAA
AAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAATTGTTTGGAA
AAAGCTAGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGCCAAAGA
GAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCC
CTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAAT
GAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGA
TAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTTGAGCG
ATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAA
GAAAAAAGGAAATGCCAAAATCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCA
ATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAG
AGCAATGTTTAGAGGGCTTAAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTG
ATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAA
AAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTT
CCATAGAGCCTGACATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAGGCTAAATTAGCTAAG
GCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAGAGTAGAAAGCAATAA
GATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAAT
TTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAACACAAGATGAAACAAGCCCTGTCAAACAGGCC
TTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGC
CACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAG
GCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAA
GCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGG
CTATGTGAATAACCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGC
CTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGT
CAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAG
TTTTTACAAAAATGAGGGTGATAGTATTAAGATTCTCACAATGGACGACATTGATTTTAGTGGCGTGTATGATGTTAAAA
TTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATCACCACA
AGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEEATEANHFEDFSKESKESSDHLDNPTETKTNFDEYESEETQTQMDFGGNETSESS
NGSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNEEYDQENNGYQEETQMDLIDDETSKKTQQYSPQDLSNEETTKAN
HFEDSSKESQESSDHHLDNPTETKTNFDEYESEETQTQMDFGGNETSESSNGSLADKLFKKARKLVDNKKPFTQQKNLDE
ETQELNEEYDQENNGYQEETQMDLIDDETSKKTQQYSPQDLSNEETTKANHFEDSSKESQESSDHHLDNPTETKTNFDEY
ESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKE
RNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNA
KTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELGLQKELQEYKDCIKNA
KTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERKECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEK
LLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEAEKKECEKLLTPEAKKLLE
EEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQA
LDCLKNAKTDGERKKCLKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVS
QAKTEAEKKECEKLLTPEAKKLLEEEAKESIKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAK
TEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTDGERKKCLKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKKECEKL
LTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKQECEKLLTPE
AKKLLEQQALDCLKNAKTEAEKKRCAKDLPKDLQKKVLAKESVKAYLDCVSRARNEKDKKECEKLLTPEAKKLLEEAKES
LKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLNCLE
KARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARN
EKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDE
EKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDL
IYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAK
ALGGNKKDDDKEKSKKSTAEARVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKTETQDETSPVKQA
FIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTK
AITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALG
QAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITT
SPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 100
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 97
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 97
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 95
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 94
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 90
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 89
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 83

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
cagY YP_005774542.1 cag pathogenicity island protein VFG0287 Protein 0.0 94