Gene Information

Name : cag7 (KHP_0792)
Accession : YP_005793184.1
Strain : Helicobacter pylori 51
Genome accession: NC_017382
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein Y
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 845111 - 849745 bp
Length : 4635 bp
Strand : +
Note : DNA transport pore protein; similar to HP0527

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
AACAGAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAA
CTAAAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATAGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAACCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAACAGAAGCC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAACTAAAACCAA
TTTTGATGAATACGAGTCAGAAGAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACA
TTATTGGTGGCATTGTAGTCGCTGTTCTTATTGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATACCTTTG
GAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCG
ATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTAT
ACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAAC
TATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTG
CATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACTTGAAAAAAAGCTTAC
TGAACCAACAAAAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAA
CTCATAAATGACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGATTCAAAAAGAGCTTCAAGAGTATAAGGATTG
TATCAAAAACGCCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAAC
AACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTG
CAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAA
AGAATGCGAAAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAGCAACAGGTTCTAGATTGTTTGAAAAACGCTAAAA
CCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGTCTTAAA
GCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAA
AAAAAAGTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGA
AACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAGAAAAGTGTTAAAGCTTATTTG
GATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTT
AGAGCAACAAGCGCTAGATTGTTTGAAAAATGCTAAAACTGATGGAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAG
ACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAAAGCTAGGAATGAAAAAGAG
AAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAA
AGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTT
TAGAAGAAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGC
GAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTC
TCAAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTTTTAGAGCAAGAAG
TTAAGAAGAGCGTTAAGGCTTACTTGGATTGCATTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTA
CTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAGTTGTTTGGAAAAAGCTAGAAATGAAGAAGAAAGAAA
AGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCC
TCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCAA
GAAGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGAAATGAAAAAGAGAAACAAGAATGCGAGAA
ATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAA
ACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAA
GAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGTCAAAACCTTTA
TAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCAATTGAGTAAAACAGAAAGATTGCATC
AAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGAT
AGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCAA
AACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGCGGCGATATTGCCACTATTA
ACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCTGACATTACTAAGCAA
TACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCAATAAAAAAGATGA
CGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAAAGTAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTG
CTAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGTAGATGAAAATGGTAATCCCATT
GATGACAAAAAGAAAACAGAAACACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATT
TGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGG
TTGTAGCCAAAGATGTATGGAATATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATGGGAATTACCAA
AGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTTTTTACTAAAGCCATTACGCCTGATGGTGTGATAAT
ACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCATTTTATGAAGC
GCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGATAAACTCATA
GGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAG
TTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGTA
TTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATCACCAACAAATCTGTGGTAGATGAA
ATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEEATEANHFEDSSKESKESSDHHLDNPTETKTNFDEYESEETQTQIDSGGNETSES
SNGSLADKLFKKARKLVDNKKPFTQQKNLDEETQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETTEA
NHFEDSSKESKESSDHHLDNPTETKTNFDEYESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPL
EDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGN
YEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLK
LINDPEIREKFRKELGIQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDL
QKELLADMSVKAYKDCVSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLK
AYKDCVSRARNEKEKKECEKLLTPEAKKKLEEEAKESVKAYLDCVSQAKTEAEKQECEKLLTPEAKKKLEEAKKSVKAYL
DCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTDGERKKCLKDLPKDLQKKVLAKESLKAYKDCVSKARNEKE
KKECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEAKESLKAYKDCVSRARNEKEKQEC
EKLLTPEAKKLLEEAKESLKAYKDCLSQARTEAEKKECEKLLTPEARKLLEQEVKKSVKAYLDCISRARNEKEKQECEKL
LTPEARKFLAKQVLSCLEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQ
EVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQ
EAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSD
SERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQ
YETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEAKVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPI
DDKKKTETQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQ
SVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLI
GLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDE
IIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagY AGC69791.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 84
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 79
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 79
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 78
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 78
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78