Name : HP0527 (HP0527)
Accession : NP_207323.1
PAI name : cag PAI
PAI accession : NC_000915_P1
Strain : Helicobacter pylori 2017
Virulence or Resistance: Virulence
Product : cag pathogenicity island protein (cag7)
Function : -
Note : similar to GP:1800165 percent identity: 94.57; identified by sequence similarity
Homologs in the searched genomes : 31 hits ( 30 protein-level, 1 DNA-level )
Publication :
-Marais,A., Mendz,G.L., Hazell,S.L. and Megraud,F., "Metabolism and genetics of Helicobacter pylori: the genome era", Microbiol. Mol. Biol. Rev. 63 (3), 642-674 (1999) PUBMED 10477311.
-Raymond,J., Thiberge,J.M., Kalach,N., Bergeret,M., Dupont,C., Labigne,A. and Dauga,C., "Using macro-arrays to study routes of infection of Helicobacter pylori in three families", PLoS ONE 3 (5), E2259 (2008) PUBMED 18493595 REMARK Publication Status: Online-Only.
-Tomb,J.-F., White,O., Kerlavage,A.R., Clayton,R.A., Sutton,G.G., Fleischmann,R.D., Ketchum,K.A., Klenk,H.P., Gill,S., Dougherty,B.A., Nelson,K., Quackenbush,J., Zhou,L., Kirkness,E.F., Peterson,S., Loftus,B., Richardson,D., Dodson,R., Khalak,H.G., Glodek,, "The complete genome sequence of the gastric pathogen Helicobacter pylori", Nature 388 (6642), 539-547 (1997) PUBMED 9252185.
-Tomb,J.-F., White,O., Kerlavage,A.R., Clayton,R.A., Sutton,G.G., Fleischmann,R.D., Ketchum,K.A., Klenk,H.P., Gill,S., Dougherty,B.A., Nelson,K., Quackenbush,J., Zhou,L., Kirkness,E.F., Peterson,S., Loftus,B., Richardson,D., Dodson,R., Khalak,H.G., Glodek,, "Direct Submission", Submitted (18-SEP-2001) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
-Tomb,J.-F., White,O., Kerlavage,A.R., Clayton,R.A., Sutton,G.G., Fleischmann,R.D., Ketchum,K.A., Klenk,H.P., Gill,S., Dougherty,B.A., Nelson,K., Quackenbush,J., Zhou,L., Kirkness,E.F., Peterson,S., Loftus,B., Richardson,D., Dodson,R., Khalak,H.G., Glodek,, "Direct Submission", Submitted (06-AUG-1997) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA.
-Wen,Y., Marcus,E.A., Matrubutham,U., Gleeson,M.A., Scott,D.R. and Sachs,G., "Acid-adaptive genes of Helicobacter pylori", Infect. Immun. 71 (10), 5921-5939 (2003) PUBMED 14500513.
DNA sequence : | |
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAGCCCAACAAGATTCACCCCAAGATTTATCCAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAAATCTTTTAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAA
CTCAAACCCATTTTGATGGAGACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGAAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGAAACTCAAACGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCC
AATCATTTTGAAAATCTTTTAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAACTCAAACCAA
TTTTGATGGAGACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGAAGGTAATGAAACTTCAGAATCTAGCAATGGCA
GTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAAGAATTTAGAT
GAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGAAACTCAAACGGACTTAATTGA
TGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCCAATCATTTTG
AAAATCTTTTAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAACTCAAACCAATTTTGATGGA
GACAAGTCAGAAGAAATAACTGACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATATATTATTGGTGG
CATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAA
GCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAA
GAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTT
GAATATTGCAGAAATTGAGGACAAAAACCCGTTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAAT
GTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAGATGAAAAAGACTCTAGAGGCTTATAACGACTGCATCAAAAAT
GCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTAAAAAAAAGCTTACTGAACCAACA
AAAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATG
ACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAAC
GCCAAAACAGAAGCTGAGAAAAACAAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAGAGATTGAAACAGCAAGCGCT
AGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAAC
TATTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGAAATGAAAAAGAGAAACAAGAATGCGAG
AAATTGCTCACGCCTGAAGCGAGGAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGA
ACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTCTAGCCAAAGAGAGCCTGAAAGCTTATAAAG
ACTGCGTATCTCAAGCCAAAACCGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTA
GAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACCGAAGCTGAGAAAAAAGAATG
CGAGAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTACTTGGATTGCGTAT
CAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAGCAACAA
GCACTAGATTGTTTGAAAAACGCTAAAACCGATAAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAA
AAAGGTTTTAGCTAAAGAAAGCGTTAAAGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAAT
GCGAGAAATTACTCACCCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTA
TCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTTTTAGAAGAAGA
KGCCAAAGAGAGCGTTAAAGCTTACTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAGAAAT
TGCTCACCCTTGAATCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCC
AAAACCGAAGCTGAGAAAAAAGAATGCGAAAAATTGCTCACGCCTGAAGCGAAAAAACTTTTAGAGCAACAAGCGCTAGA
TTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTT
TAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAA
TTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGC
CAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTCTTAGAAGAAGCTAAAGAGA
GCGTTAAAGCTTATAAAGACTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCT
GAAGCGAAAAAACTTTTAGAGCAACAAGTGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGT
CAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCTAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAG
CTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCCAAAGAG
AGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCACGCC
TGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAAAGCATTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATG
AAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAATTGTTTG
GAAAAAGCTGGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATATTTTAGCTAA
AGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCA
CGCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAAAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGG
AATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAA
AGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATCATGAAGTGTTTGGATGGTTTGA
GCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTTGCGGATTGTTTGGCTATGGCTAAAACCGAT
GAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAACAAACAAAA
TCAATTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAGGCCA
TAGAGCAATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGAT
CTGATTTATAGCGATCTAAGAAACCGTAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTACCAATGGATTT
CAAAAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATG
CTTCCATAGAGCCTGATATTGCCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCT
AAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAAAGCAGAAAACAA
TAAGATAGACAAAGATGTCGCAGAAACTGCCAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGG
AATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAAAACAAGATGAAACAAGCCCTGTCAAACAG
GCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCCATTGAAATCACTCTGACTTCTAAAGTAGA
TGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTATTAGACA
AAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACT
AAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGA
TGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTG
CGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTG
GGTCAAGCTATCAATGGTAGCATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCC
AAGTTTTTACAAAAACGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGCGGTGTGTATGATGTTA
AAATTACTAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACCAAAACTTTGTCTAGAGAACATGAAGAAATCACC
ACAAGCCCCAAAGGTGGCAATTAA
|
Protein sequence : | |
MNEENDKLETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSES
SNGSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDDETSKKTQQHSPQDLSNEEATEA
NHFENLLKESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGSLADKLFKKARKLVDNKKPFTQQKNLD
EETQELNEEDDQENNEYQEETQTDLIDDETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDG
DKSEEITDDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLK
ERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKN
AKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKN
AKTEAEKNKCLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKQECE
KLLTPEARKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEAEKKECEKLLTPEAKKLL
EEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEQQ
ALDCLKNAKTDKERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCV
SQAKTEAEKKECEKLLTPEARKLLEEXAKESVKAYLDCVSQAKNEAEKKECEKLLTLESKKKLEEAKKSVKAYLDCVSQA
KTEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESLKAYKDCVSKARNEKEKKECEK
LLTPEAKKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKESVKAYKDCVSKARNEKEKKECEKLLTP
EAKKLLEQQVLDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKE
SLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSIKAYLDCVSRARNEKEKKECEKLLTPEARKFLAKQVLNCL
EKAGNEEERKACLKNLPKDLQENILAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRAR
NEKEKKECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVADCLAMAKTD
EEKRKCQNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVD
LIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDIAKQYETEKTIKDKNLEAKLA
KALGGNKKDDDKEKSKKSTAEAKAENNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKAEKQDETSPVKQ
AFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFT
KAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYAL
GQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEIT
TSPKGGN
|
|