> JQ685139|1|G1|442739063|AGC69791.1|Helicobacter_pylori_J166_isolate_B16-6.5/mOut4|+|1..5529|cagY|-|cag
pathogenicity island
protein Y
Length=1842
Score = 1276 bits (4840), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1125/1311 (86%), Positives = 1184/1311 (91%), Gaps = 62/1311 (4%)
Query 470 NECLKNIPQD-LQKELLADMSVKAYKDCVSKARNekekkecekLLTPEA--KKLLEQQ-- 524
+ECLK I + LQ ++ ++ AY+DC+ +A+ E E+ C L+ E K LL+QQ
Sbjct 582 EECLKLIKDKKLQDQM--KKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKV 639
Query 525 --ALDCLKNAKTDEERKKCLK-----DLPKDLQSDILAKESVKAYKDCVSQakteaekke 577
ALDCLKNAKTDEERK CLK ++ ++ ++ + ++ YKDC+ +AKTEAEKKE
Sbjct 640 QVALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKKE 699
Query 578 ceklltpeakklleeeakeSVKAYLDCVSQakteaekkeceklltpeakkkleeakksvk 637
CEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEA+KKLEEAKKSVK
Sbjct 700 CEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKKLEEAKKSVK 759
Query 638 aYLDCVSQakteaekkeceklltPEAKKLLEQQALDCLKNAKTDEERKKCLKNLPKDLQK 697
AYLDCVSQAK E EKKECEKLLTPEA+KLLEQQALDCLKNAKTDEERKKCLK+LPKDLQK
Sbjct 760 AYLDCVSQAKNEDEKKECEKLLTPEARKLLEQQALDCLKNAKTDEERKKCLKDLPKDLQK 819
Query 698 KVLAKESVKAYLDCVSRARNekekkeceklltpeakklleeakeslkaYKDCVSRSRNEK 757
KVLAKESVKA Y DCVS+++ E
Sbjct 820 KVLAKESVKA--------------------------------------YLDCVSQAKTEA 841
Query 758 EKQECeklltpeakklleeeakeSVKAYLDCVSQARTEAEKKECeklltpeakkkleeak 817
EK+ECEKLLTPEA+KLLEE S+KAY DCVSQA+ E EKKECEKLLTPEAKK LE+
Sbjct 842 EKKECEKLLTPEARKLLEEAKE-SIKAYKDCVSQAKNEDEKKECEKLLTPEAKKLLEQQA 900
Query 818 ksvkaYLDCVSQakteaekkeceklltpeakklleeeakeSVKAYLDCVSRARNekekke 877
LDC+ +AKTEA+KK C K L + +K + SVKAY DCVSRARNEKEKKE
Sbjct 901 ------LDCLKNAKTEADKKRCVKDLPKDLQKKVLAKE--SVKAYKDCVSRARNEKEKKE 952
Query 878 ceklltpeakkkleeakksvkaYLDCVSQARTEAEKKECEKLLTPEARKLLEQEVKKSVK 937
CEKLLTPEAKK LEEAKKSVKAYLDCVSQA+TEAEKKECEKLLTPEA+KLLE E K S+K
Sbjct 953 CEKLLTPEAKKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLE-EAKESLK 1011
Query 938 AYLDCVSRARNekekkecekLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQK 997
AY DCVSRARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTEA+KKRCVKDLPKDLQK
Sbjct 1012 AYKDCVSRARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQK 1071
Query 998 KVLAKESVKAYLDCVSRARNekekkeceklltpeakklleeakeslkaYKDCLSQARNEE 1057
KVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEE
Sbjct 1072 KVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEE 1131
Query 1058 ERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQV 1117
ER+ACEKLLTPEARKLLEQEVKKSVKAYLDCVS+A+ E EK+ECEKLLTPEARKFLAKQV
Sbjct 1132 ERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKFLAKQV 1191
Query 1118 LSCLEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERKACEKLLTPE 1177
L+CLEKA NEEERKACLKN+PKDLQ NVLAKESLKAYKDCLSQARNEEER+ACEKLLTPE
Sbjct 1192 LNCLEKAGNEEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPE 1251
Query 1178 ARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLK 1237
ARKLLEQEVKKSVKAYLDCVSRARNEKEK+ECEKLLTPEARKFLAKELQQKDKAIKDCLK
Sbjct 1252 ARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKFLAKELQQKDKAIKDCLK 1311
Query 1238 NADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQE 1297
NADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQE
Sbjct 1312 NADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQE 1371
Query 1298 IQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIEKCLEGLSDSERALILGIKRQA 1357
IQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQ+AIE+CLEGLSDSERALILGIKRQA
Sbjct 1372 IQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQQAIEQCLEGLSDSERALILGIKRQA 1431
Query 1358 DEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDTDKIASDNPIYASI 1417
DEVD IYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVD+DKIASDNPIYASI
Sbjct 1432 DEVDRIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASI 1491
Query 1418 EPDITKQYETEKTIkdknleaklakalggnkkdddkekskksTAEARVESNKIDKDVAET 1477
EPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEA+VESNKIDKDVAET
Sbjct 1492 EPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEAKVESNKIDKDVAET 1551
Query 1478 AKNISEIALKNKKEKSGEFVDENGNPIddkkktekqdetSPVKQAFIGKSDPTFVLAQYT 1537
AKNISEIALKNKKEK+GEFVDENGNPIDDKKK+EKQDETSPVKQAFIGKSDPTFVLAQYT
Sbjct 1552 AKNISEIALKNKKEKNGEFVDENGNPIDDKKKAEKQDETSPVKQAFIGKSDPTFVLAQYT 1611
Query 1538 PIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLM 1597
PIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLM
Sbjct 1612 PIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLM 1671
Query 1598 IVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPI 1657
IVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPI
Sbjct 1672 IVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPI 1731
Query 1658 IALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDS 1717
IALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDS
Sbjct 1732 IALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDS 1791
Query 1718 IKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN 1768
IKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN
Sbjct 1792 IKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN 1842
Score = 525 bits (1981), Expect = 8e-150, Method: Compositional matrix adjust.
Identities = 493/568 (87%), Positives = 520/568 (92%), Gaps = 8/568 (1%)
Query 9 ETSKKTQQHSPQDLSNEETIKANHFedsskeskessdHHLDNSTETKTNFDEYKSEETQT 68
ETSKKTQQHSPQDLSNEE+ NHFEDSSKESKESSD LDN+TETKTNFD KSEETQT
Sbjct 269 ETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDNPTETKTNFDGDKSEETQT 328
Query 69 QMDsggnetsessnssLADKLFKKARKLVDNKRPFTQQKNLdeeiqepneeddqennGYQ 128
QMDSGG+ETSESSN+SLADKLFKKARKLVDNKRPFTQQKNLDEE QE NEEDDQENNGYQ
Sbjct 329 QMDSGGDETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNGYQ 388
Query 129 EETQMDLIDDETSKKTQQHSPQDLSNEETIKANHFedsskeskessdHHLDNSTETKTNF 188
EETQ +LIDDETSKKTQQHSPQDLSNEE+ NHFEDSSKESKESSD LDN+TETKTNF
Sbjct 389 EETQTGLIDDETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDNPTETKTNF 448
Query 189 DGEKSEEITNNSNDQEiikgskkkyiiggivvavliviilFSRSIFHYFIPLEDKSSRFS 248
D +KSEEITN+SNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYF+PLEDKSSRFS
Sbjct 449 DEDKSEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPLEDKSSRFS 508
Query 249 KDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPL 308
KDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPL
Sbjct 509 KDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPL 568
Query 309 RAFYECISNGGNYEEClklikdkklqdqmkktlEAYNDCIKNAKTEEERIKCLDLIKDEN 368
RAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDEN
Sbjct 569 RAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDEN 628
Query 369 LKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIrekfrkelelqkelqeYKDCI 428
LKKSLLNQQKVQVALDCLKNAKTDEER+ECLKLINDPEIREKFRKELELQKELQEYKDCI
Sbjct 629 LKKSLLNQQKVQVALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCI 688
Query 429 KNAKTEAEKNKCLKGLSKEAIERLKQQA-------LDCLKNAKTDEERNECLKNIPQDLQ 481
KNAKTEAEK+ C K L+ EA L ++A LDC+ +AKT+ E++EC K + + +
Sbjct 689 KNAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAR 748
Query 482 KEL-LADMSVKAYKDCVSKARNekekkecekLLTPEAKKLLEQQALDCLKNAKTDEERKK 540
K L A SVKAY DCVS+A+NE EKKECEKLLTPEA+KLLEQQALDCLKNAKTDEERKK
Sbjct 749 KKLEEAKKSVKAYLDCVSQAKNEDEKKECEKLLTPEARKLLEQQALDCLKNAKTDEERKK 808
Query 541 CLKDLPKDLQSDILAKESVKAYKDCVSQ 568
CLKDLPKDLQ +LAKESVKAY DCVSQ
Sbjct 809 CLKDLPKDLQKKVLAKESVKAYLDCVSQ 836
Score = 247 bits (923), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 206/393 (53%), Positives = 273/393 (70%), Gaps = 29/393 (7%)
Query 900 YLDCVSQARTEAEKKECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNekekkecekLL 959
+ +C+S++ ++ EC KL+ K L++++KK++ AY DC+ A+ E E+ C L+
Sbjct 571 FYECISNG---GNYEECLKLIKD---KKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLI 624
Query 960 TPEA--KKLLEQQ----ALDCLKNAKTEAEKKRCVK-----DLPKDLQKKVLAKESVKAY 1008
E K LL+QQ ALDCLKNAKT+ E+K C+K ++ ++K + + ++ Y
Sbjct 625 KDENLKKSLLNQQKVQVALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEY 684
Query 1009 LDCVSRARNekekke-ceklltpeakklleeakeslkaYKDCLSQARNEEERKACEKLLT 1067
DC+ A+ E EKKE L K L EEAKES+KAY DC+SQA+ E E+K CEKLLT
Sbjct 685 KDCIKNAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLT 744
Query 1068 PEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLSCLEKARNE 1127
PEARK LE E KKSVKAYLDCVS+A+NE EK+ECEKLLTPEARK+L +Q L CL +A+ +
Sbjct 745 PEARKKLE-EAKKSVKAYLDCVSQAKNEDEKKECEKLLTPEARKLLEQQALDCLKNAKTD 803
Query 1128 EERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERKACEKLLTPEARKLLEQEVK 1187
EERK CLK++PKDLQK+VLAKES+KAY DC+SQA+ E E+K CEKLLTPEARKLLE E K
Sbjct 804 EERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLE-EAK 862
Query 1188 KSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAI 1247
S+KAY DCVS+A+NE EK+ECEKLLTPEA+K+L ++ DCLKNA + A
Sbjct 863 ESIKAYKDCVSQAKNEDEKKECEKLLTPEAKKLL-------EQQALDCLKNAKTE--ADK 913
Query 1248 MKCLDGLSDEEKLKYLQEAREKAVLDCLKTART 1280
+C+ +L+ + + K L KA DC+ AR
Sbjct 914 KRCVKDLPKDLQKKVLAKESVKAYKDCVSRARN 946
Score = 166 bits (615), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 173/196 (89%), Positives = 182/196 (93%), Gaps = 0/196 (0%)
Query 1 MNEENDKLETSKKTQQHSPQDLSNEETIKANHFedsskeskessdHHLDNSTETKTNFDE 60
MNEENDKLETSKKTQQHSPQDLSNEE+ NHFEDSSKESKESSDHHL+N+TETKTNFD
Sbjct 1 MNEENDKLETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDHHLNNPTETKTNFDG 60
Query 61 YKSEETQTQMDsggnetsessnssLADKLFKKARKLVDNKRPFTQQKNLdeeiqepneed 120
KSEETQTQMDSGG+ETSESSN+SLADKLFKKARKLVDNKRPFTQQKNLDEE QE NEED
Sbjct 61 DKSEETQTQMDSGGDETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEED 120
Query 121 dqennGYQEETQMDLIDDETSKKTQQHSPQDLSNEETIKANHFedsskeskessdHHLDN 180
DQENNGYQEETQ +LIDDETSKKTQQHSPQDLSNEE+ NHFEDSSKESKESSD LDN
Sbjct 121 DQENNGYQEETQTGLIDDETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDN 180
Query 181 STETKTNFDGEKSEEI 196
+TETKTNFDG+KSEE
Sbjct 181 PTETKTNFDGDKSEET 196
Score = 151 bits (556), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 164/188 (88%), Positives = 172/188 (92%), Gaps = 0/188 (0%)
Query 9 ETSKKTQQHSPQDLSNEETIKANHFedsskeskessdHHLDNSTETKTNFDEYKSEETQT 68
ETSKKTQQHSPQDLSNEE+ NHFEDSSKESKESSD LDN+TETKTNFD KSEETQT
Sbjct 139 ETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDNPTETKTNFDGDKSEETQT 198
Query 69 QMDsggnetsessnssLADKLFKKARKLVDNKRPFTQQKNLdeeiqepneeddqennGYQ 128
QMDSGG+ETSESSN+SLADKLFKKARKLVDNKRPFTQQKNLDEE QE NEEDDQENNGYQ
Sbjct 199 QMDSGGDETSESSNGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNGYQ 258
Query 129 EETQMDLIDDETSKKTQQHSPQDLSNEETIKANHFedsskeskessdHHLDNSTETKTNF 188
EETQ +LIDDETSKKTQQHSPQDLSNEE+ NHFEDSSKESKESSD LDN+TETKTNF
Sbjct 259 EETQTGLIDDETSKKTQQHSPQDLSNEEATEINHFEDSSKESKESSDPYLDNPTETKTNF 318
Query 189 DGEKSEEI 196
DG+KSEE
Sbjct 319 DGDKSEET 326
Score = 124 bits (453), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 109/307 (36%), Positives = 167/307 (55%), Gaps = 76/307 (24%)
Query 1048 DCLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTP 1107
+C+S++ N EE C KL+ K L++++KK++ AY DC+ A+ E E+ C L+
Sbjct 573 ECISNGGNYEE---CLKLIKD---KKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKD 626
Query 1108 EA--RKFLAKQ----VLSCLEKARNEEERKACLK-----NIPKDLQKNVLAKESLKAYKD 1156
E + +L +Q L CL +A+ +EERK CLK +I ++K++ + L+ YKD
Sbjct 627 ENLKKSLLNQQKVQVALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKD 686
Query 1157 CLSQARNEEERKACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPE 1216
C+ +A+ E E+K CEKLLTPEA+KLLE+E K SVKAYLDCVS+A+ E EK+ECEKLLTPE
Sbjct 687 CIKNAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPE 746
Query 1217 ARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLK 1276
AR K L++ K++K YL DC+
Sbjct 747 AR----KKLEEAKKSVKA---------------------------YL---------DCVS 766
Query 1277 TARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTDQEAIE 1336
A+ ++EK+ C++L + + + + +QA +CL N + +E
Sbjct 767 QAKNEDEKKECEKLLTPEARKLLE----------------QQALDCLKN---AKTDEERK 807
Query 1337 KCLEGLS 1343
KCL +L+
Sbjct 808 KCLKDLP 814 |