Gene Information

Name : hp2018_0531 (hp2018_0531)
Accession : YP_005791362.1
Strain : Helicobacter pylori 2018
Genome accession: NC_017381
Putative virulence/resistance : Virulence
Product : cag island protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 530235 - 533789 bp
Length : 3555 bp
Strand : +
Note : -

DNA sequence :
ATGACTAACGAAACTATTGACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATCAATAATCTTCAAGT
AGCTTTTCTTAAGGTTGATAACGCTATCGCTTCATTCGATCCTGATCAAAAACCAATCGTTGATAAGAACGATAGGGATA
ACAGGCAAGCTTTTGATGGAATCTCGCAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAAT
CAGTATTTTTCAGACTTCATCAATAAGAGCAATGATTTAATCAACAAAGACAATCTCATTGATGTAGAATCTTCCACAAA
GAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGTTGGGTGTCCCATCAAAACGATCCGTCTAAAATCA
ACACCCGATCGATCCGAAATTTTATGGAACATGCCATACAACCCCCTATCCCTGATGACAAAGAAAAAGCAGAGTTTTTG
AAATCTGCCAAACAATCTTTTGCAGGAATCATCATAGGGAATCAAATCCGAACGGATCAAAAGTTCATGGGCGTGTTTGA
TGAATCCTTGAAAGAAAGGCAAGAAGCGGAAAAAAATGGAGGGTCTACTGGTGGGGATTGGTTGGATATTTTTTTATCAT
TTATATTTGACAAAAAACAATCTTCTGATGTCAAAGAAGCGATCAATCAAGAACCAGTTCCTCATGTCCAACCAGATATA
GCCACTAGCACCACTCACATACAAGGCTTACCGCCTGAATCTAGGGATTTGCTTGATGAAAGGGGTAATTTTTCTAAATT
CACTCTTGGCGATATGGAAATGTTAGACGTTGAGGGCGTCGCTGACATGGATCCTAATTACAAGTTCAATCAATTATTGA
TTCACAATAACGCTCTGTCTTCTGTGTTAATGGGGAGTCATGATGGCATAGAACCTGAAAAAGTTTCATTATTGTATGCG
GGCAATGGTGGTTTTGGAGACAAGCACGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGTAACAATGTGGCTAC
AATAATTAATGTGCATATGAAAAACGGCAGTGGCTTAATCATAGCAGGTGGTGAGAAAGGGATTAACAACCCTAGTTTTT
ATCTCTACAAAGAAGACCAACTCACAGGCTCACAACGAGCATTAAGTCAAGAAGAGATCCAAAACAAAATAGATTTCATG
GAATTTCTTGCACAAAACAATGCTAAATTAGACAGCTTGAGCGAGAAAGAGAAAGAAAAATTCAAAAATGAGATTAAGGA
TTTCCAAAAAGACTCTAAGCCTTATTTAGACGCCCTAGGGAATGATCGTATTGCTTTTGTTTCTAAAAAAGACCCAAAAC
ATTCAGCTTTAATTACTGAGTTTAATAAGGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCT
TTAGATAGGGAGAAAAATGTCACTCTCCAAGGTAACCTAAAACATGATGGCGTGATGTTTGTTAATTATTCTAATTTCAA
ATACACCAACGCCTCCAAGAGTCCCAATAAGGGTGTAGGCGTTACGAATGGCGTTTCCCATTTAGAAGCAGGCTTTAGCA
AGGTAGCTGTCTTTAATTTGCCTAATTTAAATAATCTCGCTATCACTAGTGTCGTAAGGCGGGATTTAGAGGATAAACTA
ATCGCTAAAGGATTGTCCCCACAAGAAGCTAATAAGCTTGTCAAAGATTTTTTGAGCAGCAACAAAGAATTGGTTGGAAA
AGCTTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAAACACAGGCAACTATGACGAGGTGAAACGAGCTCAGAAAGATC
TTGAAAAATCTCTAAAGAAACGAGAGCATTTGGAGAAAGATGTAGCGAAAAATTTGGAGAGCAAAAGCGGCAACAAAAAT
AAAATGGAAGTAAAATCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAGGCTAATAGGGATGC
AAGAGCAATCGCTTACGCTCAAAATCTTAAAGACATCAAAAGGGAATTGTCTGATAAACTTGAAAATATCAGCAAGGATT
TGAAAGACTTTAGTAAATCTTTTGATGAATTCAAAAATGGCAAAAGTAAGGATTTCAGCAAGGTAGAAGAAACGCTAAAA
GCCCTTAAAGGCTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAATGCAGCTTTGAA
TGAATTCAAAAATGGCAAAAATAAGGATTTCAGTAAGGTAACGCAAGCAAAAAGCGACCTTGAAAATTCCATTAAAGATG
TGATCATCAATCAAAAGATAACGGATAAAGTTGATAATCTCAATCAAGCGGTATCAATGGCTAAAATAGCGGGTAATTTC
AGTGGGGTAGAGCAAGCGTTAGCCGATCTCAAAAATTTCTCAAAGGAGCAATTGGCTCAACAAGCTCAAAAAAATGAAAG
TTTCAATGTTGGAAAATCTGAAATATACCAATCCGTTAAGAATGGTGTGAACGGAACCCTAGTCGGTAATGGATTATCTG
GAATAGAGGCCACAGCTCTCGCCAAAAATTTTTCGGATATCAAGAAAGAATTGAATGAGAAATTTAAAAATTTCAATAAC
AATAACAATGGTCTCAAAAACGGCAAGGATAAAGGACCTGAAGAACCCATTTACGCTCAGGTTAATAAAAAGAAAACAGG
ACAAGTAGCTAGCCCTGAAGAACCCATTTATGCTCAAGTTGCTAAAAAGGTAACTCAAAAAATTGACCAACTCAATCAAG
CAGCAAGTGGTTTCGGTGGTGTAGGGCAAGCGGGCTTCCCTTTGAAAAGGCATGATAAAGTTGAAGATCTCAGTAAGGTA
GGGCGATCAGTTAGCCCTGAACCCATTTATGCTACGATTGATGATCTCGGCGGATCTTTCCCTTTGAGAAGAAGTGCCGC
AGTTGATGATCTCAGTAAGGTAGGGCGATCAAGGGAGCAAGAATTGACTCAGAAAATTGACAATCTCAGTCAAGCGGTAT
CAGAAGCTAAAGCAGGTTTTTTTGGCAATCTAGAGCGAACGATAGACAAGCTCAAAGATTCTACAAAAAACAATCCTGTG
AATCTATGGGCTGAAAATGCAAAAAAAGTGCCTGCTAGTTTGTCAGCGAAACTAGACAATTACGCTACTAACAGCCACAC
ACGCATTAATAGCAATATCCAAAATGGAGCGATCAATGAAAAAGCGACCGGTATGCTAACGCAAAAAAACCCTGAGTGGC
TCAAGCTCGTGAATGATAAGATCGTTGCGCATAATGTGGGAAGCGTTCCTTTGTCAGAGTATGATAAAATTGGCTTCAAC
CAGAAGAATATGAAAGATTATTCTGATTCGTTCAAGTTTTCCACCAAGTTGAACAATACTGTAAAAGACGTTAAGTCTGG
CTTTACGCAATTTTTAGCCAATGCATTTTCTACAGGGTATTACTCCTTGGCGCGGGAAAATGCAGAGCATGGAATCAAAA
ATGCTAATACAAAAGGTGGTTTCCAAAAATCTTAA

Protein sequence :
MTNETIDQQPQTEAAFNPQQFINNLQVAFLKVDNAIASFDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKNPTKKN
QYFSDFINKSNDLINKDNLIDVESSTKSFQKFGDQRYRIFTSWVSHQNDPSKINTRSIRNFMEHAIQPPIPDDKEKAEFL
KSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGGSTGGDWLDIFLSFIFDKKQSSDVKEAINQEPVPHVQPDI
ATSTTHIQGLPPESRDLLDERGNFSKFTLGDMEMLDVEGVADMDPNYKFNQLLIHNNALSSVLMGSHDGIEPEKVSLLYA
GNGGFGDKHDWNATVGYKDQQGNNVATIINVHMKNGSGLIIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIQNKIDFM
EFLAQNNAKLDSLSEKEKEKFKNEIKDFQKDSKPYLDALGNDRIAFVSKKDPKHSALITEFNKGDLSYTLKDYGKKADKA
LDREKNVTLQGNLKHDGVMFVNYSNFKYTNASKSPNKGVGVTNGVSHLEAGFSKVAVFNLPNLNNLAITSVVRRDLEDKL
IAKGLSPQEANKLVKDFLSSNKELVGKALNFNKAVAEAKNTGNYDEVKRAQKDLEKSLKKREHLEKDVAKNLESKSGNKN
KMEVKSQANSQKDEIFALINKEANRDARAIAYAQNLKDIKRELSDKLENISKDLKDFSKSFDEFKNGKSKDFSKVEETLK
ALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSIKDVIINQKITDKVDNLNQAVSMAKIAGNF
SGVEQALADLKNFSKEQLAQQAQKNESFNVGKSEIYQSVKNGVNGTLVGNGLSGIEATALAKNFSDIKKELNEKFKNFNN
NNNGLKNGKDKGPEEPIYAQVNKKKTGQVASPEEPIYAQVAKKVTQKIDQLNQAASGFGGVGQAGFPLKRHDKVEDLSKV
GRSVSPEPIYATIDDLGGSFPLRRSAAVDDLSKVGRSREQELTQKIDNLSQAVSEAKAGFFGNLERTIDKLKDSTKNNPV
NLWAENAKKVPASLSAKLDNYATNSHTRINSNIQNGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGSVPLSEYDKIGFN
QKNMKDYSDSFKFSTKLNNTVKDVKSGFTQFLANAFSTGYYSLARENAEHGIKNANTKGGFQKS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0547 BAD14045.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagA BAD51747.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 91
cagA AAR03970.1 CagA Virulence cag PAI Protein 0.0 91
HP0547 NP_207343.1 cag pathogenicity island protein (cag26) Virulence cag PAI Protein 0.0 90
cagA AAR03881.1 CagA Virulence cag PAI Protein 0.0 90
cagA AAC44706.1 CagA Virulence cag PAI Protein 0.0 87
cagA AAR03909.1 CagA Virulence cag PAI Protein 0.0 87
cagA BAC10428.1 CagA Virulence cag PAI Protein 0.0 87
HP0547 BAD13908.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
cagA AAF17598.1 CagA Virulence cag PAI Protein 0.0 86
HP0547 BAD13935.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
cagA BAC10429.1 CagA Virulence cag PAI Protein 0.0 84
cagA NP_223213.1 cag island protein, cytotoxicity associated immunodominant antigen Virulence cag PAI Protein 0.0 83
cagA BAD51756.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 83
cagA AGC69806.1 cag pathogenicity island protein A Virulence cag PAI Protein 0.0 83
cagA BAD51750.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 82
cagA BAD51758.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 82
cagA BAD51751.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 82
cagA BAD51760.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 82
cagA YP_003728762.1 cytotoxin-associated protein A Virulence cag PAI Protein 0.0 82
cagA BAD51753.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 82
cagA AAR03939.1 CagA Virulence cag PAI Protein 0.0 82
cagA BAD51752.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 81
cagA BAD51766.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 81
cagA BAD51761.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 81
cagA BAD51759.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 80
cagA BAD51764.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 80
cagA BAC10430.1 CagA Virulence cag PAI Protein 0.0 79
cagA BAC10419.1 CagA Virulence cag PAI Protein 0.0 78
cagA BAC10435.1 CagA Virulence cag PAI Protein 0.0 78
HP0547 BAD14072.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
cagA BAC10424.1 CagA Virulence cag PAI Protein 0.0 78
cagA YP_005774524.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA YP_005777288.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA BAC10426.1 CagA Virulence cag PAI Protein 0.0 77
cagA BAD51749.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 77
cagA YP_005779046.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA BAD51755.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 77
cagA BAC10431.1 CagA Virulence cag PAI Protein 0.0 77
HP0547 BAD13963.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA BAC10421.1 CagA Virulence cag PAI Protein 0.0 77
HP0547 BAD13799.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA YP_005775747.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
HP0547 BAD13880.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA AAF17597.1 CagA Virulence cag PAI Protein 0.0 77
cagA BAC10432.1 CagA Virulence cag PAI Protein 0.0 77
HP0547 BAD13990.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagA BAD51744.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 76
cagA BAC10420.1 CagA Virulence cag PAI Protein 0.0 76
cagA BAD51754.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 76
cagA BAC10433.1 CagA Virulence cag PAI Protein 0.0 76
HP0547 BAD14018.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagA BAD51762.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 76
HP0547 BAD13853.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 74
HP0547 BAD13826.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 74
cagA BAC10422.1 CagA Virulence cag PAI Protein 0.0 74
cagA BAC10427.1 CagA Virulence cag PAI Protein 0.0 73
cagA BAC10423.1 CagA Virulence cag PAI Protein 0.0 68

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
hp2018_0531 YP_005791362.1 cag island protein VFG0306 Protein 0.0 90