Gene Information

Name : HPSH_04215 (HPSH_04215)
Accession : YP_001910308.1
Strain : Helicobacter pylori Shi470
Genome accession: NC_010698
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein CagA
Function : -
COG functional category : D : Cell cycle control, cell division, chromosome partitioning
COG ID : COG1196
EC number : -
Position : 819912 - 823169 bp
Length : 3258 bp
Strand : -
Note : -

DNA sequence :
ATGGCTAACGAAACCATCAATCAAACAAAAAATCCAGATCAAACACCAAACCAAACGGCTTTTGATCCACAACAATTTAT
CAATAATCTTCAAGTGGCTTTCATTAAAGTTGATAGCGCTGTCGCTTCATTTGATCCCGATCAAAAACCAATCGTTGATA
AGAACGATAGGGATAACAGGCAAGCTTTTAATGGAATCTCGCAATTAAGGGAAGAATACGCCAATAAAGCGATCAAAAAT
CCCAACAAAAAGAATCAGTATTTTTCAGACTTTATCAATAAGAGCAATGATTTGATCAACAAAGACAATCTCATTGATAC
AGATTCTTCCACAAAGAGCTTTCAGAAATTTGGGCCTGAGCCTTACCAAATTTTTATGAATTGGGTGTCCCATCAAAAAG
ATCCGTCTAAAATCAACACCCAAAAAATCCGAGATTTTATGGAAAATATCATACAACCCCCTATCTCTGATGATAAAGAA
AAAGCGGAGTTTTTGAGGTCTGCCAAACAATCTTTTGCAGGAATTATCATAGGAAACCAAACCCGATCGGATGAAAAATT
CATGGGCGTGTTTGGTGAATCTTTGGATGAATCTTTGGATGAATCTTTGGATGAATCTTTGGATGAATCTTTGGATGAAT
CTTTGAGAGAGAAGCAAGAAGCAGGAAAAAATGGGGATTGGCTTGATATTTTTTTATCGTTTGTGTTTAACAAAAAACAA
TCTTCCGATCTCAAAGAAACGCTCAATCAAGAGCCAAGGTCTAATGTTGAACAAAATATAGCCACTACCCCCACCCCCAT
ACAAGGCTTACCGCCTGAAGCTAGGGATTTGCTTGATGAAAGGGGTGATTTTTCTAAATTCACTCTTGGTGATATGGAAA
TGTTGGATGTTGAGAGGCTCAACCAGTTATTGATTCACAATAACGCTCTGTCTTCTATGCTAATGGGGAGTCATAGCAAC
ATAAAACCTGAAAAAGTTTCATTGTTGTATGGGGATAATGGTGGCCCTGAAGCTAGGCATGATTGGAACGCCACCGTTGG
TTATAAAAACCAACAAGGCAACAATGTGGCCACACTCATCAATGCACATCTTAAAAACAGCAGTGGGTTAATCATAGCGG
GTAATGAAAATGGGATTAACAACCCTAGCTTCTATCTCTACAAAAAAGACCAACTCACAGGCTTGGAACAAGCGTTGAGT
CAAGAAGAGATCCAAAACAAACTAGGTTTCATGGAATTTCTTGCACAAAACAGCGCTAGACATGTTGGATTAAATAACTT
GAGCAAGGAAGAGAAAGAAAAGTTCCAAACTGAAATTGGAAATTTCCAAAAAGACCCTAAGGCTTATTTAGACACCCTAG
GGAGTGATCACATTGCTTTTGTTTCTAAAAAAGACCAAAAGCATTTAGCTTTGGTTACTGAGTTTGGCAATGGGGAATTG
AGCTATACCCTCAAAGATTATGGGAAAAAACCAGATAGAGCTTTAGATAGGGAGACAAAAACCACTCTTCAAGGTAACCT
AAAAGATGATGGCGTGATGTTTGTCAATTATTCCAATTTCAAATACACCAACGCCTCCAAGAGTCCTAATGAGGGTATAG
GCGCTACGAATGGCGTTTCCCATTTGGAAGCAAATTTTAGCAAGGTAGCTGTCTTTAATTTGCCTGATTTAAATGGTCTC
GCTGTCTCTAGTTTTGCAAGGCGGAATTTAGAGGATAAACTGGCCGCTAAAGGATTGTCCGGAAAAGAATCTAATAAGAT
CATCAAAGACTTTTTGAACAGCAACAAGGAATTGCTTGAAAAAGTTTTAAACTTCAATAAAGCTGTAGCTGAAGCTAAAA
ATACAGGCAATTATGGCGGAGTGAAAAAAGCTCAAAAAGATCTTGAAAAATCTATAAGGAAACGAGAGCTTTTAGAGAAA
GAAGTAACGAAACAATTTGAGAGCAAGAGCGGCAACAAAAATAAAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGA
TGAGATTTTTAAGCTTATCAATGAAGGGGCTTATAAGGAAGCAAGAATCATCGCTTACGCTCAGAATCTTAAAGGCATCA
GGAGGGAATTGAGGGAATTGTTTGATAAAATTGGAAATATCAACAAGAATTTGAAAGACTTTAATCAATCTTTTGATGCG
CTCAAAAGTGGTAAAAATAAGGATTTCAGCAAGGTAGAAGAAACGCTAAAAGCCCTTAAAAGCTCGGTGAAAGATTTGAA
TATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTTAATGTAGCTTTGAATGAATTCAAAAATGGCAAAAATAAGGATT
TCAGCAAGGTAACACAAGCAAAAAGCGACCTTGAAAATTCCATTAAGGATGTGCACATCAATCAACAGATAACGGATAAA
GTTGACAATCTCAATCAGGCTGTATTAGTGGCTAAAGCGACAGGCGATTTTAGTGGGGTAGAGCAAGCGCTAGCCGGACT
CAAAAACTTCAATGTTGGAAAAAATTCTGATAGGTCTGAACCCATTTACGCCACGATTGATGATCTCGACGGATCTTCCC
CTTTGAAAAGGTATGCTAAAGTTGATGATCTCAGTAAGGTAGGGCAATCAGATAGCCCTGAACCCATTTACGCTAATCTC
GGCGGATCTTCCCCTTTGAAAAGGCATGCTAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGGGAGCAAGAATTGAC
TCAGAAAATTGGCAATCTCAATCAGGCAGTGTCAGAAGCTAAAGCAGGTTCTTTTGGCAACCTAGAACAAACGATGGATG
GACTCAAAGATTCTACAAAAAAGAATGTTGTGAATCTATGGTTTGAAGGTGCAAGAAAAGTGCCTATTAGTTTGCCTAGT
TCGCAAGCGAAATTGGACAATTACGCTACTAACAGCCACACACGCATTAATAGCAATGTCAAAAATGGAACAGTCAATGA
AAAAGCGACCATCATGCTAACGCAAAAAAACCCTGAGTGGCTTAAGCTCGTGAATGATAAGATAGTTGCGCATAATGTGG
GAAGCACTCCTTTGTCAGATTATGATAAAATTGGATTCAACCAAAAGAATATGAAAGATTACTCTGATTCGTTCAAGTTT
TCCATCAAGTTGAGTAATGCCGTAAAAAACATTAAGTCTGGCTTTGTGCAATGTTTAACCGATTGCATTTCTGCAGGATC
TTACAGCCCAAAGAAAGCGGAACATGGAGTTACAAAAAGTGGTTTCCAGAAATCTTAA

Protein sequence :
MANETINQTKNPDQTPNQTAFDPQQFINNLQVAFIKVDSAVASFDPDQKPIVDKNDRDNRQAFNGISQLREEYANKAIKN
PNKKNQYFSDFINKSNDLINKDNLIDTDSSTKSFQKFGPEPYQIFMNWVSHQKDPSKINTQKIRDFMENIIQPPISDDKE
KAEFLRSAKQSFAGIIIGNQTRSDEKFMGVFGESLDESLDESLDESLDESLDESLREKQEAGKNGDWLDIFLSFVFNKKQ
SSDLKETLNQEPRSNVEQNIATTPTPIQGLPPEARDLLDERGDFSKFTLGDMEMLDVERLNQLLIHNNALSSMLMGSHSN
IKPEKVSLLYGDNGGPEARHDWNATVGYKNQQGNNVATLINAHLKNSSGLIIAGNENGINNPSFYLYKKDQLTGLEQALS
QEEIQNKLGFMEFLAQNSARHVGLNNLSKEEKEKFQTEIGNFQKDPKAYLDTLGSDHIAFVSKKDQKHLALVTEFGNGEL
SYTLKDYGKKPDRALDRETKTTLQGNLKDDGVMFVNYSNFKYTNASKSPNEGIGATNGVSHLEANFSKVAVFNLPDLNGL
AVSSFARRNLEDKLAAKGLSGKESNKIIKDFLNSNKELLEKVLNFNKAVAEAKNTGNYGGVKKAQKDLEKSIRKRELLEK
EVTKQFESKSGNKNKMEAKAQANSQKDEIFKLINEGAYKEARIIAYAQNLKGIRRELRELFDKIGNINKNLKDFNQSFDA
LKSGKNKDFSKVEETLKALKSSVKDLNINPEWISKVENLNVALNEFKNGKNKDFSKVTQAKSDLENSIKDVHINQQITDK
VDNLNQAVLVAKATGDFSGVEQALAGLKNFNVGKNSDRSEPIYATIDDLDGSSPLKRYAKVDDLSKVGQSDSPEPIYANL
GGSSPLKRHAKVDDLSKVGLSREQELTQKIGNLNQAVSEAKAGSFGNLEQTMDGLKDSTKKNVVNLWFEGARKVPISLPS
SQAKLDNYATNSHTRINSNVKNGTVNEKATIMLTQKNPEWLKLVNDKIVAHNVGSTPLSDYDKIGFNQKNMKDYSDSFKF
SIKLSNAVKNIKSGFVQCLTDCISAGSYSPKKAEHGVTKSGFQKS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagA BAD51750.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 79
cagA BAD51759.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 78
cagA BAC10429.1 CagA Virulence cag PAI Protein 0.0 76
HP0547 BAD13935.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagA BAD51747.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 75
cagA BAC10432.1 CagA Virulence cag PAI Protein 0.0 71
HP0547 BAD13990.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 71
cagA BAD51752.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 70
cagA BAD51758.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 70
cagA BAD51766.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 70
cagA NP_223213.1 cag island protein, cytotoxicity associated immunodominant antigen Virulence cag PAI Protein 0.0 70
cagA AAR03939.1 CagA Virulence cag PAI Protein 0.0 70
cagA BAD51760.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 69
cagA BAD51751.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 69
cagA BAD51761.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 69
cagA AAF17598.1 CagA Virulence cag PAI Protein 0.0 69
cagA AAC44706.1 CagA Virulence cag PAI Protein 0.0 68
cagA BAD51756.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 67
cagA YP_005774524.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 67
cagA BAC10419.1 CagA Virulence cag PAI Protein 0.0 67
cagA BAD51755.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 66
cagA BAC10431.1 CagA Virulence cag PAI Protein 0.0 66
HP0547 BAD13963.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA BAD51753.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 66
cagA BAC10420.1 CagA Virulence cag PAI Protein 0.0 66
cagA YP_005777288.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA BAD51754.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 66
cagA YP_005775747.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA BAC10435.1 CagA Virulence cag PAI Protein 0.0 66
HP0547 BAD14072.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
HP0547 BAD13880.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA AAF17597.1 CagA Virulence cag PAI Protein 0.0 66
cagA BAC10421.1 CagA Virulence cag PAI Protein 0.0 66
HP0547 BAD13799.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA YP_005779046.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA BAD51749.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 66
cagA BAD51762.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 66
HP0547 BAD14018.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66
cagA BAC10433.1 CagA Virulence cag PAI Protein 0.0 66
cagA AAR03909.1 CagA Virulence cag PAI Protein 0.0 65
cagA BAD51764.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 65
cagA BAC10426.1 CagA Virulence cag PAI Protein 0.0 65
cagA BAD51744.1 cytotoxin associated protein A Virulence cag PAI Protein 0.0 65
cagA YP_003728762.1 cytotoxin-associated protein A Virulence cag PAI Protein 0.0 65
cagA AGC69806.1 cag pathogenicity island protein A Virulence cag PAI Protein 0.0 65
cagA AAR03881.1 CagA Virulence cag PAI Protein 0.0 65
cagA AAR03970.1 CagA Virulence cag PAI Protein 0.0 64
HP0547 NP_207343.1 cag pathogenicity island protein (cag26) Virulence cag PAI Protein 0.0 64
HP0547 BAD14045.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 63

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPSH_04215 YP_001910308.1 cag pathogenicity island protein CagA VFG0306 Protein 0.0 64