Gene Information

Name : HPCU_03005 (HPCU_03005)
Accession : YP_005766675.1
Strain : Helicobacter pylori Cuz20
Genome accession: NC_017358
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cagA, cag26)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 596990 - 600400 bp
Length : 3411 bp
Strand : +
Note : -

DNA sequence :
ATGGCTAACGAAACCATCGATCAAACAGAAAAACTAGATCAAACACCAATCGTTGATAAGAACGATAATAAAGCGATCAA
AAATCCCACCAAAAAGAATCAGTTTTTTTCAACTTTTTTTACATCTAGCAAAAAACAATCTTCCGATCTCAAAGAAACGC
TCAATCAAGAGCCAAAGCCTAGCGTTGAACAAAATATAGCCACTACCACCACCACCAACATACAAGGTTTACCTGAAGCT
AGGGATTTGCTTGATGGAAGGGGTAATTTTCCTAAATTCACTCTCGGTGATATGGAAATGTTGGATGTTGAGGGTATCGC
CGACATTGATCCTAATTACAAGTTCAACCAGTTATTGATTCACAATAATGCTCTGTCTTCTATGTTAATGGGGAGTCATA
GCAACATAGGACCTGAAAAAGTTTCATTGTTGTATGGGGATAATGGTGGCCCTGAAGCTAGGCATGATTGGAACGCCACC
GTTGGTTATAAAAACCAACAAGGCAACAATGTGGCCACACTCATCAATATGCATCTTAAAAACGGCAGTGGGTTAGTCAT
AGCGGGTAATGAAGGTGGGATTAACAACCCTAGCTTCTATCTCTACAAAAAAGACCAACTCACAGGCTTGGAACAAGCGT
TGAGTCAAGAAGAGATCCAAAACAAAGTGGATTTCATGGAACTTCTTGCACAAAACAGTGCTAGGTTAGATAACTTGAGC
GAGAAAGAGAAAGAAAAGTTCCAAACTGAAATTGGAAATTTCCAAAAAGACCCTAAGGCTTATTTAGACACCCTAGGGAG
TGATCACATTGCTTTTGTTTCTAAAAAAGACCAAAAGCATTTAGCTTTGGTTACTGAGTTTGGTAATGGGGAATTGAGCT
ATACCCTCAAAGATTATGGGAAAGAACAAGATAGAGCTTTAGATAGGGAGATAAAAACCACTCTTCAAGGTAACCTAAAA
CATGATGGCGTGATGTTTGTCAATTATTCCAATTTCAAATACACCAACGCCTCCAAGAGTCCTAATGAGGGTATAGGCGC
TACGAATGGCGTTTCCCATTTGGAAGCAAATTTTAGCAAGGTAGCTGTCTTTGATTTGCCTAAATTAAATGGTCTCGTTC
TCTCTAATCATCCTGTAAGGCAAAATTTAGAGGATAAACTGGCCGCTAAAGGATTGTCCCCAAAAGAAGCTAATAAGCTC
GTCAAAGACTTTTTGAACAGTAACAAGGAATTGGTTGGAAAAGTTTCAAGTTTCAATAAAGCTGTAGCTGAAGCTAAAAA
TACAGGCAATTATGACGGAGTGAAAAAAGCTCAAAAAGATCTTGAAAAATCTATAAGGAAACGAGAGCATTTAGAGAAAG
AAGTAACGAAAAAAATTGAGAGCAAGAGCGGCAACAAAAATAAAATGGAAGCAAAAGCTCAAGCTAACAGCCAAAAAGAT
GAGATTTTTGCGCTTATCAATCAAGAGACTCATAAGGAAGCAAGAAATGCCAGTTACGCTCAGAATCTTGAAGGCATCAG
GAGGGAATTGTCTGATAAAATTGAAAATATCAACAAGAATTTGAAAGACTTTAATAAATCTTTTGATGCACTCAAAAGTG
GCAAAAATAAGGATTTCAGCAAGGTAGAAGAAACGCTAAAAGCCCTTAAAAGCTCGGTGAAAGATTTGGGTATCAATCCA
GAATGGATTTCAAAAGTTGAAAACCTTAGTACAGCTTTGAATGAATTTAAAAATGGCAAAAATAAGGATTTCAGCAAGGT
AACACAAGCAAAAAGCGACCTTGAAAATTCCATTAAGGATGTGATCATCAATCAAGAAATAACGGATAAAGTTGACAATC
TCAATCAGGCTGTATTAGTGGCTAAAACGGCATACAATTTCAGTATGTTAGATCAAGCGCTAGCCGACCTCAAAAACTTC
TCAACGGATCAAAAATTGGATCAAAAAAATGAAAGTTTCAATGTTGGAAAAGATTCTGATCTACAAAAATCCGTTAAAAA
TGGTGTGAATGGAACCCTAGTCGGTAATGGGTTATCTAAAACAGAAGCCACAACGCTCACCAGAAAAATTTCGGATATTA
GGAAAGAATTGAATGAGAAATTTGCAAATTTCAACAAAAATAATGATGGACTCAAAAACAGCGCAGAGCCCATTTACGCT
CAAGTTAATAAAAGGAAAACAGGACAAGTAGCTAGCCCTGAAGGGTCCATTTACGATCAAGTTGCTAAAGCGGTAAATGA
AAAAATTGACCGACTCAACGAAAAAGCATCAGCAAGTAAAGGAGTGGGCAATTTTAGTGGAGCAGGGCGATTAGATAGCC
GTGAACCCATTTACGCTACGATTGATGATTTCGGCGGACCTTCCTCTTTGAAAAGGTATGCTAAAGTTGATGATCTCAGT
AAGGTAGGGCTTTCAAGAGAATCAGATAGCCGTGAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTCCTCTTT
GAAAAGGTATGCTAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGAGAATCAGATAGCCGTGAACCCATTTACGCTA
CGATTGATGATTTCGGCGGACCTTCCTCTTTGAAAAGGTATGCTAAAGTTGATGATCTCAGTAAGGTAGGGCTTTCAAGA
GAATCAGATAGCCATAAACCCATTTACGCTACGATTGATGATCTCGGCGGACCTTCCTCTTTGAAAAGGTATGCTAAAGT
TGATGATCTCAGTAAGGTAGGGCTTTCAAGAGAATCAGATAGCCATAAACCCATTTATGTTACGATTGATGATCTCGGCG
GACCTTATACTTTGAAAATGTATGTTGAAAGGGAGCAAGAATTGACTCAGAAAATTAGCAATCTCAATCAGGCAGCGTTA
GAAGCTAAAGCAAGTTCTTTTTGCAGCCCAAAACAAATGGCACATCTAGAACAAGCGATACAAGGACTCAAAGATTCTAC
AAAAAAGAATGTTATGAATCTATGGGTTGAAGATACAAAAAAAGTGTCTCCTAGTTTGCAAGCGAAATTGGACAATTACG
CTACTAACAGCCACATACACATTAATAGCAATGTCAAAAATGGAACAGTCAATGAAAAAGCGACCATCATGCTAACGCAA
AAAAACCCTGAGTGGCTTAAGCTCGTGAATGATAAGATAGTTGCACATAATGTGGGAAGCACTCCTTTGTCAGATTATGA
TAAAATTGGATTCAACCAAAAGGATATGAAAGGTTATTCTGATTCGTTCAAGTTTTCCACCAAGTTGAACAATGCCGCAA
AAAACACTAAGTCTGGCTTTGCGCAATTTTTAATCGATTGCATTTCTGCAGGATCTTACAGCCCGAAGAAAGCGGAATAT
GAAGATGGAGTTAAAAATATTAATACAAAAAGTGGTTTCCAAAAATCTTAA

Protein sequence :
MANETIDQTEKLDQTPIVDKNDNKAIKNPTKKNQFFSTFFTSSKKQSSDLKETLNQEPKPSVEQNIATTTTTNIQGLPEA
RDLLDGRGNFPKFTLGDMEMLDVEGIADIDPNYKFNQLLIHNNALSSMLMGSHSNIGPEKVSLLYGDNGGPEARHDWNAT
VGYKNQQGNNVATLINMHLKNGSGLVIAGNEGGINNPSFYLYKKDQLTGLEQALSQEEIQNKVDFMELLAQNSARLDNLS
EKEKEKFQTEIGNFQKDPKAYLDTLGSDHIAFVSKKDQKHLALVTEFGNGELSYTLKDYGKEQDRALDREIKTTLQGNLK
HDGVMFVNYSNFKYTNASKSPNEGIGATNGVSHLEANFSKVAVFDLPKLNGLVLSNHPVRQNLEDKLAAKGLSPKEANKL
VKDFLNSNKELVGKVSSFNKAVAEAKNTGNYDGVKKAQKDLEKSIRKREHLEKEVTKKIESKSGNKNKMEAKAQANSQKD
EIFALINQETHKEARNASYAQNLEGIRRELSDKIENINKNLKDFNKSFDALKSGKNKDFSKVEETLKALKSSVKDLGINP
EWISKVENLSTALNEFKNGKNKDFSKVTQAKSDLENSIKDVIINQEITDKVDNLNQAVLVAKTAYNFSMLDQALADLKNF
STDQKLDQKNESFNVGKDSDLQKSVKNGVNGTLVGNGLSKTEATTLTRKISDIRKELNEKFANFNKNNDGLKNSAEPIYA
QVNKRKTGQVASPEGSIYDQVAKAVNEKIDRLNEKASASKGVGNFSGAGRLDSREPIYATIDDFGGPSSLKRYAKVDDLS
KVGLSRESDSREPIYATIDDLGGPSSLKRYAKVDDLSKVGLSRESDSREPIYATIDDFGGPSSLKRYAKVDDLSKVGLSR
ESDSHKPIYATIDDLGGPSSLKRYAKVDDLSKVGLSRESDSHKPIYVTIDDLGGPYTLKMYVEREQELTQKISNLNQAAL
EAKASSFCSPKQMAHLEQAIQGLKDSTKKNVMNLWVEDTKKVSPSLQAKLDNYATNSHIHINSNVKNGTVNEKATIMLTQ
KNPEWLKLVNDKIVAHNVGSTPLSDYDKIGFNQKDMKGYSDSFKFSTKLNNAAKNTKSGFAQFLIDCISAGSYSPKKAEY
EDGVKNINTKSGFQKS

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagA AAR03939.1 CagA Virulence cag PAI Protein 0.0 75
cagA BAC10423.1 CagA Virulence cag PAI Protein 0.0 69
HP0547 BAD13853.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 69
cagA BAC10430.1 CagA Virulence cag PAI Protein 0.0 69
cagA BAC10427.1 CagA Virulence cag PAI Protein 0.0 68
cagA BAC10424.1 CagA Virulence cag PAI Protein 0.0 68
cagA BAC10422.1 CagA Virulence cag PAI Protein 0.0 67
HP0547 BAD13826.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 67
cagA BAC10429.1 CagA Virulence cag PAI Protein 0.0 66
HP0547 BAD13935.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 66