Gene Information

Name : HPKB_0799 (HPKB_0799)
Accession : YP_005762307.1
Strain : Helicobacter pylori 52
Genome accession: NC_017354
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein E
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 836393 - 839344 bp
Length : 2952 bp
Strand : +
Note : Consensus COG by CONSORF: COG3451

DNA sequence :
GTGTTTGTGGCAAGCAAGCAAGCCGATGAACAAAAAAAGCTAATCATAGAGCAAGAGGTTCAAAAGCGGCAGTTTCAAAA
AATAGAAGAGCTTAAAGCAGACATGCAAAAGGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGT
TTGGTTTCCCTGAAACTTTTATTTATTCCTCTATATTTATATTGTTTGTAACCATTGTATTATCTGTTATTCTTTTTCAA
GCCTATGAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGACTATAGGCTTTATCA
AAGAATGGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCGTTCATGAGCATTTTTT
CCATGAAGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGACTTGTGAGTGCTGCAAAC
TCCTATCTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAAT
CAAATTGGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTA
GGAATTTTGTTACCCCTGAATTCAAATTTTATTTCCACACTATTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGAC
TATGGTCTTATTTTTTCTAATGATTTTATGCGAGCCTATAACGAGAAGCAAAAGAGAGAAAGTTTTTATGACATTAGTTT
TTTTCTGACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCCGTTATGAATAAAAAGCATTTTGCAGACAATAATT
TTGAAGAGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGGATAGAGCTCATAGAAGAGCTGTTGAGT
AAATACCACCCCACTAGATTAAAAGAATACACCAAAGATGGCGTTATTTACTCCAAACAATGCGAGTTTTACAATTTTCT
TGTGGGAATGAATGAAGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAAGAAAAAATGCATGGTGGGGTGAAAG
AAGTTTATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGT
GAATACGCCCCTAAATCACAAAGCGATTTGTTTGACAAGATCAACGCTCTAGACAGCGAGTTCATCTTTATGCATGCTTA
TTCGCCTAAAAACTCACAGGTTTTAAAGGACAAACTAGCTTTCACCTCTAGAAGAATTATTATTAGTGGAGGCTCTAAAG
AGCAGGGCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGCAGTTATGGTAATTCTTTA
GTGTTGTTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTTAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTT
AGCCAACGCAGCGACTTTCTCTATGGAAAATTACTTTTTTGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTG
ATGTAACTTCTAATAATTTTGCTGATTTCATCGCTATGAGAGCGATGAGTTTTGATGGCAATCAAGAGAATAACGCTTGG
GGTAATAGCGTCATGACGCTAAAAAGCGAGATCAATTCGCCTTTTTATCTGAACTTCCACATGCCTACTGATTTTGGTTC
AGCTTCAGCAGGACACACTTTAATACTTGGCTCAACCGGTTCAGGTAAGACAGTGTTTATGTCAATGACCTTGAACGCTA
TGGGACAATTTGTTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTCTATATGGATAAAGAT
TATGGCGCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAAATTGAGCTAGGGACAGATACAGGATTAAATCC
TTTTGCTTGGGCGGCTTGTGTGCAAAAAACAAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGC
TTGTGAAAAACTTAGCAACCAAAAGCGATGAAAAAGATGAAAACGGCAACAGCACCACTTTTAGCCTAGCAGATTCTAAT
ACGCTTGCAGCGGCAGTAACCAACCTTATCACAGGAGATATGAACCTAGATTATCCCATCACTCAACTTATTAATGCTTT
CGGAAAAGATCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCGCCTTTTTGCAAATCAACCAATGGTGAATTTCAAT
GGCTTTTTGATAATAAAGCAACAGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGAC
AATAATGATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATCCAAGAGGCAATGGATGGGCGTAGATTTGTCTT
AGATATTGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGA
AAAGAAACGCTATTGTCAGGCTTGCGACTCAAAGCATCACTGATCTTTTGGCTTGCCCCATTGCTGATACGATTAGAGAA
CAATGCCCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTAGCTAATGTTACAGAAAA
AGAATTTGAAATCATCACTAAAGGGCTAGATAGGAAAATCCTCTACAAACAAGATGGAAGCCCTAGCGTTATCGCTAGTT
TTAATTTGAGAGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATT
ATCCAAAACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAAATGTATCAACAAATAGAGGAGTATTAA

Protein sequence :
MFVASKQADEQKKLIIEQEVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTIKKKIVIDETNRD
YGLIFSNDFMRAYNEKQKRESFYDISFFLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLS
KYHPTRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFVHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSTTFSLADSN
TLAAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIEEY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cag23 AAR03906.1 Cag23 Virulence cag PAI Protein 0.0 99
cag23 AAR03878.1 Cag23 Virulence cag PAI Protein 0.0 99
HP0544 BAD13932.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13905.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13960.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51914.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51918.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005775744.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
HP0544 BAD14042.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51923.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51907.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13987.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51909.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51928.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14069.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE YP_005779049.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cagE BAD51913.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51917.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51927.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14015.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE AAF80209.1 CagE Virulence cag PAI Protein 0.0 99
HP0544 BAD13850.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51896.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_003728720.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51932.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51920.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51891.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51906.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51899.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51895.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51916.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 NP_207340.1 cag pathogenicity island protein (cag23) Virulence cag PAI Protein 0.0 99
cagE AGC69804.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51925.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51903.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51910.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51905.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51898.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51931.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51921.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03936.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51926.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51904.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51911.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13796.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13823.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51915.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51897.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005774527.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cag23 AAR03967.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51922.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51908.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51902.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51893.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51894.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13877.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51929.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005777285.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE BAD51924.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51919.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51912.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51892.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51930.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51900.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51901.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE NP_223210.1 DNA transfer protein Virulence cag PAI Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPKB_0799 YP_005762307.1 cag pathogenicity island protein E VFG0303 Protein 0.0 99