Gene Information

Name : HMPREF4655_21058 (HMPREF4655_21058)
Accession : YP_005770145.1
Strain : Helicobacter pylori 35A
Genome accession: NC_017360
Putative virulence/resistance : Virulence
Product : CAG pathogenicity island protein 23
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1040586 - 1043537 bp
Length : 2952 bp
Strand : +
Note : COG: COG3451; Pfam: PF03135; InterPro: IPR004346

DNA sequence :
GTGTTTGTGGCAAGCAAGCAAGCCGATGAACAAAAAAAGCTAATCATAGAGCAAGAGGTTCAAAAGCGGCAGTTTCAAAA
AATAGAAGAACTTAAAGCAGACATGCAAAAAGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGT
TTGGTTTCCCTGAAACTTTTATTTATTCCTCTATATTTATATTGTTTGTAACAATTGTATTATCTGTTATTCTTTTTCAA
GCCTATGAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGACTATAGGCTTTATCA
AAGAATGGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCATTCATGAGCATTTTTT
CCATGAAGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGACTTGTGAGCGCTGCAAAC
TCCTATCTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAAT
CAAATTGGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTA
GGAATTTTGTTACCCCTGAATTCAAATTTTATTTTCACACTGTTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGAC
TATAGTCTTGCTTTTTCTAATGATTTTATGCGAGCCTATAATGAGAAGCAAAAAAGAGAAAGTTTTTATGATATTAGTTT
TTTTCTGACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCTGTTATGAATAAAAAGCATTTTGCAGACAATAATT
TTGAAGAGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGAATAGAGCTCATAGAAGAGCTATTGAGT
AAATACCACCCCACTAGATTAAAAGAATACACTAAAGATGGCGTTGTTTATTCCAAACAATGCGAGTTTTACAATTTTCT
TGTGGGAATGAATGAAGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAG
AAGTTTATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGT
GAATACGCCCCTAAATCACAGAGCGATTTGTTTGATAAGATCAACGCTCTAGACAGCGAATTCATCTTTATGCATGCTTA
TTCGCCTAAAAACTCACAGGTTTTAAAGGACAAACTAGCTTTCACCTCTAGAAGGATTATTATTAGTGGGGGTTCTAAAG
AGCAGGGCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGCAGTTATGGTAATTCTTTA
GTGCTGTTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTCAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTT
AGCCAACGCAGCGACTTTCTCTATGGAAAATTACTTTTTTGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTG
ATGTAACTTCTAATAATTTTGCTGATTTTATCGCTATGAGAGCGATGAGTTTTGATGGCAATCAAGAGAATAACGCTTGG
GGTAATAGCGTCATGACTCTAAAAAGCGAGATCAATTCGCCTTTTTATCTAAACTTCCACATGCCCACTGATTTTGGTTC
AGCTTCAGCAGGACACACTTTGATACTTGGCTCAACCGGTTCAGGTAAGACGGTGTTCATGTCAATGACTCTAAACGCTA
TGGGACAATTTGCTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTCTATATGGATAAAGAT
TATGGCGCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAGATTGAGCTAGGGACAGATACAGGATTAAATCC
TTTTGCTTGGGCGGCTTGTGTGCAAAAATCCAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGC
TTGTGAAAAACTTAGCCACCAAAAGCGATGAAAAAGATGAAAATGGCAACAGCATCTCTTTTAGCCTAGCTGATTCTAAT
ACGCTTGCAGCGGCAGTAACCAACCTTATCACAGGAGATATGAACCTAGATTATCCCATCACTCAACTCATTAATGCTTT
CGGAAAAGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCGCCTTTTTGCAAATCAACCAATGGTGAATTTCAAT
GGCTTTTTGATAATAAAGCAACCGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGAC
AACAATGATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATTCAAGAGGCAATGGATGGGCGTAGATTTGTCTT
AGATATTGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGA
AAAGAAACGCTATTGTTAGACTTGCGACTCAAAGCATCACTGATCTTTTAGCTTGCCCTATTGCTGATACTATTAGAGAA
CAATGCCCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTAGCCAATGTTACAGAAAA
AGAATTTGAAATCATCACTAAGGGGCTAGATAGGAAAATTCTCTACAAACAAGATGGAAGCCCTAGCGTTATCGCTAGTT
TTAATTTGAGAGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATT
ATCCAAAACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAGATGTATCAACAAATAAAGGAGTATTAA

Protein sequence :
MFVASKQADEQKKLIIEQEVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTVKKKIVIDETNRD
YSLAFSNDFMRAYNEKQKRESFYDISFFLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLS
KYHPTRLKEYTKDGVVYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFAHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKSNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSN
TLAAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIKEY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagE BAD51895.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51927.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51893.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51914.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51907.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51897.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE NP_223210.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE BAD51911.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51924.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03906.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51902.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51900.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE AGC69804.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03878.1 Cag23 Virulence cag PAI Protein 0.0 99
HP0544 BAD13823.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE YP_005774527.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51923.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51926.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51912.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13960.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51891.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51918.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51894.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005777285.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE BAD51917.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51932.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51928.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13987.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51904.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51915.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51896.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005779049.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cagE BAD51920.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13850.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51892.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51908.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14069.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51929.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 NP_207340.1 cag pathogenicity island protein (cag23) Virulence cag PAI Protein 0.0 99
cagE BAD51925.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14042.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51898.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51909.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51922.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51916.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51901.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005775744.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
HP0544 BAD13905.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51903.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03936.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51910.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13796.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51930.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51931.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51921.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51906.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51913.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03967.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51899.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13877.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51905.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13932.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE AAF80209.1 CagE Virulence cag PAI Protein 0.0 99
cagE YP_003728720.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14015.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51919.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HMPREF4655_21058 YP_005770145.1 CAG pathogenicity island protein 23 VFG0303 Protein 0.0 99