Gene Information

Name : HMPREF4655_21058 (HMPREF4655_21058)
Accession : YP_005770145.1
Strain : Helicobacter pylori 35A
Genome accession: NC_017360
Putative virulence/resistance : Virulence
Product : CAG pathogenicity island protein 23
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1040586 - 1043537 bp
Length : 2952 bp
Strand : +
Note : COG: COG3451; Pfam: PF03135; InterPro: IPR004346

DNA sequence :
GTGTTTGTGGCAAGCAAGCAAGCCGATGAACAAAAAAAGCTAATCATAGAGCAAGAGGTTCAAAAGCGGCAGTTTCAAAA
AATAGAAGAACTTAAAGCAGACATGCAAAAAGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGT
TTGGTTTCCCTGAAACTTTTATTTATTCCTCTATATTTATATTGTTTGTAACAATTGTATTATCTGTTATTCTTTTTCAA
GCCTATGAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGACTATAGGCTTTATCA
AAGAATGGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCATTCATGAGCATTTTTT
CCATGAAGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGACTTGTGAGCGCTGCAAAC
TCCTATCTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAAT
CAAATTGGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTA
GGAATTTTGTTACCCCTGAATTCAAATTTTATTTTCACACTGTTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGAC
TATAGTCTTGCTTTTTCTAATGATTTTATGCGAGCCTATAATGAGAAGCAAAAAAGAGAAAGTTTTTATGATATTAGTTT
TTTTCTGACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCTGTTATGAATAAAAAGCATTTTGCAGACAATAATT
TTGAAGAGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGAATAGAGCTCATAGAAGAGCTATTGAGT
AAATACCACCCCACTAGATTAAAAGAATACACTAAAGATGGCGTTGTTTATTCCAAACAATGCGAGTTTTACAATTTTCT
TGTGGGAATGAATGAAGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAG
AAGTTTATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGT
GAATACGCCCCTAAATCACAGAGCGATTTGTTTGATAAGATCAACGCTCTAGACAGCGAATTCATCTTTATGCATGCTTA
TTCGCCTAAAAACTCACAGGTTTTAAAGGACAAACTAGCTTTCACCTCTAGAAGGATTATTATTAGTGGGGGTTCTAAAG
AGCAGGGCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGCAGTTATGGTAATTCTTTA
GTGCTGTTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTCAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTT
AGCCAACGCAGCGACTTTCTCTATGGAAAATTACTTTTTTGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTG
ATGTAACTTCTAATAATTTTGCTGATTTTATCGCTATGAGAGCGATGAGTTTTGATGGCAATCAAGAGAATAACGCTTGG
GGTAATAGCGTCATGACTCTAAAAAGCGAGATCAATTCGCCTTTTTATCTAAACTTCCACATGCCCACTGATTTTGGTTC
AGCTTCAGCAGGACACACTTTGATACTTGGCTCAACCGGTTCAGGTAAGACGGTGTTCATGTCAATGACTCTAAACGCTA
TGGGACAATTTGCTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTCTATATGGATAAAGAT
TATGGCGCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAGATTGAGCTAGGGACAGATACAGGATTAAATCC
TTTTGCTTGGGCGGCTTGTGTGCAAAAATCCAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGC
TTGTGAAAAACTTAGCCACCAAAAGCGATGAAAAAGATGAAAATGGCAACAGCATCTCTTTTAGCCTAGCTGATTCTAAT
ACGCTTGCAGCGGCAGTAACCAACCTTATCACAGGAGATATGAACCTAGATTATCCCATCACTCAACTCATTAATGCTTT
CGGAAAAGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCGCCTTTTTGCAAATCAACCAATGGTGAATTTCAAT
GGCTTTTTGATAATAAAGCAACCGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGAC
AACAATGATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATTCAAGAGGCAATGGATGGGCGTAGATTTGTCTT
AGATATTGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGA
AAAGAAACGCTATTGTTAGACTTGCGACTCAAAGCATCACTGATCTTTTAGCTTGCCCTATTGCTGATACTATTAGAGAA
CAATGCCCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTAGCCAATGTTACAGAAAA
AGAATTTGAAATCATCACTAAGGGGCTAGATAGGAAAATTCTCTACAAACAAGATGGAAGCCCTAGCGTTATCGCTAGTT
TTAATTTGAGAGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATT
ATCCAAAACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAGATGTATCAACAAATAAAGGAGTATTAA

Protein sequence :
MFVASKQADEQKKLIIEQEVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTVKKKIVIDETNRD
YSLAFSNDFMRAYNEKQKRESFYDISFFLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLS
KYHPTRLKEYTKDGVVYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFAHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKSNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSN
TLAAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIKEY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagE BAD51911.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51924.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51895.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51927.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51893.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51914.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51907.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51897.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE NP_223210.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE BAD51923.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03906.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51902.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51900.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE AGC69804.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03878.1 Cag23 Virulence cag PAI Protein 0.0 99
HP0544 BAD13823.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE YP_005774527.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51917.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51926.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51912.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13960.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51891.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51918.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51894.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005777285.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE BAD51920.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51932.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51928.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13987.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51904.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51915.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51896.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005779049.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cagE BAD51925.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14042.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13850.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51892.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51908.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14069.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51929.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 NP_207340.1 cag pathogenicity island protein (cag23) Virulence cag PAI Protein 0.0 99
HP0544 BAD13905.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51903.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51898.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51909.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51922.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51916.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51901.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005775744.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cagE BAD51906.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51913.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03936.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51910.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13796.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51930.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51931.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51921.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14015.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51919.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03967.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51899.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13877.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51905.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13932.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE AAF80209.1 CagE Virulence cag PAI Protein 0.0 99
cagE YP_003728720.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HMPREF4655_21058 YP_005770145.1 CAG pathogenicity island protein 23 VFG0303 Protein 0.0 99