Gene Information

Name : C694_02805 (C694_02805)
Accession : YP_006934467.1
Strain : Helicobacter pylori 26695
Genome accession: NC_018939
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag23)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 575165 - 578116 bp
Length : 2952 bp
Strand : -
Note : COG3451 Type IV secretory pathway, VirB4 components

DNA sequence :
GTGTTTGTGGCAAGCAAACAAGCTGACGAACAAAAAAAGCTAGTCATAGAGCAAGAGGTTCAAAAGCGCCAATTTAAAAA
AATAGAAGAACTTAAAGCAGACATGCAAAAGGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGT
TTGGTTTCCCTGAAACTTTCATTTATTCCTCTATATTTATATTGTTTGTAACAATTGTACTATCTGTTATTCTTTTTCAA
GCCTATGAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGATTATAGGCTTTATCA
AAGAATGGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCGTTCATGAGCATTTTTT
CCATGAAGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGGCTTGTGAGCGCTGCAAAC
TCCTATCTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAAT
CAAATTGGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTA
GGAATTTTGTTACCCCTGAATTCAAATTCTATTTTCACACTGTTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGAC
TATAGTCTTATTTTTTCTAATGATTTCATGCGAGCCTATAATGAGAAGCAAAAGAGAGAAAGTTTTTATGATATTAGTTT
TTATCTCACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCCGTTATGAATAAAAAGCATTTTGCAGACAATAATT
TTGAAGAGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGGATAGAGCTCATAGAAGAGCTATTGAGT
AAATACCACCCCATTAGATTAAAAGAATACACTAAAGATGGCGTTATTTACTCCAAACAATGCGAGTTTTATAATTTCCT
TGTGGGAATGAATGAAGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAG
AAGTTTATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGT
GAATACGCCCCTAAATCACAAAGCGATTTGTTTGATAAAATCAACGCCCTAGACAGCGAATTTATTTTCATGCATGCTTA
TTCGCCTAAAAACTCACAGGTTTTAAAGGACAAACTAGCTTTCACCTCTAGAAGAATTATTATTAGTGGAGGCTCTAAAG
AGCAGGGCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGCAGTTATGGTAATTCTTTA
GTGTTGTTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTTAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTT
AGCCAACGCAGCGACTTTCTCTATGGAAAATTACTTTTTCGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTG
ATGTAACTTCTAATAATTTTGCTGATTTCATCGCTATGAGGGCTATGAGTTTTGATGGCAATCAGGAGAATAACGCTTGG
GGCAATAGTGTGATGACGCTAAAAAGCGAGATCAATTCGCCTTTTTATCTGAACTTCCACATGCCTACTGATTTTGGTTC
AGCTTCAGCAGGACACACTTTGATACTTGGCTCAACCGGTTCAGGTAAGACAGTGTTTATGTCAATGACCTTGAACGCTA
TGGGGCAATTTGTTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTTTATATGGATAAAGAT
TATGGCGCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAGATTGAGCTAGGGACAGATACAGGATTAAATCC
TTTTGCTTGGGCGGCTTGTGTGCAAAAAACAAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGC
TTGTGAAAAACTTAGCAACCAAAAGCGATGAAAAAGATGAGAATGGCAACAGCATCTCTTTTAGCCTAGCAGATTCTAAT
ACGCTTGCAGCGGCAGTAACCAACCTTATCACAGGAGATATGAACCTAGATTATCCCATCACTCAACTTATTAATGCTTT
CGGGAAAGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCACCTTTTTGCAAATCAACCAATGGTGAATTTCAAT
GGCTTTTTGATAATAAAGCAACGGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGAC
AATAATGATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATCCAAGAGGCAATGGATGGGCGTAGATTTGTCTT
AGATATTGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGA
AAAGAAACGCTATTGTCAGACTTGCGACTCAAAGCATCACTGATCTTTTGGCTTGCCCTATTGCTGATACGATTAGAGAA
CAATGCCCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTGGCTAATGTTACAGAAAA
AGAATTTGAAATCATCACTAAGGGACTAGATAGGAAAATTCTCTACAAACAGGATGGAAGCCCTAGCGTTATCGCTAGTT
TTAATTTGAGGGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATT
ATCCAAAACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAAATGTATCAACAAATAAAGGAGTATTAA

Protein sequence :
MFVASKQADEQKKLVIEQEVQKRQFKKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTVKKKIVIDETNRD
YSLIFSNDFMRAYNEKQKRESFYDISFYLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLS
KYHPIRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFVHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSN
TLAAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIKEY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0544 NP_207340.1 cag pathogenicity island protein (cag23) Virulence cag PAI Protein 0.0 100
cagE BAD51913.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13932.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51926.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51908.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51910.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51931.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13823.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51912.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51907.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13905.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51899.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03878.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51930.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51894.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51921.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03967.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51922.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14015.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51905.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51895.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51928.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14069.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51897.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005777285.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE AGC69804.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51903.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51906.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51923.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51927.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13850.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51915.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51929.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005774527.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cag23 AAR03906.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51911.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51892.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13796.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13877.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51916.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51896.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005779049.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cag23 AAR03936.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51917.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51904.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51891.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13987.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13960.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51901.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005775744.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cagE BAD51932.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51920.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51919.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51893.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51898.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51909.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE AAF80209.1 CagE Virulence cag PAI Protein 0.0 99
cagE NP_223210.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
HP0544 BAD14042.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51925.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51924.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51902.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51900.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51914.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51918.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_003728720.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
C694_02805 YP_006934467.1 cag pathogenicity island protein (cag23) VFG0303 Protein 0.0 100