Gene Information

Name : C695_02805 (C695_02805)
Accession : YP_006893032.1
Strain : Helicobacter pylori Rif1
Genome accession: NC_018937
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag23)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 575162 - 578113 bp
Length : 2952 bp
Strand : -
Note : COG3451 Type IV secretory pathway, VirB4 components

DNA sequence :
GTGTTTGTGGCAAGCAAACAAGCTGACGAACAAAAAAAGCTAGTCATAGAGCAAGAGGTTCAAAAGCGCCAATTTAAAAA
AATAGAAGAACTTAAAGCAGACATGCAAAAGGGTGTCAATCCCTTTTTTAAAGTCTTGTTTGATGGGGGGAATAGGTTGT
TTGGTTTCCCTGAAACTTTCATTTATTCCTCTATATTTATATTGTTTGTAACAATTGTACTATCTGTTATTCTTTTTCAA
GCCTATGAACCTGTTTTGATTGTAGCGATTGTTATTGTGCTTGTAGCTCTTGGATTCAAGAAAGATTATAGGCTTTATCA
AAGAATGGAGCGAGCGATGAAATTTAAAAAACCTTTTTTGTTTAAGGGCGTGAAAAACAAAGCGTTCATGAGCATTTTTT
CCATGAAGCCTAGTAAAGAAATGGCTAATGACATCCACTTAAATCCAAACAGAGAAGACAGGCTTGTGAGCGCTGCAAAC
TCCTATCTAGCGAATAACTATGAATGTTTTTTAGATGATGGGGTGATCCTTACTAACAACTATTCTCTTTTAGGCACAAT
CAAATTGGGGGGCATTGATTTTTTAACCACTTCCAAAAAAGATCTCATAGAGTTACACGCTTCTATTTATAGCGTTTTTA
GGAATTTTGTTACCCCTGAATTCAAATTCTATTTTCACACTGTTAAAAAGAAAATCGTTATTGATGAAACCAATAGGGAC
TATAGTCTTATTTTTTCTAATGATTTCATGCGAGCCTATAATGAGAAGCAAAAGAGAGAAAGTTTTTATGATATTAGTTT
TTATCTCACCATAGAGCAAGATTTATTAGACACTCTCAATGAACCCGTTATGAATAAAAAGCATTTTGCAGACAATAATT
TTGAAGAGTTTCAAAGGATTATTAGAGCCAAGCTTGAAAACTTCAAGGATAGGATAGAGCTCATAGAAGAGCTATTGAGT
AAATACCACCCCATTAGATTAAAAGAATACACTAAAGATGGCGTTATTTACTCCAAACAATGCGAGTTTTATAATTTCCT
TGTGGGAATGAATGAAGCCCCTTTTATTTGCAACAGAAAAGACTTGTATCTCAAGGAAAAAATGCATGGTGGGGTGAAAG
AAGTTTATTTTGCCAATAAGCATGGAAAAATCTTAAATGACGATTTGAGTGAAAAATATTTTAGCGCTATTGAGATTAGT
GAATACGCCCCTAAATCACAAAGCGATTTGTTTGATAAAATCAACGCCCTAGACAGCGAATTTATTTTCATGCATGCTTA
TTCGCCTAAAAACTCACAGGTTTTAAAGGACAAACTAGCTTTCACCTCTAGAAGAATTATTATTAGTGGAGGCTCTAAAG
AGCAGGGCATGACTTTAGGTTGCTTGAGCGAATTAGTGGGTAATGGTGATATTACGCTAGGCAGTTATGGTAATTCTTTA
GTGTTGTTTGCTGATAGCTTTGAAAAAATGAAACAAAGCGTTAAGGAATGCGTCTCTAGTCTTAACGCTAAAGGTTTTTT
AGCCAACGCAGCGACTTTCTCTATGGAAAATTACTTTTTCGCCAAACATTGCTCTTTTATCACGCTTCCTTTTATTTTTG
ATGTAACTTCTAATAATTTTGCTGATTTCATCGCTATGAGGGCTATGAGTTTTGATGGCAATCAGGAGAATAACGCTTGG
GGCAATAGTGTGATGACGCTAAAAAGCGAGATCAATTCGCCTTTTTATCTGAACTTCCACATGCCTACTGATTTTGGTTC
AGCTTCAGCAGGACACACTTTGATACTTGGCTCAACCGGTTCAGGTAAGACAGTGTTTATGTCAATGACCTTGAACGCTA
TGGGGCAATTTGTTCACAATTTTCCTGCTAATGTCAGCAAAGACAAGCAAAAGCTCACTATGGTTTATATGGATAAAGAT
TATGGCGCTTATGGGAATATTGTCGCAATGGGTGGGGAGTATGTCAAGATTGAGCTAGGGACAGATACAGGATTAAATCC
TTTTGCTTGGGCGGCTTGTGTGCAAAAAACAAATGCAACAATGGAGCAAAAACAAACAGCTATTTCTGTTGTCAAAGAGC
TTGTGAAAAACTTAGCAACCAAAAGCGATGAAAAAGATGAGAATGGCAACAGCATCTCTTTTAGCCTAGCAGATTCTAAT
ACGCTTGCAGCGGCAGTAACCAACCTTATCACAGGAGATATGAACCTAGATTATCCCATCACTCAACTTATTAATGCTTT
CGGGAAAGACCACAATGATCCTAATGGGCTTGTCGCGCGATTAGCACCTTTTTGCAAATCAACCAATGGTGAATTTCAAT
GGCTTTTTGATAATAAAGCAACGGATCGCTTAGATTTTTCAAAAACGATTATTGGCGTTGATGGGTCAAGTTTCTTAGAC
AATAATGATGTTTCGCCCTTTATTTGTTTTTACCTTTTCGCTCGTATCCAAGAGGCAATGGATGGGCGTAGATTTGTCTT
AGATATTGATGAAGCTTGGAAATATTTAGGCGATCCAAAGGTCGCTTATTTTGTAAGAGACATGCTAAAAACTGCAAGGA
AAAGAAACGCTATTGTCAGACTTGCGACTCAAAGCATCACTGATCTTTTGGCTTGCCCTATTGCTGATACGATTAGAGAA
CAATGCCCTACAAAGATTTTTTTGAGAAACGATGGGGGCAATCTTTCTGATTACCAAAGATTGGCTAATGTTACAGAAAA
AGAATTTGAAATCATCACTAAGGGACTAGATAGGAAAATTCTCTACAAACAGGATGGAAGCCCTAGCGTTATCGCTAGTT
TTAATTTGAGGGGCATTCCTAAAGAATATTTGAAAATTTTATCCACAGATACTGTATTTGTCAAAGAAATTGACAAGATT
ATCCAAAACCATAGTATCATAGATAAATATCAGGCCTTGAGGCAAATGTATCAACAAATAAAGGAGTATTAA

Protein sequence :
MFVASKQADEQKKLVIEQEVQKRQFKKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTVKKKIVIDETNRD
YSLIFSNDFMRAYNEKQKRESFYDISFYLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLS
KYHPIRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFVHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSN
TLAAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIKEY

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0544 NP_207340.1 cag pathogenicity island protein (cag23) Virulence cag PAI Protein 0.0 100
HP0544 BAD13960.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51901.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005775744.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cag23 AAR03936.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51917.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51904.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51891.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13987.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51909.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE AAF80209.1 CagE Virulence cag PAI Protein 0.0 99
cagE NP_223210.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cagE BAD51932.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51920.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51919.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51893.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51898.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51918.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_003728720.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14042.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51925.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51924.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51902.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51900.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51914.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13823.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51913.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13932.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51926.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51908.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51910.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51931.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51894.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51921.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51912.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51907.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13905.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51899.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cag23 AAR03878.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51930.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14069.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51897.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005777285.1 DNA transfer protein Virulence cag PAI Protein 0.0 99
cag23 AAR03967.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51922.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD14015.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51905.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51895.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51928.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51915.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51929.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005774527.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE AGC69804.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51903.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51906.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51923.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51927.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13850.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
cagE BAD51916.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51896.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE YP_005779049.1 DNA transfer protein Not tested cag PAI Protein 0.0 99
cag23 AAR03906.1 Cag23 Virulence cag PAI Protein 0.0 99
cagE BAD51911.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
cagE BAD51892.1 cag pathogenicity island protein E Virulence cag PAI Protein 0.0 99
HP0544 BAD13796.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99
HP0544 BAD13877.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 99

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
C695_02805 YP_006893032.1 cag pathogenicity island protein (cag23) VFG0303 Protein 0.0 100