Gene Information

Name : HPSJM_02650 (HPSJM_02650)
Accession : YP_003928443.1
Strain : Helicobacter pylori SJM180
Genome accession: NC_014560
Putative virulence/resistance : Virulence
Product : cag island protein
Function : -
COG functional category : U : Intracellular trafficking, secretion and vesicular transport
COG ID : COG2948
EC number : -
Position : 525301 - 530781 bp
Length : 5481 bp
Strand : +
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAGCCCAACAACATTCACCCCAAGATCTATCTAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAATTCCACAGAAACTCAAA
CCCATTTTGATGAAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGAAGGTAATGAAACTTCAGAATCTAGCAAT
GGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTT
AGATGAAGAAACCCAAGAATTGAACGAAGAAGACGATCAAGAAAATAATGAGTATCAAGAAGAAACTCAAACGGGCTTAA
TTGATGATGAAACTTCTAAAAAAGCCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGTCAATCAT
TTTGAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAAGCTCAGACAATCATCTTGACAA
TTCCACAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTAAAACCCAAGAAACTAAAACCCATTTTGATGAAG
ACAAGCTAGAAGAAATAACTGACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGGC
ATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTTATGCCTTTGGAAGATAAAAG
CTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAG
AACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTG
AATATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAGTGTATTGGTAATGGTGGCAACTATGAAGAATG
TTTGAAGCTTATCAAAGACAAAAAACTTCAAGAGCAAATGAAAAAGACTCTAGAGGCTTATAATGATTGTATCAAAAATG
CCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAA
AAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGGAAAGAGTGCCTAAAACTCATAAATGA
CCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACG
CCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTATCTAAAGAAGCTATAGAAAGATTGAAACAACAAGCGCTA
GATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACT
ACTAGCTGATATGAGCGTCAAGGCTTACAAGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAGTGTGAGA
AATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTCTGAAAAACGCTAAAACCGATGAAGAA
CGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGTCTGAAAGCTTATAAAGA
CTGCACATCTCAAGCCAAAACTGAAGATGAGAAAAAAGAGTGTGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAG
AAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGATGAGAAAAAAGAGTGT
GAGAAATTACTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATC
TCAAGCCAAAACTGAAGATGAGAAAAAAGAGTGTGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAGCAACAAG
CACTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGGAAAGAGTGCTTGAAAGATCTCCCTAAAGACTTACAGAAA
AAGGTTTTAGCCAAAGAGAGTGTTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATG
CGAGAAATTGCTCACGCCTGAAGCGAGGAAACTATTAGAAGAAGCTAAAGAGAGCGTTAAGGCTTATTTAGACTGCGTTT
CAAGAGCTAGGAATGAAAAAGAGAAAAAAGAGTGTGAGAAATTACTCACTCCTGAAGCGAGAAAACTATTAGAAGAATCT
AAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTGCT
CACCCCTGAAGCGAGAAAACTTTTAGAAGAAGCTAAAGAGAGCGTTAAGGCTTACAAGGACTGCGTATCAAGAGCTAGGA
ATGAAAAAGAGAAAAAAGAGTGTGAGAAATTACTCACGCCTGAAGCGAGGAAACTATTAGAAGAATCTAAAAAAAGCGTT
AAGGCTTATTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTACTCACGCCTGAAGC
GAGAAAACTTTTAGAAGAAGCTAAAAAGAGCGTTAAGGCTTATTTAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGA
AAAAAGAGTGTGAGAAATTACTCACCCCTGAAGCGAGGAAACTATTAGAGAATCAAGCACTAGATTGTTTGAAAAACGCT
AAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTACAGAAAAAGGTTTTAGCCAAAGAGAGTGT
TAGGGTTTATTTGGATTGCGTGTCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTACTCACGCCTGAAG
CGAGAAAACTTTTAGAAGAAGCTAAAGAGAGCGTTAAGGCTTACAAAGACTGCGTATCAAGAGCTAGGAACGAAAAAGAG
AAGCAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAACTATTAGAGCAAGAAGTTAAAAAGAGCGTTAAGGCTTA
TTTAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAGTGTGAGAAATTGCTCACCCCTAAAGCGAGGAAAC
TATTAGAGAATCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCT
AAAGACTTACAGAAAAAGGTTTTAGCTAAAAAAAGCGTTAAGGCTTATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAA
AGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGAAAGCTATTAGAAGAAGCTAAAAAAAGCGTTAAGGCTT
ATTTAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGAAAG
CTATTAGAAGAAGCTAAAGAGAGCCTTAAAGCTTATAAAGACTGTCTCTCTCAAGCTAGAAATGAAACTGAAAGGAGAGC
CTGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTATTAGAGCAAGAAGTTAAAAAGAGCGTTAAGGCTTATTTAGACT
GCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGGAAATTTTTAGAG
AAACAGCGCCAACAAAAAGATAAAGCGATAAAGGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATCATGAA
GTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGA
AAACGGCTAGGACCGATGAAGAAAAAAGAAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGG
GCACAGAACAAACAAAATCAATTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCC
TACTGATCAAGAAGCCATAGAGCAATGTTTAGAGGGATTGAGCGATAGCGAAAGGGCGCTAATTCTAGGAATCAAACGAC
AAGCTGATGAAGTGGATCGGATTTATAGCGATCTAAGAAGCCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCA
TTGTTACCAATGGATTTCAAAAATGGTGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAG
CGATAATCCTATTTATGCTTCCATAGAGCCTGATATTACTAAGCAATACGAAACAGAGAAAACCATTAAGGATAAGAGTT
TAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCGATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAACCCACAGCAGAA
ACTAAAGCAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGCGAAATCGCTCTTAAGAACAA
AAAAGAAAAGAGTGGGGATTTTGTAGATGAAAATGGTAATCCCATTGACGATAAAAAGAAAGAAGAAAAACAAGATGAAA
CAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACTCCCATTGAAATCACT
CTGACTTCTAAAGTAGATGCCACTCTCACGGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCAC
TATGATCTTATTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCTATTATGACTCGTT
TAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGGGTGATTATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTG
GGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTCATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAA
TAGCTTCTTGCAAACCGCGCCTATCATAGCCCTAGATAAACTCATAGGCCTTGGCAAAGGCAGAAGTGAAAGGACACCTG
AATTTAATTACGCTTTGGGTCAAGCTATCAATGGCAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAA
CTGATGAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGCATTAAGATTCTCACAATGGACGATATTGATTTTAG
CGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACCAAAACTTTGTCTAGAG
AACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTGA

Protein sequence :
MNEENDKLETSKKAQQHSPQDLSNEEATEANHFEDLLKEESSDNHLDNSTETQTHFDEDKLEETQTQMDSEGNETSESSN
GSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQENNEYQEETQTGLIDDETSKKAQQHSPQDLSNEEATEVNH
FEDLLKEESSDNHLDNPTESSDNHLDNSTESSDNHLDNPTETKTQETKTHFDEDKLEEITDDSNDQEIIKGSKKKYIIGG
IVVAVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYL
NIAEIEDKNPLRAFYECIGNGGNYEECLKLIKDKKLQEQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQ
KVQVALDCLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQAL
DCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEE
RKKCLKDLPKDLQSDILAKESLKAYKDCTSQAKTEDEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEDEKKEC
EKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEDEKKECEKLLTPEAKKLLEQQALDCLKNAKTDEERKECLKDLPKDLQK
KVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYLDCVSRARNEKEKKECEKLLTPEARKLLEES
KKSVKAYLDCVSQAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKLLEESKKSV
KAYLDCVSQAKNEAERKECEKLLTPEARKLLEEAKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKLLENQALDCLKNA
KTEAEKKRCVKDLPKDLQKKVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKE
KQECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPKARKLLENQALDCLKNAKTEAEKKRCVKDLP
KDLQKKVLAKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKLLEEAKKSVKAYLDCVSRARNEKEKQECEKLLTPEARK
LLEEAKESLKAYKDCLSQARNETERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLE
KQRQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKR
AQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDRIYSDLRSRKTFDNMAAKGYP
LLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKSLEAKLAKALGGDKKDDDKEKSKKPTAE
TKAESNKIDKDVAETAKNISEIALKNKKEKSGDFVDENGNPIDDKKKEEKQDETSPVKQAFIGKSDPTFVLAQYTPIEIT
LTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGML
GEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQ
LMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 96
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 85
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 80
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 79
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 79
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 78
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 78
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 77
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 77
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 76
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 76
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 76
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 74
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 74

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPSJM_02650 YP_003928443.1 cag island protein VFG0287 Protein 0.0 78