Gene Information

Name : HPSNT_02715 (HPSNT_02715)
Accession : YP_005786683.1
Strain : Helicobacter pylori Santal49
Genome accession: NC_017376
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag7)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 545813 - 551206 bp
Length : 5394 bp
Strand : -
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTCAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
GACAGAAGTCAATCGCTTTGAAGATTCTTCAAAAGAATCCGAAGAAAATTCAGATCACCATCTTGACAACCCCACAGAAA
CTAAAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCCATGGCAGTCTGGCAGACAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATAG
ACTTAATTGATGATGAAACTTCTCAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCGACAGAAGTC
AATCGCTTTGAAGATTCTTCAAAAGAATCCGAAGAAAATTCAGATCACCATCTTGACAACCCCACAGAAACTAAAACCAA
TTTTGATGAATACAAGTCAGAAGAAATAACTAACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATATA
TTATTGGTGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTG
GAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCAAAATAAGGCAAGAATATAACCG
ATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTAT
ACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTGAAAGCCTTTTATGAATGTATCAGTAATGGCGGCAAC
TATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTCTAGAGGCTTATAATGACTG
CATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAATGTCTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTAC
TGAACCAACAAAAAGTTCAAGTGGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAA
CTCATAAATGACCCTGAGATCAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTG
TATCAAAAACGCCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAAC
AGCAAGCTCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTG
CAAAAAGAACTACTAGCTGATATGAGCGTTAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAA
AGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAA
CTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGTCTTAAA
GCTTATAAAGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAA
AAAACTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGA
AAAAAGAATGCGAAAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTG
GATTGCGTATCTCAAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACCCCTGAAGCGAAAAAACTCTT
AGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCTAAAG
ACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAG
AAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTCTTAGAAGAGGCTAAAGAGAGCGTTAAAGCTTATAA
AGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTTT
TAGAAGAAGAAGCCAAAGAAAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAA
TGCGAGAAATTACTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGCTAAAGCTTATTTGGATTGCGT
ATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAACTCTTAGAGCAAC
AAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAG
AAAAAGGTTTTAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGA
ATGCGAAAAATTGCTCACCCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCG
TATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAGGAAACTCTTAGAAGAG
GCTAAAGAGAGCCTGAAAGCTTATAAAGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAAAAATT
GCTCACCCCTGAAGCGAAAAAACTCTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGACGCTGAGAAAA
AAAGGTGTGCCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGT
GTTTCAAGAGCTAGGAATGAAAAAGAAAGAAAAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGA
AGCCAAAGAGAGCCTGAAGGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAAT
TACTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAATAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGA
GCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAATTTTTAGCGAAGCAAGTGCT
AAGTTGTTTGGAAAAAGCTAAAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAAGAAAATG
TTTTAGCCAAAGAGAGCCTGAAGGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAG
AAATTACTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAATAGCGTTAAGGCTTATTTGGATTGCGTTTC
AAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAATTTTTAGCGAAAGAAC
TCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTG
GATGGTTTGAGCGATGAAGAGAAGCTCAAATATCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGC
TAGGACCGATGAAGAAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAA
ATAAACAAAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACCGAT
CAACAAGCTATAGAGCAATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGA
TGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCGTTGTTGC
CAATGGATTTCAAAAATGGCGGCGATATTGCCACTATTAACGCTACTAATGTTGATGCGGACAAAATAGCTAGCGATAAT
CCTATTTATGCTTCCATAGAGCCTGATATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGC
TAAATTGGCTAAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAAGCTAAAG
CAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCTAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAA
AAGAGTGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAAAACAAGATGAAACAAGCCC
TGTCAAACAAGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCCATTGAAATCACTCTGACTT
CTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATC
TTACTAGACAAAGGCACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGAT
AGTCTTTACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAG
CAGGGGTAGATGGCTATGTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTC
TTACAAACTGCGCCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAA
TTACGCTTTGGGTCAAGCTATCAATGGCAGCATGCAAAGTTCAGCTCATATGTCTAATCAAATTCTAGGGCAACTGATGA
ATATCCCCCCAAGTTTTTACAAAAACGAGGGTGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTG
TATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACCAAAACTTTGTCTAGAGAACATGA
AGAGATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSQKTQQHSPQDLSNEEATEVNRFEDSSKESEENSDHHLDNPTETKTNFDEYESEETQTQMDSGGNETSES
SHGSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNGYQEETQIDLIDDETSQKTQQHSPQDLSNEEATEV
NRFEDSSKESEENSDHHLDNPTETKTNFDEYKSEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPL
EDKSSRFSKDRNLYVNDEIKIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLKAFYECISNGGN
YEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLK
LINDPEIREKFRKELGLQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDL
QKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLK
AYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKNEAEKKECEKLLTPEAKKKLEEAKKSVKAYL
DCVSQARTEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAE
KKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKKECEKLLTPEARKLLEEEAKESVKAYLDCVSQAKTEAEKKE
CEKLLTPEAKKKLEEAKKSAKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQ
KKVLAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEE
AKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTDAEKKRCAKDLPKDLQKKVLAKESVKAYLDC
VSRARNEKERKACEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKNSVKAYLDCVSR
ARNEKEKQECEKLLTPEARKFLAKQVLSCLEKAKNEEERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACE
KLLTPEARKLLEQEVKNSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCL
DGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTD
QQAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDN
PIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEAKAESNKIDKDVAETAKNISEIALKNKKE
KSGEFVDENGNPIDDKKKAEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMI
LLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSF
LQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAHMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGV
YDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 97
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 96
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 96
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 96
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 94
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 94
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 85
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 83

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPSNT_02715 YP_005786683.1 cag pathogenicity island protein (cag7) VFG0287 Protein 0.0 96