Gene Information

Name : HPIN_04260 (HPIN_04260)
Accession : YP_005782161.1
Strain : Helicobacter pylori India7
Genome accession: NC_017372
Putative virulence/resistance : Virulence
Product : cag pathogenicity island protein (cag7)
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 862073 - 867352 bp
Length : 5280 bp
Strand : +
Note : COG2948 Type IV secretory pathway, VirB10 components

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTCAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
GACAGAAGCCAATCATTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAA
CTAAAACCCATTTTGATGAATACGAGTTAGAAGAAACCCAAACTCAAATAGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCGATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGGAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAA
AAATTTAGATGAAGAAACCCAAGAACTGAACGAAGAAGACGATCAAGGAAATAATGGGTATCAAGAAGAAACTCAAATAG
ACTTAATTGATGATGAAACTTCTCAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCGACAGAAGCC
AATCATTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACTACACAGAAACTGAAATCAA
TTTTGATGGAGACAAGTCAGAAGAAATAACTAACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACA
TTATTGGTGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTG
GAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCG
ATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTAT
ACAACTATTTGAATATTGCAGAAATTGAGGACAAAAATCCGTTGAGAGCCTTTTATGAATGTATCAGTAATGGCGGCAAC
TATGAAGAATGTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAAGACTCTAGAGGCTTATAATGACTG
CATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTAC
TGAACCAACAAAAAGTTCAAGTGGCACTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAA
CTCATCAATGACCCTGAGATCAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTG
TATCAAAAACGCCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAAC
AGCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTG
CAAAAAGAACTACTAGCTGATATGAGCGTTAAGGCTTACAAGGATTGCATATCAAAAGCTAGGAATGAAAAAGAGAAAAA
AGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAA
CCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATCTTAGCCAAAGAGAGCCTGAAA
GCTTATAAAGACTGCGTATCTCAAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAACTACTCACCCCTGAAGCGAA
AAAACTTTTAGAAGAAGAAGCCAAAGAAAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGA
AAAAAGAATGCGAGAAACTACTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTG
GATTGTGCATCTCAAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACGCCTGAAGCGAGAAAACTTTT
AGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAG
ACTTGCAGAAAAAGGTTTTAGCCAAAGAAAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAG
AAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAACTCTTAGAAGAGGCTAAAGAGAGTCTTAAAGCTTATAA
AGACTGCGTATCAAGAGCTAGGAATAAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTT
TAGAAGAAGAGGCTAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAA
TGCGAAAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTGGATTGCGT
ATCAAAAGCCAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAGCAAC
AAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGATAAAAAAAGGTGTGTCAAAGATCTCCCCAAAGACTTACAG
AAAAAGGTTTTAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGA
ATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAGGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCG
TATCTCAAGCTAGAAATGAAAAAGAGAAAAAAGAATGCGAGAAACTACTCACCCCTGAAGCGAAAAAACTTTTAGAGCAA
CAAGCACTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCA
GAAAAAGGTTTTAGCCAAAGAGAGTGTTAAGGCTTATTTGGACTGCGTTTCAAAAGCTAGGAATGAAAAAGAGAAAAAAG
AATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGC
CTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTGCTCACCCCTGAAGCGAGAAAACTCTTAGAGCA
AGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGA
AATTACTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAATTGTTTGGAAAAAGCTGGAAATGAAGAAGAA
AGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCCAAAGAGAGTCTTAAAGCTTATAAAGA
CTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGAAGCGAGGAAACTCTTAG
AACAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAACAAGAATGC
GAGAAATTGCTCACGCCTGAAGCGAGAAAATTCTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGATTGCTT
GAAAAACGCCGATCCTAACGACAGAGCGGCCATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCAAATACC
TGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGCCAAAAC
CTTTATAGCGATTTGATTCAAGAAATCCAAAATAAAAGGACACAAAATAAACAAAATCAATTGAGTAAAACAGAAAGATT
GCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATGAGCAAGCCATAGAGCAATGTTTAGAGGGCTTGA
GCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTAAGAAAC
CGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCGTTGTTGCCAATGGATTTCAAAAATGGCGGCGATATTGCCAC
TATTAACGCTACTAATGTTGATGCGGACAAAATAGCTAGTGATAACCCTATTTATGCTTCCATAGAGCCTGACATTACCA
AGCAATATGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGTAATAAAAAA
GATGACGATAAAGAAAAAAGTAAAAAATCTGCCACAGAAGCTAAAACAGAAAGCAATAAAATAGACAAAGATGTCGCAGA
AACTGCCAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAATGGGGAATTTGTAGATGAAAATGGTAATC
CCATTGATGACAAAAAGAAAGCAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCC
ACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAG
TGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATGGGAATT
ATCAAAGCGTGAAAGGTGGCACACCTATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGCCTGATGGTGTG
ATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCACTTTAT
GAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAGATAAAC
TCATAGGCCTTGGCAAAGGCAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGTAGTATG
CAAAGTTCAGCTCAAATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAAAACGAGGGCGA
TAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGCGGCGTGTATGATGTTAAAATTACCAACAAATCTGTGGTAG
ATGAAATCATCAAACAAAGCACTAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSQKTQQHSPQDLSNEEATEANHFEDSSKESKESSDHHLDNPTETKTHFDEYELEETQTQIDSGGNETSES
SDGSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEDDQGNNGYQEETQIDLIDDETSQKTQQHSPQDLSNEEATEA
NHFEDSSKESKESSDHHLDNYTETEINFDGDKSEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPL
EDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGN
YEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLK
LINDPEIREKFRKELGLQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDL
QKELLADMSVKAYKDCISKARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLK
AYKDCVSQARTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKNEAEKKECEKLLTPEAKKKLEEAKKSVKAYL
DCASQARTEAEKKECEKLLTPEARKLLEQQALDCLKNAKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAE
KKECEKLLTPEARKLLEEAKESLKAYKDCVSRARNKKEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKNEAEKKE
CEKLLTPEAKKKLEEAKKSVKAYLDCVSKARTEAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQ
KKVLAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSQARNEKEKKECEKLLTPEAKKLLEQ
QALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSKARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDC
LSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLNCLEKAGNEEE
RKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSKARNEKEKQEC
EKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQN
LYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDEQAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRN
RKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKK
DDDKEKSKKSATEAKTESNKIDKDVAETAKNISEIALKNKKEKNGEFVDENGNPIDDKKKAEKQDETSPVKQAFIGKSDP
TFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGV
IIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSM
QSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 97
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 95
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 95
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 94
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 90
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 89
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 86
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 86
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 83
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 82
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 80

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPIN_04260 YP_005782161.1 cag pathogenicity island protein (cag7) VFG0287 Protein 0.0 94