Gene Information

Name : K750_04135 (K750_04135)
Accession : YP_007982183.1
Strain : Helicobacter pylori UM037
Genome accession: NC_021217
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 777386 - 783151 bp
Length : 5766 bp
Strand : -
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATCTATCCAATGAAGAAGC
GACAGAAGCCAATCATTTTAAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGAAAACTCCACAGAAACTCAAA
CCAATTTTGATGGAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAAT
GGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAGAATTT
AGATGAAGAAACCCAAGAACTGAACGAAGAATACGATCAAGAAAATAGTGAGTATCAAGAAGAAACTCAAACGGGCTTAA
TTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATCTATCCAATGAAGAAGCGACAGAAGCCAATCAT
TTTAAAGATCTTTTAAAAGAAGAAAGCTCAGACAATCATCTTGACAATTCCACAGAAACTAAGAGCAATTTTGATGGAGA
CAAGTTAGAAGAAACCCAAACTCAAATAGATTCTGGAGGTAATGAAACTTCAGAATCTAGCGATGGCAGTCTAGCAGACA
AGTTATTCAAAAAAGCCAGAAAATTAGTTGATAATAAAAGACCTTTCACTCAGCAAAAAAATTTAGATGAAGAAACCCAA
GAATTGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAACGGACTTAATTGATGATGAAACTTC
TAAAAAAACCCAACAAGATTCACCCCAATATTCATCCAATGAAGAAGCGACAGAAGTCAATCATTTTGAAGATCTTTTAA
AAGAAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAACTCAAACCAATTTTGATGGAGACAAGTCAGAAGAAATA
ACTAACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATATATTATTGGTGGCATTGTAGTCGCTGTTCT
TATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAG
ACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGC
AATATGATCGATAAAAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGA
GGACAAAAACCCGTTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTATCAAAG
ACAAAAAACTTCAAGATCAGATGAAAAAGACTCTAGAGGCTTATAAAGACTGCATCAAAAATGCCAAAACTGAAGAAGAA
AGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTGGCGCT
AGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATGACCCTGAGATTAGAGAGA
AATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAAGCTGAG
AAAAACAAATGCTTGAAAAGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCTAGATTGTTTGAAAAACGC
TAAAACCGATGAAGAACGAAACGAGTGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCG
TCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAA
GCGAGGAAAAAGTTAGAACAACAAGTGCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAA
AGATCTCCCTAAAGACTTGCAAAGCGATATTTTAGCTAAAGAGAGCGTTAAAGCTTATAAAGACTGCGTATCTCAAGCCA
AAACCGAAGATGAGAAAAAAGAGTGTGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAAAA
AGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACCGAAGATGAGAAAAAAGAGTGTGAGAAATTACTCACCCC
TGAAGCGAAAAAAAAGTTAGAAGAATCTAAAAAAAGCGTTAAGGCTTACTTGGACTGCGTATCTCAAGCCAAAACTGAAG
CTGAGAAAAAAGAATGCGAAAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAACAACAAGCGCTAGATTGTTTGAAA
AACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCTAAAGA
AAGCGTTAAGGCTTACTTGGACTGCGTATCTCAAGCCAAAAACGAAGCTGAGAAAAAAGAATGCGAAAAATTGCTCACGC
CTGAAGCGAGGAAACTCTTAGAAGAGGCTAAAGAGAGCGTTAAAGCTTATAAAGACTGCGTATCTCAAGCCAAAACCGAA
GATGAGAAAAAAGAGTGTGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAAAAAGCGTTAA
GGCTTATTTGGATTGCGTATCTCAAGCCAAAACCGAAGATGAGAAAAAAGAGTGTGAGAAATTACTCACCCCTGAAGCGA
AAAAAAAGTTAGAAGAATCTAAAAAAAGCGTTAAGGCTTACTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAA
AAAGAATGCGAAAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAACAACAAGCGCTAGATTGTTTGAAAAACGCTAA
AACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCTAAAGAAAGCGTTA
AGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCG
AAAAAAAAGTTAGAAGAATCTAAAGAAAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAA
AAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAATCTAAAAAAAGCGTTAAGGCTTACTTGG
ACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAAAAATTACTCACGCCTGAAGCGAAAAAACTTTTA
GAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGA
CTTGCAGAAAAAGGTTTTAGCTAAAGAAAGCGTTAAGGCTTACTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGA
AAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAA
GACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGAAGCGAGAAAACTCTT
AGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAAT
GCGAGAAATTACTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAATTGTTTGGAAAAAGCTGGAAATGAA
GAAGAAAGGAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCTAAAGAGAGTCTTAAAGCTTA
TAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAACTACTCACCCCTGAAGCGAGAAAAC
TCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAA
GAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAATTCTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATCAAAGA
TTGCTTGAAAAATGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAAGCTCA
AATATCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGAAATGC
CAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAATAAACAAAATCAATTGAGCAAAACAGA
AAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACCGATCAACAAGCCATAGAGCAATGTTTAGAGG
GCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATCAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCGATCTA
AGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCGTTGTTGCCAATGGATTTCAAAAATGGCGGCGATAT
TGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCCCGACA
TTACCAAACAATACGAAACAGAAAAAACCATTAAGGATAAAAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTGGCAAT
AAAAAAGATGACGATAAAGAAAAAAGTGAAAAATCCACAGCAAAAGCTAAAGCAGAAAGCAATAAGATAGACAAAGATGT
CGCAGAAACTGCTAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGACTTTGTAGATGAAAATG
GTAATCCCATTGATGACAAAAAGAAAGCAGAAACACAAGATGAAACAAGCCCTGTCAAACAAGCCTTTATAGGCAAGAGT
GATCCCACATTTGTTTTAGCACAATACACCCCCATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACAGGTAT
AGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGTGTATG
GGAATTATCAAAGCGTGAAAGGTGGCACACCTATTATGACACGCTTAATGATAGTTTTTACTAAAGCCATTACTCCTGAT
GGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGTTGGGTGAAGCAGGGGTAGATGGCTATGTGAATAATCA
CTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCCTATCATAGCTCTAG
ATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCAATGGT
AGTATGCAAAGTTCAGCTCAAATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCTCCAAGTTTTTACAAAAACGA
GGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATTACCAACAAATCTG
TGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAGGTGGC
AATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEEATEANHFKDLLKEESSDNHLENSTETQTNFDGDKLEETQTQMDSGGNETSESSN
GSLADKLFKKARKLVDNKRPFTQQKNLDEETQELNEEYDQENSEYQEETQTGLIDDETSKKTQQHSPQDLSNEEATEANH
FKDLLKEESSDNHLDNSTETKSNFDGDKLEETQTQIDSGGNETSESSDGSLADKLFKKARKLVDNKRPFTQQKNLDEETQ
ELNEEDDQENNGYQEETQTDLIDDETSKKTQQDSPQYSSNEEATEVNHFEDLLKEESSDNHLDNPTETQTNFDGDKSEEI
TNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKG
NMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYKDCIKNAKTEEE
RIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELGLQKELQEYKDCIKNAKTEAE
KNKCLKSLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPE
ARKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVKAYKDCVSQAKTEDEKKECEKLLTPEAKKLLEEEAKK
SVKAYLDCVSQAKTEDEKKECEKLLTPEAKKKLEESKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLK
NAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSQAKNEAEKKECEKLLTPEARKLLEEAKESVKAYKDCVSQAKTE
DEKKECEKLLTPEAKKLLEEEAKKSVKAYLDCVSQAKTEDEKKECEKLLTPEAKKKLEESKKSVKAYLDCVSQAKTEAEK
KECEKLLTPEAKKLLEQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEA
KKKLEESKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKKLEESKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLL
EQQALDCLKNAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK
DCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKKECEKLLTPEARKFLAKQVLNCLEKAGNE
EERKACLKNLPKDLQENVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQ
ECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKC
QNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDQQAIEQCLEGLSDSERALILGIKRQADEVDLIYSDL
RNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGN
KKDDDKEKSEKSTAKAKAESNKIDKDVAETAKNISEIALKNKKEKSGDFVDENGNPIDDKKKAETQDETSPVKQAFIGKS
DPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPD
GVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAING
SMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGG
N

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 95
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 95
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 94
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 93
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 92
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 89
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 85
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 82
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 82
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 82
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 82
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 79

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
K750_04135 YP_007982183.1 hypothetical protein VFG0287 Protein 0.0 95