Gene Information

Name : K747_11675 (K747_11675)
Accession : YP_008357563.1
Strain : Helicobacter pylori UM032
Genome accession: NC_021215
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 921336 - 925106 bp
Length : 3771 bp
Strand : +
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGAAGAACGAAAAAAAATGTTTGAAAAATATTCCCAAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCGTCAAGGC
TTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAAAAATTGCTCACGCCTGAAGCGAAAA
AAAAGTTAGAACAACAGGTTCTAGATTGTTTAAAAAACGCTAAAACTGATGAAGAACGAAAAAAATGTTTGAAAGATCTC
CCTAAAGACTTACAAAGCGATATTTTAGCTAAAAAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGA
AAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGAAGCCAAAGAGAGCGTTA
AGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCG
AAAAAAAAGTTAGAAGAAGCCAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAA
AAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAGTGCTA
AAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCCAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGTCTT
AAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGC
GAAAAAGCTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTG
AGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAGT
GCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAG
TCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTG
AAGCGAAAAAACTTTTAGAAGAAGCTAAAAAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAA
GAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAGAGCCTGAAAGCTTA
TAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAATC
TTTTAGAACAACAAGCGCTAGATTGTTTGAAAAGTGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCT
AAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAA
AGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCCAAAGAGAGCCTGAAAGCTT
ATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGAAGCGAGGAAA
CTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACA
AGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAGTTGTTTGGAAAAAGCTAGAA
ATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGCTAAAGAGAGCCTGAAA
GCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGAAGCGAG
GAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGA
AACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATC
AAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAA
ACTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGA
AATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAATAAACAAAATCAATTGAGTAAA
ACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTT
AGAAGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCG
ATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGCGGC
GATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCC
TGACATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTG
GCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACCGCAGAAGCTAAAGTAGAAAGCAATAAGATAGACAAA
GATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGTAGATGA
AAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAACACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCA
AGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACA
GGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGT
GTATGGGAATTACCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGC
CTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGCTATGTGAAT
AACCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCACCTATCATAGC
TCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCA
ATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAA
AATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATCACCAACAA
ATCTGTGGTGGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAG
GTGGCAATTAA

Protein sequence :
MKNEKKCLKNIPKDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDL
PKDLQSDILAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEA
KKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKSAKTDEERKKCLKDLPKDLQKKVLAKESL
KAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKS
AKTDEERKKCLKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEAKKSLKAYKDCVSRARNEK
EKQECEKLLTPEAKKLLEEAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKNLLEQQALDCLKSAKTEAEKKRCVKDLP
KDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARK
LLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLSCLEKARNEEERKACLKNIPKDLQKNVLAKESLK
AYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAI
KDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQNKQNQLSK
TERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGG
DIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEAKVESNKIDK
DVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKAETQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLT
GIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVN
NHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYK
NEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 90
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 88
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 88
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 83