Name : K747_11675 (K747_11675) Accession : YP_008357563.1 Strain : Helicobacter pylori UM032 Genome accession: NC_021215 Putative virulence/resistance : Virulence Product : hypothetical protein Function : - COG functional category : - COG ID : - EC number : - Position : 921336 - 925106 bp Length : 3771 bp Strand : + Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+. DNA sequence : ATGAAGAACGAAAAAAAATGTTTGAAAAATATTCCCAAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCGTCAAGGC TTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAAAAATTGCTCACGCCTGAAGCGAAAA AAAAGTTAGAACAACAGGTTCTAGATTGTTTAAAAAACGCTAAAACTGATGAAGAACGAAAAAAATGTTTGAAAGATCTC CCTAAAGACTTACAAAGCGATATTTTAGCTAAAAAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGA AAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGAAGCCAAAGAGAGCGTTA AGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCG AAAAAAAAGTTAGAAGAAGCCAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAA AAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAGTGCTA AAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCCAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGTCTT AAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGC GAAAAAGCTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTG AGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAGT GCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAG TCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTG AAGCGAAAAAACTTTTAGAAGAAGCTAAAAAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAA GAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAGAGCCTGAAAGCTTA TAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAATC TTTTAGAACAACAAGCGCTAGATTGTTTGAAAAGTGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCT AAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAA AGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCCAAAGAGAGCCTGAAAGCTT ATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGAAGCGAGGAAA CTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACA AGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCTAAGTTGTTTGGAAAAAGCTAGAA ATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATGTTTTAGCTAAAGAGAGCCTGAAA GCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACCCCTGAAGCGAG GAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGA AACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAACTCCAACAAAAAGATAAAGCGATC AAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTGGATGGTTTGAGCGATGAAGAGAA ACTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAAAAAGGA AATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAATAAACAAAATCAATTGAGTAAA ACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCAATGTTT AGAAGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTTATAGCG ATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTGCCAATGGATTTCAAAAATGGCGGC GATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCATAGAGCC TGACATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTTTAGGTG GCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACCGCAGAAGCTAAAGTAGAAAGCAATAAGATAGACAAA GATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGTAGATGA AAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAACACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTATAGGCA AGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACTCTCACA GGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCACTAAGGT GTATGGGAATTACCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCATTACGC CTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGCTATGTGAAT AACCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCACCTATCATAGC TCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAGCTATCA ATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTTTACAAA AATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAATCACCAACAA ATCTGTGGTGGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCCCCAAAG GTGGCAATTAA Protein sequence : MKNEKKCLKNIPKDLQKELLADMSVKAYKDCVSKARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDL PKDLQSDILAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEA KKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKSAKTDEERKKCLKDLPKDLQKKVLAKESL KAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKS AKTDEERKKCLKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEAKKSLKAYKDCVSRARNEK EKQECEKLLTPEAKKLLEEAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKNLLEQQALDCLKSAKTEAEKKRCVKDLP KDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEARK LLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLSCLEKARNEEERKACLKNIPKDLQKNVLAKESLK AYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAI KDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQNKQNQLSK TERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGG DIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAEAKVESNKIDK DVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKAETQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLT GIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVN NHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYK NEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN |
Gene | GenBank Accn | Product | Virulance or Resistance | PAI or REI | Alignment Type | E-val | Identity |
HP0527 | BAD13998.1 | cag pathogenicity island protein | Virulence | cag PAI | Protein | 0.0 | 90 |
HP0527 | BAD13943.1 | cag pathogenicity island protein | Virulence | cag PAI | Protein | 0.0 | 89 |
cag-Y | AAF80198.1 | Cag-Y | Virulence | cag PAI | Protein | 0.0 | 88 |
HP0527 | BAD13806.1 | cag pathogenicity island protein | Virulence | cag PAI | Protein | 0.0 | 88 |
orf13/14 | NP_223194.1 | cag island protein | Virulence | cag PAI | Protein | 0.0 | 83 |