Gene Information

Name : K749_01335 (K749_01335)
Accession : YP_007980064.1
Strain : Helicobacter pylori UM299
Genome accession: NC_021216
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 468870 - 474425 bp
Length : 5556 bp
Strand : +
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAAATTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGC
AACAGAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAA
CTAAAACCAATTTTGATGGATACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATGGCAATCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAA
GATTTTAGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAGTAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCACCATCTTGACAACCCCACAGAAACTAAAACCAA
TTTTGATGGATACAAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCTAGCAATGGCA
ATCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAAGATTTTAGAT
GAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAGTAATGGGTATCAAGAAGAAACTCAAATGGACTTAATTGA
TGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCCAATCACTTTG
AAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCACCATCTTGACAACCCCACAGAAACTAAAACCAATTTTGATGAA
TACGAGTCAGAAGAAATAACTAACGACTCTAACGATCAAGAAATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGG
CATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATACCTTTGGAAGATAAAA
GTTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAA
GAACGGAATGAAAAAGGCAATATGATCGATAAAAATCTTTTCTTCAATGACGATCCCAATAGAACCCTATACAACTATTT
GAATATTGCAGAAATTGAGGACAAAAACCCATTGAGGGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAAT
GTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAAATGAAAAGGACTTTAGAGGCTTATAATGACTGCATCAAAAAT
GCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACA
AAAAGTTCAAGTGGCGCTGGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATG
ACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAACTTCAAGAGTATAAGGATTGTATCAAAAAC
GCTAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCT
AGATTGTTTGAAAAGTGCTAAAACTGATGAAGAACGAAAAAAATGTTTGAAAAATATTCCCAAAGACTTGCAAAAAGAAC
TACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAAAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAA
AAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTAAAAAACGCTAAAACTGATGAAGA
ACGAAAAAAATGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAAAGAGCCTGAAAGCTTATAAAG
ACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTA
GAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATG
CGAGAAGTTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCCAAAAAAAGCGTTAAAGCTTATTTGGATTGCGTAT
CTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCAACAA
GCGCTAGATTGTTTGAAAAGTGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCCAAAGACTTGCAGAA
AAAGGTTTTAGCCAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAAT
GCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTATTTGGATTGC
GTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAGCA
ACAAGCGCTAGATTGTTTGAAAAGTGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTGC
AGAAAAAGGTTTTAGCCAAAGAGAGTCTTAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAA
GAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGCTAAAAAGAGCCTGAAAGCTTATAAAGACTG
CGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAG
AAGCTAAAAAGAGCCTGAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAA
TTACTCACGCCTGAAGCGAAAAATCTTTTAGAACAACAAGCGCTAGATTGTTTGAAAAGTGCTAAAACCGAAGCTGAGAA
AAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACT
GCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAGCTTTTAGAA
GAAGCCAAAGAGAGCCTGAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAA
ATTACTCACCCCTGAAGCGAGGAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAA
GAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTG
CTAAGTTGTTTGGAAAAAGCTAGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAA
TGTTTTAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCG
AGAAATTACTCACCCCTGAAGCGAGGAAACTTTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTT
TCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGA
ACTCCAACAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTT
TGGATGGTTTGAGCGATGAAGAGAAACTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACG
GCTAGGACCGATGAAGAAAAAAGGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACA
AAATAAACAAAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTG
ATCAAGAAGCCATAGAGCAATGTTTAGAAGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCT
GATGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTT
GCCAATGGATTTCAAAAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATA
ATCCTATTTATGCTTCCATAGAGCCTGACATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAA
GCTAAATTAGCTAAGGCTTTAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACCGCAGAAGCTAA
AGTAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGTGAAATCGCTCTTAAGAACAAAAAAG
AAAAGAGTGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAACACAAGATGAAACAAGC
CCTGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGAC
TTCTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGA
TCTTACTAGACAAAGGCACTAAGGTGTATGGGAATTACCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATG
ATAGTCTTTACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGA
AGCAGGGGTAGATGGCTATGTGAATAACCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCT
TCTTGCAAACTGCACCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTT
AATTACGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGAT
GAATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCG
TGTATGATGTTAAAATCACCAACAAATCTGTGGTGGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAACAT
GAAGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLENSKKTQQHSPQDLSNEEATEANHFEDSSKESKESSDHHLDNPTETKTNFDGYKSEETQTQMDSGGNETSES
SNGNLADKLFKKARKLVDDKRPFTQQKILDEEIQEPNEEDDQESNGYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEA
NHFEDSSKESKESSDHHLDNPTETKTNFDGYKSEETQTQMDSGGNETSESSNGNLADKLFKKARKLVDDKRPFTQQKILD
EEIQEPNEEDDQESNGYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEANHFEDSSKESKESSDHHLDNPTETKTNFDE
YESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLK
ERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKRTLEAYNDCIKN
AKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELGLQKELQEYKDCIKN
AKTEAEKNECLKGLSKEAIERLKQQALDCLKSAKTDEERKKCLKNIPKDLQKELLADMSVKAYKDCVSKARNEKEKKECE
KLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKKLL
EEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQQ
ALDCLKSAKTDEERKKCLKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEEAKESVKAYLDC
VSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKSAKTDEERKKCLKDLPKDLQKKVLAKESLKAYKDCVSRARNEKEKQ
ECEKLLTPEAKKLLEEAKKSLKAYKDCVSRARNEKEKQECEKLLTPEAKKLLEEAKKSLKAYKDCVSRARNEKEKQECEK
LLTPEAKNLLEQQALDCLKSAKTEAEKKRCVKDLPKDLQKKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLE
EAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQV
LSCLEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCV
SRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKT
ARTDEEKRKCQNLYSDLIQEIQNKRTQNKQNQLSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQA
DEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLE
AKLAKALGGNKKDDDKEKSKKSTAEAKVESNKIDKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKAETQDETS
PVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLM
IVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEF
NYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREH
EEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
HP0527 BAD13943.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 90
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 88
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 88
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 88
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 84
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 83
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 83
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 83
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 82
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 79
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 79

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
K749_01335 YP_007980064.1 hypothetical protein VFG0287 Protein 0.0 84