Gene Information

Name : K751_03260 (K751_03260)
Accession : YP_007983632.1
Strain : Helicobacter pylori UM066
Genome accession: NC_021218
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 984307 - 989700 bp
Length : 5394 bp
Strand : +
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AACAGAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAA
CTAAAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCTAGAAAATTAGTTGATAATAAAAAACCTTTCACTCAGCAAAA
GAATTTAGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAGCAACAGAAGCC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCTCAGATCATCATCTTGACAACCCCACAGAAGCTAAAACCAA
TTTTGATGAATACGAGTCAGAAGAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACA
TTATTGGTGGCATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATACCTTTG
GAAGATAAAAGCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCG
ATTGCTGAAAGAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTTTTCAATGACGATCCCAATAGAACCTTAT
ACAACTATTTGAATATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAAC
TATGAAGAATGTTTGAAGCTTATCAAAGATAAAAAACTTCAAGATCAAATGAAAAAGACTTTAGAGGCTTATAATGACTG
CATCAAAAATGCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTAC
TGAACCAACAAAAAGTTCAAGTAGCACTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAA
CTCATAAATGACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTG
TATCAAAAACGCCAAAACAGAAGATGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAAC
AGCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAAAAAGTGTTTGAAAAATATTCCCCAAGACTTG
CAAAAAGAACTACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAA
AGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAA
CTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGTCTTAAA
GCTTATAAAGACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAA
AAAGCTCTTAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCTAAAACTGAAGCTGAGA
AAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAAGCTTACTTG
GATTGCGTGTCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCTGAAGCGAAAAAACTTTT
AGAACAACAAGCGCTAGATTGTTTGAAAAGTGCTAAAACTGATGAAGAACGAAAAAAGTGTTTGAAAGATCTCCCTAAAG
ACTTGCAGAAAAAGGTTTTAGCTAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTATCTCAAGCCAAAACTGAAGCTGAG
AAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGCTTTTAGAAGAAGCTAAAGAGAGCCTGAAAGCTTATAA
AGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAGTTACTCACCCCTGAAGCGAAAAAGCTTT
TAGAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAA
TGCGAGAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGT
ATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTACTCACCCCTGAAGCGAAAAATCTTTTAGAGCAAC
AAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAATCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAG
AAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGA
ATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCG
TATCTCAAGCCAAAACTGAAGATGAGAAAAAAGAATGCGAGAAGTTACTCACGCCTGAAGCGAAAAAGCTTTTAGAAGAA
GCTAAAGAGAGCCTAAAAGCTTATAAAGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATT
GCTCACGCCTGAAGCGAAAAATCTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAATCTGAGAAAA
AAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGC
GTTTCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGA
AGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAAT
TACTCACCCCTGAAGCAAAAAAACTTTTAGAGCAAGAAGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCGTTTCAAGA
GCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTTTTAGCGAAGCAAGTGCT
AAGTTGTTTGGAAAAAGCTAGAAATGAGGAAGAAAGAAAAGCATGTCTTAAAAATATCCCTAAAGACTTACAGAAAAATG
TTTTAGCTAAAGAGAGTCTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAG
AAATTACTCACCCCTGAAGCGAGAAAACTCTTAGAGCAAGAAGTTAAGAAGAGTGTTAAGGCTTATTTGGACTGCGTTTC
AAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGAAAATTTTTAGCGAAAGAAC
TCCAGCAAAAAGATAAAGCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATTATGAAGTGTTTG
GATGGTTTGAGCGATGAAGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGC
TAGGACCGATGAAGAAAAAAGGAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAA
GCAAACAAAATCAATTGAGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGAT
CAAGAAGCCATAGAGCAATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCACTAATTCTAGGAATTAAACGACAAGCTGA
TGAAGTGGATCTGATTTATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCAGCTAAAGGTTATCCATTGTTGC
CAATGGATTTCAAAAATGGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAAT
CCTATTTATGCTTCCATAGAGCCTGACATTACCAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGC
TAAATTAGCTAAGGCTTTAGGTGGTAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACCACAGAAGCTAAAA
CAGAAAGCAATAAGATAGACAAAGATGTCGCAGAAACTGCCAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAA
AAGAGTGGGGAATTTGTAGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAACAGAAACACAAGATGAAACAAGCCC
TGTCAAACAGGCCTTTATAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTT
CTAAAGTAGATGCCACTCTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATC
TTACTAGACAAAGGCACTAAGGTGTATGGGAATTACCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGAT
AGTCTTTACTAAAGCCATTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAG
CAGGGGTAGATGGCTATGTGAATAATCACTTTATGAAACGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTC
TTGCAAACTGCACCTATCATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAA
TTACGCTTTGGGTCAAGCTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGA
ATATCCCCCCAAGTTTTTACAAAAATGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGTGGCGTG
TATGATGTTAAAATTACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAGCATGA
AGAAATCACCACAAGCCCCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKLETSKKTQQHSPQDLSNEETTEANHFEDSSKESKESSDHHLDNPTETKTNFDEYESEETQTQMDSGGNETSES
SNGSLADKLFKKARKLVDNKKPFTQQKNLDEEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEEATEA
NHFEDSSKESKESSDHHLDNPTEAKTNFDEYESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFIPL
EDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGN
YEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLK
LINDPEIREKFRKELGLQKELQEYKDCIKNAKTEDEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERKKCLKNIPQDL
QKELLADMSVKAYKDCVSRARNEKEKKECEKLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESLK
AYKDCVSQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKKLEEAKKSVKAYL
DCVSQAKTEAEKKECEKLLTPEAKKLLEQQALDCLKSAKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAE
KKECEKLLTPEARKLLEEAKESLKAYKDCVSRARNEKEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKE
CEKLLTPEAKKKLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEAKNLLEQQALDCLKNAKTESEKKRCVKDLPKDLQ
KKVLAKESVKAYLDCVSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCVSQAKTEDEKKECEKLLTPEAKKLLEE
AKESLKAYKDCVSRARNEKEKQECEKLLTPEAKNLLEQQALDCLKNAKTESEKKRCVKDLPKDLQKKVLAKESVKAYLDC
VSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERRACEKLLTPEAKKLLEQEVKKSVKAYLDCVSR
ARNEKEKQECEKLLTPEARKFLAKQVLSCLEKARNEEERKACLKNIPKDLQKNVLAKESLKAYKDCLSQARNEEERRACE
KLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDKAIKDCLKNADPNDRAAIMKCL
DGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQLSKTERLHQASECLDNLDDPTD
QEAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKNGGDIATINATNVDADKIASDN
PIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTTEAKTESNKIDKDVAETAKNISEIALKNKKE
KSGEFVDENGNPIDDKKKTETQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMI
LLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSF
LQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGV
YDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 98
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 98
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 98
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 98
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 97
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 96
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 96
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 95
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 94
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 94
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 91
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 89
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 89
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 87
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 84

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
K751_03260 YP_007983632.1 hypothetical protein VFG0287 Protein 0.0 96