Gene Information

Name : HPKB_0815 (HPKB_0815)
Accession : YP_005762323.1
Strain : Helicobacter pylori 52
Genome accession: NC_017354
Putative virulence/resistance : Virulence
Product : conjugation TrbI family protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 855278 - 860974 bp
Length : 5697 bp
Strand : +
Note : Consensus COG by CONSORF: COG2948

DNA sequence :
ATGAATGAAGAAAACGATAAATTTGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAAC
AACAGAAGCCAATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCCCAGATCATCATCTTGACAACCCCACAGAAA
CTAAAACCAATTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTTTGGAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAACAAAA
GAATTTAGATGAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGG
ACTTAATTGATGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAACAGAAGCC
AATCACTTTGAAGATTCTTCAAAAGAATCCAAAGAAAGCCCAGATCATCATCTTGACAACCCCACAGAAACTAAAACCAA
TTTTGATGAATACGAGTCAGAAGAAACCCAAACTCAAATGGATTTTGGAGGTAATGAAACTTCAGAATCTAGCAATGGCA
GTCTAGCAGACAAGTTATTCAAAAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAACAAAAGAATTTAGAT
GAAGAAATCCAAGAACCGAACGAAGAAGACGATCAAGAAAATAATGGGTATCAAGAAGAAACTCAAATGGACTTAATTGA
TGATGAAACTTCTAAAAAAACCCAACAACATTCACCCCAAGATTTATCCAATGAAGAAACAACAGAAGCCAATCACTTTG
AAGATTCTTCAAAAGAATCCAAAGAAAGCCCAGATCATCATCTTGACAACCCCACAGAAACTAAAACCAATTTTGATGAA
TACGAGTCAGAAGAAATAACTAACGATTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGTGG
CATTGTAGTCGCTGTTCTTATCGTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTCATGCCTTTGGAAGATAAAA
GCTCTCGTTTTAGCAAAGACAGGAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCAATTGCTGAAA
GAACGGAATGAAAAAGGCAATATGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTT
AAATATTGCAGAAATTGAGGACAAAAACCCATTGAGAGCCTTTTATGAATGTATCAGTAATGGTGGCAACTATGAAGAAT
GTTTGAAGCTTATCAAAGACAAAAAACTTCAAGATCAGATGAAAAAGACTCTAGAGGCTTATAAAGACTGCATCAAAAAT
GCCAAAACTGAAGAAGAAAGGATCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACA
AAAAGTTCAAGTAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAGTGCCTAAAACTCATAAATG
ACCCTGAGATTAGAGAGAAATTCCGTAAGGAATTAGGGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAAC
GCCAAAACAGAAGCTGAGAAAAACGAATGCTTGAAAGGCTTGTCTAAAGAAGCTATAGAAAGATTGAAACAGCAAGCGCT
AGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGAAACGAATGCTTGAAAAATATTCCCCAAGACTTGCAAAAAGAAC
TACTAGCTGATATGAGCGTCAAGGCTTACAAGGATTGCGTATCAAGAGCTAGGAATGAAAAAGAAAAAAAAGAATGCGAA
AAATTGCTCACGCCTGAAGCGAAAAAAAAGTTAGAACAACAGGTTCTAGATTGTTTGAAAAACGCTAAAACTGATGAAGA
ACGAAAAAAGTGTTTGAAAGATCTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCGTTAAGGCTTATAAAG
ACTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACGCCCGAAGCGAAAAAACTTTTA
GAAGAAGAAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCTAGAACTGAAGCTGAGAAAAAAGAATG
CGAGAAATTGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTGGATTGCGTAT
CTCAAGCTAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAGCAACAA
GTGCTAGATTGTTTGAAAAATGCTAAAACCGATGAAGAACGAAAAAAATGTTTGAAAGATCTCCCTAAAGACTTGCAGAA
AAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCCAAAACTGAAGCTGAGAAAAAAGAAT
GCGAAAAATTACTCACCCCTGAAGCGAAAAAACTCTTAGAAGAAGCCAAAGAGAGTCTGAAAGCTTATAAAGACTGCGTT
TCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAAAAAACTTTTAGAAGAAGA
AGCCAAAGAGAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCTAGAACTGAAGCTGAGAAAAAAGAATGCGAGAAAT
TGCTCACCCCTGAAGCGAAAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCT
AGAACTGAAGCTGAGAAAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGGAAACTCTTAGAGCAAGAAGTTAAGAA
GAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAATGAAAAAGAAAGAAAAGCTTGCGAGAAACTACTCACCC
CTGAAGCGAGAAAAAAGTTAGAAGAAGCTAAAAAAAGCGTTAAGGCTTACTTGGATTGCGTATCTCAAGCTAGAACTGAA
GCTGAGAAAAAAGAATGCGAGAAACTACTCACCCCTGAAGCGAGGAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAA
GGCTTATTTGGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGA
AGAAATTTTTAGAGCAACAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAACTGAGAAAAAAAGGTGTGTCAAAGAT
CTCCCTAAAGACTTGCAGAAAAAGGTTTTAGCCAAAGAGAGCGTTAAGGCTTATTTGGACTGCGTTTCAAGAGCTAGGAA
TGAAAAAGAAAGAAAAGCTTGCGAGAAACTACTCACCCCTGAAGCGAAAAAGCTTTTAGAAGAAGCTAAAGAGAGTCTTA
AAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGAAAAGCTTGCGAGAAATTACTCACGCCTGAAGCG
AGGAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGAAATGAAAAAGA
GAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTCTTAGCGAAGCAAGTGCTAAATTGTTTGGAAAAAG
CTGGAAATGAAGAAGAAAGAAAAGCATGTCTTAAAAATCTCCCTAAAGACTTACAGGAAAATGTTTTAGCCAAAGAGAGT
CTTAAAGCTTATAAAGACTGCCTCTCTCAAGCTAGAAATGAAGAAGAAAGGAGAGCTTGCGAGAAATTACTCACGCCTGA
AGCGAGGAAACTCTTAGAGCAAGAAGTTAAGAAGAGCGTTAAGGCTTATTTGGATTGCGTTTCAAGAGCTAGAAATGAAA
AAGAGAAACAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAATTCTTAGCGAAAGAACTCCAACAAAAAGATAAA
GCGATCAAAGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCCATTATGAAGTGTTTGGATGGTTTGAGCGATGA
AGAGAAGCTCAAATACCTGCAAGAAGCTAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAGAAA
AAAGAAAATGTCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAGGACACAAAGCAAACAAAATCAATTG
AGTAAAACAGAAAGATTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGAGCA
ATGTTTAGAGGGCTTGAGCGATAGTGAAAGGGCGCTAATTCTAGGAATTAAACGACAAGCTGATGAAGTGGATCTGATTT
ATAGCGATCTAAGAAACCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCGTTGTTGCCAATGGATTTCAAAAAT
GGCGGCGATATTGCCACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTCCAT
AGAGCCTGACATTACTAAGCAATACGAAACAGAAAAAACCATTAAGGATAAGAATTTAGAAGCTAAATTAGCTAAGGCTT
TAGGTGGCAATAAAAAAGATGACGATAAAGAAAAAAGTAAAAAATCCACAGCAGAATCTAAAGTAGAAAGCAATAAGATA
GACAAAGATGTCGCAGAAACTGCCAAGAATATCAGTGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGAATTTGT
AGATGAAAATGGTAATCCCATTGATGACAAAAAGAAAGCAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCTTTA
TAGGCAAGAGTGATCCCACATTTGTTTTAGCGCAATACACCCCTATTGAAATCACTCTGACTTCTAAAGTAGATGCCACT
CTCACAGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTACTAGACAAAGGCAC
TAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACACCCATTATGACACGCTTAATGATAGTCTTTACTAAAGCCA
TTACGCCTGATGGTGTGATAATACCTCTAGCAAACGCTCAAGCAGCAGGTATGCTGGGTGAAGCAGGGGTAGATGGCTAT
GTGAATAATCACTTTATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTACAAACTGCGCCTAT
CATAGCTCTAGATAAACTCATAGGCCTTGGCAAAGGTAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTCAAG
CTATCAATGGTAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGTTTT
TACAAAAACGAGGGCGATAGTATTAAGATTCTCACAATGGACGATATTGATTTTAGCGGTGTGTATGATGTTAAAATTAC
CAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACTAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAAGCC
CCAAAGGTGGCAATTAA

Protein sequence :
MNEENDKFETSKKTQQHSPQDLSNEETTEANHFEDSSKESKESPDHHLDNPTETKTNFDEYESEETQTQMDFGGNETSES
SNGSLADKLFKKARKLVDDKRPFTQQKNLDEEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETTEA
NHFEDSSKESKESPDHHLDNPTETKTNFDEYESEETQTQMDFGGNETSESSNGSLADKLFKKARKLVDDKRPFTQQKNLD
EEIQEPNEEDDQENNGYQEETQMDLIDDETSKKTQQHSPQDLSNEETTEANHFEDSSKESKESPDHHLDNPTETKTNFDE
YESEEITNDSNDQEIIKGSKKKYIIGGIVVAVLIVIILFSRSIFHYFMPLEDKSSRFSKDRNLYVNDEIQIRQEYNQLLK
ERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYKDCIKN
AKTEEERIKCLDLIKDENLKKSLLNQQKVQVALDCLKNAKTDEERNECLKLINDPEIREKFRKELGLQKELQEYKDCIKN
AKTEAEKNECLKGLSKEAIERLKQQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSRARNEKEKKECE
KLLTPEAKKKLEQQVLDCLKNAKTDEERKKCLKDLPKDLQSDILAKESVKAYKDCVSQAKTEAEKKECEKLLTPEAKKLL
EEEAKESVKAYLDCVSQARTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQARTEAEKKECEKLLTPEAKKLLEQQ
VLDCLKNAKTDEERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEAKKLLEEAKESLKAYKDCV
SRARNEKEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQARTEAEKKECEKLLTPEAKKKLEEAKKSVKAYLDCVSQA
RTEAEKKECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKERKACEKLLTPEARKKLEEAKKSVKAYLDCVSQARTE
AEKKECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEAKKFLEQQALDCLKNAKTETEKKRCVKD
LPKDLQKKVLAKESVKAYLDCVSRARNEKERKACEKLLTPEAKKLLEEAKESLKAYKDCLSQARNEEERKACEKLLTPEA
RKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKQVLNCLEKAGNEEERKACLKNLPKDLQENVLAKES
LKAYKDCLSQARNEEERRACEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLAKELQQKDK
AIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKRTQSKQNQL
SKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDLIYSDLRNRKTFDNMAAKGYPLLPMDFKN
GGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKNLEAKLAKALGGNKKDDDKEKSKKSTAESKVESNKI
DKDVAETAKNISEIALKNKKEKSGEFVDENGNPIDDKKKAEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDAT
LTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDGY
VNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPSF
YKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 98
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 95
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 95
cagY AGC69789.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 94
cagY AGC69792.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 94
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 93
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 93
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 92
cagY AGC69786.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 92
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 91
cagY AGC69787.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 88
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 86
HP0527 BAD13806.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 85
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 83
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 81
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 81
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 81
cagY AGC69785.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 80
cagY AGC69788.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 80

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
HPKB_0815 YP_005762323.1 conjugation TrbI family protein VFG0287 Protein 0.0 93