Gene Information

Name : orf13/14 (jhp0476)
Accession : NP_223194.1
Strain : Helicobacter pylori J99
Genome accession: NC_000921
Putative virulence/resistance : Virulence
Product : cag island protein
Function : -
COG functional category : U : Intracellular trafficking, secretion and vesicular transport
COG ID : COG2948
EC number : -
Position : 518074 - 523533 bp
Length : 5460 bp
Strand : -
Note : similar to H. pylori 26695 gene HP0527

DNA sequence :
ATGAATGAAGAAAACGATAAACTTGAAACTTCTAAAAAAGCCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGC
AACAGAAGCCAATCATTTTGAAGATCTTTTAAAAGAATCCACAGAAAGCTCAGACAATCATCTTGACAACCCCACAGAAA
CTAAAACCCATTTTGATGAAGACAAGCTAGAAGAAACCCAAACTCAAATGGATTCTGGAGGTAATGAAACTTCAGAATCT
AGCAATGGCAGTCTAGCAGACAAGTTATTCAAGAAAGCCAGAAAATTAGTTGATGATAAAAGACCTTTCACTCAGCAAAA
GAGTTTAGATGAAGAAACCCAAAAACTGAACGAAGAAGACGATCAAGAAAATAATGAGCATCAAGAAGAAACTCAAACGG
ACTTGATTGATGATGAAACTTCTGAAAAAACCCAACAAGATTCACCCCAAGATCTATCTAATGAAGAAGCAACAGAAGCC
AATCATTTTGAAGATCTTTTAAAAGAATCCACAGAAAGTTCAGATAATCATCTTGACAACCCCACAGAAAGCTCAGACAA
TCATCTTGACAACCCCACAGAAACTAAAACCCAAGAAACTAAAACCCATTTTGATGAAGACAAGCCAGAAGAAATAACTG
ACGACTCTAACGATCAAGAGATTATCAAAGGAAGCAAAAAGAAATACATTATTGGCGGCATTGTAGTCGCTGTTCTTATC
GTGATTATTTTATTTTCTAGAAGCATTTTTCACTACTTTGTACCTTTGGAAGATAAAAGCTCTCGTTTTAGCAAAGACAG
GAATCTTTATGTCAATGATGAAATCCAAATAAGGCAAGAGTATAACCGATTGCTGAAAGAACGGAATGAAAAAGGCAATA
TGATCGATAAGAATCTTTTCTTCAATGACGATCCCAATAGAACCTTATACAACTATTTGAATATTGCAGAAATTGAGGAC
AAAAACCCATTGAGAGCCTTTTATGAATGTATTAGTAATGGTGGCAACTATGAAGAATGTTTGAAGCTTATCAAAGACAA
AAAACTTCAAGATCAGATGAAAAAGACTCTAGAGGCTTATAATGACTGCATCAAAAATGCCAAAACTGAAGAAGAAAGGA
TCAAGTGTTTAGATTTAATCAAAGATGAAAACCTGAAAAAAAGCTTACTGAACCAACAAAAAGTTCAAGTGGCTCTAGAT
TGTTTGAAAAACGCTAAAACCGATGAAGAACGGAAAGAGTGCCTAAAACTCATAAATGACCCTGAGATTAGAGAGAAATT
CCGTAAGGAATTAGAGCTTCAAAAAGAGCTTCAAGAGTATAAGGATTGTATCAAAAACGCCAAAACAGAAGCTGAGAAAA
ACGAATGCTTGAAAGGCTTGTCTAAAGAAGCCATAGAAAGATTGAAACAACAAGCGCTAGATTGTTTGAAAAACGCTAAA
ACCGATGAAGAACGGAAAGAGTGCTTAAAAAATATTCCCCAAGACTTGCAAAAAGAACTACTAGCTGATATGAGCGTTAA
GGCTTACAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGA
AAAAACTATTAGAGAATCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGATGAAGAACGGAAAGAGTGCTTGAAAAAT
CTCCCTAAAGACTTACAAAGCGATATTTTAGCTAAAGAGAGCCTGAAAGCTTATAAAGACTGCGCATCTCAAGCCAAAAC
TGAAGCTGAAAAAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTTTTAGAAGAAGAAGCCAAAGAGAGCG
TTAAGGCTTATTTGGATTGCGTATCTCAGGCCAAAACTGAAGCTGAAAAAAAAGAATGCGAGAAATTGCTCACACCTGAA
GCGAAAAAAAAGTTAGAAGAAGCCAAAAAGAGCGTTAGAGCTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGA
AAGAAAAGAATGCGAGAAATTACTCACGCCTGAAGCGAAAAAACTATTAGAGAATCAAGCGCTAGATTGTTTGAAAAACG
CTAAAACCGATGAAGAACGGAAAGAGTGCTTGAAAGATCTCCCTAAAGACTTACAGAAAAAGGTTTTAGCCAAAGAGAGT
GTTAGGGTTTATTTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTGCTCACCCCTGA
AGCGAGAAAGCTATTAGAAGAAGCTAAGAAGAGCGTTAAGGCTTACAAAGACTGCGTTTTAAGAGCTAGGAATGAAAAAG
AGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGGAAACTATTAGAAGAATCTAAAAAAAGCGTTAAGGCTTAT
TTGGATTGCGTATCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGCT
ATTAGAAGAAGCTAAAGAGAGTGTTAAGGCTTACAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAGAAT
GCGAGAAATTACTCACGCCTGAAGCGAGGAAACTATTAGAAGAATCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTA
TCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGCTATTAGAAGAAGC
TAAAGAGAGTGTTAAGGCTTACAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTAC
TCACGCCTGAAGCGAAAAAACTATTAGAGAATCAAGCGCTAGATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAA
AGGTGTGTCAAAGATCTCCCTAAAGACTTACAGAAAAAGGTTTTAGCCAAAGAGAGTGTTAGGGTTTATTTGGATTGCGT
ATCAAAAGCCAAAAACGAAGCTGAAAGAAAAGAATGCGAGAAATTGCTCACCCCTGAAGCGAGAAAGCTATTAGAAGAAG
CTAAAGAGAGTGTTAAGGCTTACAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTA
CTCACCCCTGAAGCTAGGAAACTATTAGAGCAAGAAGTTAAAAAGAGCGTTAAGGCTTATTTAGACTGCGTTTCAAGAGC
TAGGAATGAAAAAGAGAAACAAGAATGCGAGAAATTACTCACGCCTGAAGCGAGGAAACTTTTAGAGAATCAAGCGCTAG
ATTGTTTGAAAAACGCTAAAACCGAAGCTGAGAAAAAAAGGTGTGTCAAAGATCTCCCTAAAGACTTGCAGAAAAAGGTT
TTAGCTAAAGAGAGCGTTAAGGCTTATTTAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAAAAAGAATGCGAGAA
GTTGCTCACGCCTGAAGCGAGAAAACTATTAGAAGAATCTAAAAAAAGCGTTAAGGCTTATTTGGATTGCGTATCAAAAG
CCAAAAACGAAGCTGAAAAAAAAGAATGCGAGAAATTGCTCACACCTGAAGCGAGAAAGCTATTAGAAGAAGCTAAAGAG
AGTGTTAAGGCTTACAAAGACTGCGTATCAAGAGCTAGGAATGAAAAAGAGAAACAAGAATGCGAGAAACTACTCACCCC
TGAAGCGAGGAAACTATTAGAGCAAGAAGTTAAAAAGAGCGTTAAGGCTTATTTAGACTGTGTATCAAGAGCTAGGAATG
AAAAAGAGAAACAAGAATGCGAGAAATTACTCACCCCTGAAGCGAGGAAATTTTTAGAGAAACAGCGCCAACAAAAAGAT
AAAGCGATAAAGGATTGCTTGAAAAACGCCGATCCTAACGACAGAGCGGCTATCATGAAGTGTTTGGATGGTTTGAGCGA
TGAAGAGAAGCTCAAATACCTGCAAGAAGCCAGAGAAAAGGCTGTCTTGGATTGTTTGAAAACGGCTAGGACCGATGAAG
AAAAAAGGAAATGCCAAAACCTTTATAGCGATTTGATCCAAGAAATCCAAAATAAAAAGGCACAGAACAAACAAAATCAA
TTGAGTAAAACAGAAAGGTTGCATCAAGCAAGCGAGTGCTTGGATAACTTAGATGACCCTACTGATCAAGAAGCCATAGA
GCAATGTTTAGAGGGATTGAGCGATAGCGAAAGGGCGCTAATTCTAGGAATCAAACGACAAGCTGATGAAGTGGATCGGA
TTTATAGCGATCTAAGAAGCCGCAAAACCTTTGATAACATGGCGGCTAAAGGTTATCCATTGTTACCAATGGATTTTAAA
AATGGTGGCGATATTGCTACTATTAACGCCACTAATGTTGATGCGGACAAAATAGCTAGCGATAATCCTATTTATGCTTC
CATAGAGCCTGATATTACTAAGCAATACGAAACAGAGAAAACCATTAAGGATAAGAGTTTAGAAGCTAAATTAGCTAAGG
CTTTAGGTGGCGATAAGAAAGATGACGATAAAGAAAAAGGTAAAAAACCCACAGCAGAAACTAAAGCAGAAAGCAATAAG
ATAGACAAAGATGTCGCAGAAACTGCCAAAAATATCAGCGAAATCGCTCTTAAGAACAAAAAAGAAAAGAGTGGGGATTT
TGTAGATGAAAATGGTAATCCCATTGATGATAAAAAGAAAGAAGAAAAACAAGATGAAACAAGCCCTGTCAAACAGGCCT
TTATAGGCAAGAGTGATCCCACATTTGTTTTAGCACAATACACTCCCATTGAAATCACTCTGACTTCTAAAGTAGATGCC
ACTCTCACGGGTATAGTGAGTGGGGTTGTAGCCAAAGATGTATGGAACATGAACGGCACTATGATCTTATTAGACAAAGG
CACTAAGGTGTATGGGAATTATCAAAGCGTGAAAGGTGGCACGCCTATTATGACTCGTTTAATGATAGTCTTTACTAAAG
CTATTACGCCTGATGGGGTGATTATACCTCTAGCAAACGCTCAAGCAGCAGGCATGCTGGGTGAAGCAGGGGTAGATGGC
TATGTGAATAATCACTTCATGAAGCGCATAGGCTTTGCTGTGATAGCAAGCGTGGTTAATAGCTTCTTGCAAACTGCGCC
TATCATAGCCCTAGATAAACTCATAGGTCTTGGCAAAGGCAGAAGTGAAAGGACACCTGAATTTAATTACGCTTTGGGTC
AAGCTATCAATGGCAGTATGCAAAGTTCAGCTCAGATGTCTAATCAAATTCTAGGGCAACTGATGAATATCCCCCCAAGT
TTTTACAAAAATGAGGGCGATAGCATTAAGATTCTCACCATGGACGATATTGATTTTAGTGGCGTGTATGATGTTAAAAT
TACCAACAAATCTGTGGTAGATGAAATTATCAAACAAAGCACCAAAACTTTGTCTAGAGAACATGAAGAAATCACCACAA
GCCCCAAAGGTGGCAATTGA

Protein sequence :
MNEENDKLETSKKAQQDSPQDLSNEEATEANHFEDLLKESTESSDNHLDNPTETKTHFDEDKLEETQTQMDSGGNETSES
SNGSLADKLFKKARKLVDDKRPFTQQKSLDEETQKLNEEDDQENNEHQEETQTDLIDDETSEKTQQDSPQDLSNEEATEA
NHFEDLLKESTESSDNHLDNPTESSDNHLDNPTETKTQETKTHFDEDKPEEITDDSNDQEIIKGSKKKYIIGGIVVAVLI
VIILFSRSIFHYFVPLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIED
KNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSLLNQQKVQVALD
CLKNAKTDEERKECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEAEKNECLKGLSKEAIERLKQQALDCLKNAK
TDEERKECLKNIPQDLQKELLADMSVKAYKDCVSRARNEKEKQECEKLLTPEAKKLLENQALDCLKNAKTDEERKECLKN
LPKDLQSDILAKESLKAYKDCASQAKTEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQAKTEAEKKECEKLLTPE
AKKKLEEAKKSVRAYLDCVSKAKNEAERKECEKLLTPEAKKLLENQALDCLKNAKTDEERKECLKDLPKDLQKKVLAKES
VRVYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKKSVKAYKDCVLRARNEKEKQECEKLLTPEARKLLEESKKSVKAY
LDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKQECEKLLTPEARKLLEESKKSVKAYLDCV
SKAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKQECEKLLTPEAKKLLENQALDCLKNAKTEAEKK
RCVKDLPKDLQKKVLAKESVRVYLDCVSKAKNEAERKECEKLLTPEARKLLEEAKESVKAYKDCVSRARNEKEKQECEKL
LTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKLLENQALDCLKNAKTEAEKKRCVKDLPKDLQKKV
LAKESVKAYLDCVSRARNEKEKKECEKLLTPEARKLLEESKKSVKAYLDCVSKAKNEAEKKECEKLLTPEARKLLEEAKE
SVKAYKDCVSRARNEKEKQECEKLLTPEARKLLEQEVKKSVKAYLDCVSRARNEKEKQECEKLLTPEARKFLEKQRQQKD
KAIKDCLKNADPNDRAAIMKCLDGLSDEEKLKYLQEAREKAVLDCLKTARTDEEKRKCQNLYSDLIQEIQNKKAQNKQNQ
LSKTERLHQASECLDNLDDPTDQEAIEQCLEGLSDSERALILGIKRQADEVDRIYSDLRSRKTFDNMAAKGYPLLPMDFK
NGGDIATINATNVDADKIASDNPIYASIEPDITKQYETEKTIKDKSLEAKLAKALGGDKKDDDKEKGKKPTAETKAESNK
IDKDVAETAKNISEIALKNKKEKSGDFVDENGNPIDDKKKEEKQDETSPVKQAFIGKSDPTFVLAQYTPIEITLTSKVDA
TLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQSVKGGTPIMTRLMIVFTKAITPDGVIIPLANAQAAGMLGEAGVDG
YVNNHFMKRIGFAVIASVVNSFLQTAPIIALDKLIGLGKGRSERTPEFNYALGQAINGSMQSSAQMSNQILGQLMNIPPS
FYKNEGDSIKILTMDDIDFSGVYDVKITNKSVVDEIIKQSTKTLSREHEEITTSPKGGN

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
orf13/14 NP_223194.1 cag island protein Virulence cag PAI Protein 0.0 100
HP0527 BAD13915.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 86
HP0527 BAD13860.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 84
cagY YP_005775730.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 84
HP0527 BAD13888.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 82
HP0527 BAD13998.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 81
HP0527 BAD13970.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
cagY YP_003728737.1 cag pathogenicity island protein Y Virulence cag PAI Protein 0.0 78
HP0527 BAD14052.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 78
HP0527 NP_207323.1 cag pathogenicity island protein (cag7) Virulence cag PAI Protein 0.0 78
HP0527 BAD13833.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cagY YP_005777271.1 cag pathogenicity island protein Y VirB10-like protein Virulence cag PAI Protein 0.0 77
HP0527 BAD14026.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
cag-Y AAF80198.1 Cag-Y Virulence cag PAI Protein 0.0 77
cagY YP_005774542.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 77
HP0527 BAD13779.1 cag pathogenicity island protein Virulence cag PAI Protein 0.0 76
cagY YP_005779063.1 cag island protein Virulence cag PAI Protein 0.0 76

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
orf13/14 NP_223194.1 cag island protein VFG0287 Protein 0.0 78