Gene Information

Name : CLH_3055 (CLH_3055)
Accession : YP_001922432.1
Strain : Clostridium botulinum Alaska E43
Genome accession: NC_010723
Putative virulence/resistance : Virulence
Product : type IV pilus assembly protein TapB
Function : -
COG functional category : N : Cell motility
COG ID : COG2804
EC number : -
Position : 3287792 - 3289480 bp
Length : 1689 bp
Strand : -
Note : identified by match to protein family HMM PF00437; match to protein family HMM PF05157

DNA sequence :
ATGGCAATTGAGAAAAAAAGATTAGGAAATATTCTTATAAATGCTGGTAAAATAAATAGTTATCAATTACAAGAAGCTCT
AAAATCTCAAAAAATTCTTGGTAAAAAATTGGGTGAAATATTAGTAGATAGTAATATCATTACAGAAGAAGAGATAATAG
AATCTATTGAACAACAAACAGGAATAAAGAAGGTAGATTTAAATACAATAACTTTTGATAATAAATCAATAGCTATAATA
CCTAAAAACTTATGTAGCAAATATTCACTAATACCTTTCGGATTTGATAATAATAAAATAAAAGTAGCTATGTCAGATCC
TCTTAATATTTATGCAATAGATGATGTTGCAATTTCAACAGGTTTTGAAATAGAAACCTTCATATCTAAAAAGAATGACA
TAAAAAAATTCATAGAAATATATTATAGTAGTCAACAAGTTAGTATGGCTGCACAACAGCTAGCTAAAGAAAGTTCAGAG
TCAAAAAGAAATAATATTGTAAATATTGAAGAGATTGATGATGTAAAGAATGCACCCGTTGTAAAGATGGTTGAATATCT
ATTTAAAAATTCAATAGAAATGAATGCATCAGACATACATATAGAGCCATTTGAGAATGAGATAAGGATAAGATATAGAA
TAGATGGACAACTACAACCAGTAAATATTTTAAATATAGATAGCTTAGGTCCATTAATAACAAGAATAAAAATTCTTGCA
GGGCTTAATATAGCAGAAAAAAGAATACCACAGGATGGTAGGATAATAGTTAACATTGGAGAAAAAGATGTAGACCTTAG
AGTATCTGTATTACCAGTAGTACATGGAGAAAAGGTAGTTATAAGAATATTAAACACATCTAATTATAATGTAAGTAAGG
ACAAATTAGGCATAAATGAAAAAAACTTAAAAAAGATAGATAAGATAATTTCAAATCCATATGGAATAGTATTAGTTACA
GGGCCAACAGGAAGTGGAAAATCAACAACATTATACAGCATTTTAAGTGAATTGAATTCTAATAATGTCAATATAGTTAC
AGTAGAAGATCCTGTTGAATATACACTTCCAGGGGTTAATCAAGTTAATGTAAATACAAAAGCTGGACTTACATTTGCAA
GTGGACTTAGAAGTATATTAAGGCAAGATCCTGATATAGTTATGATTGGTGAAATAAGAGATAATGAAACAGCTGAAATA
GCAATAAAAGCAGCAATTACAGGACATTTAGTATTAAGTACACTTCACACTAATGATGCTCCATCATCAATAATAAGACT
TATAGATATGGGTATAAAACCTTATTTAGTGTCTACCTCAGTTGTTGGAATATTAGCTCAAAGACTAGTCAGAAAAGTAT
GTAATAAGTGCAAAGAATCTTATGAAGCTAGTAAATATGAAAAAGAAATATTAGGTATAGATGAAAATGAAGATTTAAAA
TTATATAAATCTTCAGGATGTGGTTACTGTAATAATACAGGTTATTTAGGGAGAATTGGTGTTTATGAAGTTATGGAAAT
GACAAGAGATCATAGAGAAGCAATAAATGCTGGGGCTAATTCCGATATATTAAAAGATATTTCATTAAAAAACGGAATGA
CAACATTAGGTATGGAATGTAGAGAATTAGTTGTTAAAGGGGTAACTACTATAACAGAACTTGCCACAATAAGTTTATTA
AAGGATTAA

Protein sequence :
MAIEKKRLGNILINAGKINSYQLQEALKSQKILGKKLGEILVDSNIITEEEIIESIEQQTGIKKVDLNTITFDNKSIAII
PKNLCSKYSLIPFGFDNNKIKVAMSDPLNIYAIDDVAISTGFEIETFISKKNDIKKFIEIYYSSQQVSMAAQQLAKESSE
SKRNNIVNIEEIDDVKNAPVVKMVEYLFKNSIEMNASDIHIEPFENEIRIRYRIDGQLQPVNILNIDSLGPLITRIKILA
GLNIAEKRIPQDGRIIVNIGEKDVDLRVSVLPVVHGEKVVIRILNTSNYNVSKDKLGINEKNLKKIDKIISNPYGIVLVT
GPTGSGKSTTLYSILSELNSNNVNIVTVEDPVEYTLPGVNQVNVNTKAGLTFASGLRSILRQDPDIVMIGEIRDNETAEI
AIKAAITGHLVLSTLHTNDAPSSIIRLIDMGIKPYLVSTSVVGILAQRLVRKVCNKCKESYEASKYEKEILGIDENEDLK
LYKSSGCGYCNNTGYLGRIGVYEVMEMTRDHREAINAGANSDILKDISLKNGMTTLGMECRELVVKGVTTITELATISLL
KD

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
gspE YP_854406.1 type II secretion protein GspE Not tested PAI I APEC-O1 Protein 5e-78 41
gspE CAE85233.1 GspE, hypothetical type II secretion protein Not tested PAI V 536 Protein 4e-78 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
CLH_3055 YP_001922432.1 type IV pilus assembly protein TapB VFG0233 Protein 3e-78 47
CLH_3055 YP_001922432.1 type IV pilus assembly protein TapB VFG1876 Protein 2e-84 45
CLH_3055 YP_001922432.1 type IV pilus assembly protein TapB VFG0182 Protein 2e-81 43
CLH_3055 YP_001922432.1 type IV pilus assembly protein TapB VFG2048 Protein 6e-80 41