Gene Information

Name : spaC1 (CDHC04_1883)
Accession : YP_005141192.1
Strain : Corynebacterium diphtheriae HC04
Genome accession: NC_016788
Putative virulence/resistance : Virulence
Product : putative surface-anchored fimbrial subunit
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2007721 - 2013330 bp
Length : 5610 bp
Strand : -
Note : Predicted outer membrane protein

DNA sequence :
GTGAGTAACCTACGTACCATCAAGAAACGAGCGTCTATCCCCGCAGCACTCGTCGCCATCCTCGCAATGGTCATGAGCGT
AGTACTCGTGCCGTTAATTGCAGCGCCATCAGCGAATGCGGAGCCACTGCCGAAAAAAGAGTTTGAAACCTGTGGCGGTT
CTGTTGCGATTTCCTTTGACTTGTCCAATTCCCTGAGCGCTTCGGACGTGGAAAAATCTAAGCAAGCAGCGTTGGAGCTG
GTCAAGAGCTTGAAAGGATCTCCCTATCGTTTTGGTATTTATACCTTTGCTTCACACTCGCCTGCTGCTGGAAACAAAAA
TTTCACGCCAGTAAGTCTTGCTAATGATGACGGATACAACAAAGTTGTTGCCGCTATCAATGACATCCAGATGCCAGCGA
TCCGAGAGAACAAAAAGGGTTCTCCCAACGGTGGTACCAACTGGGAGGGCGGGCTCCAAGCAATCGCGAATGACATAGAC
AGAGGCATCAAGTATGACGCCGTTTACTTCATCACTGATGGTCAACCAACTTGGGATAACAATGGGAGAAATTGGTTGGG
AACCACCACCGAGGTTGTGGAATTAGAAAATGCCGTTACCCAAGCCAAACTTATTTCTGATAAAGGCGCAAAACTTATTC
CGGTGGGTATTGGCCAGCTTTCTGATGATAAGCCGTTTGATCTCTATAAACCGATTCTTCCTTCCGAAGACGATTACTAT
TGGTCGCGTTATCCATGGAAAATAGATCGTTCCCTGACCGGCAAACAAATGCTGGAGAAGATAACCTCACCGGGCCTAGA
GCCAATTATTTTGCCCGACTATTCCACATTGCCGCAGCGAATGGGACAACAGATTTTTACCGGATGTTTCCAAATCGCTA
AAAACATTATTGATGCAGACGGAAACGTGATAGAAAATCCAGCTGGCTGGAATTTCGATATTACAGCGGCTGGTGTGCAA
GGCATTCCTCCGTCGATCGAGACAGATAAAAATGGCCAAGACACCTTTGCTATGAAGTCGATTAATAAGGAATCCTTTAA
GATCACTATCACTGAGCGACCTACCGGAGATCAAAAACAAAACTTCCGGTTTAAAAACGCACGCTGCCAGCGCTACTCCT
ATGGGCAAGCACCTACTGATATTCCGATCAAAACTAGCGATACTTCAATTACTTTGACTGCAGACACCAAAAGCTTGATC
TCTTGCTTTTTTAACAACCTGCCAGTTGTACCGGTTTCTGTTTCGAAAAAAGTAAACGTAAATACGCCACAACTTTTAGA
AGAGCTCAACAATCAAACCTTTGATTTCACCTATAGCTGTGAAAAAGGAGCTAATGAAAAAGAAATCAAGGGAAAAATTG
AGGGCGTCCGCAATGGAGAATCCAAAGAGATCGGAAAAGTTGCTGTTGGAACTCAGTGCGAAATCAAGGAAGTGACTCCC
AATGTCGACGATTCTCGGATGAAGCTTTCCACCACTTGGAGCAGCGAAAGTACTGCTGCAGTATCTAATGAGGCTGACGG
TACATATCGCTTTAAAGCTGGCATCGATGCGTTTAAAAACAAGAAAACAGTTCTAGCTACAGCAGAAAATAACTATGAAG
CTAAAACAGCCACTATTAAGCTGACCAAGAGCATCATTAACCGTGACAAAATTCCAGCAGCAAAACTGCCTAAGGAGTTT
CCTGTCACTTACACCTGTCGTTACTTACCACATCCTCATGCTCGTCCCGAACATGGTGGGCTCCCAGAAACCAATCCGTA
TTTTGTAGACTCTAAAACCGTTGTTGTTCCGCGTGATGGAATTATAGAAATCGGACCTTTTCCAGTGGGAACGCAGTGCA
GCTTTGAAGAAACTGCACGGCTTGATTCGAATGTTCAAGCAGACGCTAAAGTTCCTGGTTTTAGTTTGAAAACCGAGTGG
AAGTCCAACATCTGTTTCGGCAACACCACCGATAATAATTCTCAAGATTGTTCTACTAACTCAGTATGGATCCCTAAGCC
AGGCCAATACTCGATCAACGTAGAAAACACATACACACGTGAGCATGCGAGCGTGGAGATCGAAAAGAAGGTGAGCGGCG
ATGCCTCTGACCTCACGAATTCACACGAGTTTTCTTTTAACCTTCGATGTGAAGATTCCGGAGTAGAAGTCTATTCACAA
GACAATATTGTGGTGAAAAAAGACGGACGCCAAGTCATCGAAGACATTCCTGTTGATGCCAACTGTACGTTGATCGAGAA
ACAGCCTGAGCAAAAAGGCGTGGATTTTGTGGTCCCCGCGCCGTTCCGTTTACGTGCTTCAACTGCCGGCGAAATTGTCA
AAGTGGTTGTAGATAACACCGCAAAACGTCAGGTAGCTCCTATTTCAGTACAGAAAAAGGTAAATAAAAAAGACACATTT
TCTCCTGAAATTTCTGCAGCAATCGATGCATTAACCTACAACGTGGTGGCAGAATGTACGGTTCCTGGCGAAGAAACGCC
TCGAAAAGTTCTAAAAACAGTAAGTGATAATCAAACTGTTGACTTCGGAAGCTTTCCAGTAGGAACTACTTGTAGCTTTA
GCGAGCTCACCGAAGCCCCTGCCGGAACCGAAATGAATTATAAATTCGCGGATGGTCCAGAGGTGACAATTGAAGACTCC
ACTCCTATAAATAAGGTGCTGACGAATACGTTTGAAAACGCACATGGAGAGCTAACAGTAACCAAAAAGGTAATCCATGG
TGATATGCCTCAAGCATTAGTAGACCAGATTCCATCGAGCTTTACAGTCAACGTCGTATGCTCAATCACTGGTAATCATT
CCATCACTTTGCAAAAAGATGAGCAGAAAAGCGTACCAGGGATTGTTGCAGGTGATAGCTGCACATTAAGTGAGGAGGTA
ACTCCTATAATTGGTGCTATCCATCACAAGCACTGGATTAACGGCGAGCTGCATGAAGTTGCAGATTCTACAAACATCAC
GATTGACCCTAATGGTAGTAACGCAATTCGCTTGGAAAACCATTACGAAGCCGATGATGTGCCTTTGGAACTTACCAAAC
GTGTTCGGGTCATAGACCACACCGGAAATGACGTCAACTCGGAACTAAAAAATGCCATTGTCCAACCAGACCAGTCATTC
CTATTCCGATACCGTTGTGAAATCAATGGTCAAGTAGTTGCAGAAAATACCTTAAGTGCCGGAGAGATTAACGCTGGTTC
CACTAAGGTGCCACGAGGATCTACTTGCACGGTAGAAGAAGATACCTCTTCGGTGGAGCTACCCAATGCATCGTTATCTC
GTGTTGAGTTCTCCGTTGACGGGACAAACACGAATGATAAGGCATCGATAGCAATAAATTCGGATCAAAACCGACTAGAT
GCTACTAATACTTTCACGTTGAAGACTGGCTCATTTAACCTGAAAAAGAAGGTCGATGGTGAAGGCGTATCTACTATTCA
TAAGGATCGACGCTTTGAACTTGCGTATCGGTGCACCTTAGGTGATTGGAAGAAGGAAGGCTCTATTACGCTGGGACGTT
TTGATAGTGCCGAATCGCATTTTGTTAAAGACATTCCCGTGGGTGCATCATGTGAGATTATTGAGGACTCTGTAAAAGCC
CAAGAGCCAAACGCACAAGTAACAGCTCGTTGGACGCATACAGACAGCACGAATGGCTGGGGCGATACCGAAGCAGCATG
CGAAAATCATGCAGCCTGCGAGGTTGATCCAAAAAATGAGTTTGCAACTACAGTGATTATTACTGGAAATGAGAAAGAGA
ATTTCCAAGGAACCTTTGTTGTATGGAACACCTACACTTACGATAAAACAAAGGTAGAGATCAACAAGGTGTTGACGAAT
GATGGTCCAGAACTTGCTGGTAAAGATAACTTTGCCTTCACTTTGAAATGTACTGATCCTCGTTTTGCAGGAAGTGATTT
GGCAGATAAGAATTTCATTCCAGACCCCACAATTACAGTTGCGTTAAATGCTAAAGGCCAAAGCCGAGCGTCGTACCAAG
TTGCGGGCGAACGGCACGATAGTGTTGAGGTTCCTGTTGGGTATGACTGCACTGTGACCGAAAATCCGATTGCACTTTAC
GATGCCAAAGCAACGACCCAATTCAGTGGCCCGGAAGTGGTGGAAAATACAGCCGTGCAACGCACAGCATCAAACTCCGC
CTCGGCTCGTTTTGTCACGGTGAAGCAAGAAGATAATGGCACTCAAAAAATTCAGGTAACTAATGATTACATTCGTCCGC
GTGCCGATGTCACGGTGCATAAGACAATCAAAAAACCAAAGCACTCGGTAGATCCTTGGCTGCTTAGCACTACTTATCGC
ATCACTTATGTGTGCAACGATTCATACATCAAGGATCGTTCCTATTCAGGACACGTAGATGTAAAAGCAGATGCGGAACA
ACCAACGACAATCGTGGCGGATCCGATTACTGGCGTAAAAATTCCTGCGTCGGCAGTATGTACTTTCAGTGAAAACACCG
AAGGGCATTTACCAGATGAGGTAAAAGGCGTAGTGGATGAAACGAATAAAGTTGTTGAATTCGCTGGGGAACATGAAAAG
CGCTCCTATTTCACCCCAGAAATTGAAGACGTTGTCTTGTCGGAATCTGAACCAACACGAATTGAATTCACCAATTCGTA
CGTGATGCCTCAACGAATTTTGAGCCTACAAAAATATGTTGAGGGCGACCCCGGCCATGCTGTGATTACTCCAGAAGAAA
CATTTGAATTCTCCTACACCTGCACCATGCCGCATCTATTCCCAAATCAACCCAATCCTATGTCGCAGGAAGTAGGAAAC
AAGGTTGCACGTGGCGTCATTAAGATTCGAGAAGGTGAGACATGGCGATCTCCCGAGGTCCCTATTGGTACGTCCTGCAC
GATCAAAGAAGAAGACGACCCCGCCTTGCACACCAAGTTGGAAAACAATGCGCTGCGCATGGTGCCTACCTACTTTTTCC
CCACGGAGCGTGCAGGCGCTGCTAGTGCGCCAGTGATTCCGCCGTTGACAGGCCGCCCGATTTATAACGGCACGGAGCCT
CGCCTCCAGATGCCAGAATCAGGCATTGAGCTTAACGACGCCCACTCGCACACCGTGGTGATCAACAACGTGTACACCAC
TGACGCTGAGATCAACATTGCCAAGGTGAATGCCAATAACTCTCCGCTTCCCGGCGCGCACTTCGCCGTCTATGGGATAG
GGGAGAACGGTCAGCGTAAAGAGTCGGCTGAGGTTGCGGATGTGCCGGCGAAGTCGGTGGAGCAGGCGTTGTTTGCGGTG
CGCTTGCGCCCTGGTAGCTACGAGTTGGTGGAGACTCAGGCTCCTCAGGGTGCGCAGTTGTTGCCTAAGCCGTGGCGTTT
TGATGTCAAGGCTGCGAATGCGGGTGCGATGGGTGACCTTGAGGTGACCTTGGATAACTATGATGCTGATTCGGGGTTGA
TCACGGTGGAGCACCCGCAGGGTAAGCCGTGGTTGATCAAGGTGGCTAATGTGTCGGCATCCACACTGCCGTTGACTGGT
TCGAATGGGTATGTGCGGTGGCTGTTGGCCGGTGCTGTGGGCCTGTTGGTGGCTGCAGCATTGTGGTTAGTGGCGCGTCG
TAAGCGTTAG

Protein sequence :
MSNLRTIKKRASIPAALVAILAMVMSVVLVPLIAAPSANAEPLPKKEFETCGGSVAISFDLSNSLSASDVEKSKQAALEL
VKSLKGSPYRFGIYTFASHSPAAGNKNFTPVSLANDDGYNKVVAAINDIQMPAIRENKKGSPNGGTNWEGGLQAIANDID
RGIKYDAVYFITDGQPTWDNNGRNWLGTTTEVVELENAVTQAKLISDKGAKLIPVGIGQLSDDKPFDLYKPILPSEDDYY
WSRYPWKIDRSLTGKQMLEKITSPGLEPIILPDYSTLPQRMGQQIFTGCFQIAKNIIDADGNVIENPAGWNFDITAAGVQ
GIPPSIETDKNGQDTFAMKSINKESFKITITERPTGDQKQNFRFKNARCQRYSYGQAPTDIPIKTSDTSITLTADTKSLI
SCFFNNLPVVPVSVSKKVNVNTPQLLEELNNQTFDFTYSCEKGANEKEIKGKIEGVRNGESKEIGKVAVGTQCEIKEVTP
NVDDSRMKLSTTWSSESTAAVSNEADGTYRFKAGIDAFKNKKTVLATAENNYEAKTATIKLTKSIINRDKIPAAKLPKEF
PVTYTCRYLPHPHARPEHGGLPETNPYFVDSKTVVVPRDGIIEIGPFPVGTQCSFEETARLDSNVQADAKVPGFSLKTEW
KSNICFGNTTDNNSQDCSTNSVWIPKPGQYSINVENTYTREHASVEIEKKVSGDASDLTNSHEFSFNLRCEDSGVEVYSQ
DNIVVKKDGRQVIEDIPVDANCTLIEKQPEQKGVDFVVPAPFRLRASTAGEIVKVVVDNTAKRQVAPISVQKKVNKKDTF
SPEISAAIDALTYNVVAECTVPGEETPRKVLKTVSDNQTVDFGSFPVGTTCSFSELTEAPAGTEMNYKFADGPEVTIEDS
TPINKVLTNTFENAHGELTVTKKVIHGDMPQALVDQIPSSFTVNVVCSITGNHSITLQKDEQKSVPGIVAGDSCTLSEEV
TPIIGAIHHKHWINGELHEVADSTNITIDPNGSNAIRLENHYEADDVPLELTKRVRVIDHTGNDVNSELKNAIVQPDQSF
LFRYRCEINGQVVAENTLSAGEINAGSTKVPRGSTCTVEEDTSSVELPNASLSRVEFSVDGTNTNDKASIAINSDQNRLD
ATNTFTLKTGSFNLKKKVDGEGVSTIHKDRRFELAYRCTLGDWKKEGSITLGRFDSAESHFVKDIPVGASCEIIEDSVKA
QEPNAQVTARWTHTDSTNGWGDTEAACENHAACEVDPKNEFATTVIITGNEKENFQGTFVVWNTYTYDKTKVEINKVLTN
DGPELAGKDNFAFTLKCTDPRFAGSDLADKNFIPDPTITVALNAKGQSRASYQVAGERHDSVEVPVGYDCTVTENPIALY
DAKATTQFSGPEVVENTAVQRTASNSASARFVTVKQEDNGTQKIQVTNDYIRPRADVTVHKTIKKPKHSVDPWLLSTTYR
ITYVCNDSYIKDRSYSGHVDVKADAEQPTTIVADPITGVKIPASAVCTFSENTEGHLPDEVKGVVDETNKVVEFAGEHEK
RSYFTPEIEDVVLSESEPTRIEFTNSYVMPQRILSLQKYVEGDPGHAVITPEETFEFSYTCTMPHLFPNQPNPMSQEVGN
KVARGVIKIREGETWRSPEVPIGTSCTIKEEDDPALHTKLENNALRMVPTYFFPTERAGAASAPVIPPLTGRPIYNGTEP
RLQMPESGIELNDAHSHTVVINNVYTTDAEINIAKVNANNSPLPGAHFAVYGIGENGQRKESAEVADVPAKSVEQALFAV
RLRPGSYELVETQAPQGAQLLPKPWRFDVKAANAGAMGDLEVTLDNYDADSGLITVEHPQGKPWLIKVANVSASTLPLTG
SNGYVRWLLAGAVGLLVAAALWLVARRKR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
DIP2010 NP_940341.1 surface-anchored membrane protein Not tested Not named Protein 0.0 93

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
spaC1 YP_005141192.1 putative surface-anchored fimbrial subunit VFG2199 Protein 0.0 93