Gene Information

Name : spaC (CDHC01_1907)
Accession : YP_005136699.1
Strain : Corynebacterium diphtheriae HC01
Genome accession: NC_016786
Putative virulence/resistance : Virulence
Product : fimbrial associated sortase
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2017267 - 2022873 bp
Length : 5607 bp
Strand : -
Note : Predicted outer membrane protein

DNA sequence :
GTGAGCAACCTACGCACGATCAAAAAACGAGCATCTATCCCCGCAGCACTCATCGCCATCATCGCAATGGTCATGAGCGT
AGTACTCGTGCCGTTAATTGCAGCGCCATCAGCGAATGCGGAGCCACTGCCGAAAAAAGAGTTTGAAACCTGTGGCGGTT
CTGTTGCGATTTCCTTTGACTTGTCCAATTCCCTGAGCGCTTCGGACGTGGAAAAATCTAAGCAAGCAGCGTTGGATCTG
GTCAAGAGCTTGAAAGGATCTCCCTATCGCTTTGGTATTTATACCTTTGCTTCACACTCGCCTGCTGCTGGAAACAAAAA
TTTCACGCCAGTAAGTCTTGCTAATGAAGACGGATACAACAAAGTTGTTGCCGCTATCAACGACATCCAGATGCCAGCGA
TCCGAGAGAACAAAAAGGGTTCTCCCAACGGTGGTACCAACTGGGAGGGTGGACTCCAAGCAATCGCGAATGACATAGAC
AGCGGCATCAAGTATGACGCCGTTTACTTCATCACTGATGGTCAACCAACTTGGGATAACAATGGGCGAAATTGGTTGGG
AAACACCACCGAGGTTGTGGAATTAGAAAATGCCGTTACCCAAGCCAATCGTATTTCTGATAAAGGTGCAAAACTTATTC
CGGTGGGTATCGGCCAGCTTTCTGATGATAAGCCGTTTGATCTCTATAAACCGATTCTTCCTTCCGAATACTATTATCCG
CCGCGTGATCCATGGAAAATAGATCGTTCCCTGACCGGCAAACAAATGCTGGAGAAGATAACCTCACCAGGTCTAGCGCC
AATTATTTTGCCCGACTATTCCACATTGCCGGAGCAAATGGGGCAACAGATTTTTACCGGATGTTTCCAAATCGCTAAAA
ACATTATTGATGCAGACGGAAACGTGATAGAAAATCCAGCTGGCTGGAATTTCGATATTAAAGCGGCTCGCGAGCAAGGC
ATTCCTACGTCGATCGTGACCGATAAAAACGGTCAAGATACCTTTGCTATGAAGTCGATTAATAAAAAATCCTTCGAGGT
CACTATCACTGAACGACCTACCGGAGATCAAGAACAAAACTTCCGGTTTAAAGACGCACGCTGCCAGCGCTACTCCTATG
GGCAAGCACCTACTGATATTCCGATCAAAACTAGCGATACTTCAATTACTTTGACTGCAGACACCAAAAGCTTGATCTCT
TGCTTTTTTAACAACCTGCCAGTTGTACCGGTTTCTGTTTCGAAAAAAGTAAACGTAAATACGCCACAACTTTTAGAAGA
GCTCAACAATCAAACCTTTGATTTCACCTATAGCTGTGAAAAAGGAGCTAATGAAAAAGAAATCAAGGGAAAAATTGAGG
GCGTCCGCAATGGAGAATCCAAAGAGATCGGAAAAGTTGCTGTTGGAACTCAGTGCGAAATCAAGGAAGTGACTCCCAAT
GTCGACGATTCTCGGATGAAGCTTTCCACCACTTGGAGCAGCGAAAGTACTGCTGCAGTATCTAATGAGGCTGACGGTAC
ATATCGCTTTAAAGCTGGCATCGATGCGTTTAAAAACAAGAAAACAGTTCTAGCTACAGCAGAAAATAACTATGAGGCTA
AAACAGCCACTATTAAGCTGACCAAGAGCATCATTAACCGTGACAAAATTCCAGCAGCAAAACTGCCTAAGGAGTTTCCT
GTCACTTACACCTGTCGTTACTTACCACATCCTTATGCTCGTCCCGAACATGGTGGGCTCCCAGAAACCAATCCGTATTT
TGTAGACTCTAAAACCGTTGTTGTTCCGCGTGATGGAATTATAGAAATCGGACCTTTTCCAGTGGGAACGCAGTGCAGCT
TTGAAGAAACTGCACGGCTTGATTCGAATGTTCAAGCAGACGCTAAAGTTCCTGGTTTTAGTTTGAAAACCGAGTGGAAG
TCCAACATCTGTTTCGGCAACACCACCGATAATAATTCTCAAGATTGTTCTACTAACTCAGTATGGATCCCTAAGCCAGG
CCAATACTCGATCAACGTAGAAAACACATACACACGTGAGCATGCGAGCGTGGAGATCGAAAAGAAGGTGAGCGGCGATG
CCTCTGACCTCACGAATTCACACGAGTTTTCTTTTAACCTTCGATGTGAAGATTCCGGAGTAGAAGTCTATTCACAAGAC
AATATTGTGGTGAAAAAAGACGGACGCCAAGTCATCGAAGACATTCCTGTTGATGCCAACTGTACGTTGATCGAGAAACA
GCCTGAGCAAAAAGGCGTGGATTTTGTGGTCCCCGCGCCGTTCCGTTTACGTGCTTCAACTGCCGGCGAAATTGTCAAAG
TGGTTGTAGATAACACCGCAAAACGTCAGGTAGCTCCTATTTCAGTACAGAAAAAGGTAAATAAAAAAGACACATTTTCT
TCTGAAATTTCTGACTCAATCGATGCATTAACTTACAGGGTGGTGGCAGAATGTACGGTTCCTGGTGAAGAAACGCCTCG
AGAAGTCATACAAACAGTAAGTGATAATCAAATTGTTAACTTCGGAAGCTTTCCAGTAGGAACTACTTGTAGTTTTAGAG
AGCTCACCGAAGCCCCTGCCGGAACCGCAATGAGTTATGAATTCGCGGATGGTCCAGAGGTGACAATTGAGGACTCCACA
CCTATAAATAAGGTGCTGACGAATACGTTTGAAAACGCACGTGGCGAGCTAACAGTAACCAAAAAAGTACTCGATGGTGA
TATGCCTCAAGCATTAGTAGACCAGATTCCATCGAGCTTTACTGTCAACGTCGCATGCTCAATCACTGGTAATCATTCCA
TCACTTTGCAAAAAGATGAGCAGAAAAGCGTACCAGGGATTGTTGCAGGTGATAGCTGCACATTAAGTGAGGAGGTAACT
CCTATAACTGGTGCTATCCATCACAAGCACTGGATTAACGGCGAGCTGCATGAAGTTGCAGATTCTACAGACATCACGAT
TGACCCTAATGGTAGTAACGCAATTCGCTTGGAAAACCATTACGAAGCCGATGATGTGCCTTTGGAACTTACCAAACGTG
TTCGGGTCATAGACCACACCGGAAATGACGTCAACTCGGAACTAAAAAATGCCATTGTTCAACCAGACCAGTCATTCCTA
TTCCGATACCGTTGTGAAATCAATGGTCAAGTAGTTGCAGAAAATACCTTAAGTGCCGGAGAGATTAACGCTGGTTCCAC
TAAGGTGCCACGAGGATCTACTTGCACGGTAGAAGAAGATACCTCTTCGGTGGAGCTACCCAATGCATCGTTATCTCGTG
TTGAGTTCTCCGTTGACGGGACAAAGACGAATGATAAGGCATCGATATCAATAAATTCGGATCAAAACCGACTAGAGGCT
ACTAATACTTTCACGTTGAAGACTGGCTCATTTAACCTGAAAAAGAAGGTCGATGGTGAAGGCGTATCTACTATTCATGA
GGATCGACGCTTTGAACTTAAGTATCGGTGCACCTTAGGTGACTGGAAGAAGGACGGCCCCATTACGCTGGGACGTTTTG
ATAGTGCCGAATCGCATTCTGTTAAAGACATTCCCGTGGGTGCATCATGTGAGATTATTGAGGACTCTGGAAAAGCCCAA
GAGCCAAACGCACAAGTGACAGCTCGTTGGACGCATACAGACAGCACGAATGGCTGGGGCGATACCGAAGCAGCATGCGA
AAATCATGCAGCCTGCGAGGTTAATCCAGAAAATGAGTTTGCAACCACAGTGATGATTACTGGAAATGAGAAAGAGAATT
TCCAAGGAACCTTTGTTGTATGGAACACCTACACTTACGATAAAACAAAGGTAGAGATCAACAAGGTGTTGACGAATGAT
GGCCCAGAACTTGCTGGTAAAGATGACTTTGCCTTCACCTTGAAATGTACTGATCCTCGTTTTGCAGAAAGTGATTTGGC
AGATAAACATTCCATTCCAGACCCCACAATTACAGTTGCATTAAATGCTAAAGGCCAAAGCCGAGCGTCGTACCAAGATG
CGAACGAACGGCACGATAGCGTTGAGGTTCCTGTTGGGTATAACTGCACTGTGACCGAAAACCCGATTGCACTTTACGAT
GCCAAGGCGACGACCCAATTCAGTGGTCCGGCAGTGGTGGAAAATACAGCCGTGCAACGCACAGCATCAAACTCCGCCTC
GGCTCGTTTTGTCACGGAGAAGCAAGAAAATAATGGCACTCAAAAAATTCAGGTAACTAATGATTACATTCGTCCGCGTG
CCGATGTCATGGTGCATAAGACAATCGCAAAACCAGAACACTCGGTACATCATTGGTTGCCTAACACTACATACAGCATC
ACTTATAAGTGCGACGATCCATACATCAAGGATCGTTCCTATTCAAACGACGTAGATGTACAAGCTGATGCAGCAGAACC
AACGCCAATTTTCGCTGATCCTACGGCTCACGTAAAAATTCCTGCGTCGGCAGTATGTACTTTCAGTGAAAACACCGAAG
GGCATTTACCAGATGAGGTAAAAGGCGTAGTGGATGAAACGAATGAAGTTGCTGAATTCGCTGGGGAACATGAAAAGCGC
TCCTATTTCACCCCAGAAATTAAAGATGTTGTCTTGTCGGAATCTGAACCAACACGAATTGAATTCACCAATTCGTACGT
GATGCCTCAACGAATTTTGAGCCTACAAAAATATGTTGAGGGCGACCCCGGCCATGCTGTGATTGCTCCAGAAGAAGCAT
TTGAATTCTCCTACACCTGCACCATGCCGCATCTATTCCCAAATCAACCCAATCCTATGTCGCAGGAAGTAGGAAACAAG
GTTGCACGTGACGTCATTAAGATTCGAGAAGGTGAGACATGGCGATCTCCTGAAGTGCCCATCGGCACGTCTTGCACGAT
CAAAGAAGAAACAGATGCCACCTTGCATGCCAAGTTGGAAACTAACGCGCTGCGGATGGTGCCTACCTATTTGTTCCCCA
CAGAACGTGCAGGTGGTTCTACTACGCCGGTAGCACCGAATATGACTGGACGTCCTGCCTACAACGGCACAGAGGCCCGC
CACCAGATGCCAGAATCAGGCATTGAGCTTAACGACGCCCACTCGCACACCGTGGTGATCAACAACGTGTACACCACCGA
CGCTGAGATCAACATTGCCAAGGTGAATGCCGATAACTCTCCGCTCCCCGGCGCGCACTTCGCCGTCTATGGGATAGGGG
AGAATGGCCAGCGTAATGAGTTGGCTGAGGTTGCGGATGTGCCGGCGCAGTCTGTGGAGCAGGCGTTGTTTGCGGTGCGC
TTGCGCCCTGGTAGTTACGAGTTGGTGGAGACTCAGGCTCCTCAGGGTGCGCAGTTGTTGCCTAAGCCCTGGCGTTTTGA
TGTCAAGGCTGCGAATGCGGGTGCGATGGGTGATCTTGAGGTGACCTTGGATAACTATGATGCTGATTCTGGTTTGATCA
CAGTGGAGCGCCCGCAGGGTAAGCCGTGGTTGATCAAGGTGGCTAATGTGTCGGCATCCACACTGCCATTGACTGGTTCG
AATGGTTACTTGCGGTGGCTGTTGGCCGGTGCTGCGGGCCTGTTGGTGGCAGCAGCACTGTGGCTAGTGGCGCGTCGTAA
GCGTTAG

Protein sequence :
MSNLRTIKKRASIPAALIAIIAMVMSVVLVPLIAAPSANAEPLPKKEFETCGGSVAISFDLSNSLSASDVEKSKQAALDL
VKSLKGSPYRFGIYTFASHSPAAGNKNFTPVSLANEDGYNKVVAAINDIQMPAIRENKKGSPNGGTNWEGGLQAIANDID
SGIKYDAVYFITDGQPTWDNNGRNWLGNTTEVVELENAVTQANRISDKGAKLIPVGIGQLSDDKPFDLYKPILPSEYYYP
PRDPWKIDRSLTGKQMLEKITSPGLAPIILPDYSTLPEQMGQQIFTGCFQIAKNIIDADGNVIENPAGWNFDIKAAREQG
IPTSIVTDKNGQDTFAMKSINKKSFEVTITERPTGDQEQNFRFKDARCQRYSYGQAPTDIPIKTSDTSITLTADTKSLIS
CFFNNLPVVPVSVSKKVNVNTPQLLEELNNQTFDFTYSCEKGANEKEIKGKIEGVRNGESKEIGKVAVGTQCEIKEVTPN
VDDSRMKLSTTWSSESTAAVSNEADGTYRFKAGIDAFKNKKTVLATAENNYEAKTATIKLTKSIINRDKIPAAKLPKEFP
VTYTCRYLPHPYARPEHGGLPETNPYFVDSKTVVVPRDGIIEIGPFPVGTQCSFEETARLDSNVQADAKVPGFSLKTEWK
SNICFGNTTDNNSQDCSTNSVWIPKPGQYSINVENTYTREHASVEIEKKVSGDASDLTNSHEFSFNLRCEDSGVEVYSQD
NIVVKKDGRQVIEDIPVDANCTLIEKQPEQKGVDFVVPAPFRLRASTAGEIVKVVVDNTAKRQVAPISVQKKVNKKDTFS
SEISDSIDALTYRVVAECTVPGEETPREVIQTVSDNQIVNFGSFPVGTTCSFRELTEAPAGTAMSYEFADGPEVTIEDST
PINKVLTNTFENARGELTVTKKVLDGDMPQALVDQIPSSFTVNVACSITGNHSITLQKDEQKSVPGIVAGDSCTLSEEVT
PITGAIHHKHWINGELHEVADSTDITIDPNGSNAIRLENHYEADDVPLELTKRVRVIDHTGNDVNSELKNAIVQPDQSFL
FRYRCEINGQVVAENTLSAGEINAGSTKVPRGSTCTVEEDTSSVELPNASLSRVEFSVDGTKTNDKASISINSDQNRLEA
TNTFTLKTGSFNLKKKVDGEGVSTIHEDRRFELKYRCTLGDWKKDGPITLGRFDSAESHSVKDIPVGASCEIIEDSGKAQ
EPNAQVTARWTHTDSTNGWGDTEAACENHAACEVNPENEFATTVMITGNEKENFQGTFVVWNTYTYDKTKVEINKVLTND
GPELAGKDDFAFTLKCTDPRFAESDLADKHSIPDPTITVALNAKGQSRASYQDANERHDSVEVPVGYNCTVTENPIALYD
AKATTQFSGPAVVENTAVQRTASNSASARFVTEKQENNGTQKIQVTNDYIRPRADVMVHKTIAKPEHSVHHWLPNTTYSI
TYKCDDPYIKDRSYSNDVDVQADAAEPTPIFADPTAHVKIPASAVCTFSENTEGHLPDEVKGVVDETNEVAEFAGEHEKR
SYFTPEIKDVVLSESEPTRIEFTNSYVMPQRILSLQKYVEGDPGHAVIAPEEAFEFSYTCTMPHLFPNQPNPMSQEVGNK
VARDVIKIREGETWRSPEVPIGTSCTIKEETDATLHAKLETNALRMVPTYLFPTERAGGSTTPVAPNMTGRPAYNGTEAR
HQMPESGIELNDAHSHTVVINNVYTTDAEINIAKVNADNSPLPGAHFAVYGIGENGQRNELAEVADVPAQSVEQALFAVR
LRPGSYELVETQAPQGAQLLPKPWRFDVKAANAGAMGDLEVTLDNYDADSGLITVERPQGKPWLIKVANVSASTLPLTGS
NGYLRWLLAGAAGLLVAAALWLVARRKR

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
DIP2010 NP_940341.1 surface-anchored membrane protein Not tested Not named Protein 0.0 92

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
spaC YP_005136699.1 fimbrial associated sortase VFG2199 Protein 0.0 92