Gene Information

Name : SDE12394_04640 (SDE12394_04640)
Accession : YP_006013004.1
Strain : Streptococcus dysgalactiae ATCC 12394
Genome accession: NC_017567
Putative virulence/resistance : Unknown
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 896310 - 899111 bp
Length : 2802 bp
Strand : +
Note : COG3942 Surface antigen

DNA sequence :
ATGAAAGAGGATAAAAAGCTAGTCAAACAGGCTAGGCAAAACTTTCGAAGCAACTTAAAATCAGCTCGTATGCATTATCG
GAAAGAAGTTAGGACCTTAAAACAGACTGTTCCTAAAAAAGGAAGATTTCGAAAATCAGCCAAAAATTCTCTGTTTCAGG
AAAAGAAAACAGAGTTAAAAGGAAATCTACTCAGTAGCCAAAAGGAAGCAGAAGAGAAGTTCATAAAGGAAATTACCTAT
GTTTCTCCTAGACTGTTGAAGGCAAAGGAAATCAAGAACTATCGACTTCCGCAAGCTCAAGAACGTTTGCGGACGGCAAG
AAAACATCTGTCAGAAGTGAAACTAAGTGAGAAACAGCAGGTAGTTAATCCCAAGTTTACTTTCCAAAAAGAAAATCCTT
CCCTGAAGTCTCGCTTTCAGTTTCACCAAGAAAAATCATTTGATCGGCTAAGTGCAGAAAAAGACGTAAGTTCTGCCAAG
CGTGAGGTTAAACAACTCAAGAAAGTCCAAAAGACTAAGAAAAACTCTACCAAAGTCAAAGCTGGATTAGGCTTGGCTGC
ATCTGAATCCCTTGACCTGATAGCACAGGAGGATGATTTAGATGGTCTAAGAACCATGAAGGATACTAGTTTGAAAGCCA
GACGCTATGGCAGGTTTACTTATCAAGCAGGTAAGGTGGCAGTAAAAAGTGGACAGACTGGTGTGCGGTTTACCAAAACA
AAATTTTCTCATGGAAAGGAACGATTCCAGAACTTCAAAAAGGGAAAAGGATTCACACGCCAGAAACCTCTTAAACCAAG
AAGACGCTACCAAACCTTTTTAAATCATGTCAGAAAACAAAGTGTCGCAGGTTTTAAAGGAATCGTCCAAGCCATTAAGG
GAAGTATGACTTTCTTTTCTGTCCTTGCGGGAAATCCTTTGACTTGGATTGTCTCAGGCATTCTCTTGATACTCCTTTTG
ATGATGAGCTTTTTCATGAGTGTTTCAGGTAGTAGTGTCATACAACAAGATGAAATAGAATTGAGTAAGAGTTATACCCA
CATGACGTGGGAAGATGCGGAACATACAAGAACCAACGATAAGGGAATTACCTTTTACACTAAGATTGATGAGGTCATGG
TTTACATGAATCATCAATACCAAGACTACAAGTTAGATGATTTCATAGAGACAAGCGGAACAACCTACAAAGAATTTCTC
AGTCAACTGTGGTCGGACTTAAACGGTGGAGATTCTATTAAATCCATGTCTGACTTATATAAAGAACCTGCTTACAAGCT
GTCTGATGAGGACCAAGAAGAACTAAACGAATTGACCGAAGAAGGCAACTATTTAGCCCTTCAAGAATTGGACAATCCCT
TTCAGGGACAGACCGATGAGGATAGTTTAAGCATGACCTACCGATATGGTTATGAGGTCATTGATGAGAAACCAACGCTT
CATCATCATATCATCTTAGAAGCAAAAGAAGGTCAAGTCATTGTAGCTCCGATGGATGGCAAGGTATCTCTTGATGGAGA
GAACGTTGTTTTGACATCTGGTAAGGGAGTGAATAAGACTAAACTAACCTTGTTTGGCATTCATTCAGGCCGAGTGAGTG
AAAATCAACAAGTCTTGGCAGGAGATATTATTGGTCAGACCAAAGACGGAACGGGTCTAAAAGTCACCTATCAAAAGGTT
GATGATGACACGGATAAGTTGGTCTATGTCAATCCAGCATTCTACTTTCCAAAAGTGATTCAGGTTCAGACCACCATTCT
TCCAACTATCGGTCAGTTTGGCGGCGATGAGTTCGAGAGAGCCAAGGCAATTTATGACTATCTTAAAAGCAAAGGTGCGA
CCAATCAAGCTATCGCAGCCATTCTAGGTAACTGGTCGGTAGAATCCTCCATTAATCCAAAACGAGCTGAAGGCGACTAT
TTATCTCCACCTGTTGGTGCGACAGATAGTTCGTGGGATGATGAGGGCTGGCTCACACTTAATGGTCCAACTATATACAA
TGGACGTTACCCAAATATTCTCAAACGTGGTTTAGGCTTAGGCCAATGGACAGATACCGCGGATGGTTCACGCAGACATA
CTTTATTGTTGGAATATGCCAAAGGAAAACATCAAAAATGGTATGACTTAGGCTTACAACTGGATTTCATGTTGTATGGG
GATAGTCCTTATTACACCAACTGGTTAAAGGACTTCTTCAAAAATTCAGGCAGTCCAGCTAGTCTTGCCCAGCTCTTTCT
CATTTACTGGGAAGGAAATAGTGGTGATAAGCTACTTGAACGTCAAACACGAGCCAGTGAGTGGTATTACCAAATTGAAA
AAGGCTTTAGTCAACCTAACGGTGGGACAGCACAAAGCGATCCAAAAGCACTTGAAGCTGTACGAGAAGACCTTTTTGAA
AACTCTATTCCAGGAGGTGGTGACGGTATGGGTTACGCTTACGGCCAATGTACTTGGGGAGTCGCAGCCCGTATCAATCA
ACTGGGTCTAAAACTCAAAGGTAAAAACGGTGAGAAGATTCCAATTATCAGTACCATGGGCAATGGCCAAGATTGGGTAC
GAACAGCCGCAAGTCTCGGTGGAGAGACAGGGACAAGTCCACAAGAAGGAGCTATCCTTTCCTTTGCGGGAGGAGGACAT
GGCACCCCAACAGAATACGGACATGTGGCTTTTGTGGAGAAAGTCTACCCAGACGGTTCATTCCTTATCTCAGAAACCAA
CTACAATGGCAATCCAAACTATACCTTCCGTAAATTATCTGGAGTGGATAGTAGCTTGAGTTTTGCTTATACGACGAAAT
AA

Protein sequence :
MKEDKKLVKQARQNFRSNLKSARMHYRKEVRTLKQTVPKKGRFRKSAKNSLFQEKKTELKGNLLSSQKEAEEKFIKEITY
VSPRLLKAKEIKNYRLPQAQERLRTARKHLSEVKLSEKQQVVNPKFTFQKENPSLKSRFQFHQEKSFDRLSAEKDVSSAK
REVKQLKKVQKTKKNSTKVKAGLGLAASESLDLIAQEDDLDGLRTMKDTSLKARRYGRFTYQAGKVAVKSGQTGVRFTKT
KFSHGKERFQNFKKGKGFTRQKPLKPRRRYQTFLNHVRKQSVAGFKGIVQAIKGSMTFFSVLAGNPLTWIVSGILLILLL
MMSFFMSVSGSSVIQQDEIELSKSYTHMTWEDAEHTRTNDKGITFYTKIDEVMVYMNHQYQDYKLDDFIETSGTTYKEFL
SQLWSDLNGGDSIKSMSDLYKEPAYKLSDEDQEELNELTEEGNYLALQELDNPFQGQTDEDSLSMTYRYGYEVIDEKPTL
HHHIILEAKEGQVIVAPMDGKVSLDGENVVLTSGKGVNKTKLTLFGIHSGRVSENQQVLAGDIIGQTKDGTGLKVTYQKV
DDDTDKLVYVNPAFYFPKVIQVQTTILPTIGQFGGDEFERAKAIYDYLKSKGATNQAIAAILGNWSVESSINPKRAEGDY
LSPPVGATDSSWDDEGWLTLNGPTIYNGRYPNILKRGLGLGQWTDTADGSRRHTLLLEYAKGKHQKWYDLGLQLDFMLYG
DSPYYTNWLKDFFKNSGSPASLAQLFLIYWEGNSGDKLLERQTRASEWYYQIEKGFSQPNGGTAQSDPKALEAVREDLFE
NSIPGGGDGMGYAYGQCTWGVAARINQLGLKLKGKNGEKIPIISTMGNGQDWVRTAASLGGETGTSPQEGAILSFAGGGH
GTPTEYGHVAFVEKVYPDGSFLISETNYNGNPNYTFRKLSGVDSSLSFAYTTK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
SSU98_0981 YP_001200539.1 Tn5252, Orf28 Not tested 89K Protein 0.0 96
SSU05_0968 YP_001198334.1 Tn5252, Orf28 Not tested 89K Protein 0.0 96
Spaf_1124 YP_006309926.1 putative conjugal transfer protein Not tested FWisland_1 Protein 0.0 60