Gene Information

Name : SPG_1272 (SPG_1272)
Accession : YP_002037980.1
Strain : Streptococcus pneumoniae G54
Genome accession: NC_011072
Putative virulence/resistance : Unknown
Product : Tn5253 SNF2-related: helicase
Function : -
COG functional category : K : Transcription
COG ID : COG4646
EC number : -
Position : 1232076 - 1238300 bp
Length : 6225 bp
Strand : -
Note : identified by match to protein family HMM PF00176; match to protein family HMM PF00271; match to protein family HMM PF02384; match to protein family HMM PF04851

DNA sequence :
ATGGAAGTAATACAATTATTGGCTATGTTTCGTGGAACAATTCCAAAAGATAGGGATAAAATGGACCTATTTCTTCGCTA
TCAGGCGCAACATTTTGATGAGAAATGGCAGGACTTGGTAGAGAGTTTTTTGACTGAAGAGGAGAAGATAGAAGAGATAC
CTCATGTCTATTCGTTTCATCAAGACATTGTTTCTTTCCTAGAGGCCAGTTCTGAAAATAATGACCAAGATCTAGAAAGT
TACACAAGAAATTTTGGACAAGCAGGACTAGATAAATTATCTCAATTAAGTAATTTTGAGAAAAATTTGGTGCTAAAAGT
CGCAAGCTATAACCTTTCCACTCGATTTTACATCCAATCTGAAAAAGAGAAACTAACACCATTAAGTGAGCTTGTATTTC
ATCAGAATCAGGATGTTAATTTAGTCAATGTCTATCGGGTTGCGAATAATCTATCTGACCGTATTAGTAGAGATATAGAG
GAATTTCTTCTAATGGTTGACTCAAAAGAGGTTAAAAAAGAAGTTCCTGAGATTCATTTTGAAGAAAAAGAGGGAGATGT
TCTAGCCTATTTGGGTTCAGAATTGATGGCTACTTTAGATATCGTTACGGATCTTGTCCATCATGAAGAAAACTACATAC
AACTCCCACTGACACAAAAGCTGAAGATTATTACTCATTTTGATGAAGTAAAAGCTAAAAGCGAAAAGTCTAATCAAGTA
GAGAAAATTTTATCTCCTTCAAGTGATATTGAACAGGAAACGGAAGAAACTAACTCCTTTTCTAATGTCGATAAAATTGT
AGAAGAAGCTTTGAGGGAATATCCAATCGGTTCACAAGTAAGTTATAAAGGACAAGTATTTCAGTTGGTTTCGATTGAAA
ATGCGCAGTTAAATGACTTAGTTCGCCTAGAGCTATTCAATGATTCCAACCAGTTATTTGAAGAGAATCCTATCTTATAC
TTGAACAGTTTGGAAGAGATTGAACAAGTATTGTCTCATTTAGAACTTGAAAAAGAAGATTCAGAGATTGAGATTGATTC
ATCAAGTGAAAGTCAGGAAATAGATTTGTTTTCATACCTGGAAGAAGAAAATGTAAATGAAAAGGATAAGGATAAGGAAA
CAAAATCTTTAATTTCAGGTATAGAAGAGACGGATGTCCCTGTTCTAGATTTTGTTTTTCCAGATGATTTAGAGGACTTT
TATCCTAAGACAAATCGAGAAAAGATTGAAACGAATATCGCCGCAATTGAACTTGTTAAAAGATTAGAAAAAGAGAGACG
ACAAGCGAATCCAGAAGAACAAGAGCTACTCGCCAAGTATGTCGGCTGGGGCGGTCTTGCCAATGAATTTTTCGATGAAC
TCAATCCAAAGTATGAAACAGAACGTTTAACTCTTAAGAGCTTAGTAAGTAAATCAGAATACTCGACCATAAAACAAAGT
TCTCTCACAGCCTATTATACAGACCCAATGATTATTCGCCAGATTTGGCAAAAATTACTGGATGATGGTTTTGAGGGAGG
AAGGATATTAGATCCTTCTATGGGGACTGGGAACTTCTTTGCGGCGATGCCTAGAAGTATACGAGAGAAATCAGAACTCT
ATGGGGTTGAATTAGACAGTGTGACTGGTGCAATCGCAAAACAACTCCACCCCAATACCCATATTGAAGTGCGAGGATTT
GAAGAAGTTCCCTATCAAAATAATAGTTTTGATTTAGTCTTAACGAATGTTCCTTTTGGAAATTTTCGCATTGCCGATAA
AAACTATGATAAACCTTATATGATTCATGATTACTTTGTCAAACACTCACTTGATTTAGTAAGAGACGGAGGACAAGTGT
CGATTATCTCATCTATTGGGACAATGGATAAGCGGACAGATAATGTCTTACAAGAGATTAAATCCAATACTCATTTTTTA
GGGGGAGTTCGGTTGCCGGATACGGCTTTTAAAAAGATTGCAGGTACCCGAGTGACCACAGATCTCCTATTCTTTCAAAA
GGATCAAGCAAAGAATCTTAATGAGGAGGAACTTGTCTTTAGTGGCTCTGTTCCCTTTGAGGAGGATAAGCGTGTCTGGA
TCAATCCTTATTTTGATGGGAAATACAATACACAAGTTTTGGGTGAATATGAGGTACGTAATTTTAATGGAGGAACCCTC
AATGTTAAGGGGGAATCAGAAACATTAGCTACTGACATAATGAAAGCATTAGAGAATGTAGAAGCACCTAAACAAATTGA
CAATTCTTTGAAAGCACCTGTTTTTATCCAAGAAGAAGTGGATAATTCTATCCCAAGTCATATACGTGAGAACTTAGCGC
TCTATTCTTTTGGATATGAGGGAAATCAAATTTATTATCGAGATACGCATGGCATTCGGAGAAGTTCAAAAGTAGACGAA
ATTAGTTATTATGTAGATGAGAAGGGAGATTTTAAAGCTTGGGACAGTTCCTTGTCTGAACATAAAATAGATCGATTCGT
GCAACTTCATTTGACAGATGAGGAAGCACTAGATGTATACAAGTCAGAAGAAGCGAGTAAAAGAGGGAAATATAAGGGCT
TGTTCAAAAAAACGGTCTTTTATGAAAGCCCCTTATCGGATAAGGATATTAGTCGCATTAAGGGCATGGTTGATTTGAGA
GAGACCTATCAAGCCTTAATTGAAATTCAACGCCATCCAGATTATAGTCGGACAGATTTTCAGGTATTACTTAGTAAACT
CAATCGTGACTATGATCGGTTTGTAAGCCAATTTGGATACTTGAATGTCTCAGTCAATCGAAACTTATTTGATAGTGACG
ATAAGTATTCTTTACTAGCAAGTTTAGAAGATGAATACATTGATTCTAAAGATCAGAAAGTAAAATATAAAAAATCTTTA
GCCTTTGAGAAAGCATTGGTTAGGCCGGAGAGAGTGATTACAAGAGTTTCAACGGCTTTAGATGCCTTAAACTCCAGTTT
ATCAGATGGTAGAGGGGTTGATATAGACTTTATGGTATCAATTTACCCTGAACATAGCCAAGCAGCTATTTTAGATGAGT
TGGGTGACCAGATCTTAATGGATCCAGAAAGCTATTTAAGAGGGGAAAGAAATTATCTTTCTAAGAACCAGTTTTTGTCA
GGAGATATTCTCAACAAGATAGAAGTAGTTCAACTATTAGTAGAGGAAAACAACCAAGAATGTGATTGGACTCATGCGTT
AGATTTGTTAGAATCTGTTCGCCCTCCAAGGATTCATCTGGCAGATATTGAGTTTAAAATAGGGTCACGTTGGATTCCTC
AATCCGTTTATGGTAAATTTGCCTTTGAATGTTTTACCAACCGTGAATTTGAATTGTCTTCGCCTGATGTTGAACAAGTC
ATTGAAGTGAATCCTGTCGATGGGCAGGTTCATTTAAGGACACCATTTGCTTATCGCTATCCAAGTGCCAAAGATAGTAG
TCTTGGAGTCAGTGGCTCACGTTATGATACAGGAAGAAAGATTTTTGAGAATTTACTTAATTCGAACCAACCGACTATTA
CTATGACTGTTACGGAAGGGGAAAAGAAAAAGACCATCACAGATTTGGAAAAAACCTCTGTTCTAAGAGCAAAAGAGCAG
CATTTACAAGAACTCTTTCAAGACTTTGTCTCACGGTATCCAGAAGTTCAACAAGTCATTGAAGAAAGTTATAATCGTCT
TTATAATCGAACGGTTAGTCGAGAGTATGACGGTAGCCATTTAGTCATTGATGGCTTGGCACAAAACATCAGTCTTCGTC
CTCATCAAGAGAATGCCATTCAAAGAATTGTAGAAGAAAAAAGAGCCTTGTTAGCTCATGAGGTAGGTTCAGGAAAGACC
TTGACCATGCTTGGTGCTGGCTTTAAATTAAAGGAGTTGGGGATGGTTCATAAGCCCTTGTATGTGGTGCCCTCTAGTTT
GTCTGCTCAGTTTGGCCAAGAAATCATGAAATTTTTCCCTACTAAAAAAGTCTTTGTGACCACTAAGAAAGATTTTGTGA
AGGCAAGAAGAAAACAGTTTGTGTCACGTATTATTACAGGAGATTACGATGCCATTGTCATTGGGGATTCTCAATTTGAA
AAAATCCCTGTCAGTAAGGAAAGACAGATGAATTATATCGAGGATAAACTCAATGAACTACGAGAGATTAAAACACATTC
TGAAAATAAGTACACCGTTAAAGAAGCAGAGCAATCAATAAGTGGTCTTGAGAAACAATTGGAAGAACTCCAACGCTTTA
ATCGTGATAGTTTTATTGATTTTGAGAACTTAGGAATTGATTTTCTCTTTGTGGATGAAGCACATCACTTTAAAAATATA
CGTCCAATTACTGGACTTGGAAATGTAGCAGGGATTACCAATACAACGTCTAAGAAGAATGTGGATATGGAAATGAAGGT
TCGACAGATTCAGGAAGAACATGATTTTAAAAATATTGTCTTTGCGACAGGAACACCTGTTTCAAATTCAATTAGTGAGC
TGTATACTATGATGAACTACATTCAACCGGATATCTTAAAACGCTATCAAGTTGATTATTTTGACTCTTGGGTAGGTGCT
TTTGGAGAAATTCAAAACTCTATGGAATTAGCTCCTACAGGGGATAAGTACCAGCCTAAGAAACGATTTAAAAAGTTTGT
CAATCTACCTGAGTTGATGAAAATCTATAAAGAAACAGCCGACATTCAAACACAAGATATGTTGGATTTACCTGTACCAG
AAGCCCATATTATCCCTATTGAGAGTGAGTTAACTGAAAACCAGAAACTCTATCTAGAAGAATTGGTTATGAGATCAGAT
GCGGTCAAATGTGGAACAGTTGATCCAAGTCAGGATAACATGTTAAAAATTACGGGTGAGGCACGAAAATTAGCTATTGA
TATGCGTTTATTGGACTCTAGTTATAGTCTAGCAGACAATCATAAACTGCTTCAGGTAGTGGATAATGTTGAAAGAATTT
ATCGTGAGGGAATGGAAAATAAGGCTACTCAGATGATTTTTTCAGATATTGGCACACCTAAGAAAAAAGATAATGGCTTT
GATGTTTATTCTGAGATTAAGGCTTTATTAGTTGATAGAGGAATCCCTAGTATGGAAATTGCCTTTGTACATGATGCCAA
TAGTGATGAAAAGAAGAATAGCTTGTCTCGAAAGGTTAATGCAGGAGAGGTTCGGGTTCTTCTTGCCTCAACTGAAAAAG
GAGGAACAGGTTTAAATGTTCAGAGCAAGATGAAAGCAGTTCACCATCTGGATGTACCGTGGAGACCAAGTGACATTCAG
CAACGTAATGGACGTATTATCCGACAGGGAAATGAAAACAAGGAAGTGGATATTTACCACTATATTACCAAAGGTTCGTT
TGATAATTATCTATGGGCAACTCAGGAGAACAAACTCCGTTATATTAAGCAGATTATGACTTCTAAGGAGCCGATTCGTG
CTGCGGAAGATATTGATGAACAGACTATGACAGCTTCTGATTTTAAGGCACTAGCAACAGGTAATCCTTATCTCAAATAT
AAGATGGAACTAGAGAATGATCTAACTCTATTAGAAAATCAAAGACGCGCCTTTCAACGCAGCAAGGATCACTATCGTCA
TACAATCTCTTACTGTGAAGAAAATATGCCCATTCTTGAGAAACGATTAAGCAAGTATGAAGGCGACATTCAACAGTCTG
AAATGTCGAAAGACCAATCATTTTCTATGACGATAGGTAAACAAGTTTTTGAGCAACGAGCTGAAGCAGGTGAATCCCTA
CACCGTCTTATCCGTCATAATCAATCTGACAGCAAAGAATTTCGAACCCTAGCAAGTTATCGAGGATTTGACATTAAAAT
GCTTAGTCTTGCAACAAATCAACCTCTTCCTGAAACCTTCTCTGTTAAGATTGTAGGAGAAAACCAATATTCTGTCAGTT
TAGATTTGTATTCTCCTTTGGGGACAATTCAAAGGCTTCAGCATACGATAGACCACATTAAAGATGACCAAGTGAAAACT
CAGAACTTATTGGATGAATTAAAGGATAAATGGACTACTGCTAAGGTAGAAATTGAGAAAAATTTTCCAAAGGAAGAGGA
TTATCAAACTAAAAAGGCCGAATACGATGTACTCGCGCCATTGATTGAAACAGAAACGGATTTAGATATTATTGATCAGG
CCTTACGACAATTCCACGAAAAAGGAAAAGAAAAGCAAGAACAACTTTCTTTTGAATTAGATTAA

Protein sequence :
MEVIQLLAMFRGTIPKDRDKMDLFLRYQAQHFDEKWQDLVESFLTEEEKIEEIPHVYSFHQDIVSFLEASSENNDQDLES
YTRNFGQAGLDKLSQLSNFEKNLVLKVASYNLSTRFYIQSEKEKLTPLSELVFHQNQDVNLVNVYRVANNLSDRISRDIE
EFLLMVDSKEVKKEVPEIHFEEKEGDVLAYLGSELMATLDIVTDLVHHEENYIQLPLTQKLKIITHFDEVKAKSEKSNQV
EKILSPSSDIEQETEETNSFSNVDKIVEEALREYPIGSQVSYKGQVFQLVSIENAQLNDLVRLELFNDSNQLFEENPILY
LNSLEEIEQVLSHLELEKEDSEIEIDSSSESQEIDLFSYLEEENVNEKDKDKETKSLISGIEETDVPVLDFVFPDDLEDF
YPKTNREKIETNIAAIELVKRLEKERRQANPEEQELLAKYVGWGGLANEFFDELNPKYETERLTLKSLVSKSEYSTIKQS
SLTAYYTDPMIIRQIWQKLLDDGFEGGRILDPSMGTGNFFAAMPRSIREKSELYGVELDSVTGAIAKQLHPNTHIEVRGF
EEVPYQNNSFDLVLTNVPFGNFRIADKNYDKPYMIHDYFVKHSLDLVRDGGQVSIISSIGTMDKRTDNVLQEIKSNTHFL
GGVRLPDTAFKKIAGTRVTTDLLFFQKDQAKNLNEEELVFSGSVPFEEDKRVWINPYFDGKYNTQVLGEYEVRNFNGGTL
NVKGESETLATDIMKALENVEAPKQIDNSLKAPVFIQEEVDNSIPSHIRENLALYSFGYEGNQIYYRDTHGIRRSSKVDE
ISYYVDEKGDFKAWDSSLSEHKIDRFVQLHLTDEEALDVYKSEEASKRGKYKGLFKKTVFYESPLSDKDISRIKGMVDLR
ETYQALIEIQRHPDYSRTDFQVLLSKLNRDYDRFVSQFGYLNVSVNRNLFDSDDKYSLLASLEDEYIDSKDQKVKYKKSL
AFEKALVRPERVITRVSTALDALNSSLSDGRGVDIDFMVSIYPEHSQAAILDELGDQILMDPESYLRGERNYLSKNQFLS
GDILNKIEVVQLLVEENNQECDWTHALDLLESVRPPRIHLADIEFKIGSRWIPQSVYGKFAFECFTNREFELSSPDVEQV
IEVNPVDGQVHLRTPFAYRYPSAKDSSLGVSGSRYDTGRKIFENLLNSNQPTITMTVTEGEKKKTITDLEKTSVLRAKEQ
HLQELFQDFVSRYPEVQQVIEESYNRLYNRTVSREYDGSHLVIDGLAQNISLRPHQENAIQRIVEEKRALLAHEVGSGKT
LTMLGAGFKLKELGMVHKPLYVVPSSLSAQFGQEIMKFFPTKKVFVTTKKDFVKARRKQFVSRIITGDYDAIVIGDSQFE
KIPVSKERQMNYIEDKLNELREIKTHSENKYTVKEAEQSISGLEKQLEELQRFNRDSFIDFENLGIDFLFVDEAHHFKNI
RPITGLGNVAGITNTTSKKNVDMEMKVRQIQEEHDFKNIVFATGTPVSNSISELYTMMNYIQPDILKRYQVDYFDSWVGA
FGEIQNSMELAPTGDKYQPKKRFKKFVNLPELMKIYKETADIQTQDMLDLPVPEAHIIPIESELTENQKLYLEELVMRSD
AVKCGTVDPSQDNMLKITGEARKLAIDMRLLDSSYSLADNHKLLQVVDNVERIYREGMENKATQMIFSDIGTPKKKDNGF
DVYSEIKALLVDRGIPSMEIAFVHDANSDEKKNSLSRKVNAGEVRVLLASTEKGGTGLNVQSKMKAVHHLDVPWRPSDIQ
QRNGRIIRQGNENKEVDIYHYITKGSFDNYLWATQENKLRYIKQIMTSKEPIRAAEDIDEQTMTASDFKALATGNPYLKY
KMELENDLTLLENQRRAFQRSKDHYRHTISYCEENMPILEKRLSKYEGDIQQSEMSKDQSFSMTIGKQVFEQRAEAGESL
HRLIRHNQSDSKEFRTLASYRGFDIKMLSLATNQPLPETFSVKIVGENQYSVSLDLYSPLGTIQRLQHTIDHIKDDQVKT
QNLLDELKDKWTTAKVEIEKNFPKEEDYQTKKAEYDVLAPLIETETDLDIIDQALRQFHEKGKEKQEQLSFELD

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Spaf_1117 YP_006309919.1 SNF2 family protein Not tested FWisland_1 Protein 0.0 96
SSU05_0962 YP_001198328.1 SNF2 family protein Not tested 89K Protein 0.0 71