Gene Information

Name : SbBS512_E2143 (SbBS512_E2143)
Accession : YP_001880666.1
Strain : Shigella boydii CDC 3083-94
Genome accession: NC_010658
Putative virulence/resistance : Unknown
Product : IS66 family element, transposase
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 1949452 - 1951005 bp
Length : 1554 bp
Strand : +
Note : identified by match to protein family HMM PF03050

DNA sequence :
ATGCCAGCCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAGAGACGGAGGCCTTCCTGCGCTCTGCACTGGCCCGCGC
CGAAGAAAAGATCGAAGAAGATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCG
GTACCCGTTCTGAAAAACTGCGTCGTGAAGTTGAACAGGCTGAGGCCCTGCTGAAACAACGCGAACAGGACAGTGATCGT
TACAGTGGGCGGGAAGACGATCCGCAGGTTCCCCGCCAGTTGCGACAGTCTCGTCATCGTCGCCCGTTACCGGAGCATCT
GCCCCGCGAAATAAATCGCCTGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAG
TCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCTCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAA
TGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCGATAGAGCGTGGTATCGCGGGCCCGGGGTTACTTGCCCGCGT
GTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCGTCAGAGTGAAATTTTTGCCCGTCAGGGTGTCGAACTGAGCC
GTGCATTACTCTCCAACTGGGTTGACGCGTGCTGCCAGTTAATGACGCCGCTGAATGATGCTCTGTACCGTTATGTGATG
AACAGCCGCAAAGTTCACACTGATGACACACCAGTAAAAGTGCTGGCACCGGGCAGGAAGAAGGCGAAAACAGGATATAT
CTGGACGTATGTCCGGGATGACAGGAATGCCGGTTCGCCAGAGCCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCATC
AGGGTAAACATCCGGAGCAGCACCTTAGTCCCTTCCGGGGTATCCTGCAGGCAGATGCGTTTAATGGTTACGATCGGCTG
TTCAGTGCCGAACGAGAAGGCGGCGCGTTGACGGAAGCAGGATGCTGGGCTCATGCGCGGCGCAAAGTCCACGATGTATA
TATCAGTACCAAAAGCGCGACAGCGGAAGAAGCCCTGAAACTAATCGGTGAGCTGTACGCCATCGAGCACGAAATACGCG
GGTTGCCGGTGTCTGAACGCCTGGCGGTCAGGCAAATGCAGAGTAAACCGCTACTGACTTCCCTGTATAAGCTGATGCAG
GAGAAAGAACACACGTTATCGAAAAAATGCCGTCTGAGAGATGCGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTGTG
CAACTTCAGTGATGATGGTCTGGCTGAGGCGGATAATAATGCCGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGA
AAAACTTTATGTTCTTCGGCAGCGATCACGGTGGAGAGCGTGGTGCGCTACTGTACGGGCTGATCGGCACCTGCCGACTG
AACGGTATCGATCCGGAAGCGTATCTGCGCTATATCCTGAGCGTACTGCCGGAATGGCCTTCCAACCGTGTTGACGAACT
CCTGCCATGGAACGTAGCACTCACCAATAAATAA

Protein sequence :
MPARQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQDSDR
YSGREDDPQVPRQLRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTK
CDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVM
NSRKVHTDDTPVKVLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWFAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRL
FSAEREGGALTEAGCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQSKPLLTSLYKLMQ
EKEHTLSKKCRLRDAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRYILSVLPEWPSNRVDELLPWNVALTNK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 0.0 91
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 0.0 91
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 0.0 90
tnp AEA34687.1 IS66 family transposase Not tested Not named Protein 3e-133 86
c3563 NP_755438.1 hypothetical protein Not tested PAI I CFT073 Protein 5e-122 66
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 2e-131 63
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 4e-137 63
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 3e-136 63
unnamed AAC31494.1 L0015 Not tested LEE Protein 3e-137 63
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 2e-136 63
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 2e-136 63
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 5e-137 63
tnp AEA34686.1 transposase Not tested Not named Protein 2e-136 63
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 5e-137 63
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 2e-136 63
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 2e-136 63
unnamed AAL57570.1 unknown Not tested LEE Protein 5e-137 63
unnamed AAL08460.1 unknown Not tested SRL Protein 2e-97 63
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 1e-105 54
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 1e-97 48
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 7e-88 47
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 5e-88 47

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SbBS512_E2143 YP_001880666.1 IS66 family element, transposase VFG1736 Protein 0.0 98
SbBS512_E2143 YP_001880666.1 IS66 family element, transposase VFG1700 Protein 2e-122 66
SbBS512_E2143 YP_001880666.1 IS66 family element, transposase VFG0793 Protein 2e-137 63
SbBS512_E2143 YP_001880666.1 IS66 family element, transposase VFG1051 Protein 1e-97 63