Gene Information

Name : SBO_1920 (SBO_1920)
Accession : YP_408341.1
Strain : Shigella boydii Sb227
Genome accession: NC_007613
Putative virulence/resistance : Unknown
Product : protein encoded within IS
Function : -
COG functional category : L : Replication, recombination and repair
COG ID : COG3436
EC number : -
Position : 1899701 - 1900975 bp
Length : 1275 bp
Strand : +
Note : Code: L; COG: COG3436

DNA sequence :
TTGCGACAGTCTCGTCATCGTCGCCCGTTACCGGAGCATCTGCCCCGCGAAATAAATCGCCTGGAGCCAGAAGAAAGCTG
TTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCTCTGA
AAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGCATCGTTGAAGCACCGGCACCATCCCGTCCG
ATAGAGCGTGGTATCGCGGGCCCGGGGTTACTTGCCCGCGTGTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCG
TCAGAGTGAAATTTTTGCCCGTCAGGGTGTCGAACTGAGCCGTGCATTACTCTCCAACTTGGTTGACGCGTGCTGCCAGT
TAATGACGCCGCTGAATGATGCTCTGTACCGTTATGTAATGAACAGCCGCAAAGTTCACACTGATGACACACCAGTAAAA
GTGCTGGCACCGGGCAGGAAGAAGGCGAAAACAGGATATATCTGGACGTATGTCCGGGATGACAGGAATGCCGGTTCGCC
AGAGCCTCCGGCGGTCTGGTTCGCCTACTCACCGGACCATCAGGGTAAACATCCGGAGCAGCACCTTAGTCCCTTCCGGG
GTATCCTGCAGGCAGATGCGTTTAATGGTTACGATCGGCTGTTCAGTGCCGAACGAGAAGGCGGCGCGTTGACGGAAGCA
GGATGCTGGGCTCATGCGCGGCGCAAAGTCCACGATGTATATATCAGTACCAAAAGCGCGACAGCGGAAGAAGCCCTGAA
ACTAATCGGTGAGCTGTACGCCATCGAGCACGAAATACGCGGGTTGCCGGTGTCTGAACGCCTGGCGGTCAGGCAAATGC
AGAGTAAACCGCTACTGACTTCCCTGTATAAGCTGATGCAGGAGAAAGAACACACGTTATCGAAAAAATGCCGTCTGAGA
GATGCGTTCCGGTATATCAGGAAGCACTGGGTTGCGTTGTGCAACTTCAGTGATGATGGTCTGGCTGAGGCGGATAATAA
TGCCGCGGAAAGAGCGCTTCGTGCAGTCTGTCTCGGAAAGAAAAACTTTATGTTCTTCGGCAGCGATCACGGTGGAGAGC
GTGGTGCGCTACTGTACGGGCTGATCGGCACCTGCCGACTGAACGGTATCGATCCGGAAGCGTATCTGCGCTATATCCTG
AGCGTACTGCCGGAATGGCCTTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGCACTCACCAATAAATAA

Protein sequence :
MRQSRHRRPLPEHLPREINRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRP
IERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNLVDACCQLMTPLNDALYRYVMNSRKVHTDDTPVK
VLAPGRKKAKTGYIWTYVRDDRNAGSPEPPAVWFAYSPDHQGKHPEQHLSPFRGILQADAFNGYDRLFSAEREGGALTEA
GCWAHARRKVHDVYISTKSATAEEALKLIGELYAIEHEIRGLPVSERLAVRQMQSKPLLTSLYKLMQEKEHTLSKKCRLR
DAFRYIRKHWVALCNFSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRYIL
SVLPEWPSNRVDELLPWNVALTNK

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
Z1161 NP_286696.1 hypothetical protein Not tested TAI Protein 1e-160 89
Z1600 NP_287104.1 hypothetical protein Not tested TAI Protein 1e-160 89
Z4340 NP_289564.1 hypothetical protein Not tested OI-122 Protein 2e-160 88
c3563 NP_755438.1 hypothetical protein Not tested PAI I CFT073 Protein 3e-120 66
st57 CAC81895.1 ST57 protein Not tested LEE II Protein 4e-123 65
Z1131 NP_286666.1 hypothetical protein Not tested TAI Protein 2e-123 65
Z1570 NP_287074.1 hypothetical protein Not tested TAI Protein 2e-123 65
Z4337 NP_289562.1 hypothetical protein Not tested OI-122 Protein 4e-124 65
unnamed ACU09439.1 IS66 family element transposase Not tested LEE Protein 2e-123 65
Z5098 NP_290249.1 prophage-associated protein Not tested LEE Protein 4e-124 65
tnp AEA34686.1 transposase Not tested Not named Protein 2e-123 65
unnamed AAC31494.1 L0015 Not tested LEE Protein 3e-124 65
unnamed CAC39285.1 hypothetical protein Not tested LPA Protein 3e-124 65
ECs4547 NP_312574.1 hypothetical protein Not tested LEE Protein 3e-123 65
unnamed AAL57570.1 unknown Not tested LEE Protein 3e-124 65
unnamed CAI43806.1 hypothetical protein Not tested LEE Protein 6e-124 65
unnamed AAK00463.1 unknown Not tested SHI-1 Protein 6e-80 56
SF2972 NP_708746.1 hypothetical protein Not tested SHI-1 Protein 2e-77 56
aec53 AAW51736.1 Aec53 Not tested AGI-3 Protein 1e-98 56
s0025 CAD33772.1 IS66-like transposase Not tested PAI I 536 Protein 1e-74 54
BCAM0248 YP_002232880.1 putative transposase Not tested BcenGI11 Protein 3e-93 53
ECO103_3554 YP_003223421.1 hypothetical protein Not tested LEE Protein 1e-85 53
Z4317 NP_289543.1 hypothetical protein Not tested OI-122 Protein 1e-85 53

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SBO_1920 YP_408341.1 protein encoded within IS VFG1700 Protein 1e-120 66
SBO_1920 YP_408341.1 protein encoded within IS VFG0793 Protein 2e-124 65
SBO_1920 YP_408341.1 protein encoded within IS VFG0634 Protein 9e-78 56
SBO_1920 YP_408341.1 protein encoded within IS VFG1513 Protein 7e-75 54