Gene Information

Name : SCE1572_10375 (SCE1572_10375)
Accession : YP_008148521.1
Strain : Sorangium cellulosum So0157-2
Genome accession: NC_021658
Putative virulence/resistance : Virulence
Product : hypothetical protein
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 2881948 - 2883669 bp
Length : 1722 bp
Strand : +
Note : Derived by automated computational analysis using gene prediction method: GeneMarkS+.

DNA sequence :
GTGAGCCAGCAAAAAGACCGACCGAACGACAAGAAGAGCGAGCGGCTAGCGAAGGAGGCCGCCGAGGCCGCGCGCGCCGA
GGAGACGCTGATCGCCACCCGCAAGGCGAAGGCGGCCCGCCTCCGCGCCCGCGGCGAGAACCCGTTCGCCAACGACGTCA
CGACGGGCGAGCCGCTCACCGAGCTCGCCGCCGTCCGCGCCCGCTTCGACGGGGCGCGGGCCGCGGCCGAGCCCCGGCCG
GAGGGCGCGGCGCCGGACCGGCCCGCCGCGCCGACGGTGGATCGCTACGACCCGGCGCGCGTCGAGCCGATCCCGTTCCG
CGTCGCGGGCCGGGTCCTCTTCGTGCGCTCGTTCGGCGGCGTCACCTTCGTGCGGCTGCGCGACCGGACCGGCGAGCTCC
AGCTCTACTGCGAGCAGGGGGGCCTCGCCGATTTCGAGCGGCTGGAGGACATCGATCTCGGCGATTTCGTCGAGGCCCAC
GGCGTCGCGATGGCCACGCAGAAGGGGGAGCTCTCGATCCGCGCGGAGCGCCTCCGGCTGCTCACGAAGGCCTACCGCCC
GCTGCCGACCAAGACGAGCTTCAAGGACGTCGAGTCGCGCTACCGCATGCGGTACGTCGACCTCGTGGCCAACCCGCACG
TGGCCGGGGTCTTCCGCGCCCGCAGCGCGATCGTCGCGGCGCTGCGCGAGTTCCTCGACGCGCGCGGCTTCCTCGAGGTC
GAGACCCCGACGATGCACACGCTCGTCGGCGGCGCGGCCGCGAAGCCGTTCAAGACCCACCACAACGCGCTCGATCTCGA
GCTCTTCATGCGCATCGCGCCCGAGCTCTTCCTGAAGCGGCTCGTGGTCGGCGGCTTCGAGCGCGTCTACGAGATCGCGC
GCTGCTACCGGAACGAGGGGCTGTCGACGCGGCACAACCCCGAGTTCACGATGCTCGAGTACTACCAGGCGTACGCGACG
TACGAGACCCTGATGGACGCGACCGAGGCGATGCTCCGCCACGTCGACGCCCGCCTCGCCGAGCGCCTCCCGGCCGAGCA
CGCGGCCTGGGCCTCCCAGCGCGCGTTCTCGCTCGAGCGCTTCGCGCGCGTCCCCATGGCCGAGGGGCTCGCCCGCGCCC
TGGAAAAGGCGGGGCTCCCCGCCGACGTCCCGCAGCGCGTCGCCGCGGACGACGCGCCGATCAAGGAGTGGGCGAAGGCC
GCCAAGGCGAGCGGGCGCGAGATCGACTGGACGAACTTCCGCGCGGGCGCGAAGAAGTGCGAGTCGCCCGGCGAGCTCGT
CTTCTGCGCCTACGAGTACGTGGTCGAGCCGTTCCTCACGAAGGACTACCGCACCGAGGCGGGCGACAAGAGCGTCCCCG
TCTTCATCATCGACTATCCCTTCGAGGTCTCCCCGCTCGCGCGCAAGAAGGACAGCGATCCCTCGCTGGTCGACCGCTTC
GAGCTCTTCATCGAGGGCCGCGAGCTCTGCAACGCCTTCAGCGAGCTCAACGACCCCGAGGACCAGGACGCGCGCTTCCG
CGCGCAGGTCGCCAAGAAGTCGCGGGGCGAGGAGGAGACGATGGATTACGACGCCGACTACGTCCGCGCCTTGGAGCACG
GCCTCCCGCCGACGGCGGGCTTCGGCATGGGGATCGATCGCCTGACGATGCTCCTCACCGGCGCGACATCCATCCGCGAC
GTCATCCTGTTCCCCCTCCTGCGCCCCGAGACCGACGCGTGA

Protein sequence :
MSQQKDRPNDKKSERLAKEAAEAARAEETLIATRKAKAARLRARGENPFANDVTTGEPLTELAAVRARFDGARAAAEPRP
EGAAPDRPAAPTVDRYDPARVEPIPFRVAGRVLFVRSFGGVTFVRLRDRTGELQLYCEQGGLADFERLEDIDLGDFVEAH
GVAMATQKGELSIRAERLRLLTKAYRPLPTKTSFKDVESRYRMRYVDLVANPHVAGVFRARSAIVAALREFLDARGFLEV
ETPTMHTLVGGAAAKPFKTHHNALDLELFMRIAPELFLKRLVVGGFERVYEIARCYRNEGLSTRHNPEFTMLEYYQAYAT
YETLMDATEAMLRHVDARLAERLPAEHAAWASQRAFSLERFARVPMAEGLARALEKAGLPADVPQRVAADDAPIKEWAKA
AKASGREIDWTNFRAGAKKCESPGELVFCAYEYVVEPFLTKDYRTEAGDKSVPVFIIDYPFEVSPLARKKDSDPSLVDRF
ELFIEGRELCNAFSELNDPEDQDARFRAQVAKKSRGEEETMDYDADYVRALEHGLPPTAGFGMGIDRLTMLLTGATSIRD
VILFPLLRPETDA

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
unnamed CAD66193.1 putative lysil-tRNA synthetase LysU Not tested PAI III 536 Protein 6e-82 43

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SCE1572_10375 YP_008148521.1 hypothetical protein VFG1668 Protein 3e-82 43