Gene Information

Name : SCE1572_30350 (SCE1572_30350)
Accession : YP_008152458.1
Strain : Sorangium cellulosum So0157-2
Genome accession: NC_021658
Putative virulence/resistance : Virulence
Product : ATPase AAA
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 8623627 - 8626326 bp
Length : 2700 bp
Strand : -
Note : Derived by automated computational analysis using gene prediction method: Protein Homology.

DNA sequence :
ATGCCTCGCATGCAGCTCGTCGACACCAAGGCCATCGTCAAGCGACTCACCAAGAACTGCACTTCAGCCCTCGAGGCGGC
GATTGGCCAGTGTGTCAACGCGCGCCACTACGAGGTCACGCTCGAGCACCTGCTCCTCGCGCTGCTCGACGACGCGAACT
CCGACATCGCGTTTCTCGTATCCCACTATGATCTCGACCCGTCGCACCTCCGCGCCGCGCTCCAGCGCAGCCTGGAGGAG
CTGAGGAAGGGGAACGCCGGGAGGCCGGTGCTCTCTCCGACGATGCTCGAGTGGATGCAGGACGCCTATCTCGTCGGATC
CATGGAGTATGGCTACCAGCGGGTGCGGAGCGGCGTGCTCTTCCAGCGCCTCGTCCAGCAGCCGACCCGGTACTCGGTGA
GCGGCATCGGCGCTTACCTCGAGGGCATCTCGAAAGACGATCTCAAGACCAACCTCTCCAAGATCGTCTCCGGCTCGAAG
GAAGAGGCCGAGGCCGCCGCGGCGTCCGCGGGCGCGCCCGGGGCGGGGAAGCTGCCGGCCGGCATGGCGAGCGCCGGGCC
CGACTCGGCCCTGGCGAAGTTCTGCGTCGACTACACAGGCAAGGCGCGCGCGGGGCAGATCGATCCGATCTTCGGCCGCG
AGCGGGAGATCCGGCAGGTCATCGACATCCTCGCGCGGCGCCGCAAGAACAACCCGATCATCGTCGGCGACGCGGGCGTC
GGCAAGACGGCGCTCGTCGAGGGGCTCGCGCTGCTCATCGTCGAGAGCTCGCCGGAGAACCCGAAGGTGCCGCCGCTCCT
GCAAGGGGTCGACATCCTCGGGCTCGACATGGGCCTGCTCCAGGCGGGCGCCGGCGTGAAGGGCGAGTTCGAGAACCGCA
TGAAGCAGGTGATCGCCGAGGTGAAGGGGTCGTCGAAGCCGATCATCCTCTTCATCGACGAGGCGCACACGCTCATCGGC
GCGGGCGGCCAGCAGGGCGGCGGCGACGCGGCCAACCTGCTCAAGCCGGCGCTCGCCCGCGGCGAGCTCCGGACGATCGC
GGCCACGACGTGGAGCGAGTACAAGAAGTACTTCGAGAAGGACGCGGCGCTGGAGCGGCGCTTCCAGCCCGTCAAGGTCG
ACGAGCCGAGCGAGCCGGCCGCCGTCGTGATGCTCCGCGGGCTCCGGCCGAAGTTCGAGCAGGCCCACAACGTCATCATC
CAGGACGAGGCGGTGACCGCCGCCGTGCGGCTCTCGGCGCGCTACCTGACCGGCAGGCAGCTGCCGGACAAGGCGGTCGA
CCTGCTCGACACGTGCGCGGCGCGGGTGAAGGTCGCGCTGCAGCAGCGCCCCGCCGCGGTCGAGGACGCCGAGATCCTCA
TCGTGAACACCGAGACCGAGCTCAAGGCGCTCGAGCGCGATCGGGACAAGGGCGTGCACATCGACGCCGAGCGGGTCGCC
GAGCTCAAGGAGCGGCTCGCCAAGGCGAAGGCCGAGCTCGAGGAGGTGCGCGCCGCCTACGCGAAGGAGACGGCCGGCAC
GCAGAAGGTCATCGACGCCCGCAAGAAGATGGACGAGGCGAAGTCGAACGAGGAGCGCGACGCCGCCCGCCGCGAGGTGG
TGAAGGCCCTCGACGAGCTCAAGCAGAGCCAGGGCGAGGTGCCGCTGATCCGGCCGGACGTCGACGAGGCCATGGTGGCG
AGCGTCGTGGCGGCGTGGACGGGCATCCCCGTCGGCAAGATGGTCCAGGACGACGTGAAGGCGCTCCTCGAGATGGAGGA
CCGGCTGACCCGGCGGATCAAGGGGCAGACGCACGGCATCGTGACCATCTCGAAGGAGCTCCGGAGCGCGCGCGCGGGGC
TGAAGCCGCTGAGCACGCCGCAGGGCGTGTTCCTGCTCGTCGGGCCGAGCGGCGTCGGCAAGACGGAGACCGCGCTCGGG
ATCGCCGACCTCATGTTCGGCGGCGAGCGGATGATGACCGTCATCAACATGTCCGAGTTCCAGGAGAAGCACACGGTCTC
CCGGCTCATCGGCTCGCCGCCCGGCTACGTCGGCTACGGCGAGGGCGGCATGCTCACCGAGGCCGTGCGCCAGCGGCCCT
ACACGGTCGTGCTCCTCGACGAGGTCGAGAAGGCCGATCCCGACGTGCTGAACCTGTTCTACCAGGTGTTCGACAAGGGC
ATGCTGAGCGACGGCGAGGGGCGGCTCGTCGACTTCAAGAACACGGTCATCATCCTGACGAGCAACCTCGCGACCGACAA
GATCACGAACATGACGATCGCCGCGCGGGAAGAGGACCCGGCGCGGCGGCTCGACGCCGACGGCGAGTTCATCAAGGAGG
TGGTCGAGGCGATCAAGCCGACGCTCTCGGCGCACTTCAAGCCGGCGCTGCTCGCCCGCATGACGACGGTGCCGTACCTG
CCGATCTCGCCCGACGCGCTCGGCGAGATCACCCGGCTCAAGCTGGACGCGCTCGTCGATCGCCTGCGCAAGAGCCAGCG
GATCGAGGCGTCGTACTCGGACGCGCTCGTCGACACGATCGCCGCGCGCTGCACCGAGGTCGACACGGGCGCGCGCAACA
TCGATCACATCCTCCGGGCGTCGCTCCTGCCGCAGCTCTCGGTCGCCGTCCTCGAGAAGATGGCGGAGGGCCCGCTCCCG
AAGCGGCTCCAGATCGGGGTCGACGCCGAGAAGAACTTCACCGTCTCGTTCTCGGACTAA

Protein sequence :
MPRMQLVDTKAIVKRLTKNCTSALEAAIGQCVNARHYEVTLEHLLLALLDDANSDIAFLVSHYDLDPSHLRAALQRSLEE
LRKGNAGRPVLSPTMLEWMQDAYLVGSMEYGYQRVRSGVLFQRLVQQPTRYSVSGIGAYLEGISKDDLKTNLSKIVSGSK
EEAEAAAASAGAPGAGKLPAGMASAGPDSALAKFCVDYTGKARAGQIDPIFGREREIRQVIDILARRRKNNPIIVGDAGV
GKTALVEGLALLIVESSPENPKVPPLLQGVDILGLDMGLLQAGAGVKGEFENRMKQVIAEVKGSSKPIILFIDEAHTLIG
AGGQQGGGDAANLLKPALARGELRTIAATTWSEYKKYFEKDAALERRFQPVKVDEPSEPAAVVMLRGLRPKFEQAHNVII
QDEAVTAAVRLSARYLTGRQLPDKAVDLLDTCAARVKVALQQRPAAVEDAEILIVNTETELKALERDRDKGVHIDAERVA
ELKERLAKAKAELEEVRAAYAKETAGTQKVIDARKKMDEAKSNEERDAARREVVKALDELKQSQGEVPLIRPDVDEAMVA
SVVAAWTGIPVGKMVQDDVKALLEMEDRLTRRIKGQTHGIVTISKELRSARAGLKPLSTPQGVFLLVGPSGVGKTETALG
IADLMFGGERMMTVINMSEFQEKHTVSRLIGSPPGYVGYGEGGMLTEAVRQRPYTVVLLDEVEKADPDVLNLFYQVFDKG
MLSDGEGRLVDFKNTVIILTSNLATDKITNMTIAAREEDPARRLDADGEFIKEVVEAIKPTLSAHFKPALLARMTTVPYL
PISPDALGEITRLKLDALVDRLRKSQRIEASYSDALVDTIAARCTEVDTGARNIDHILRASLLPQLSVAVLEKMAEGPLP
KRLQIGVDAEKNFTVSFSD

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY0294 NP_454876.1 ClpB-like protein Not tested SPI-6 Protein 8e-140 45
clpC YP_005163377.1 ATP-dependent Clp protease ATP-binding subunit Not tested Not named Protein 2e-109 41

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SCE1572_30350 YP_008152458.1 ATPase AAA VFG2084 Protein 7e-154 45
SCE1572_30350 YP_008152458.1 ATPase AAA VFG2076 Protein 4e-159 45
SCE1572_30350 YP_008152458.1 ATPase AAA VFG0079 Protein 2e-112 41