Gene Information

Name : SSON53_01235 (SSON53_01235)
Accession : YP_005454830.1
Strain : Shigella sonnei 53G
Genome accession: NC_016822
Putative virulence/resistance : Virulence
Product : ATP-dependent Clp proteinase ATP-binding chain
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 259988 - 262777 bp
Length : 2790 bp
Strand : -
Note : 'COG0542 ATPases with chaperone activity, ATP-binding subunit'

DNA sequence :
ATGATCCAGATTGATCTTCCCACGCTGGTAAAACGGCTGAACCTGTTCTCCCGCCAGGCGCTGGAGATGGCCGCCTCTGA
ATGTATGAGTCAGCAGGCAGCGGAAATTACCGTCAGCCATGTGCTCATTCAGATGCTCGCCATGCCACGCAGTGACCTGC
GGGTTATTACCCGCCAGGGCGATATTGGCATGGAAGAGTTGCGCCAGGCGCTGACGGTAGAGAACTACACAACCGCCCGT
TCTGCGGACAGCTACCCGGCGTTTTCCCCGATGCTGGTTGAGTGGCTTAAAGAGGGCTGGCTGCTGGCGTCGGCTGAGAT
GCAGCACAGCGAACTGCGCGGCGGCGTGTTGCTGCTGGCCCTGCTGCATTCGCCGCTGCGTTATATACCGCCTGCTGCCG
CCCGGCTGTTGACCGGCATTAACCGTGACCGTCTGCAACAGGACTTTGTGCAGTGGACACAGGAGTCGGCGGAATCAGTC
GTGCCGGATGCAGACGGTAAAGGCGCAGGCACAATGACGGACGCCTCTGACACCCTGCTTGCCCGCTATGCCAAAAACAT
GACCGCAGACGCCCGTAACGGCAGGCTTGACCCGGTACTGTGCCGCGACCACGAAATCGACCTGATGATCGACATTCTCT
GCCGCCGCCGTAAAAACAACCCGGTGGTGGTGGGCGAAGCGGGCGTGGGCAAAAGCGCACTGATTGAAGGGCTGGCGCTG
CGCATCGTGGCAGGCCAGGTGCCGGACAAGCTGAAAAACACCGATATCATGACCCTTGACTTGGGCGCATTGCAGGCCGG
GGCGTCGGTGAAGGGTGAATTCGAAAAACGTTTCAAAGGGCTGATGGCGGAGGTCATTTCCTCCCCGGTGCCGGTCATTC
TGTTTATCGACGAAGCACATACCCTGATTGGCGCGGGCAACCAGCAGGGCGGGCTGGATATCTCCAACCTGCTCAAACCG
GCGCTGGCGCGCGGCGAGCTGAAAACCATCGCCGCCACCACCTGGAGCGAGTACAAAAAATACTTCGAAAAAGATGCCGC
CCTGTCGCGCCGCTTCCAGTTGGTGAAGGTCAGCGAACCCAACGCTGCCGAAGCCACCATTATTCTGCGCGGTCTGTCGG
CGGTCTATGAACAGTCTCACGGCGTGCTGATTGATGATGACGCCTTGCAGGCCGCTGCGACATTAAGCGAGCGTTATCTC
TCCGGGCGTCAGTTACCGGACAAAGCGATTGATGTGCTGGATACCGCCTGCGCCCGTGTGGCCATCAACCTGTCGTCGCC
GCCGAAGCAAATCTCGGCGCTGACCACTCTGAGCCACCAGCAGGAGGCGGAAATTCGCCAGCTTGAGCGCGAGCTTCGCA
TCGGACTGCGTACCGACACATCACGGATGACCGAGGTGCTGGTGCAGTATGATGAAACGCTGACGGCGCTGGATGAACTG
GAAGCGGCCTGGCACCAGCAGCAGACGCTGGTCCGGGAGATTATCGCGCTGCGCCAGCAGTTACTGGGCGTGGCAGAGGA
CGATGCGGCGCCGTTGCCGGACGCAGATACCGTGGAGGATACGCAGCCAGAGTCAGAGTCAGAGTCAGAGTCAGAGTCAG
AGTCAGAGCAGGATAATACCGGTGCCGAACCGGCTGATGAGGCCGACAGAGAACAACCGGAAGAGACCGCTGAAACAGTT
TCCCCGGTACAGCGGCTGGCCCAGCTCACTGCCGAACTGGACGCCCTGCATAACGACCGGTTGCTGGTCTCCCCGCACGT
CGATAAAAAACAGATTGCGGCGGTGATTGCCGAATGGACCGGCGTGCCGCTCAACCGCCTGTCGCAGAATGAAATGTCGG
TCATCACCGACCTGACGAAATGGCTGGGTGACACCATCAAAGGCCAGGACCTGGCGATTGCCAGCCTGCATAAGCATCTA
CTGACCGCACGCGCCGACCTGCGTCGTCCGGGACGCCCGCTCGGTGCGTTCCTGCTGGCTGGCCCCAGCGGTGTGGGTAA
AACCGAAACCGTCCTGCAACTGGCAGAGCTGCTCTACGGCGGTCGCCAGTACCTGACCACCATCAATATGTCCGAATTCC
AGGAGAAACACACCGTCTCGCGGCTGATTGGTTCGCCTCCGGGCTACGTTGGCTACGGTGAAGGCGGCGTACTGACCGAA
GCGATTCGCCAGAAACCCTACTCGGTAGTACTGCTCGATGAAGTGGAAAAAGCGCACCCGGATGTGCTCAACCTGTTCTA
CCAGGCGTTCGATAAGGGCGAAATGGCAGACGGTGAAGGCCGCCTGATTGACTGCAAAAATATCGTCTTCTTCCTGACGT
CCAACCTCGGCTACCAGGTAATAGTCGAGCATGCCGATGACCCGGAAACCATGCAGGAAGTACTGTATCCGGTGCTGGCC
GACTTCTTCAAACCTGCCCTGCTGGCGCGTATGGAAGTGGTGCCGTATCTGCCGCTGTCGAAAGAGACGCTCGCCACCAT
TATCGCCGGAAAACTGGCCCGTCTGGATAACGTGCTGCGCAGTCGCTTTGGTGTAGAAGTGGTCATTGAACCGGAAGTGA
CGGACGAAATCATGAGCCGCGTCACCCGCGCGGAAAACGGCGCGAGGATGCTGGAATCGGTCATCGATGGCGACATGCTA
CCGCCGCTCTCGCTGCTGCTGTTGCAGAAAATGGCGGCTAACACGGCGATTGCCCGGATTCGGTTGTCGGCAGTGGACGG
CGCATTTACGGCAGACGTGGAAGATGCTCAGAACGACGAGTCCGTCACAAAGGATGAAACGGTTTTATGA

Protein sequence :
MIQIDLPTLVKRLNLFSRQALEMAASECMSQQAAEITVSHVLIQMLAMPRSDLRVITRQGDIGMEELRQALTVENYTTAR
SADSYPAFSPMLVEWLKEGWLLASAEMQHSELRGGVLLLALLHSPLRYIPPAAARLLTGINRDRLQQDFVQWTQESAESV
VPDADGKGAGTMTDASDTLLARYAKNMTADARNGRLDPVLCRDHEIDLMIDILCRRRKNNPVVVGEAGVGKSALIEGLAL
RIVAGQVPDKLKNTDIMTLDLGALQAGASVKGEFEKRFKGLMAEVISSPVPVILFIDEAHTLIGAGNQQGGLDISNLLKP
ALARGELKTIAATTWSEYKKYFEKDAALSRRFQLVKVSEPNAAEATIILRGLSAVYEQSHGVLIDDDALQAAATLSERYL
SGRQLPDKAIDVLDTACARVAINLSSPPKQISALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDEL
EAAWHQQQTLVREIIALRQQLLGVAEDDAAPLPDADTVEDTQPESESESESESESEQDNTGAEPADEADREQPEETAETV
SPVQRLAQLTAELDALHNDRLLVSPHVDKKQIAAVIAEWTGVPLNRLSQNEMSVITDLTKWLGDTIKGQDLAIASLHKHL
LTARADLRRPGRPLGAFLLAGPSGVGKTETVLQLAELLYGGRQYLTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGVLTE
AIRQKPYSVVLLDEVEKAHPDVLNLFYQAFDKGEMADGEGRLIDCKNIVFFLTSNLGYQVIVEHADDPETMQEVLYPVLA
DFFKPALLARMEVVPYLPLSKETLATIIAGKLARLDNVLRSRFGVEVVIEPEVTDEIMSRVTRAENGARMLESVIDGDML
PPLSLLLLQKMAANTAIARIRLSAVDGAFTADVEDAQNDESVTKDETVL

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
aec27 AAQ96721.1 Aec27 Not tested AGI-1 Protein 0.0 90
aec27 YP_851418.1 ATPase Not tested PAI II APEC-O1 Protein 0.0 90

• Homologs from VFDB (virulence genes)

GeneGenBank Accn Product ID of source DB Alignment Type E-val Identity
SSON53_01235 YP_005454830.1 ATP-dependent Clp proteinase ATP-binding chain VFG2076 Protein 1e-136 42