Gene Information

Name : SSON53_08320 (SSON53_08320)
Accession : YP_005456214.1
Strain : Shigella sonnei 53G
Genome accession: NC_016822
Putative virulence/resistance : Unknown
Product : replication protein for prophage CP-933T
Function : -
COG functional category : -
COG ID : -
EC number : -
Position : 1575627 - 1578449 bp
Length : 2823 bp
Strand : -
Note : -

DNA sequence :
ATGACGGCAGAGTACATCAGGGACTGGCAACAACCGCGCCACGCAGTGGGGCGTGAAGGAACGGGGATCCCCGCTCCTGA
ATCCGCGCTTTCCTCCTGGCTGGATGCCCACCGGGCAGAGAACGAGCGCCGCCAGGAAATGGCTGATGCGGCGTTCTCCG
CCACGCCACTGGGCAACCTGATTAATAAAAGCCTGGACGCACAGGAAAAACAGGACAAAACCATCACACTGGCAGGAGAC
GCCAGAAAACAGGCACGCGGCGCGGTGGATGAAGCCATGGCCTCGCTGCGCCTGCTGCCGTCCTATCTGCGCGATCCGCT
TATTCGCCACCTCTCCTTCCTGCGCAAAAAACAGGAAGCCGATCGCCGGAAAGGCAAAAAGAGCTGGCAGGCGGAACGCT
ATGCACGCGGAACCCTGCGCAAAATATTCGAACGTCTGGATCGCACTGACGGACACTGGCTGACACCGGGTTATCGCTCC
CTTGCCGGACGTGAACGCCTGGACGATTTGCTTTACCTGCCGCAACTCAACAAACACCAGATACAGACGCTGGCCACCAT
GACGGCGGCGATGTTCAGCAGCACCTTCGAAAAACTCTGCGATGGTTTTGGCGCGACTGATGGCGAGCTGACCATGGATG
TAACGCTGAAGGCGTATCAGATGCTGGCCCGCATGGCGTTACACTTACACGCCATGCCTCCGCATTATGACGCACTGACA
ACAGACAAAGACCGGAGGCACGAACCGGACACAGAACTGCTGCCGGGCGCAATCCTTCGCCTGACCTGTGCGGAATGGTG
GAAACGCAAACTGTGGCTGTTACGTTGCGAGTGGAGAGAAGAACAACTTCGCGCCGCCTGTCTGGTTTCCAGAAAAACAT
CGCCCTATCTGAGCCAGGACGCGTTAAGCGAGTTTCGCGCACAGCGCGAGAAAACACGCGATTTCCTGAAAAGTTTCATG
CTGGAAAATGAAGACGGGTTCACGATTGATCTCGAGACGGTGTATTACGCGGGAGTAAGTAACCCGGTTCACCGTAAGGC
AGAAATGATGGCCACCATGAAGGGGCTGGAACTTCTGGCCGAAGCCCGTGGCGACAAAGCGGTGTTTCTGACTGTCACCT
GCCCGTCAAAATACCACGCCACAACGGAGAACGGTCATCCGAATCCCAAATGGAACGGGGCCACAATGCGCGACTCCAGC
GATTACCTGGTTAACACGTTTTTTGCGACGGTCCGCAAAAAACTGAACCGCGACGGCCTGCGCTGGTATGGCATCCGCAC
GGCGGAGCCTCACCATGACGGCACCGTGCACTGGCATATGATGGTCTTTGCTCATCCGGAAGAAATCGACACCATTGTGT
CCCACACCCGCGATATTGCCATTCAGGAAGATCGTCACGAGCTGGGCGATGACATAACTCCGCGCTTTAAGGCGGAGTAT
GTAGACGGCTCAAAAGGCACGCCAACCAGCTACATCGCCACCTACATCGGGAAAAACCTGGACAGCCGCGCCGTGGATGG
CATCGACCCGAAAACGGGCAAGCCACGCGTTGACCACGAAACCGGAAAATCAATGGCCGAGAGCGTGGAACGCGCCATCG
GCTGGGCGCGCCTTCACCGGGTCCGCCAGTTCCAGTTCTTTGGCATCCCCTCCCGCCAGGTATGGCGTGAACTGCGCCGC
CTTGCCAGCCAGATGGCACGCAACCCGGAAGGCCCGCAACGGCTGAAGGATGACGCAATGGATGCGGTTCTTGCCGCCGC
TGATGCCGGATGTTTTGCCACCTACATAGAGAAACAGGGCGGCGTACTTGTTCCACGCAAAGACTACCTGATTCGCACCG
CCTACGACCTCGCCGATGAGCTGAACGATTACGGCGAACAGAGCGTACAGATTTACGGGATCTGGTCACCACTCATCGGG
GAATCCTCCCGTGTGTGCACGCATCCGGATAACTGGAAGCTGGTAAGACGCAAACCGGAAGCGGAAGACAGCGCCCGCGA
AAATGGTTTTGACCTTCAGGGCGGCCCTGCCGCCCCTTGGACTCGTGGCAATAACTGTCCCCGTGTACAGGAAACGGACA
ACAACGGGACAGAACAGCCGGAAGAACGGCCAGCACCGTGGCCGCAGCTCCCTGACGGCGTTGATGTGGATGAATGGATG
CGCTCACTGAAACGGCACGAACGCCGGGCGCTGATGCGTTCGCTTCGTGACAAACAGGCAAAAAACAGCAGTGATGAAAT
GCAGAGCTGGACACAGAGCCGCAAACAGCAGCGGCCTTTGCCTGATAACCACGAATTACTCGCTAAAGAATGGCGGGAGT
CTGCTGAATCTCTCGGCCTGCATATCGGTGAACAACAGATGCAGCACCTGTTACGGGGCGGCAGTCTGTACGTTGACGGC
AGCATCATTGCACCGCAGGGATTTGAAATTGTACGCAAACCGGATACCCGCCCGGACAGCCGAATCACGCAGCTCTGGCA
GCGCCTGAGCCGTAATCATGGCGTAAGCAGCACGGAGATCCGCCATAACCCGGTCGCCAGCTATCTGACACAGCTGGGGG
CATCAGACCCTGAAGCCGCCGCACGCCTGGCATCCACACTTCAGCAGGACCAGAACACCATGAAAACACCCGTTACCGTG
CTTTCTGACATGCTGCGCGCCATCCGCGACGCAGAGCACGCACAGAGAATCAGTGAAACCACTGAACGCGCCCGCCGCAA
AGCAGACCTGCTGCGGGGTGGCCTGACCAGTGGAAACAAAAAACAGACAGAAACGGGACTCACGAATCCCGTAAATGAGC
AAAAAACGCGCAGCGATATATGA

Protein sequence :
MTAEYIRDWQQPRHAVGREGTGIPAPESALSSWLDAHRAENERRQEMADAAFSATPLGNLINKSLDAQEKQDKTITLAGD
ARKQARGAVDEAMASLRLLPSYLRDPLIRHLSFLRKKQEADRRKGKKSWQAERYARGTLRKIFERLDRTDGHWLTPGYRS
LAGRERLDDLLYLPQLNKHQIQTLATMTAAMFSSTFEKLCDGFGATDGELTMDVTLKAYQMLARMALHLHAMPPHYDALT
TDKDRRHEPDTELLPGAILRLTCAEWWKRKLWLLRCEWREEQLRAACLVSRKTSPYLSQDALSEFRAQREKTRDFLKSFM
LENEDGFTIDLETVYYAGVSNPVHRKAEMMATMKGLELLAEARGDKAVFLTVTCPSKYHATTENGHPNPKWNGATMRDSS
DYLVNTFFATVRKKLNRDGLRWYGIRTAEPHHDGTVHWHMMVFAHPEEIDTIVSHTRDIAIQEDRHELGDDITPRFKAEY
VDGSKGTPTSYIATYIGKNLDSRAVDGIDPKTGKPRVDHETGKSMAESVERAIGWARLHRVRQFQFFGIPSRQVWRELRR
LASQMARNPEGPQRLKDDAMDAVLAAADAGCFATYIEKQGGVLVPRKDYLIRTAYDLADELNDYGEQSVQIYGIWSPLIG
ESSRVCTHPDNWKLVRRKPEAEDSARENGFDLQGGPAAPWTRGNNCPRVQETDNNGTEQPEERPAPWPQLPDGVDVDEWM
RSLKRHERRALMRSLRDKQAKNSSDEMQSWTQSRKQQRPLPDNHELLAKEWRESAESLGLHIGEQQMQHLLRGGSLYVDG
SIIAPQGFEIVRKPDTRPDSRITQLWQRLSRNHGVSSTEIRHNPVASYLTQLGASDPEAAARLASTLQQDQNTMKTPVTV
LSDMLRAIRDAEHAQRISETTERARRKADLLRGGLTSGNKKQTETGLTNPVNEQKTRSDI

• Homologs from PAI DB

GeneGenBank Accn Product Virulance or Resistance PAI or REI Alignment Type E-val Identity
STY4635 NP_458715.1 conserved hypothetical protein Not tested SPI-7 Protein 6e-136 42
t4328 NP_807922.1 hypothetical protein Not tested SPI-7 Protein 3e-136 42