Name : STY4521  
      Accession : NP_458616.1  
PAI name :  SPI-7 
      PAI accession : NC_003198_P9 
      Strain : Salmonella enterica RSK2980 
      Virulence or Resistance: Not determined 
      Product : hypothetical protein 
      Function : - 
      Note : Weakly similar to the C-terminus of several polysaccharide biosynthesis proteins e.g. Streptococcus pneumoniae capsular polysaccharide synthesis protein Cps14D TR:P72512 (EMBL:X85787) (227 aa) fasta scores: E(): 1.7e-06, 32.8% id in 189 aa 
      Homologs in the searched genomes :    51 hits    ( 51 protein-level )   
Publication : 
-Parkhill,J., "Direct Submission", Submitted (25-OCT-2001) Submitted on behalf of the Salmonalla sequencing team, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
  -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18", Nature 413 (6858), 848-852 (2001) PUBMED 11677608.
  -Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Direct Submission", Submitted (10-SEP-2013) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
 
   
       
      
      | DNA sequence :  |  |  
      
          ATGCAAACTGTATCTCGCGTTAAGTGTTCCCCCCAGGTTTCAACCGGGGAGTCTGTAGAAAGAAAACCAGACGCCAGTAT
TCATTTGGCACAGCAGGTTTCACCGCATTCGTTGTTACTCCCTGATGAGCAACAATTCCCGGTAGTATTGCCAGTAGTAT
CCACTAAGGGCGGAGAAGGGAAGTCAACCAAGGCAGGCAATATTGCGGGTTACACCGCCGATGCTGGTCTGAAAACACTG
CTAATCGATGGTGATTATAATCAGCCAACAGCCAGCAGTATTTTTAAACTTCACTATGAAGCACCCTGCGGACTGTATGA
ATTACTCATGCAGACTGCTGACCTTAACAAACCTGACAGCATCATTTCCCGCACGGTTATTCCCAATCTTGACGTCATCA
TTTCCAACGATCCTGACGATCGTCTTTCCAATGATATGCTGCATGCAGCTGATGGCAGAATGCGTCTGCGTAATGTCCTG
CAGCATCCTCTTTTCAGACAATATGACGTCATAATCGTCGATTCCAAAGGCGCTGGCGGGGTGATGGTGGAGCTCGTGGT
GCTCGCTGCGACTCAAAGCGTCATGGGTGTTATTAAACCGATTTTACCCGATGTACGTGAGTTCCTACGCGGCACTGTAC
GTCTTTTATCCAAACTTCTGGTTCTGGAACCCTACGGTATCCATATTCCCGATATTCGAATTCTCGCCAACTGTGTTGAA
CCCACTGTACTGGATCGAAACACCCTCAACGAACTCAAGGCAATCGTGGATAAAGGTCAGTACCCCCAGTCAGACCGTAT
TGCCATATCAATGCTGAATACCGAAATAGAGCAACTGGAAGTCTACAAACGTGGGCATGCGTGCGGGCAGCCAGTACATC
GTCTCGAATATAAAACTGACCGGGTAAGCCTGCCGGCAGCGGAGTCCATGCACCACCTGGTCTGTGAGTTATTTCCTCAA
TGGAAAGATAAGTTTGATGCGGTTCTGGTTAACCGGCCTCAGCCCGGATTTGGTCAGGGGGCGGATGAGGTATGA
  
           | 
       
      | Protein sequence :  |  |  
      
          MQTVSRVKCSPQVSTGESVERKPDASIHLAQQVSPHSLLLPDEQQFPVVLPVVSTKGGEGKSTKAGNIAGYTADAGLKTL
LIDGDYNQPTASSIFKLHYEAPCGLYELLMQTADLNKPDSIISRTVIPNLDVIISNDPDDRLSNDMLHAADGRMRLRNVL
QHPLFRQYDVIIVDSKGAGGVMVELVVLAATQSVMGVIKPILPDVREFLRGTVRLLSKLLVLEPYGIHIPDIRILANCVE
PTVLDRNTLNELKAIVDKGQYPQSDRIAISMLNTEIEQLEVYKRGHACGQPVHRLEYKTDRVSLPAAESMHHLVCELFPQ
WKDKFDAVLVNRPQPGFGQGADEV
  
         | 
       
       
      |