Name : STY4821
Accession : NP_458899.1
PAI name : SPI-10
PAI accession : NC_003198_P10
Strain : Salmonella enterica RSK2980
Virulence or Resistance: Not determined
Product : integrase
Function : -
Note : Similar to Bacteriophage P4 Integrase SW:VINT_BPP4 (P08320) (439 aa) fasta scores: E(): 0, 96.1% id in 410 aa, and to Escherichia coli P4 Integrase-like protein TR:Q9RPJ5 (EMBL:AF157599) (428 aa) fasta scores: E(): 0, 91.4% id in 419 aa, and to Escherichi
Homologs in the searched genomes : 458 hits ( 457 protein-level, 1 DNA-level )
Publication :
-Parkhill,J., "Direct Submission", Submitted (25-OCT-2001) Submitted on behalf of the Salmonalla sequencing team, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
-Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18", Nature 413 (6858), 848-852 (2001) PUBMED 11677608.
-Parkhill,J., Dougan,G., James,K.D., Thomson,N.R., Pickard,D., Wain,J., Churcher,C., Mungall,K.L., Bentley,S.D., Holden,M.T., Sebaihia,M., Baker,S., Basham,D., Brooks,K., Chillingworth,T., Connerton,P., Cronin,A., Davis,P., Davies,R.M., Dowd,L., White,N., , "Direct Submission", Submitted (10-SEP-2013) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.
DNA sequence : | |
ATGAAGCTCAACGCCAGACAGGTCGAGACCGCAAAGCCAAAAGACAAAACCTACAAAATGGCCGATGGCGGCGGTTTGTA
CCTTGAGGTTTCGGCCAAGGGATCCAAATACTGGCGCATGAAATACAGACGCCCCTCTGACAAAAAAGAGGATCGCCTCG
CTTTTGGTGTTTGGCCTACTGTGACACTTGCTCAGGCAAGAGCAAAGCGCGATGAAGCCAAAAAGTTGCTAGTGCAGGGC
ATTGATCCAAAAACCGAACAGAAAGAGGCTCAGGCCGAGAACTCGGGGGCGTATACTTTCGAAACTATCGCTCGTGAATG
GCATGCCAGTAACAAGCGCTGGAGCGAAGACCATCGATCGCGCGTTCTTCGCTATCTTGAACTTTATATTTTCCCTCATA
TCGGTTCGTCCGACATTCGCCAGCTCAAAACCAGTCACCTGTTAGCCCCGATCAAAAAAGTTGATGCCAGTGGTAAACAC
GACGTCGCTCAGCGCCTGCAACAGCGCGTCACGGCTATTATGCGTTATGCCGTACAGAACGATTACATCGACTCTAATCC
AGCCAGTGACATGGCCGGTGCGTTATCGACTACCAAAGCACGACACTATCCAGCCTTACCTTCTAGCCGTTTCCCTGAGT
TTCTTGCTCGCCTTGCTGCATATCGTGGCCGTGTAATGACACGGATTGCTGTCGAGCTTTCCTTACTAACTTTTGTACGT
TCCAGCGAGTTACGTTTCGCGCGTTGGGACGAGTTCGACTTCGATAAATCCCTTTGGCGCGTACCGGCAAAACGAGAAGA
AATTAAGGGTGTGCGTTATTCGTACCGTGGCATGAAGATGAAAGAGGAGCATATCGTTCCTCTCAGTCGGCAGGCGATGG
TTTTGTTAGAGCAGCTCAAGCAGATTAGTGGTGATAAAGAGCTGTTGTTTCCGGGGGATCACGACGCAACTAAGGTTATG
AGCGAAAACACGGTAAATAGCGCATTGCGTGCGATGGGTTATGACACAAAAACCGAGGTCTGCGGGCATGGGTTTCGGAC
GATGGCGCGTGGTGCTTTGGGTGAGTCAGGATTATGGAGCGATGACGCGATAGAGCGTCAGTTAAGCCATTCTGAGCGTA
ACAATGTGCGTGCTGCCTATATTCACACTTCAGAGCATTTGGATGAGCGTCGTTTGATGGTGCAGTGGTGGGCTGATTAT
CTCGATATTATTCAGTATGAGCATATTACACCTTATGAATATGCAAGAGTCTGCAACATAAAATGA
|
Protein sequence : | |
MKLNARQVETAKPKDKTYKMADGGGLYLEVSAKGSKYWRMKYRRPSDKKEDRLAFGVWPTVTLAQARAKRDEAKKLLVQG
IDPKTEQKEAQAENSGAYTFETIAREWHASNKRWSEDHRSRVLRYLELYIFPHIGSSDIRQLKTSHLLAPIKKVDASGKH
DVAQRLQQRVTAIMRYAVQNDYIDSNPASDMAGALSTTKARHYPALPSSRFPEFLARLAAYRGRVMTRIAVELSLLTFVR
SSELRFARWDEFDFDKSLWRVPAKREEIKGVRYSYRGMKMKEEHIVPLSRQAMVLLEQLKQISGDKELLFPGDHDATKVM
SENTVNSALRAMGYDTKTEVCGHGFRTMARGALGESGLWSDDAIERQLSHSERNNVRAAYIHTSEHLDERRLMVQWWADY
LDIIQYEHITPYEYARVCNIK
|
|