Name : tnp
Accession : AEA34674.1
PAI name : Not named
PAI accession : HQ018801
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : IS10 transposase
Function : -
Note : similar to NP_052934.1 transposase in Plasmid R100 and NP_863365.1 hypothetical protein R64_p010 in Salmonella typhimurium
Homologs in the searched genomes : 266 hits ( 251 protein-level, 15 DNA-level )
Publication :
-Ziebell,K., Johnson,R.P., Kropinski,A.M., Ahmed,R., Gannon,V., Gilmour,M. and Boerlin,P., "Direct Submission", Submitted (04-AUG-2010) Laboratory for Foodborne Zoonoses, Public Health Agency of Canada, 110 Stone Road West, Guelph, Ontario N1G 3W4, Canada.
-Ziebell,K., Johnson,R.P., Kropinski,A.M., Reid-Smith,R., Ahmed,R., Gannon,V.P., Gilmour,M. and Boerlin,P., "Gene Cluster Conferring Streptomycin, Sulfonamide, and Tetracycline Resistance in Escherichia coli O157:H7 Phage Types 23, 45, and 67", Appl. Environ. Microbiol. 77 (5), 1900-1903 (2011) PUBMED 21239555.
DNA sequence : | |
ATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAAC
GTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAA
CAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGG
CATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATTAGTGAGCAAAAACGACT
TATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAAT
GTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGT
GATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGT
ACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTT
TAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAA
AATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCT
AGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAATATCTATTCGAAGCGAATGCAGATTGAAGAAA
CCTTCCGAGACTTGAAAAGTCCTGCCTACGGACTAGGCCTACGCCATAGCCGAACGAGCAGCTCAGAGCGTTTTGATATC
ATGCTGCTAATCGCCCTGATGCTTCAACTAACATGTTGGCTTGCGGGCGTTCATGCTCAGAAACAAGGTTGGGACAAGCA
CTTCCAGGCTAACACAGTCAGAAATCGAAACGTACTCTCAACAGTTCGCTTAGGCATGGAAGTTTTGCGGCATTCTGGCT
ACACAATAACAAGGGAAGACTCACTCGTGGCTGCAACCCTGCTTACTCAAAATCTATTCACACATGGTTACGTTTTGGGG
AAATTATGA
|
Protein sequence : | |
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRW
HASFICSGNTMPIVLVDWSDISEQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVS
DAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK
NQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI
MLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDSLVAATLLTQNLFTHGYVLG
KL
|
|