PAI Gene Information


Name : tnp
Accession : AEA34687.1
PAI name : Not named
PAI accession : HQ018801
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : IS66 family transposase
Function : -
Note : similar to ZP_02814205.1 IS66 family element, transposase in Escherichia coli O157:H7 str. EC869 and YP_001458801.1 IS66 family transposase in Escherichia coli HS
Homologs in the searched genomes :   360 hits    ( 359 protein-level,   1 DNA-level )  
Publication :
    -Ziebell,K., Johnson,R.P., Kropinski,A.M., Ahmed,R., Gannon,V., Gilmour,M. and Boerlin,P., "Direct Submission", Submitted (04-AUG-2010) Laboratory for Foodborne Zoonoses, Public Health Agency of Canada, 110 Stone Road West, Guelph, Ontario N1G 3W4, Canada.

    -Ziebell,K., Johnson,R.P., Kropinski,A.M., Reid-Smith,R., Ahmed,R., Gannon,V.P., Gilmour,M. and Boerlin,P., "Gene Cluster Conferring Streptomycin, Sulfonamide, and Tetracycline Resistance in Escherichia coli O157:H7 Phage Types 23, 45, and 67", Appl. Environ. Microbiol. 77 (5), 1900-1903 (2011) PUBMED 21239555.


DNA sequence :
ATGAGTCAGAAATACCTCATTCGCATCGCAGAGCTGGAAAGGTTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGA
CCAGCAACTGAGTCTGGTTGAAGAGACGGAAGCCTTCCTGCGCTCTGCACTGACACGTGCCGAAGAAAAGATCGAAGAAG
ATGAACGGGAAATAGAACATCTGCGGGCTCAGATAGAAAAACTGCGCCGGATGCTGTTCGGTACCCGTTCTGAAAAACTG
CGTCGTGAAGTTGAACTGGCTGAGGCCCTGCTGAAACAACGTGAACAGGACAGCGATCGTTACAGTGGGCGGGAAGACGA
TCCTCAGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACACCTTCCCCGTGAAATACACCGCC
TGGAGCCAGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCTGAACAGCTGGAA
CTGGTGAGCAGTGCCCTGAAAGTGATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGTATTGTTGAAGC
ACCGGCGCCGTCCCGCCCGATAGAGCGTGGTATCGCGGGCCCCGGATTACTTGCCCGCGTGTTAACGGGAAAATACTGCG
AACATCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGCCAGGGCATCGAACTGAGCCGGGCATTACTCTCCAACTGG
GTTGACGCATGTTGTCAGTTAATGACTCCGCTGAATGATACCCTGTACCGTTACGTGATGAACACCCGCAAGGTTCACAC
TGACGACACACCAGTAAAAGTGCTGGCACCGGGCAGAAAAAAGGCAAAAACAGGACGCATCTGGACGTATGTCCGGGATG
ACCGGAATGCGGGCTCATCAGAGCCACCGGCGGTCTGGTTCGCCTACTCACCAGACAGGCAGGGAAAACATCCGGTACAA
CACCTTCGTCCCTTCCGGGGTATCCTGCAGGCGGATGCGTTCAGCGGTTACGATCGGCTGTTCAGTGCCGAACGTGAAGG
TGGTGCACTGACAGAAGTTGCGTGCTGGGCCCATGCCCGCAGGGGCTTCGCCGACCTGTATAAAATCAGTAAAGATCCAC
GGGCTGCCATAGCCGTGAAGAAAATCGCGGGGTTGTACCGTCTTGAGAAGAAGATCAGTAGCCGCCCCGTGGAAAAAATC
CGCCAGTGGCGACAGCGTTATGCCCGTCCGATACTGGAAGATCTGTGGTCATGGCTTGAAGAGCAGGAACCGCAATGTTC
TCCGGGAAGTAAGCGTACAGCCTGA

Protein sequence :
MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALTRAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKL
RREVELAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLE
LVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGIELSRALLSNW
VDACCQLMTPLNDTLYRYVMNTRKVHTDDTPVKVLAPGRKKAKTGRIWTYVRDDRNAGSSEPPAVWFAYSPDRQGKHPVQ
HLRPFRGILQADAFSGYDRLFSAEREGGALTEVACWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKI
RQWRQRYARPILEDLWSWLEEQEPQCSPGSKRTA