PAI Gene Information


Name : tnp
Accession : AEA34664.1
PAI name : Not named
PAI accession : HQ018801
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : IS10 transposase
Function : -
Note : similar to NP_052934.1 transposase in Plasmid R100 and NP_863365.1 hypothetical protein R64_p010 in Salmonella typhimurium
Homologs in the searched genomes :   263 hits    ( 248 protein-level,   15 DNA-level )  
Publication :
    -Ziebell,K., Johnson,R.P., Kropinski,A.M., Ahmed,R., Gannon,V., Gilmour,M. and Boerlin,P., "Direct Submission", Submitted (04-AUG-2010) Laboratory for Foodborne Zoonoses, Public Health Agency of Canada, 110 Stone Road West, Guelph, Ontario N1G 3W4, Canada.

    -Ziebell,K., Johnson,R.P., Kropinski,A.M., Reid-Smith,R., Ahmed,R., Gannon,V.P., Gilmour,M. and Boerlin,P., "Gene Cluster Conferring Streptomycin, Sulfonamide, and Tetracycline Resistance in Escherichia coli O157:H7 Phage Types 23, 45, and 67", Appl. Environ. Microbiol. 77 (5), 1900-1903 (2011) PUBMED 21239555.


DNA sequence :
ATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAAC
GTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAA
CAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGG
CATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATTAGTGAGCAAAAACGACT
TATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAAT
GTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGT
GATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGT
ACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTT
TAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAA
AATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCT
AGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAATATCTATTCGAAGCGAATGCAGATTGAAGAAA
CCTTCCGAGACTTGAAAAGTCCTGCCTACGGACTAGGCCTACGCCATAGCCGAACGAGCAGCTCAGAGCGTTTTGATATC
ATGCTGCTAATCGCCCTGATGCTTCAACTAACATGTTGGCTTGCGGGCGTTCATGCTCAGAAACAAGGTTGGGACAAGCA
CTTCCAGGCTAACACAGTCAGAAATCGAAACGTACTCTCAACAGTTCGCTTAGGCATGGAAGTTTTGCGGCATTCTGGCT
ACACAATAACAAGGGAAGACTTACTCGTGGCTGCAACCCTACTAGCTCAAAATTTATTCACACATGGTTACGCTTTGGGG
AAATTATGA

Protein sequence :
MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRW
HASFICSGNTMPIVLVDWSDISEQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVS
DAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRK
NQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDI
MLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDLLVAATLLAQNLFTHGYALG
KL