| Name : api52 Accession : CAF28526.1
 PAI name :  YAPI
 PAI accession : AJ627388
 Strain : Yersinia pseudotuberculosis IP 31758
 Virulence or Resistance: Not determined
 Product : hsdr-like Type I restriction enzyme
 Function : -
 Note : -
 Homologs in the searched genomes :    184 hits    ( 184 protein-level )
 Publication :
 
-Collyn,F., "Direct Submission", Submitted (03-FEB-2004) Collyn F., E0364, Inserm, 1, rue du Professeur Calmette, Lille, 59021, FRANCE.
 -Collyn,F., Billault,A., Mullet,C., Simonet,M. and Marceau,M., "YAPI, a new Yersinia pseudotuberculosis pathogenicity island", Infect. Immun. 72 (8), 4784-4790 (2004) PUBMED 15271940.
 
 -Collyn,F., Lety,M.A., Nair,S., Escuyer,V., Ben Younes,A., Simonet,M. and Marceau,M., "Yersinia pseudotuberculosis harbors a type IV pilus gene cluster that contributes to pathogenicity", Infect. Immun. 70 (11), 6196-6205 (2002) PUBMED 12379698.
 
 
 
 
      | DNA sequence : |  |  | ATGCTGAGCGAAGACGATTTAGAGCAACTCAGTCTGGGTTGGTTTGCCGGACAAGGCTGGGAAGTGCTACACGGGCCGGA
TATTGCCCCCGATGGCAAGAACCCACTTCGCATTTCTTTCCATGATGTGTTTCTGCGCCCTATTTTTCGAAAGCAATTGG
AGACGCTGAATCCTCATCTTCCGGCCAGCCTGTTTGACGAGATAATAAGTCGAATCACCCGCCCAGAAAGCCCTGATATC
GTCGTCAGTAATAAAGCCTTCCACCATCTGCTGTTAAGCGGCGTTCCGGTTGAATATAAACGCGATGACAAAGTGATTCA
CGATACAGCTCTGTTGATGGATTTTAACCATCCGGAGAATAACCGTTTCACGGTGGTTAACCAGGTAGCTATCAGCGGCA
CTAAACAACTACGCCGCCCTGATGTAATTTGTTATATCAACGGCCTGCCGATAGCCGTAATTGAATTAAAAAGCCCGAGC
GATGTTAATGCCGATATTTGGGCGGCGTTTAATCAGCTTCAAACTTATAAAAATGAAATCAGCGACCTGTTTATTTGTAA
TGAGGCACTGGTGATAAGTGATGGCTATAACGCACGTATTGGTTCCCTCACCGCTGACGAAGAACGTTTTCTACCATGGA
AAACCATCAGCAACGAAGATGATAAACCGTTGCTGGAACACCAACTGGAAAACGTGGTTAACGGCTTCTTTAATCGTGAA
TTACTGCTCGATTACATCCGTTATTTCATCCTGTTCGAAAGTGACGGCAAACGCCTGATCAAAAAAATAGCCGCCTATCA
TCAGTTCCACGCGGTGCGTGAAGCGGTGAATGCAACCATCGTTGCCTCTACAGGTAAGTTTTTACCGCTGCGTAGTAACA
TCAAACCCGGCAGTAAGAAAGCCGGTGTGGTGTGGCATACACAAGGTTCGGGTAAGAGTATTTCAATGTGCTGCTATGCC
GGAAAGCTATTACAGCAGGCAGAGATGAACAACCCAACAATCGTCGTGGTGACTGACCGTAACGATCTGGATGGCCAGCT
CTATACTACGTTTTGCCAGGCAAAAGACCTATTGAAGCAAGAGCCTCAACAAGCCAGCAACCGCGACCAGCTACGTGAAA
TGCTGGCGGCTCGCGAATCCGGCGGCATCATCTTCACCACCGTGCAAAAATTCGCCCCGCTGGATGGCGAGCAAGCCCAT
CCGGCATTGAACCTACGCGACAATATCGTGGTAATTTCTGATGAAGCCCATCGCAGCCAGTACGGATTAAGTGCCACGCT
GGATAAGGATGGGGCCTATAAATACGGCTATGCCAAACATATGCGTGATGCACTGCCGAATGCTTCGTTTATGGGCTTCA
CTGGCACGCCTGTCTCTTCTGAAGATAAAGATACCCGCGCCGTGTTTGGTGATTACGTTTCTATTTACGATATTCAGGAT
GCGGTGGATGACGGCGCAACCGTGCCGATTTATTACGAATCCCGGCTGGCAAAGCTCGATCTGAATCATGAAGAACTAGA
AGCGTTATCCAGCCAGGTTGATGATCTGGTAGAAGATGAAGAGACTGACCAGAAAGAGAAAACTAAAGGTGATTGGAGCC
GTCTAGAAAAGCTGGTCGGTTCCGAACCGCGTATTAAACAGGTAGCGGCCGATCTGGTTAAACATTTTGCAGCCCGCAAC
GCAACGATGAATGGCAAAGCGATGATTGTCGCCATGAGTCGCGATATCTGCGTGCGTCTTTATAATGCGTTAGTTGCGCT
GCGCCCGGAATGGCATAGCGAGGATGTCGAAAAAGGTGAGATAAAAATCATCATGACCGGTTCAGCATCGGATAAAGAAC
ACCTTCAACCGCACATCTACAACAAGCAAACCAAAAAACGGCTCGAAGCACGGTTTAAAGACTTGAATGACCCGCTAAAA
ATCGTGATTGTCCGCGATATGTGGTTAACCGGTTTTGATGCACCCTGTTGCCACACCATGTATATCGATAAACCGATGCG
TGGGCATAACCTGATGCAGGCCATCGCCCGCGTGAACCGCGTGTTTAAAGATAAACCTGGTGGACTGGTGGTGGATTATA
TTGGTATTGCCAATGAGTTAAAGCAAGCGCTGAAAACCTATACCGACTCCAAAGGTAAGGGGCAGACCACCATTGATGCC
AGAGATGCCTTTGCGGTTTTGTTGGAAAAACTCGATGTTATTCATGGTATGTTCGCCAAAACGACTACTGATCCGGGCTT
CGACTATTCAGCGTTTGAAAATAATCCACAGCATGTTTTGCTTGATGCAGCGAACTATATTCTCGGATTGGATGACGGCA
AAAAACGCTATTTCGATGTGGTGCTGGCGCTAAATAAAGCCTGGTCGTTATGCAGCACGCTGGATGAAGCAAAACCGTTG
CAGAAAGAACTCGCCTTCCTGTCAGCGGTGAAAGTGGCAATAATCAAACTCACCACCACGGATAAAAAGTTCAGCCAGTC
CGAGAAGAACTCACTACTGAGCCGTATTTTGGATAACGCCATAGTAGCAACTGGCGTGGATGATGTATTTGCTCTGGCTG
GGTTGGATAAGCCCAATATTGGCCTACTGTCAGACGAATTTCTGGAAGAAGTACGTGAAATGCCGCAACGCAATCTGGCC
GTTGAGTTACTGGAGAAGCTGCTTAACGATGGTATTCATGCCCGTACAAATAATAACGTTGTGCAGGAGAAGAAGTATTC
TGACCGCTTGAGGGCCGTGCTGCTGAGATACAATAACCGCGCGATTGAAACCGCTCAAGTTATTGAAGAGCTGATCCAGA
TGGCGAAAGAGTTTCAGGCAGCCATGGCGCGTGATGATGCGCTCGGTCTGAATCCCGACGAAATTGCTTTCTACGATGCA
CTAGCGGAAAACGAAAGCGCAGTACGGGAATTAGGTGATGAAACCCTCAAAAAACTCGCCATTGAAGTAACGGCTCAGTT
ACGTAAATCTACGACGGTTGACTGGCAAGTGCGCGAAAGTGTGCGTGCACGGTTGCGTATTCTGGTGCGGAAAACGCTAC
TTAAATATAAATATCCGCCAGATAAAGCACTGGATGCGGTTGAGTTAATACTGAAGCAGGCTGAAGTGGTTTCTAATAGC
TGGACCGCTTAA
 
 |  | Protein sequence : |  |  | MLSEDDLEQLSLGWFAGQGWEVLHGPDIAPDGKNPLRISFHDVFLRPIFRKQLETLNPHLPASLFDEIISRITRPESPDI
VVSNKAFHHLLLSGVPVEYKRDDKVIHDTALLMDFNHPENNRFTVVNQVAISGTKQLRRPDVICYINGLPIAVIELKSPS
DVNADIWAAFNQLQTYKNEISDLFICNEALVISDGYNARIGSLTADEERFLPWKTISNEDDKPLLEHQLENVVNGFFNRE
LLLDYIRYFILFESDGKRLIKKIAAYHQFHAVREAVNATIVASTGKFLPLRSNIKPGSKKAGVVWHTQGSGKSISMCCYA
GKLLQQAEMNNPTIVVVTDRNDLDGQLYTTFCQAKDLLKQEPQQASNRDQLREMLAARESGGIIFTTVQKFAPLDGEQAH
PALNLRDNIVVISDEAHRSQYGLSATLDKDGAYKYGYAKHMRDALPNASFMGFTGTPVSSEDKDTRAVFGDYVSIYDIQD
AVDDGATVPIYYESRLAKLDLNHEELEALSSQVDDLVEDEETDQKEKTKGDWSRLEKLVGSEPRIKQVAADLVKHFAARN
ATMNGKAMIVAMSRDICVRLYNALVALRPEWHSEDVEKGEIKIIMTGSASDKEHLQPHIYNKQTKKRLEARFKDLNDPLK
IVIVRDMWLTGFDAPCCHTMYIDKPMRGHNLMQAIARVNRVFKDKPGGLVVDYIGIANELKQALKTYTDSKGKGQTTIDA
RDAFAVLLEKLDVIHGMFAKTTTDPGFDYSAFENNPQHVLLDAANYILGLDDGKKRYFDVVLALNKAWSLCSTLDEAKPL
QKELAFLSAVKVAIIKLTTTDKKFSQSEKNSLLSRILDNAIVATGVDDVFALAGLDKPNIGLLSDEFLEEVREMPQRNLA
VELLEKLLNDGIHARTNNNVVQEKKYSDRLRAVLLRYNNRAIETAQVIEELIQMAKEFQAAMARDDALGLNPDEIAFYDA
LAENESAVRELGDETLKKLAIEVTAQLRKSTTVDWQVRESVRARLRILVRKTLLKYKYPPDKALDAVELILKQAEVVSNS
WTA
 
 |  |