PAI Gene Information


Name : int
Accession : CAA21384.1
PAI name : HPI
PAI accession : AL031866_P1
Strain : Yersinia pestis A1122
Virulence or Resistance: Not determined
Product : -
Function : -
Note : label:int; Integrase gene, len=420 aa, highly similar to IntB E. coli prophage P4 integrase (396 aa), 54.1% identity in 390 aa overlap, Fasta scores: opt: 1482, E(): 0
Homologs in the searched genomes :   330 hits    ( 323 protein-level,   7 DNA-level )  
Publication :
    -Buchrieser,C., Rusniok,C., Couve,E., Frangeul,L., Billault,A., Kunst,F., Carniel,E. and Glaser,P., "DNA sequence of the 102 kbases unstable region of Yersinia pestis", Unpublished.

    -Glaser,P., Rusniok,C., Buchrieser,C., Couve,E., Frangeul,L., Billault,A., Kunst,F. and Carniel,E., "Direct Submission", Submitted (09-OCT-1998) P. Glaser, Institut Pasteur, Genomique des Microorganismes Pathogenes, 25 rue du Docteur Roux, 75724 Paris Cedex 15, FRANCE. E-mail: pglaser@pasteur.fr Phone: +33 1 45 68 89 96, Fax: +33 (0)1 45 68 87 46.


DNA sequence :
ATGTCCCTTACCGACGCAAAAATCCGCACCCTCAAGCCTTCTGATAAACCCTTTAAAGTCTCCGATTCTCACGGTCTGTA
TCTGCTGGTCAAGCCGGGTGGCTCCCGCCACTGGTATCTCAAATACCGTATTAGCGGTAAAGAATCCCGCATTGCGCTGG
GTGCCTATCCAGCCATCTCCCTGTCTGATGCGCGACAGCAACGTGAAGGTATCCGTAAAATGCTGGCGCTGAATATCAAC
CCGGTACAGCAGCGGGCTGCTGAACGTGGCTCACGAACACCGGAGAAAGTTTTTAAAAACGTGGCGCTGGCGTGGCATAA
AAGTAACAGGAAATGGTCGCAGAACACCGCCGACCGTCTGCTTGCCAGCCTGAACAATCACATCTTTCCGGTCATCGGGA
ACCTACCTGTATCAGAACTTAAACCCCGTCATTTCATTGACCTGCTGAAAGGGATCGAGGAAAAAGGTCTGCTGGAGGTT
GCGTCCCGCACACGGCAGCACCTGAGTAACATAATGCGCCATGCGGTCCATCAGGAGTTAATCGATACGAACCCTGCAGC
AAACCTTGGCGGCGTGACCACACCTCCTGTCAGACGGCACTATCCTGCCCTGCCGCTGGAGCGGCTGCCTGAACTGCTTG
AACGTATTGGGGCATATCATCAGGGCCGTGAACTGACCCGGCATGCCGTTCTGCTGATGCTGCATGTGTTCATTCGCTCC
AGTGAACTGCGTTTCGCCCGCTGGTCAGAGATTGATTTCACAAACCGAGTCTGGACGATACCCGCGACGCGAGAACCCAT
TATTGGCGTGCGTTATTCCGGCCGCGGGGCAAAAATGCGAATGCCGCATATCGTCCCCCTCTCAGAACAGTCCATCGCCA
TTCTGAAACAGATTAAGGATATCACCGGTAATAATGAACTGATCTTCCCCGGCGACCATAACCCGTATAAGCCAATGTGT
GAAAACACGGTCAATAAGGCACTGCGGGTGATGGGTTACGACACGAAAAAGGATATCTGCGGTCACGGCTTCCGGGCAAT
GGCATGCAGTGCGCTGATGGAATCGGGTTTATGGGCAAAGGACGCAGTAGAACGCCAGATGAGTCATCAGGAGCACAATA
CCGTGCGCATGGCTTATATTCATAAGGCAGAGCACCTAGAAGCCCGCAAAGCGATGATGCAGTGGTGGTCGGATTATCTG
GAAGCATGCCGAGAATCTTATGCACCGCCTTATACAATTGGTAAAAATAAGTTTATCCCATAG

Protein sequence :
MSLTDAKIRTLKPSDKPFKVSDSHGLYLLVKPGGSRHWYLKYRISGKESRIALGAYPAISLSDARQQREGIRKMLALNIN
PVQQRAAERGSRTPEKVFKNVALAWHKSNRKWSQNTADRLLASLNNHIFPVIGNLPVSELKPRHFIDLLKGIEEKGLLEV
ASRTRQHLSNIMRHAVHQELIDTNPAANLGGVTTPPVRRHYPALPLERLPELLERIGAYHQGRELTRHAVLLMLHVFIRS
SELRFARWSEIDFTNRVWTIPATREPIIGVRYSGRGAKMRMPHIVPLSEQSIAILKQIKDITGNNELIFPGDHNPYKPMC
ENTVNKALRVMGYDTKKDICGHGFRAMACSALMESGLWAKDAVERQMSHQEHNTVRMAYIHKAEHLEARKAMMQWWSDYL
EACRESYAPPYTIGKNKFIP