Name : intP4
Accession : CAE85151.1
PAI name : PAI V 536
PAI accession : AJ617685
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : CP4 integrase protein
Function : -
Note : ORF1
Homologs in the searched genomes : 507 hits ( 506 protein-level, 1 DNA-level )
Publication :
-Dobrindt,U., "Direct Submission", Submitted (11-NOV-2003) Dobrindt U., Inst. f. Molekulare Infektionsbiologie, Universitaet Wuerzburg, Roentgenring 11, 97070 Wuerzburg, GERMANY.
-Schneider,G., Dobrindt,U., Bruggemann,H., Nagy,G., Janke,B., Blum-Oehler,G., Buchrieser,C., Gottschalk,G., Emody,L. and Hacker,J., "The pathogenicity island-associated K15 capsule determinant exhibits a novel genetic structure and correlates with virulence in uropathogenic Escherichia coli strain 536", Infect. Immun. 72 (10), 5993-6001 (2004) PUBMED 15385503.
DNA sequence : | |
ATGGCACTGACTGACGCAAAAATCCGGGCAGCAAAGCCTACTGACAAGGCTTATAAACTCACTGACGGGGCTGGCATGTT
CCTGCTGGTACATCCTAATGGTTCCCGTTACTGGCGTCTCCGTTATCGTATTCTGGGTAAGGAGAAGACTCTGGCACTTG
GTGTGTATCCAGAAGTTTCTCTCTCCGAAGCTCGTACAAAACGGGATGAGGCCCGAAAACTGATTTCAGAGGGGATTGAC
CCTTGCGAACAGAAAAGAGTTAAAAAAGTAGTCCCTGATTTACAGCTCTCTTTTGAACATATTGCACGACGCTGGCATGC
CAGTAATAAACAATGGGCACAATCACACAGCGATAAAGTACTCAAAAGCCTCGAGACACACGTTTTCCCCTTTATCGGCA
ACCGGGATATCACAACTCTCAATACCCCGGACCTGCTTATCCCTGTTCGTGCTGCAGAAGCAAAACAAATTTATGAAATC
GCTAGTCGTCTGCAGCAAAGAATATCTGCTGTAATGCGTTACGCCGTACAGTCTGGCATCATCAGATATAACCCGGCTCT
GGATATGGCTGGTGCATTGACCACGGTAAAACGCCAGCATCGCCCCGCCCTGAATCTTTCACGCCTGCCTGAACTTCTGT
CGCGTATTGACGGTTATAAAGGCCAGCCTGTCACCCGGCTTGCCGTTATGCTGAATTTACTGGTTTTTATTCGTTCCAGT
GAACTCAGATATGCCCGCTGGTCTGAAATTGATATTGACAATGCCATGTGGACTATTCCAGCCGAACGCGAACCTCTGCC
TGGCGTAAAATTCTCACACCGGGGCTCCAGGATGCGGACACCACATCTTGTGCCACTCAGTAAACAGGCTGTAGCCATAC
TGACAGAACTTCAGACATGGGCAGGTGAAAATGGTCTGATATTTACGGGTGCACATGACCCGCGTAAACCAATCAGTGAA
AATACTGTAAATAAAGCCCTGAGGGTGATGGGGTATGACACAACCCAGGAAGTCTGTGGCCATGGATTCCGGGCGATGGC
GTGCAGTGCATTGATTGAATCAGGTTTGTGGTCCCGCGATGCTGTGGAACGTCAGATGAGCCATCAGGAACGTAACGGTG
TACGTGCTGCTTACATTCATAAAGCAGAACATCTGGAAGAACGCCGCTTGATGCTACAATGGTGGGCCGATTTTCTGGAT
GCAAACAGAGAAAGATTTATCAGTCCATTTGAATATGCAAAGATTAATAATCCATTAAAACTGTAA
|
Protein sequence : | |
MALTDAKIRAAKPTDKAYKLTDGAGMFLLVHPNGSRYWRLRYRILGKEKTLALGVYPEVSLSEARTKRDEARKLISEGID
PCEQKRVKKVVPDLQLSFEHIARRWHASNKQWAQSHSDKVLKSLETHVFPFIGNRDITTLNTPDLLIPVRAAEAKQIYEI
ASRLQQRISAVMRYAVQSGIIRYNPALDMAGALTTVKRQHRPALNLSRLPELLSRIDGYKGQPVTRLAVMLNLLVFIRSS
ELRYARWSEIDIDNAMWTIPAEREPLPGVKFSHRGSRMRTPHLVPLSKQAVAILTELQTWAGENGLIFTGAHDPRKPISE
NTVNKALRVMGYDTTQEVCGHGFRAMACSALIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERRLMLQWWADFLD
ANRERFISPFEYAKINNPLKL
|
|