Name : unnamed
Accession : AAP70298.1
PAI name : HPI
PAI accession : AY233333
Strain : Escherichia coli 042
Virulence or Resistance: Not determined
Product : VC0179-like protein
Function : -
Note : ORF24; similar to Vibrio cholerae hypothetical protein VC0179 deposited in GenBank Accession Number AAF93355
Homologs in the searched genomes : 7 hits ( 7 protein-level )
Publication :
-Schubert,S., Dufke,S., Sorsa,J. and Heesemann,J., "A novel integrative and conjugative element (ICE) of Escherichia coli: the putative progenitor of the Yersinia high-pathogenicity island", Mol. Microbiol. 51 (3), 837-848 (2004) PUBMED 14731283.
-Schubert,S., Dufke,S., Sorsa,J. and Heesemann,J., "Direct Submission", Submitted (11-FEB-2003) Department of Bacteriology, Max von Pettenkofer-Institut, Pettenkoferstr. 9a, Munchen 80336, Germany.
DNA sequence : | |
ATGCCTTGGGATTTTAACAATTACTATAGTCACAATATGGATGGCTTAATCAGTAAGCTCAAATTGAGCAAGACTGAATC
CGATAAACTCAAAGCACTTCGTCAGATCGTACGTGAAAGGACGAGAGATGTATTTCAGGAAGCTCGCCAAGTCGCAATTG
ACGTGAGAAGGCAAGCGCTGACACTTGAAAGTGTCAGATTAAAACTTGAGAAAACAAACGTTCGCTACCTCTCCCCCGAA
GAACGTGCTGATCTAGCGCGACTTATTTTTGAAATGGAAGATGAAGCACGCGATGACTTCATCAAATTCCAGCCTCGTTT
CTGGACTCAAGGAAGTTTTCAGTACGATACGTTAAACAGGCCTTTTCATCCGGGGCAGGAAATGGATATTGATGATGGCA
CCTACATGCCCATGACGGTGTTTGAATCCGAACCGAGCATTGGACACACTCTGCTTCTCCTTCTCGTGGATACATCACTG
AAATCACTAGAAGCTGAAAACGATGGCTGGGTATTTGAAGAAAAGAATACCTGCGGACGCATCAAAATCTATCGGGAGAA
AACACACATTGATGTACCGATGTATGCGATCCCTAAAGAACAATTCCAGAAAAAACAAACAGCAGCAGATTCAGCACACC
TCATAAAGTCAGATTCGGTGTTTGAATCTTTTGCATTGAACCGGGGGGGACGCGAGGCTTATGCCGTTGAGTCCGACAAA
GTGAACCTGGCACTTCGCGAAGGGGTCAGAAGATGGTCAGTCAGCGACCCCAAAATTGTTGAAGACTGGTTCAACGAAAG
CTGTAAACGTATCGGCGGGCATCTGCGTTCAGTTTGCCGGTTTATGAAGGCTTGGCGGGATGCACAATGGGAAGTTGGGG
GCCCTTCATCAATCAGTCTGATGACTGCAGTCGTCAACATCCTCGATAGAGAATCTCATAATGGCTCCGACCTCACCGGG
ACGATGAAACTTATTGCCAGGTTGCTGCCTGAGGAATTCAATCGCGGTGTGGAAAGTCCCGACGATACTGACGAAAAACC
ATTGTTCCCTGCGGAAAGTAACCATAACGTGCACCATAGAGCTATCGTTGAAACTATGGAAGGTCTGTACGGTATTTTAC
TTGCCGCTGAGCAATCAGAAAGTCGGGAAGAAGCGTTACGTAAAATCAACGAAGCATTTGGTAAACGTGTGACTAATGCC
CTATTAATCACGTCAAGTGCTGCAGCTCCGGCATTTCTCAATGCACCATCCAAAGAGCCATCATCTAAACCAATCAACAA
AACGATGGTAAGTGGCTGA
|
Protein sequence : | |
MPWDFNNYYSHNMDGLISKLKLSKTESDKLKALRQIVRERTRDVFQEARQVAIDVRRQALTLESVRLKLEKTNVRYLSPE
ERADLARLIFEMEDEARDDFIKFQPRFWTQGSFQYDTLNRPFHPGQEMDIDDGTYMPMTVFESEPSIGHTLLLLLVDTSL
KSLEAENDGWVFEEKNTCGRIKIYREKTHIDVPMYAIPKEQFQKKQTAADSAHLIKSDSVFESFALNRGGREAYAVESDK
VNLALREGVRRWSVSDPKIVEDWFNESCKRIGGHLRSVCRFMKAWRDAQWEVGGPSSISLMTAVVNILDRESHNGSDLTG
TMKLIARLLPEEFNRGVESPDDTDEKPLFPAESNHNVHHRAIVETMEGLYGILLAAEQSESREEALRKINEAFGKRVTNA
LLITSSAAAPAFLNAPSKEPSSKPINKTMVSG
|
|