Re: Is .zip compression lossless?
From: Michael Wojcik (mwojcik_at_newsguy.com)
Date: 03/15/04
- Next message: j0mbolar: "this problem is annoying"
- Previous message: Michael Wojcik: "Re: How to write something to a html textfield and send it?"
- In reply to: CBFalconer: "Re: Is .zip compression lossless?"
- Next in thread: Arthur J. O'Dwyer: "Re: Is .zip compression lossless?"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: 15 Mar 2004 17:28:38 GMT
In article <405505B2.D5A404CA@yahoo.com>, CBFalconer <cbfalconer@yahoo.com> writes:
>
> If your objective is maximal compression for archival storage,
> look into bzip2. I believe it has the highest compression of
> anything available today.
It appears that PPM (Prediction by Partial Match, which AIUI uses
Markov chains to build a statistical model of the uncompressed data)
generally outperforms BWT (Burrows-Wheeler Transform) implementations,
such as bzip2, in terms of total size reduction for arbitrary input
data. And it looks like CTW (Context Tree Weighting) may beat PPM
algorithms.
Additional pre-processing can improve things further for some kinds
of input (eg most large text corpuses). LIPT (Length Index Preserving
Transform), a kind of star encoding, resulted in ~5% improvements for
BWT and PPM in one set of tests, for example.
But bzip2 is convenient, free, widely used, and apparently robust, so
for most people it's a fine solution. The relatively small advantages
that a few other schemes have over it won't matter unless you're
compressing a *lot* of data, and it's data that the other schemes
actually compress better (eg plain text).
-- Michael Wojcik michael.wojcik@microfocus.com The guy who's fast in the mountain pass is the coolest. -- _Initial D: Second Stage_
- Next message: j0mbolar: "this problem is annoying"
- Previous message: Michael Wojcik: "Re: How to write something to a html textfield and send it?"
- In reply to: CBFalconer: "Re: Is .zip compression lossless?"
- Next in thread: Arthur J. O'Dwyer: "Re: Is .zip compression lossless?"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
|