Re: Random data cannot be compressed
- From: Ron Garret <rNOSPAMon@xxxxxxxxxxx>
- Date: Mon, 10 Mar 2008 23:08:32 -0700
In article <13tbpulfs7tbvb7@xxxxxxxxxxxxxxxxxx>,
tim <TimJ@xxxxxxxxxxxx> wrote:
On Mon, 10 Mar 2008 10:27:07 -0800, Ron Garret wrote:
In article <13t9v8vlcivje0d@xxxxxxxxxxxxxxxxxx>,
tim <TimJ@xxxxxxxxxxxx> wrote:
Below is an algorithm that will compress *25%* of all randomly generated
bit strings of length > 1.
I think I have an algorithm that can get close to 50% but I haven't
written it out yet.
Of course 25% is a considerable improvement on the figure of
0.00000....00001 quoted above. Such strings are likely to appear well
before the heat death of the universe.
00XXX... -> 0XXX...
01XXX... -> 101XXX...
1XXXX... -> 11XXXX...
Tested code follows:
No. You are cheating in two ways.
I did not claim to compress random strings on average. In fact I clearly
stated in the *same* posting that is impossible. At best you will break
even.
Nor is this really cheating. The algorithm is fixed in size and can
compressive any of the 25% of random strings that meet its criteria.
It is cheating. You are cherry-picking which strings "count". The
result is that the data you are successfully compressing is no longer
random.
My point is that algorithms exist that, when given a completely random
string, can produce a shorter string in a large percentage of cases. This
was supposed to be impossible.
No, that is not what is impossible. What is impossible is producing an
algorithm so that the amount of compression you get when you apply the
algorithm to random data is more than the length of the algorithm. You
can choose how much random data you want to apply it to, but having
chosen the amount you do not get to cherry-pick the data you are
expected to compress. If you do then it's no longer random.
The only caveat is that it is possible to get lucky and happen to get a
random string that hits the sweet spot for your algorithm, and be able
to compress *that* random string by more than the length of your
algorithm. But the probability of that happening is vanishingly small.
rg
.
- References:
- Re: Random data cannot be compressed
- From: tim
- Re: Random data cannot be compressed
- From: Ron Garret
- Re: Random data cannot be compressed
- From: tim
- Re: Random data cannot be compressed
- Prev by Date: Re: do we need to 'nationalise' free open source?
- Next by Date: Re: Lisper's first look at Haskell
- Previous by thread: Re: Random data cannot be compressed
- Next by thread: Re: Random data cannot be compressed
- Index(es):
Relevant Pages
|