Re: Storing 20 million randomly accessible documents in compressed form



M.Barren@xxxxxxxxx wrote:
> I will give the BIG zip file a try to see if it compares anywhere near
> 200 MB (because I will have another 30 to 40 Million documents on the
> way). I might be expecting magic from compression technologies but I
> just want to make sure I will save as much space as I can since there's
> a little bit of competition to it as well.

If you're compressing small files then zip/gzip is fine. For larger files
(larger than bzip's block size), bzip is likely to be substantially better.

--
Dr Jon D Harrop, Flying Frog Consultancy
http://www.ffconsultancy.com
.