Re: Java text compression
- From: Chris <spam_me_not@xxxxxxxxxx>
- Date: Sun, 18 Nov 2007 16:35:19 -0600
Bonus question for OP: what is the size of data sets and how are they used? Especially, where are they stored?
Multi-terabyte sized, split across multiple machines. On a single machine, generally not more than a few hundred Gb. One or two disks per machine, SATA, no RAID.
At compression time, the data is streamed from an external source, transformed in memory, and written to disk.
At decompression time, the app seeks to the particular block of text of interest and decompresses it. Seek time dominates decompression time, *except* when we do heavy caching, in which case the decompression becomes the bottleneck. Storing the decompressed text in memory takes up too much space. Has to be cached in compressed form.
.
- Follow-Ups:
- Re: Java text compression
- From: Eric Sosman
- Re: Java text compression
- References:
- Java text compression
- From: Chris
- Re: Java text compression
- From: Eric Sosman
- Re: Java text compression
- From: Robert Klemme
- Java text compression
- Prev by Date: Re: Java text compression
- Next by Date: Re: Great SWT Program
- Previous by thread: Re: Java text compression
- Next by thread: Re: Java text compression
- Index(es):
Relevant Pages
|