Re: 8088 LZSS+RLE decompression: Can this be improved?
- From: Terje Mathisen <spamtrap@xxxxxxxxxx>
- Date: Thu, 14 Sep 2006 14:43:16 +0200
Samuel Stearley wrote:
The original problem laid out is for it to beat memcpy(). The only way
to do this is with fast RLE
Well, duh! :-)
Of course RLE is needed to beat a single REP MOVS opcode, the problem is this: Given that there are enough runs (which ends up as REP STOS) to beat the naive version, what is the fastest way to implement it?
The answer is quite obviously a 'compiled sprite', i.e. a set of asm instructions to directly output the wanted result. This approach touches the minimum possible number of bytes, while avoiding all tests/jumps/branches.
Terje
--
- <Terje.Mathisen@xxxxxxxxxxxxx>
"almost all programming can be viewed as an exercise in caching"
.
- References:
- 8088 LZSS+RLE decompression: Can this be improved?
- From: Jim Leonard
- Re: 8088 LZSS+RLE decompression: Can this be improved?
- From: Terje Mathisen
- Re: 8088 LZSS+RLE decompression: Can this be improved?
- From: Jim Leonard
- Re: 8088 LZSS+RLE decompression: Can this be improved?
- From: Samuel Stearley
- 8088 LZSS+RLE decompression: Can this be improved?
- Prev by Date: Re: Hex to ascii
- Next by Date: Re: Hex to ascii
- Previous by thread: Re: 8088 LZSS+RLE decompression: Can this be improved?
- Next by thread: Re: 8088 LZSS+RLE decompression: Can this be improved?
- Index(es):