Re: MMX speedup for Floyd Steinberg error diffusion
- From: rep_movsd <spamtrap@xxxxxxxxxx>
- Date: Thu, 8 May 2008 07:23:41 -0700 (PDT)
I figured it out...
I do all my processing in the form of 4 signed WORDS in the MMX
registers, and finally I use packuswb which takes the 4 and packs them
into unsigned bytes with saturation.
The code works exactly as my C++ version did and I have a speedup of
more than 2X !! :)
Excellent!
Thanks all
.
- References:
- MMX speedup for Floyd Steinberg error diffusion
- From: rep_movsd
- Re: MMX speedup for Floyd Steinberg error diffusion
- From: Maarten Kronenburg
- Re: MMX speedup for Floyd Steinberg error diffusion
- From: Maarten Kronenburg
- Re: MMX speedup for Floyd Steinberg error diffusion
- From: rep_movsd
- MMX speedup for Floyd Steinberg error diffusion
- Prev by Date: Re: MMX speedup for Floyd Steinberg error diffusion
- Next by Date: Re: MMX speedup for Floyd Steinberg error diffusion
- Previous by thread: Re: MMX speedup for Floyd Steinberg error diffusion
- Next by thread: Re: MMX speedup for Floyd Steinberg error diffusion
- Index(es):
Relevant Pages
|