SSE2 half as fast as it should be?
- From: spamtrap@xxxxxxxxxx
- Date: 20 Apr 2006 19:04:40 -0700
I haven't tried it yet, but from what I've read, Simple Integer SSE2
instructions in my 3.8 ghz Prescott CPU have latency 2 and throughput
2. But... the Prescott is supposed to have 2 simple ALU's that are
double pumped yielding 2x2x32 bits = 128 bits per clock cycle of simple
integer instructions. So... is the SSE2 manual wrong, or intel's
architecture (cuz they got the hardware to be twice as fast!)?
Anybody know of workarounds? I am matrix multiplying bits, which
depends on these simple alu instructions.
Thanks,
AndrewF
.
- Follow-Ups:
- Re: SSE2 half as fast as it should be?
- From: Maarten Kronenburg
- Re: SSE2 half as fast as it should be?
- Prev by Date: Linux assembly
- Next by Date: Re: Linux assembly
- Previous by thread: Linux assembly
- Next by thread: Re: SSE2 half as fast as it should be?
- Index(es):
Relevant Pages
|