Re: Using SSE 128 bit movs From One Memory Location To Another
- From: hopcode <hopcode@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
- Date: Fri, 18 Feb 2011 04:42:00 +0100
Il 17.02.2011 22:28, KA ha scritto:
Sorry, yes, you are right. I was doing some incorrect math in my code which I have now solved.
I am trying to beat a rep movsb on large objects by using SSE.
To measure this, I've been using GCC C++ where I use a for loop containing my inline assembly and iterate a billion times
some hints about the subject of this thread:
1) *avoiding* C and HLLs for speed-aimed-routines.
/solution/ create pure *ASM* object files to be linked
2) *avoiding* C and HL Languages to test speed-aimed-routines.
they insert too much overbloating code and neuro-handlers.
/solution/ test speed-aimed-routines *pure as they are*, possibly
without CRT/libc/SEH stub.
3) *avoiding* reinventing the wheel for such operations
(movs) on big bulks of datas.
/solution/ reading this could help something
Chapter 7.2, 7.3 could be a starting point.
x64 Assembly Lab
- Prev by Date: Re: Using SSE 128 bit movs From One Memory Location To Another
- Next by Date: Re: Yet another handmade POPCNT
- Previous by thread: Re: Using SSE 128 bit movs From One Memory Location To Another
- Next by thread: Re: Using SSE 128 bit movs From One Memory Location To Another