The concept of vectorizing code




I was wondering: is there information available somewhere (a tutorial or simple rationale outline) which describes how vectorizing code works?

I understand that it is basically about structuring your code in such a way that the CPU can process multiple instructions simultaneously and I've seen some examples in the archives but I would really like to learn how it works and how to apply it properly.

Thanks
Bram
.