Parallel programming on multi-core computers?



I would like to explore parallel programming on the
Intel multi-core architecture (Core 2 Due, quad-core
Q6600, etc.) on Linux and maybe Windows.

I have a lot of parallel programming experience
from my grad school days in MPI, Java threads, and
Pthreads, but I was wondering what the state-of-the-art
parallel programming library for these Intel architectures?

I've heard about Intel Threading Building Blocks (TBB)
but only in passing.

.