Re: Parallelization on Dell Workstations



Ganesh <ganesh.iitm@xxxxxxxxx> writes:

Ok.. we are not planning to connect these machines together.. ! The
codes are going to be run on individual comps only !

So we're talking only about SMP parallelisation.

However.. I have had trouble understanding the architecture of these
computers. Take for e.g a Dual core machine (one that is commonly
available for desktops these days). Is the memory shared between these
cores ? How about the cache ?

The main memory (RAM) is shared among all cores. The cache sharing
depends on the architecture, Intel's Core2 Duo has per-core L1 caches,
but a shared L2 cache while the AMD dual-cores have separate cache
hierarchies.

Going further..! What happens in these high end workstations ? Take
for instance the Double Dual core, and the double Quad core.. ! The
link has i sent has all the info. However I am not able to comprehend
them fully.

What do you mean by »happens«? In the case of a double quad-core
machine, you would have two processing modules with four cores each; for
most of your work you treat them like a simple 8-way SMP system.

I have an MPI code working. Setting up OpenMP code is not that big a
deal I guess. A combo would involve some effort as well. I am ready to
put in the effort needed. However I need to know which one to put my
effort on ?

Good MPI implementations should realise intra-node communication via
shared memory, so you pay a bit of lost efficiency due to the messaging
overhead. For the moment, I would suggest to install an MPI library and
run your code; you should do profiling runs and determine the
bottlenecks (communication? computation?).


Sebastian
.



Relevant Pages

  • Re: GHZ vs Cores?
    ... Quite a few reason for dual core performs well than a single core at higher ... Higher or more Cache, usually shared between 2 cores so that your ... Applications itself can be made to use dual core technology extensively. ...
    (comp.sys.intel)
  • Re: new Itanium after Tukwila: Poulson
    ... >> Tukwila, perhaps IDF will give some light on it. ... > significant core changes appear in Montvale or Tukwila I'll be at ... The L3 cache helped atleast much in SpecFP, ... > small amounts of ultimate single-thread performance to allow them to ...
    (comp.os.vms)
  • Re: new Itanium after Tukwila: Poulson
    ... > Tukwila, perhaps IDF will give some light on it. ... significant core changes appear in Montvale or Tukwila I'll be at least ... MB of on-chip cache) slightly in SPECint (yes, ... complex due to its support for single-thread performance, ...
    (comp.os.vms)
  • Re: Dell vs. eMachines T6420
    ... I still maintain that Intel "Core Duo" chips are multicore, ... Allowing communication across the L2 cache limits the use ... Intel itself has admitted several things about its "Core Duo" philosophy: ... cannot see or communicate with each other...Intel itself admits it does ...
    (alt.sys.pc-clone.dell)
  • Re: a dozen cpus on a chip
    ... Neither does the core kernel for an operating system. ... They'd surround a shared cache. ... common memory model to its knees would be to set a few small tasks ... One thing I've always thought that CPUs should have is hardware task ...
    (sci.electronics.design)