Re: gfortran, g95, and dual-core
- From: Charles Russell <NOSPAM@xxxxxxxxxxxxx>
- Date: Sun, 30 Sep 2007 11:43:41 -0600
Janne Blomqvist wrote:
On 2007-09-30, Charles Russell <NOSPAM@xxxxxxxxxxxxx> wrote:In my real applications the bottleneck is generally the repeated evaluation of expensive functions. Since I keep changing the models, this must be done in a generic way, without customizing for a particular structure of the model. To minimize the number of function evaluations I routinely use the PORT version of NL2SOL for optimization. For Monte Carlo (actually, Metropolis algorithm) there is not much I can do other than to scale the problem as best as I can. Convergence to a static distribution is fastest with about 80% rejections, so I suppose I could get a fivefold gain by parallel evaluation before the rejection step.Richard Maine wrote:
For my present hardware, using a benchmark for "flops" that simply multiplies large dense matrices, I have:So computing time is not simply a matter of counting "flops"Nope. Hasn't been for decades.
2/2003 desktop 100 Mflop/2.4GHz = 0.04 flop/Hz g77 with -fbounds-check
4/2006 laptop 40 Mflop/1.7GHz = 0.02 flop/Hz g77 with -bounds-check
Timing was not very accurate, done by stopwatch for portability of the benchmark program, and these were short runs. The laptop, though newer, is less efficient, presumably due to the factors you mention. It was a low-end machine.
If the benchmark results above are representative of your real
application(s), you ought to be able to improve performance by more
than 1 order of magnitude without any new hardware. Dense matrix
multiplication of large matrices gets very close to the peak
performance of the processor, when using an optimized library, largely
due to better utilization of the cpu cache compared to a naive
implementation of the matrix multiplication algorithm. That is, your
2003 desktop should be able to do something close to 4 Gflops on
matmul.
.
- Follow-Ups:
- Re: gfortran, g95, and dual-core
- From: Charles Russell
- Re: gfortran, g95, and dual-core
- References:
- gfortran, g95, and dual-core
- From: Charles Russell
- Re: gfortran, g95, and dual-core
- From: Richard Maine
- Re: gfortran, g95, and dual-core
- From: Charles Russell
- Re: gfortran, g95, and dual-core
- From: Richard Maine
- Re: gfortran, g95, and dual-core
- From: Charles Russell
- Re: gfortran, g95, and dual-core
- From: Janne Blomqvist
- gfortran, g95, and dual-core
- Prev by Date: Re: gfortran, g95, and dual-core
- Next by Date: Re: gfortran, g95, and dual-core
- Previous by thread: Re: gfortran, g95, and dual-core
- Next by thread: Re: gfortran, g95, and dual-core
- Index(es):
Relevant Pages
|