Re: How to Make F77 Program Faster (g95 compiler) ??



On Nov 13, 7:46 am, Chip Coldwell <coldw...@xxxxxxxxxxxxx> wrote:

The latest (version 10) Intel Fortran compilers do auto-vectorization
of loops by default (where possible). I found that merely by
upgrading from version 9 to version 10 with no other changes my code
sped up by about 14%.

This is true of the 64-bit compiler only as it now assumes SSE2
instructions are available. The 32-bit compiler, at present, does not
do this. There, use the appropriate /Qx switch for your processor
type (/QxW or "higher") in order to enable vectorization.

Steve

.



Relevant Pages

  • Re: Rules for "colon matching" in array operations
    ... < "scalarization pass inside of the compiler". ... Similarly, Andy talks about loops, but not ... Any routine using ENTRY compiles as one function ...
    (comp.lang.fortran)
  • Re: Support for optimization for dual core proc in C++
    ... Does the C++ in VS2005 allow optimization to parallelize and vectorize ... automatically like the product IntelC++ Compiler Version 10.0 does? ... restructures and optimizes loops to ensure that auto-vectorization, ...
    (microsoft.public.vc.language)
  • Re: Why is C# 450% slower than C++ on nested loops ??
    ... A nested loop written the way it is in the benchmark is measuring nothing but a compiler's ability to optimize nested loops that do more or less nothing. ... C# compiler generates such code, and the C++/CLI compiler is able to do so. ... It is reported that 2005 does a much better job of optimization of .NET code, yet, only with the C++, not the C# compiler. ...
    (microsoft.public.dotnet.languages.csharp)
  • Re: File IO
    ... > (Actually, from your description, the implicit DO loops are working ... your problem is when you read array slices instead of implicit DO ... so the compiler might creat a temporary ... that's probably a bug in the I/O design. ...
    (comp.lang.fortran)
  • Poor performance of implied-DO loop with Intel Fortran
    ... Intel fortran compliers. ... I get very poor performance when using implied-DO loops to initialise ... I wasn't able to time v8.0 of the Intel compiler because it segfaulted ... unless I hid them in a subroutine. ...
    (comp.lang.fortran)