Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-11-01 Thread Tobias . Schlueter
Quoting Jakub Jelinek <[EMAIL PROTECTED]>: > On Tue, Nov 01, 2005 at 02:01:43PM +0100, > [EMAIL PROTECTED] wrote: > > [ Bringing this back to fortran@, taking the optimizer guys out of CC: ] > > > > Quoting Toon Moene: > > > I still have to construct a bug report of something that confuses the > pa

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-11-01 Thread Jakub Jelinek
On Tue, Nov 01, 2005 at 02:01:43PM +0100, [EMAIL PROTECTED] wrote: > [ Bringing this back to fortran@, taking the optimizer guys out of CC: ] > > Quoting Toon Moene: > > I still have to construct a bug report of something that confuses the parser > > and that basically looks like this: > > > >

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-11-01 Thread Tobias . Schlueter
[ Bringing this back to fortran@, taking the optimizer guys out of CC: ] Quoting Toon Moene: > I still have to construct a bug report of something that confuses the parser > and that basically looks like this: > > IMPLICIT CHARACTER*8 (Y) > CHARACTER*11 Y1, Y2, Y3 > ... > YA =

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-11-01 Thread Toon Moene
> I just read your contribution to the 2005 gcc summit about gfortran > and HIRLAM. The two PRs(18283 and 21034) you wrote about are now > fixed. LOC is now available. That just leaves some of the extra > functionality of FLUSH(IOSTAT?), does it not? Would it compile > completely if I were to add

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-11-01 Thread Paul Thomas
Toon, I just read your contribution to the 2005 gcc summit about gfortran and HIRLAM. The two PRs(18283 and 21034) you wrote about are now fixed. LOC is now available. That just leaves some of the extra functionality of FLUSH(IOSTAT?), does it not? Would it compile completely if I were to

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-27 Thread Toon Moene
I wrote: I'm now going to try to compile HIRLAM with 32 bit pointers. And ... here are the results: With 64-bit pointers: 2685 out of 9799 loops vectorized. With 32-bit pointers: 3269 out of 8854 loops vectorized. Why there are more loops in the 64-bit case is beyond my understanding -

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-27 Thread Toon Moene
Dorit wrote: It looks like maybe a 64bit scalar-evolution issue - when I compile on powerpc-linux with -m64, I also get the "vect4.f:4: note: not consecutive access" message. This problem looks very similar to PR18403 which has been resolved a while

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-23 Thread Paul Brook
> When compiling for 64bit, there is an extra cast: > In the 64bit case however, the vectorizer dumps show that the > access-function returned for the index to array b is much more > compilcated > - the dataref analyzer doesn't seem to be able to extract the > evoluti

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-23 Thread Toon Moene
Dorit wrote: It looks like maybe a 64bit scalar-evolution issue - when I compile on powerpc-linux with -m64, I also get the "vect4.f:4: note: not consecutive access" message. This problem looks very similar to PR18403 which has been resolved a while

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-23 Thread Dorit Naishlos
It looks like maybe a 64bit scalar-evolution issue - when I compile on powerpc-linux with -m64, I also get the "vect4.f:4: note: not consecutive access" message. This problem looks very similar to PR18403 which has been resolved a while ago: When compiling for 32bit, we get the following repre

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-22 Thread Toon Moene
This one gets vectorized for me, on powerpc-linux: ~/mainline_cvs/bin/gfortran -O3 -ftree-vectorize -maltivec -ftree-vectorizer-verbose=4 -S hilaram4.f90 hilaram4.f90:4: note: Alignment of access forced using peeling. hilaram4.f90:4: note: Vectorizing an un

Re: Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-21 Thread Dorit Naishlos
This one gets vectorized for me, on powerpc-linux: ~/mainline_cvs/bin/gfortran -O3 -ftree-vectorize -maltivec -ftree-vectorizer-verbose=4 -S hilaram4.f90 hilaram4.f90:4: note: Alignment of access forced using peeling. hilaram4.f90:4: note: Vectorizing an unaligned access. hilaram4.f90:4: note

Vectorizing HIRLAM 4: complicated access patterns examined.

2005-10-21 Thread Toon Moene
L.S., This code: SUBROUTINE S(N) DIMENSION A(N), B(N) READ*,ISTART,ISTOP,B DO I = ISTART, ISTOP A(I) = B(I) ENDDO PRINT*,A END when compiled thusly: $ gfortran -g -S -O3 -ftree-vectorize -ftree-vectorizer-verbose=2 -msse2 vect4.f draws the fol