Re: Dealing with paradoxical subregs of memory?

2017-01-26 Thread Dominik Vogt
On Wed, Jan 25, 2017 at 04:45:23PM -0600, Segher Boessenkool wrote: > On Wed, Jan 25, 2017 at 06:36:04PM +0100, Dominik Vogt wrote: > > On the other hand, Combine > > does not know that they are "outlawed" and happily generates > > them. > > combine should not generate things that can never match.

Vectorization regression on s390x GCC6 vs GCC5

2017-01-26 Thread Robin Dapp
Hi, while analyzing a test case with a lot of nested loops (>7) and double floating point operations I noticed a performance regression of GCC 6/7 vs GCC 5 on s390x. It seems due to GCC 6 vectorizing something GCC 5 couldn't. Basically, each loop iterates over three dimensions, we fully unroll so

Re: Vectorization regression on s390x GCC6 vs GCC5

2017-01-26 Thread Bin.Cheng
On Thu, Jan 26, 2017 at 10:18 AM, Robin Dapp wrote: > Hi, > > while analyzing a test case with a lot of nested loops (>7) and double > floating point operations I noticed a performance regression of GCC 6/7 > vs GCC 5 on s390x. It seems due to GCC 6 vectorizing something GCC 5 > couldn't. > Basic

Re: What is the status of macOS PowerPC support?

2017-01-26 Thread Jonathan Wakely
On 25 January 2017 at 22:30, Segher Boessenkool wrote: > On Wed, Jan 25, 2017 at 04:36:13PM +0100, FX wrote: >> I am trying to determine what is the status of the powerpc-apple-darwin >> target for GCC. The last released version of GCC for which a successful >> build is reported is 4.9.1 >> (htt

Re: Vectorization regression on s390x GCC6 vs GCC5

2017-01-26 Thread Richard Biener
On Thu, Jan 26, 2017 at 11:36 AM, Bin.Cheng wrote: > On Thu, Jan 26, 2017 at 10:18 AM, Robin Dapp wrote: >> Hi, >> >> while analyzing a test case with a lot of nested loops (>7) and double >> floating point operations I noticed a performance regression of GCC 6/7 >> vs GCC 5 on s390x. It seems du

RE: [RFC] Further LRA subreg handling issues

2017-01-26 Thread Matthew Fortune
Matthew Fortune writes: ... > Pseudo 300 is assigned to memory and then LRA produces a simple DImode > load from the assigned stack slot. The only instruction to set pseudo > 300 is: > > (insn 247 212 389 3 (set (reg:SI 300) > (ne:SI (subreg/s/u:SI (reg/v:DI 231 [ taken ]) 0) >

Re: [patch, libgfortran RFC] Installation script for OpenCoarrays to enable multi-image gfortran

2017-01-26 Thread FX
Hi Jerry, A few questions: - why mpich? doesn’t opencoarrays support any MPI implementation? - I am a bit surprised by the complexity of the script… couldn’t we provide a Makefile for opencoarrays, to be compatible with our other build requirements? - do we want to work towards seamless im

Re: [RFC] Further LRA subreg handling issues

2017-01-26 Thread Eric Botcazou
> However in lra-constraints.c:simplify_operand_subreg it quite happily > performs a reload using the outer mode in this case and only drops down to > the inner mode if the outer mode reload would be slower than the inner. > > Presumably this is safe for non WORD_REGISTER_OPERATIONS targets as the

Re: [patch, libgfortran RFC] Installation script for OpenCoarrays to enable multi-image gfortran

2017-01-26 Thread Jerry DeLisle
On 01/26/2017 05:25 AM, FX wrote: > Hi Jerry, > > A few questions: > > - why mpich? doesn’t opencoarrays support any MPI implementation? We picked it as one that I had available and only as a starting point, we plan to add support for other libraries as we go. (OpenCoarrays itself does support

Re: [RFC] Further LRA subreg handling issues

2017-01-26 Thread David Malcolm
On Thu, 2017-01-26 at 13:00 +, Matthew Fortune wrote: > Matthew Fortune writes: > ... > > Pseudo 300 is assigned to memory and then LRA produces a simple > > DImode > > load from the assigned stack slot. The only instruction to set > > pseudo > > 300 is: > > > > (insn 247 212 389 3 (set (reg:

RE: [RFC] Further LRA subreg handling issues

2017-01-26 Thread Matthew Fortune
Eric Botcazou writes: > > However in lra-constraints.c:simplify_operand_subreg it quite happily > > performs a reload using the outer mode in this case and only drops > > down to the inner mode if the outer mode reload would be slower than > the inner. > > > > Presumably this is safe for non WORD_

Successful bootstrap and install of gcc (GCC) 6.3.0 on armv7l-unknown-linux-gnueabi

2017-01-26 Thread Aaro Koskinen
Hi, Here's a report of a successful build and install of GCC: $ gcc-6.3.0/config.guess armv7l-unknown-linux-gnueabi $ newcompiler/bin/gcc -v Using built-in specs. COLLECT_GCC=newcompiler/bin/gcc COLLECT_LTO_WRAPPER=/home/aaro/gcctest/newcompiler/libexec/gcc/arm-unknown-linux-gnueabi/6.3.0/lto-wr

Re: [RFC] Further LRA subreg handling issues

2017-01-26 Thread Eric Botcazou
> This part suggests to me that LRA should never be reloading the > paradoxical subreg meaning the whole SLOW_UNALIGNED_ACCESS checking code in > simplify_operand_subreg could be removed unconditionally. Why? For a little-endian target which is neither strict-alignment nor WORD_REGISTER_OPERATIO

gcc-6-20170126 is now available

2017-01-26 Thread gccadmin
Snapshot gcc-6-20170126 is now available on ftp://gcc.gnu.org/pub/gcc/snapshots/6-20170126/ and on various mirrors, see http://gcc.gnu.org/mirrors.html for details. This snapshot has been generated from the GCC 6 SVN branch with the following options: svn://gcc.gnu.org/svn/gcc/branches/gcc-6