[gomp] Reuse futex barrier implementation for RTEMS

2015-07-16 Thread Sebastian Huber
Hello, the libgomp configuration for RTEMS uses currently the POSIX implementation. Unfortunately the performance is unacceptable bad, so I work currently on a specialized RTEMS configuration. I would like to reuse the code of the Linux futex barrier. On RTEMS there is no kernel/user space se

Re: How to express this complicated constraint in md file

2015-07-16 Thread Jim Wilson
On 07/16/2015 01:32 PM, Dmitry Grinberg wrote: > WUMUL x, y which will multiply 32-bit register x by 32-bit > register y, and produce a 64-bit result, storing the high bits into > register x and low bits into register y You can rewrite the RTL to make this easier. You can use a parallel to do

How to express this complicated constraint in md file

2015-07-16 Thread Dmitry Grinberg
Hi, I have not been able to find an answer in the ".md" files i've read through, or the docs, so I wanted to try here. If we have an instruction on a machine that does "ADD x, y", where register x is both input and output, in the ".md" file one can express this easily (referencing it as "0" inste

Re: Reduction Pattern ( Vectorization or Parallelization)

2015-07-16 Thread Richard Biener
On July 16, 2015 8:03:03 PM GMT+02:00, Toon Moene wrote: >On 07/16/2015 12:53 PM, Richard Biener wrote: > >> On Sun, Jul 5, 2015 at 1:57 PM, Ajit Kumar Agarwal > >>> For the following code >>> For(j = 0; j <= N;j++) >>> { >>> y = d[j]; >>> For( I = 0 ; I <8 ; i++) >>> X(a[i]) =

Re: Reduction Pattern ( Vectorization or Parallelization)

2015-07-16 Thread Toon Moene
On 07/16/2015 12:53 PM, Richard Biener wrote: On Sun, Jul 5, 2015 at 1:57 PM, Ajit Kumar Agarwal For the following code For(j = 0; j <= N;j++) { y = d[j]; For( I = 0 ; I <8 ; i++) X(a[i]) = X(a[i]) + c[i] * y; } Fig(1). I think the issue here is dependences of X(A[i]) a

GCC 5.2 Released

2015-07-16 Thread Richard Biener
The GNU Compiler Collection version 5.2 has been released. GCC 5.2 is a bug-fix release from the GCC 5 branch containing important fixes for regressions and serious bugs in GCC 5.1 with more than 81 bugs fixed since the previous release. This release is available from the FTP servers listed at:

GCC 5.3 Status Report (2015-07-16)

2015-07-16 Thread Richard Biener
Status == The GCC 5 branch is open again for regression and documentation fixes. If nothing unusual happens you can expect GCC 5.3 in October, roughly when people should think of stage1 ending soon ;) Quality Data Priority # Change from last report -

Re: Reduction Pattern ( Vectorization or Parallelization)

2015-07-16 Thread Richard Biener
On Sun, Jul 5, 2015 at 1:57 PM, Ajit Kumar Agarwal wrote: > All: > > The scalar and array reduction patterns can be identified if the result of > commutative updates > Is applied to the same scalar or array variables on the LHS with +, *, Min or > Max. Thus the reduction pattern identified with

GCC 4.9.3 build on i386-apple-darwin9.8.0

2015-07-16 Thread Andras Salamon
Mac OS X 10.5 with XCode 3.1.4, bootstrapped via gcc-4.7.4. config.guess: 386-apple-darwin9.8.0 gcc -v: Using built-in specs. COLLECT_GCC=gcc COLLECT_LTO_WRAPPER=/usr/local/libexec/gcc/i386-apple-darwin9.8.0/4.9.3/lto-wrapper Target: i386-apple-darwin9.8.0 Configured with: ./configure --localsta

Re: Partition and subpartition Analysis that helps in further vectorization and parallelization

2015-07-16 Thread Richard Biener
On Tue, Jul 14, 2015 at 1:33 PM, Ajit Kumar Agarwal wrote: > All: > > I am trying the place the following Analysis in the vectorizer of GCC that > helps in improving the vectorizer to a great extent > For the unit stride, zero stride and non stride accesses of memory that helps > in vectorizer.