Hello,
the libgomp configuration for RTEMS uses currently the POSIX
implementation. Unfortunately the performance is unacceptable bad, so I
work currently on a specialized RTEMS configuration. I would like to
reuse the code of the Linux futex barrier. On RTEMS there is no
kernel/user space se
On 07/16/2015 01:32 PM, Dmitry Grinberg wrote:
> WUMUL x, y which will multiply 32-bit register x by 32-bit
> register y, and produce a 64-bit result, storing the high bits into
> register x and low bits into register y
You can rewrite the RTL to make this easier. You can use a parallel to
do
Hi,
I have not been able to find an answer in the ".md" files i've read
through, or the docs, so I wanted to try here.
If we have an instruction on a machine that does "ADD x, y", where
register x is both input and output, in the ".md" file one can express
this easily (referencing it as "0" inste
On July 16, 2015 8:03:03 PM GMT+02:00, Toon Moene wrote:
>On 07/16/2015 12:53 PM, Richard Biener wrote:
>
>> On Sun, Jul 5, 2015 at 1:57 PM, Ajit Kumar Agarwal
>
>>> For the following code
>>> For(j = 0; j <= N;j++)
>>> {
>>> y = d[j];
>>> For( I = 0 ; I <8 ; i++)
>>> X(a[i]) =
On 07/16/2015 12:53 PM, Richard Biener wrote:
On Sun, Jul 5, 2015 at 1:57 PM, Ajit Kumar Agarwal
For the following code
For(j = 0; j <= N;j++)
{
y = d[j];
For( I = 0 ; I <8 ; i++)
X(a[i]) = X(a[i]) + c[i] * y;
}
Fig(1).
I think the issue here is dependences of X(A[i]) a
The GNU Compiler Collection version 5.2 has been released.
GCC 5.2 is a bug-fix release from the GCC 5 branch
containing important fixes for regressions and serious bugs in
GCC 5.1 with more than 81 bugs fixed since the previous release.
This release is available from the FTP servers listed at:
Status
==
The GCC 5 branch is open again for regression and documentation fixes.
If nothing unusual happens you can expect GCC 5.3 in October, roughly
when people should think of stage1 ending soon ;)
Quality Data
Priority # Change from last report
-
On Sun, Jul 5, 2015 at 1:57 PM, Ajit Kumar Agarwal
wrote:
> All:
>
> The scalar and array reduction patterns can be identified if the result of
> commutative updates
> Is applied to the same scalar or array variables on the LHS with +, *, Min or
> Max. Thus the reduction pattern identified with
Mac OS X 10.5 with XCode 3.1.4, bootstrapped via gcc-4.7.4.
config.guess:
386-apple-darwin9.8.0
gcc -v:
Using built-in specs.
COLLECT_GCC=gcc
COLLECT_LTO_WRAPPER=/usr/local/libexec/gcc/i386-apple-darwin9.8.0/4.9.3/lto-wrapper
Target: i386-apple-darwin9.8.0
Configured with: ./configure --localsta
On Tue, Jul 14, 2015 at 1:33 PM, Ajit Kumar Agarwal
wrote:
> All:
>
> I am trying the place the following Analysis in the vectorizer of GCC that
> helps in improving the vectorizer to a great extent
> For the unit stride, zero stride and non stride accesses of memory that helps
> in vectorizer.
10 matches
Mail list logo