Re: [PATCH] [8/9/10/11 Regression] [OOP] PR fortran/86470 - ICE with OpenMP

2021-01-27 Thread Thomas Koenig via Gcc-patches
Hello Harald, OK for master / backports? OK. It is indeed fairly obvious, as you write. Should the testcase be moved to the gomp/ subdirectory? Yes. It's a compile-time test, and it will then only be run if the the compiler can do OpenMP. You will not need the +! { dg-options "-fopenmp"

[PATCH][X86] Enable X86_TUNE_AVX256_UNALIGNED_{LOAD,STORE}_OPTIMAL for generic tune [PR target/98172]

2021-01-27 Thread Hongtao Liu via Gcc-patches
Hi: GCC11 will be the system GCC 2 years from now, and for the processors then, they shouldn't even need to split a 256-bit vector into 2 128-bits vectors. .i.e. Test SPEC2017 with the below 2 options on Zen3/ICL show option B is better than Option A. Option A: -march=x86-64 -mtune=generic -m

Re: [RFC] test builtin ratio for loop distribution

2021-01-27 Thread Alexandre Oliva
On Jan 27, 2021, Richard Biener wrote: > That said, rather than not transforming the loop as you do I'd > say we want to re-inline small copies more forcefully during > loop distribution code-gen so we turn a loop that sets > 3 'short int' to zero into a 'int' store and a 'short' store for exampl

Re: [PATCH,rs6000] Fusion patterns for logical-logical

2021-01-27 Thread Segher Boessenkool
Hi! On Thu, Dec 10, 2020 at 08:41:11PM -0600, acsaw...@linux.ibm.com wrote: > This patch adds a new function to genfusion.pl to generate patterns for > logical-logical fusion. They are enabled by default for power10 and can > be disabled by -mno-power10-fusion-2logical or -mno-power10-fusion. > +

Re: [PATCH] RISC-V: Always define MULTILIB_DEFAULTS

2021-01-27 Thread Kito Cheng via Gcc-patches
Hi Sebastian: Thank for report this issue, I can reproduce that, I'll investigate what happened today :) Sebastian Huber 於 2021年1月26日 週二 14:13 寫道: > Hello Kito, > > On 20/11/2020 09:33, Kito Cheng wrote: > > - Define MULTILIB_DEFAULTS can reduce the total number of multilib if > > the def

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread will schmidt via Gcc-patches
On Wed, 2021-01-27 at 19:43 -0600, Segher Boessenkool wrote: > On Wed, Jan 27, 2021 at 01:06:46PM -0600, will schmidt wrote: > > On Thu, 2021-01-14 at 11:59 -0500, Michael Meissner via Gcc-patches > > wrote: > > > November 19th, 2020: > > > Message-ID: <20201119235814.ga...@ibm-toto.the-meissners.

Re: [PATCH] RISC-V: Fix -march option parsing when `p` extension exists.

2021-01-27 Thread Kito Cheng via Gcc-patches
Thanks! committed to master :) On Wed, Jan 27, 2021 at 1:58 PM Xing GUO via Gcc-patches wrote: > > Sorry, I forgot to remove the line '*explicit_version_p = true;' in my > previous patch. > > This is an updated patch. > > Thanks! > > --- > gcc/ChangeLog: > > * common/config/riscv/riscv-co

Re: [PATCH, rs6000] improve vec_ctf invalid parameter handling. (pr91903)

2021-01-27 Thread will schmidt via Gcc-patches
On Wed, 2021-01-27 at 18:24 -0600, Segher Boessenkool wrote: > Hi! > > On Mon, Oct 26, 2020 at 04:22:32PM -0500, will schmidt wrote: > > Per PR91903, GCC ICEs when we attempt to pass a variable > > (or out of range value) into the vec_ctf() builtin. Per > > investigation, the parameter checking

Re: [Ping] PowerPC: Add float128/Decimal conversions.

2021-01-27 Thread Segher Boessenkool
On Tue, Jan 26, 2021 at 06:43:06PM -0500, Michael Meissner wrote: > I posted this patch on January 14th, 2021: > https://gcc.gnu.org/pipermail/gcc-patches/2021-January/563498.html > > | Date: Thu, 14 Jan 2021 12:09:36 -0500 > | Subject: [PATCH] PowerPC: Add float128/Decimal conversions. > | Messag

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread Michael Meissner via Gcc-patches
Whoops, I thought I was replying to the second patch about Decimal and IEEE 128-bit conversion, not about built-in support. On Wed, Jan 27, 2021 at 10:01:38PM -0500, Michael Meissner wrote: > On Wed, Jan 27, 2021 at 07:43:56PM -0600, Segher Boessenkool wrote: > > On Wed, Jan 27, 2021 at 01:06:46PM

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread Michael Meissner via Gcc-patches
On Wed, Jan 27, 2021 at 07:43:56PM -0600, Segher Boessenkool wrote: > On Wed, Jan 27, 2021 at 01:06:46PM -0600, will schmidt wrote: > > On Thu, 2021-01-14 at 11:59 -0500, Michael Meissner via Gcc-patches wrote: > > > November 19th, 2020: > > > Message-ID: <20201119235814.ga...@ibm-toto.the-meissner

[PATCH PR97627]Avoid computing niters info for fake edges

2021-01-27 Thread bin.cheng via Gcc-patches
Hi, As described in commit message, we need to avoid computing niters info for fake edges. This simple patch does this by two changes. Bootstrap and test on X86_64, is it ok? Thanks, bin pr97627-20210128.patch Description: Binary data

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread Segher Boessenkool
On Wed, Jan 27, 2021 at 01:06:46PM -0600, will schmidt wrote: > On Thu, 2021-01-14 at 11:59 -0500, Michael Meissner via Gcc-patches wrote: > > November 19th, 2020: > > Message-ID: <20201119235814.ga...@ibm-toto.the-meissners.org> > > Subject and date should be sufficient Only if people pick good

Re: [Ping] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread Segher Boessenkool
On Tue, Jan 26, 2021 at 06:39:22PM -0500, Michael Meissner wrote: > Ping https://gcc.gnu.org/pipermail/gcc-patches/2021-January/563496.html > > | Date: Thu, 14 Jan 2021 11:59:19 -0500 > | Subject: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins. > | Message-ID: <20210114165919.ga1...@ibm-t

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread Segher Boessenkool
On Tue, Jan 19, 2021 at 12:24:51PM -0500, Michael Meissner wrote: > On Fri, Jan 15, 2021 at 03:43:13PM -0600, Segher Boessenkool wrote: > > Hi! > > > > On Thu, Jan 14, 2021 at 11:59:19AM -0500, Michael Meissner wrote: > > > >From 78435dee177447080434cdc08fc76b1029c7f576 Mon Sep 17 00:00:00 2001 >

Re: [PATCH] document BLOCK_ABSTRACT_ORIGIN et al.

2021-01-27 Thread Martin Sebor via Gcc-patches
Attached is an updated patch for both tree.h and the internals manual documenting the most important BLOCK_ macros and what they represent. On 1/21/21 2:52 PM, Martin Sebor wrote: On 1/18/21 6:25 AM, Richard Biener wrote: PS Here are my notes on the macros and the two related functions: BLOCK:

Re: [PATCH, rs6000] improve vec_ctf invalid parameter handling. (pr91903)

2021-01-27 Thread Segher Boessenkool
Hi! On Mon, Oct 26, 2020 at 04:22:32PM -0500, will schmidt wrote: > Per PR91903, GCC ICEs when we attempt to pass a variable > (or out of range value) into the vec_ctf() builtin. Per > investigation, the parameter checking exists for this > builtin with the int types, but was missing for > the

Re: [[C++ PATCH]] Implement C++2a P0330R2 - Literal Suffixes for ptrdiff_t and size_t

2021-01-27 Thread Ed Smith-Rowland via Gcc-patches
On 1/27/21 3:32 PM, Jakub Jelinek wrote: On Sun, Oct 21, 2018 at 04:39:30PM -0400, Ed Smith-Rowland wrote: This patch implements C++2a proposal P0330R2 Literal Suffixes for ptrdiff_t and size_t*.  It's not official yet but looks very likely to pass.  It is incomplete because I'm looking for some

[PATCH v3] clear VLA bounds in attribute access (PR 97172)

2021-01-27 Thread Martin Sebor via Gcc-patches
Attached is another attempt to fix the problem caused by allowing front-end trees representing nontrivial VLA bound expressions to stay in attribute access attached to functions. Since removing these trees seems to be everyone's preference this patch does that by extending the free_lang_data pass

Re: [PATCH Fortran] Re: PR fortran/93524 - rank >= 3 array stride incorrectly set in CFI_establish

2021-01-27 Thread Thomas Koenig via Gcc-patches
Hi Harris! OK for master? I do not have write access, so someone will need to commit this for me. Reviewed, regression-tested and committed as https://gcc.gnu.org/g:1cdca4261e88f4dc9c3293c6b3c2fff3071ca32b Thanks for your patch, and welcome aboard! Best regards Thomas

[PATCH] [8/9/10/11 Regression] [OOP] PR fortran/86470 - ICE with OpenMP

2021-01-27 Thread Harald Anlauf via Gcc-patches
Dear all, the fix for this ICE is obvious: make gfc_call_malloc behave as documented. Apparently the special case in question was not exercised in the testsuite. Regtested on x86_64-pc-linux-gnu. OK for master / backports? Should the testcase be moved to the gomp/ subdirectory? Thanks, Harald

[PATCH Fortran] Re: PR fortran/93524 - rank >= 3 array stride incorrectly set in CFI_establish

2021-01-27 Thread Harris Snyder
(re-sending with subject line tags) Hi all, Now that my copyright assignment is complete, I'm submitting this fix. Test cases are included. OK for master? I do not have write access, so someone will need to commit this for me. Regards, Harris libgfortran/ChangeLog: * runtime/ISO_Fortran_bi

[PATCH, revised, #2] PowerPC: Add float128/Decimal conversions.

2021-01-27 Thread Michael Meissner via Gcc-patches
>From 02b04aed77130f2ec9156d2f7ff89d4cc6b5a78b Mon Sep 17 00:00:00 2001 From: Michael Meissner Date: Thu, 21 Jan 2021 12:58:56 -0500 Subject: [PATCH, revised] PowerPC: Add float128/Decimal conversions. [PATCH, revised] PowerPC: Add float128/Decimal conversions. Unfortunately, the revision I just

Re: [PATCH, revised] PowerPC: Add float128/Decimal conversions.

2021-01-27 Thread Michael Meissner via Gcc-patches
[PATCH, revised] PowerPC: Add float128/Decimal conversions. This patch revises the patch on January 14th. The only change in this patch compared to the previous patch is to change the format string for converting IEEE 128-bit to string. This allows the c-c++-common/dfp/convert-bfp-6.c test now p

[committed] [PR97684] IRA: Recalculate pseudo classes if we added new pseduos since last calculation before updating equiv regs

2021-01-27 Thread Vladimir Makarov via Gcc-patches
The patch solves the following problem:   https://gcc.gnu.org/bugzilla/show_bug.cgi?id=97684 The patch was successfully bootstrapped and tested on x86-64. commit 238ea13cca75ad499f227b60a95c40174c6caf78 Author: Vladimir N. Makarov Date: Wed Jan 27 14:53:28 2021 -0500 [PR97684] IRA: Reca

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread Michael Meissner via Gcc-patches
On Wed, Jan 27, 2021 at 01:06:46PM -0600, will schmidt wrote: > On Thu, 2021-01-14 at 11:59 -0500, Michael Meissner via Gcc-patches wrote: > > From 78435dee177447080434cdc08fc76b1029c7f576 Mon Sep 17 00:00:00 2001 > > From: Michael Meissner > > Date: Wed, 13 Jan 2021 21:47:03 -0500 > > Subject: [P

Re: [PATCH] rs6000: Fix vec insert ilp32 ICE and test failures [PR98799]

2021-01-27 Thread David Edelsohn via Gcc-patches
This patch is okay with the removal of { target powerpc*-*-* } from the pr79251-run.c testcase directives. As I explained in the earlier email, I still believe that the testcase is not testing what you intend, but this patch is a definite improvement and removes the failures. We can correct the

[PATCH 16/16] Improve "find_first/last_set" for NEON

2021-01-27 Thread Matthias Kretz
From: yaozhongxiao find_first_set and find_last_set method is not optimal for neon, it need to be improved by synthesized with horizontal adds(vaddv) which will reduce the generated assembly code; in the following cases, vaddvq_s16 will generate 2 instructions but vpadd_s16 will generate 4 instru

[PATCH 15/16] Work around test failures using -mno-tree-vrp

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz This is necessary to avoid failures resulting from PR98834. libstdc++-v3/ChangeLog: * testsuite/Makefile.am: Warn about the workaround. Add -fno-tree-vrp to CXXFLAGS passed to the check_simd script. Improve initial user feedback from make check-simd.

[PATCH 14/16] Implement hmin and hmax

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz From 9.7.4 in Parallelism TS 2. For some reason I overlooked these two functions. Implement them via call to _S_reduce. libstdc++-v3/ChangeLog: * include/experimental/bits/simd.h: Add __detail::_Minimum and __detail::_Maximum to use them as _BinaryOperation t

[PATCH 13/16] Improve test codegen for interpreting assembly

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz In many failure cases it is helpful to inspect the instructions leading up to the test failure. After this change the location is easier to find and the branch after failure is easier to find. libstdc++-v3/ChangeLog: * testsuite/experimental/simd/tests/bits/verify.h

[PATCH 12/16] Support timeout and timeout-factor options

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * testsuite/experimental/simd/driver.sh: Abstract reading test options into read_src_option function. Read skip, only, expensive, and xfail via read_src_option. Add timeout and timeout-factor options and adjust timeout v

[PATCH 11/16] Abort test after 1000 lines of output

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz Handle overly large output by aborting the log and thus the test. This is a similar condition to a timeout. libstdc++-v3/ChangeLog: * testsuite/experimental/simd/driver.sh: When handling the pipe to log (and on verbose to stdout) count the lines. If it exceed

[PATCH 10/16] Skip testing hypot3 for long double on PPC

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz std::hypot(a, b, c) is imprecise and makes this test fail even though the failure is unrelated to simd. libstdc++-v3/ChangeLog: * testsuite/experimental/simd/tests/hypot3_fma.cc: Add skip: markup for long double on powerpc64*. --- libstdc++-v3/testsuite/expe

[PATCH 09/16] Fix mask reduction of simd_mask on POWER7

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz POWER7 does not support __vector long long reductions, making the generic _S_popcount implementation ill-formed. Specializing _S_popcount for PPC allows optimization and avoids the issue. libstdc++-v3/ChangeLog: * include/experimental/bits/simd.h: Add __have_power10v

[PATCH 08/16] Immediate feedback with -v

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * testsuite/experimental/simd/driver.sh: Remove executable on SIGINT. Process compiler and test executable output: In verbose mode print messages immediately, limited to 1000 lines and breaking long lines to below $COLUM

[PATCH 07/16] Fix incorrect display of old test summaries

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * testsuite/Makefile.am: Ensure .simd.summary is empty before collecting a new summary. * testsuite/Makefile.in: Regenerate. --- libstdc++-v3/testsuite/Makefile.am | 1 + libstdc++-v3/testsuite/Makefile.in | 1 + 2 files change

[PATCH 05/16] Fix several check-simd interaction issues

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * testsuite/experimental/simd/driver.sh (verify_test): Print test output on run xfail. Do not repeat lines from the log that were already printed on stdout. (test_selector): Make the compiler flags pattern usable as a

[PATCH 04/16] Fix simd_mask on POWER w/o POWER8

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * include/experimental/bits/simd.h: Remove unnecessary static assertion. Allow sizeof(8) integer __intrinsic_type to enable the necessary mask type. --- libstdc++-v3/include/experimental/bits/simd.h | 6 -- 1 file changed,

[PATCH 06/16] Fix DRIVEROPTS and TESTFLAGS processing

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * testsuite/experimental/simd/generate_makefile.sh: Use different variables internally than documented for user overrides. This makes internal append/prepend work as intended. --- .../testsuite/experimental/simd/generate_makefi

[PATCH 03/16] Support -mlong-double-64 on PPC

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * include/experimental/bits/simd.h: Let __intrinsic_type be valid if sizeof(long double) == sizeof(double) and use a __vector double as member type. --- libstdc++-v3/include/experimental/bits/simd.h | 11 --- 1 file changed, 8

[PATCH 02/16] Fix NEON intrinsic types usage

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz Intrinsics types for NEON differ from gnu::vector_size types now. This requires explicit specializations for __intrinsic_type and a new __is_intrinsic_type trait. libstdc++-v3/ChangeLog: * include/experimental/bits/simd.h (__is_intrinsic_type): New internal t

[PATCH 01/16] Support skip, only, expensive, and xfail markers

2021-01-27 Thread Matthias Kretz
From: Matthias Kretz libstdc++-v3/ChangeLog: * testsuite/experimental/simd/driver.sh: Implement skip, only, expensive, and xfail markers. They can select on type, ABI tag subset number, target-triplet, and compiler flags. * testsuite/experimental/simd/generate_make

[PATCH 00/16] stdx::simd fixes and testsuite improvements

2021-01-27 Thread Matthias Kretz
As promised on IRC ... Matthias Kretz (15): Support skip, only, expensive, and xfail markers Fix NEON intrinsic types usage Support -mlong-double-64 on PPC Fix simd_mask on POWER w/o POWER8 Fix several check-simd interaction issues Fix DRIVEROPTS and TESTFLAGS processing Fix incorrec

Re: [[C++ PATCH]] Implement C++2a P0330R2 - Literal Suffixes for ptrdiff_t and size_t

2021-01-27 Thread Jakub Jelinek via Gcc-patches
On Sun, Oct 21, 2018 at 04:39:30PM -0400, Ed Smith-Rowland wrote: > This patch implements C++2a proposal P0330R2 Literal Suffixes for ptrdiff_t > and size_t*.  It's not official yet but looks very likely to pass.  It is > incomplete because I'm looking for some opinions. 9We also might wait 'till >

[pushed] c++: Dependent using enum [PR97874]

2021-01-27 Thread Jason Merrill via Gcc-patches
The handling of dependent scopes and unsuitable scopes in lookup_using_decl was a bit convoluted; I tweaked it for a while and then eventually reorganized much of the function to hopefully be clearer. Along the way I noticed a couple of ways we were mishandling inherited constructors. The local b

Re: [PATCH, rs6000] improve vec_ctf invalid parameter handling. (pr91903)

2021-01-27 Thread will schmidt via Gcc-patches
Ping! Thanks -Will On Mon, 2021-01-04 at 18:03 -0600, will schmidt via Gcc-patches wrote: > On Mon, 2020-10-26 at 16:22 -0500, will schmidt wrote: > > [PATCH, rs6000] improve vec_ctf invalid parameter handling. > > > > Hi, > > Per PR91903, GCC ICEs when we attempt to pass a variable > > (

Re: PR fortran/93524 - rank >= 3 array stride incorrectly set in CFI_establish

2021-01-27 Thread Harris Snyder
Hi all, Now that my copyright assignment is complete, I'm submitting this fix. Test cases are included. OK for master? I do not have write access, so someone will need to commit this for me. Regards, Harris libgfortran/ChangeLog: * runtime/ISO_Fortran_binding.c (CFI_establish): fixed strid

RE: [PATCH] aarch64: Fix up *aarch64_bfxilsi_uxtw [PR98853]

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
> -Original Message- > From: Jakub Jelinek > Sent: 27 January 2021 19:11 > To: Richard Earnshaw ; Richard Sandiford > ; Marcus Shawcroft > ; Kyrylo Tkachov > Cc: gcc-patches@gcc.gnu.org > Subject: [PATCH] aarch64: Fix up *aarch64_bfxilsi_uxtw [PR98853] > > Hi! > > The https://gcc.gnu

[PATCH v2] IBM Z: Fix usage of "f" constraint with long doubles

2021-01-27 Thread Ilya Leoshkevich via Gcc-patches
v1: https://gcc.gnu.org/pipermail/gcc-patches/2021-January/563799.html v1 -> v2: Handle constraint modifiers, use AR constraint instead of R, add testcases for & and %. After switching the s390 backend to store long doubles in vector registers, "f" constraint broke when used with the former: l

[PATCH] aarch64: Fix up *aarch64_bfxilsi_uxtw [PR98853]

2021-01-27 Thread Jakub Jelinek via Gcc-patches
Hi! The https://gcc.gnu.org/legacy-ml/gcc-patches/2018-07/msg01895.html patch that introduced this pattern claimed: Would generate: combine_balanced_int: bfxil w0, w1, 0, 16 uxtwx0, w0 ret But with this patch generates: combine_balanced_int: bfxil w0, w1,

Re: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins.

2021-01-27 Thread will schmidt via Gcc-patches
On Thu, 2021-01-14 at 11:59 -0500, Michael Meissner via Gcc-patches wrote: > From 78435dee177447080434cdc08fc76b1029c7f576 Mon Sep 17 00:00:00 2001 > From: Michael Meissner > Date: Wed, 13 Jan 2021 21:47:03 -0500 > Subject: [PATCH] PowerPC: Map IEEE 128-bit long double built-ins. > > This patch r

Re: [PATCH 2/2] Add simd testsuite

2021-01-27 Thread Jonathan Wakely via Gcc-patches
On 27/01/21 17:54 +, Jonathan Wakely wrote: and add something to the release notes too. Also done. Pushed to wwwdocs. commit f948177c3d01d09cbc8035a75583d425a4dca46e Author: Jonathan Wakely Date: Wed Jan 27 18:30:00 2021 + Document simd additions to libstdc++ diff --git a/htd

Re: [PATCH 2/2] Add simd testsuite

2021-01-27 Thread Jonathan Wakely via Gcc-patches
On 27/01/21 16:45 +, Jonathan Wakely wrote: I'll regen the docs [...] Done. Regenerating the docs needed the attached fix. commit 3670dbe49059ab1746ac2e3b77940160c05db6c2 Author: Jonathan Wakely Date: Wed Jan 27 17:52:27 2021 libstdc++: Regenerate libstdc++ HTML docs libs

Re: [PATCH] aarch64: Use GCC vector extensions for FP ml[as]_n intrinsics

2021-01-27 Thread Richard Sandiford via Gcc-patches
Kyrylo Tkachov writes: > Hi Jonathan, > >> -Original Message- >> From: Jonathan Wright >> Sent: 27 January 2021 16:03 >> To: gcc-patches@gcc.gnu.org >> Cc: Kyrylo Tkachov >> Subject: [PATCH] aarch64: Use GCC vector extensions for FP ml[as]_n >> intrinsics >> >> Hi, >> >> As subject, this

RE: [PATCH] aarch64: Use RTL builtins for [su]mlal intrinsics

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
> -Original Message- > From: Jonathan Wright > Sent: 27 January 2021 16:28 > To: gcc-patches@gcc.gnu.org > Cc: Kyrylo Tkachov > Subject: [PATCH] aarch64: Use RTL builtins for [su]mlal intrinsics > > Hi, > > As subject, this patch rewrites [su]mlal Neon intrinsics to use RTL > builtin

Re: [PATCH 2/2] Add simd testsuite

2021-01-27 Thread Jonathan Wakely via Gcc-patches
On 18/12/20 16:49 +0100, Matthias Kretz wrote: Resending squashed patch after addressing Jonathan's comments. From: Matthias Kretz Add a new check-simd target to the testsuite. The new target creates a subdirectory, generates the necessary Makefiles, and spawns submakes to build and run the te

Re: [PATCH] rtl-optimization/80960 - avoid creating garbage RTL in DSE

2021-01-27 Thread Jakub Jelinek via Gcc-patches
On Wed, Jan 27, 2021 at 05:37:54PM +0100, Richard Biener wrote: > Sure, more micro-optimizing is possible, including passing a flag > to canon_true_dependence whether the addr RTX already had get_addr > called on it. And pass in the offset as poly-rtx-int and make > get_addr apply it if not zero.

Re: [PATCH 1/2] Add std::experimental::simd from the Parallelism TS 2

2021-01-27 Thread Jonathan Wakely via Gcc-patches
On 18/12/20 16:49 +0100, Matthias Kretz wrote: Resending this patch with proper commit message and rebased on master. From: Matthias Kretz Adds . This implements the simd and simd_mask class templates via [[gnu::vector_size(N)]] data members. It implements overloads for all of for simd. Expl

Re: [PATCH] rtl-optimization/80960 - avoid creating garbage RTL in DSE

2021-01-27 Thread Richard Biener
On Wed, 27 Jan 2021, Jakub Jelinek wrote: > On Wed, Jan 27, 2021 at 04:16:22PM +0100, Richard Biener wrote: > > I can check but all immediate first uses of mem_addr are in > > true_dependece_1 which does x_addr = get_addr (x_addr); as the > > first thing on it. So the concern would be that > > ge

[PATCH] aarch64: Use RTL builtins for [su]mlal intrinsics

2021-01-27 Thread Jonathan Wright via Gcc-patches
Hi, As subject, this patch rewrites [su]mlal Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeLog: 202

RE: [PATCH] aarch64: Use GCC vector extensions for FP ml[as]_n intrinsics

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
Hi Jonathan, > -Original Message- > From: Jonathan Wright > Sent: 27 January 2021 16:03 > To: gcc-patches@gcc.gnu.org > Cc: Kyrylo Tkachov > Subject: [PATCH] aarch64: Use GCC vector extensions for FP ml[as]_n > intrinsics > > Hi, > > As subject, this patch rewrites floating-point mla_n

[PATCH] aarch64: Use GCC vector extensions for FP ml[as]_n intrinsics

2021-01-27 Thread Jonathan Wright via Gcc-patches
Hi, As subject, this patch rewrites floating-point mla_n/mls_n intrinsics to use a + b * c / a - b * c rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --

Re: [PATCH] rtl-optimization/80960 - avoid creating garbage RTL in DSE

2021-01-27 Thread Jakub Jelinek via Gcc-patches
On Wed, Jan 27, 2021 at 04:16:22PM +0100, Richard Biener wrote: > I can check but all immediate first uses of mem_addr are in > true_dependece_1 which does x_addr = get_addr (x_addr); as the > first thing on it. So the concern would be that > get_addr (get_addr (x_addr)) != get_addr (x_addr) which

Re: [PATCH] rs6000: Fix vec insert ilp32 ICE and test failures [PR98799]

2021-01-27 Thread David Edelsohn via Gcc-patches
On Tue, Jan 26, 2021 at 10:56 PM Xionghu Luo wrote: > > Hi, > > On 2021/1/27 03:00, David Edelsohn wrote: > > On Tue, Jan 26, 2021 at 2:46 AM Xionghu Luo wrote: > >> > >> From: "luo...@cn.ibm.com" > >> > >> UNSPEC_SI_FROM_SF is not supported when TARGET_DIRECT_MOVE_64BIT > >> is false for -m32,

Re: [PATCH] rtl-optimization/80960 - avoid creating garbage RTL in DSE

2021-01-27 Thread Richard Biener
On Wed, 27 Jan 2021, Jakub Jelinek wrote: > On Wed, Jan 27, 2021 at 03:40:38PM +0100, Richard Biener wrote: > > The following avoids repeatedly turning VALUE RTXen into > > sth useful and re-applying a constant offset through get_addr > > via DSE check_mem_read_rtx. Instead perform this once for

Re: [RFC] test builtin ratio for loop distribution

2021-01-27 Thread Richard Biener via Gcc-patches
On Wed, Jan 27, 2021 at 2:18 PM Alexandre Oliva wrote: > > > This patch attempts to fix a libgcc codegen regression introduced in > gcc-10, as -ftree-loop-distribute-patterns was enabled at -O2. > > > The ldist pass turns even very short loops into memset calls. E.g., > the TFmode emulation calls

RE: aarch64: Use RTL builtins for integer mls_n intrinsics

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
> -Original Message- > From: Jonathan Wright > Sent: 27 January 2021 15:08 > To: gcc-patches@gcc.gnu.org > Cc: Kyrylo Tkachov > Subject: aarch64: Use RTL builtins for integer mls_n intrinsics > > Hi, > > As subject, this patch rewrites integer mls_n Neon intrinsics to use RTL > builti

aarch64: Use RTL builtins for integer mls_n intrinsics

2021-01-27 Thread Jonathan Wright via Gcc-patches
Hi, As subject, this patch rewrites integer mls_n Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeLog:

Re: follow SSA defs for asan base

2021-01-27 Thread Richard Biener via Gcc-patches
On Wed, Jan 27, 2021 at 1:29 PM Alexandre Oliva wrote: > > On Jan 26, 2021, Richard Biener wrote: > > > So while I think it's safe let's look at if we can improve tree-nested.c, > > like I see (probably not the correct place): > > *nod*, it's just not the *only* place. > > > seeing how we adjust

RE: [PATCH] aarch64: Use GCC vector extensions for integer mls intrinsics

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
From: Jonathan Wright Sent: 27 January 2021 12:57 To: Richard Sandiford ; Kyrylo Tkachov Cc: gcc-patches@gcc.gnu.org; Richard Earnshaw Subject: Re: [PATCH] aarch64: Use GCC vector extensions for integer mls intrinsics I have re-written this to use RTL builtins - regression tested and boots

Re: [PATCH] rtl-optimization/80960 - avoid creating garbage RTL in DSE

2021-01-27 Thread Jakub Jelinek via Gcc-patches
On Wed, Jan 27, 2021 at 03:40:38PM +0100, Richard Biener wrote: > The following avoids repeatedly turning VALUE RTXen into > sth useful and re-applying a constant offset through get_addr > via DSE check_mem_read_rtx. Instead perform this once for > all stores to be visited in check_mem_read_rtx.

[PATCH] tree-optimization/98854 - avoid some PHI BB vectorization

2021-01-27 Thread Richard Biener
This avoids cases of PHI node vectorization that just causes us to insert vector CTORs inside loops for values only required outside of the loop. Bootstrap and regtest running on x86_64-unknown-linux-gnu. 2021-01-27 Richard Biener PR tree-optimization/98854 * tree-vect-slp.c (

Re: [PATCH] libstdc++: implement locale support for AIX

2021-01-27 Thread CHIGOT, CLEMENT via Gcc-patches
> * There are minor formatting issues: > >   Should the linebreak in the extern inline definitions of strtof_l be >   after the return type, not before, matching GNU coding standards?  It >   may well be that the C++ style is different, though. > > Unrelated whitespace changes in xpg7/ctype_memb

[PATCH] rtl-optimization/80960 - avoid creating garbage RTL in DSE

2021-01-27 Thread Richard Biener
The following avoids repeatedly turning VALUE RTXen into sth useful and re-applying a constant offset through get_addr via DSE check_mem_read_rtx. Instead perform this once for all stores to be visited in check_mem_read_rtx. This avoids allocating 1.6GB of garbage PLUS RTXen on the PR80960 testca

Re: [PATCH] libstdc++: implement locale support for AIX

2021-01-27 Thread Rainer Orth
Hi Clement, > Here is a better version of the patch.  > All tests are on Linux are passing. Few have been disabled as  > they are working only with GNU model.  > For AIX, few failures remains. I haven't XFAIL them yet, as I  > want to know if they AIX only or related to the model itself.  > > A fe

RE: arm: Adjust cost of vector of constant zero

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
> -Original Message- > From: Christophe Lyon > Sent: 27 January 2021 13:56 > To: Kyrylo Tkachov > Cc: Kyrylo Tkachov via Gcc-patches > Subject: Re: arm: Adjust cost of vector of constant zero > > On Wed, 27 Jan 2021 at 14:44, Kyrylo Tkachov > wrote: > > > > > > > > > -Original Me

Re: arm: Adjust cost of vector of constant zero

2021-01-27 Thread Christophe Lyon via Gcc-patches
On Wed, 27 Jan 2021 at 14:44, Kyrylo Tkachov wrote: > > > > > -Original Message- > > From: Christophe Lyon > > Sent: 27 January 2021 13:12 > > To: Kyrylo Tkachov > > Cc: Kyrylo Tkachov via Gcc-patches > > Subject: Re: arm: Adjust cost of vector of constant zero > > > > On Wed, 27 Jan 20

[committed] libstdc++: Optimize std::string_view::find [PR 66414]

2021-01-27 Thread Jonathan Wakely via Gcc-patches
This reuses the code from std::string::find, which was improved by r244225, but string_view was not changed to match. libstdc++-v3/ChangeLog: PR libstdc++/66414 * include/bits/string_view.tcc (basic_string_view::find(const CharT*, size_type, size_type)): Optimize.

RE: arm: Adjust cost of vector of constant zero

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
> -Original Message- > From: Christophe Lyon > Sent: 27 January 2021 13:12 > To: Kyrylo Tkachov > Cc: Kyrylo Tkachov via Gcc-patches > Subject: Re: arm: Adjust cost of vector of constant zero > > On Wed, 27 Jan 2021 at 10:15, Kyrylo Tkachov > wrote: > > > > Hi Christophe, > > > > > -

Re: c++: cross-module __cxa_atexit use [PR 98531]

2021-01-27 Thread Rainer Orth
Hi Nathan, > Solaris tickled this bug as it has some mutex/sync/something primitive with > a destructor, hence wanted to generate a __cxa_atexit call inside an > inline/template function. But the problem is not solaris-specific. > > I tested this bootstrapping both x86_64-linux and aarch64-linux.

Re: arm: Adjust cost of vector of constant zero

2021-01-27 Thread Christophe Lyon via Gcc-patches
On Wed, 27 Jan 2021 at 10:15, Kyrylo Tkachov wrote: > > Hi Christophe, > > > -Original Message- > > From: Gcc-patches On Behalf Of > > Christophe Lyon via Gcc-patches > > Sent: 26 January 2021 18:03 > > To: gcc Patches > > Subject: arm: Adjust cost of vector of constant zero > > > > Neon

Re: [PATCH] aarch64: Use GCC vector extensions for integer mls intrinsics

2021-01-27 Thread Jonathan Wright via Gcc-patches
I have re-written this to use RTL builtins - regression tested and bootstrapped on aarch64-none-linux-gnu with no issues: aarch64: Use RTL builtins for integer mls intrinsics Rewrite integer mls Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling

Re: [PATCH] libstdc++: implement locale support for AIX

2021-01-27 Thread CHIGOT, CLEMENT via Gcc-patches
Hi everyone,  Here is a better version of the patch.  All tests are on Linux are passing. Few have been disabled as  they are working only with GNU model.  For AIX, few failures remains. I haven't XFAIL them yet, as I  want to know if they AIX only or related to the model itself.  A few part stil

Re: [PATCH] IBM Z: Fix usage of "f" constraint with long doubles

2021-01-27 Thread Ilya Leoshkevich via Gcc-patches
On Wed, 2021-01-27 at 08:58 +0100, Andreas Krebbel wrote: > On 1/18/21 10:54 PM, Ilya Leoshkevich wrote: > ... > > > +static rtx_insn * > > +s390_md_asm_adjust (vec &outputs, vec &inputs, > > + vec &input_modes, > > + vec &constraints, vec & > > /*clobbers*/, > > +

[RFC] mask out mult expr ctz bits from nonzero bits

2021-01-27 Thread Alexandre Oliva
While looking into the possibility of introducing setmemM patterns on RISC-V to undo the transformation from loops of word writes into memset, I was disappointed to find out that get_nonzero_bits would take into account the range of the length passed to memset, but not the trivially-available obs

Re: [PATCH v2] libstdc++: C++23, implement WG21 P1679R3

2021-01-27 Thread Jonathan Wakely via Gcc-patches
On 27/01/21 12:40 +, Jonathan Wakely wrote: On 15/01/21 01:23 +, Paul Fee via Libstdc++ wrote: Add contains member function to basic_string_view and basic_string. The new method is enabled for -std=gnu++20, gnu++2b and c++2b. This allows users to access the method as a GNU extension to

Re: [PATCH v2] libstdc++: C++23, implement WG21 P1679R3

2021-01-27 Thread Jonathan Wakely via Gcc-patches
On 15/01/21 01:23 +, Paul Fee via Libstdc++ wrote: Add contains member function to basic_string_view and basic_string. The new method is enabled for -std=gnu++20, gnu++2b and c++2b. This allows users to access the method as a GNU extension to C++20. The conditional test may be reduced to "

[RFC] test builtin ratio for loop distribution

2021-01-27 Thread Alexandre Oliva
This patch attempts to fix a libgcc codegen regression introduced in gcc-10, as -ftree-loop-distribute-patterns was enabled at -O2. The ldist pass turns even very short loops into memset calls. E.g., the TFmode emulation calls end with a loop of up to 3 iterations, to zero out trailing words,

Re: follow SSA defs for asan base

2021-01-27 Thread Alexandre Oliva
On Jan 26, 2021, Richard Biener wrote: > So while I think it's safe let's look at if we can improve tree-nested.c, > like I see (probably not the correct place): *nod*, it's just not the *only* place. > seeing how we adjust current_function_decl around the > recompute_tree_invariant_for_addr_ex

Re: [Patch, fortran] PR98573 - Dynamic type lost on assignment

2021-01-27 Thread Thomas Koenig via Gcc-patches
Hi Paul, This is a relatively obvious patch. The chunk in trans-array.c is not part of the fix for the PR but does suppress some of the bad dtype's that arise from allocation of class objects. The part in trans-stmt.c provides vptrs for all class allocations if the expression3 is available. Reg

RE: [PATCH] aarch64: Use RTL builtins for integer mla_n intrinsics

2021-01-27 Thread Kyrylo Tkachov via Gcc-patches
> -Original Message- > From: Jonathan Wright > Sent: 26 January 2021 11:43 > To: gcc-patches@gcc.gnu.org > Cc: Kyrylo Tkachov ; Richard Sandiford > > Subject: [PATCH] aarch64: Use RTL builtins for integer mla_n intrinsics > > Hi, > > As subject, this patch rewrites integer mla_n Neon

[Patch, fortran] PR93924/5 - [OOP] ICE with procedure pointer

2021-01-27 Thread Paul Richard Thomas via Gcc-patches
This patch fixes PRs 93924/5. It is another 'obvious' patch, whose consequences are very limited. I am trying to slip in as many small ready-to-go patches as I can before we go too far into stage 4. It would be nice to have the patch for PR98573 (posted 23rd Jan) OK'd before the end of the week.

Re: [PATCH] i386: Add peephole2 for __atomic_sub_fetch (x, y, z) == 0 [PR98737]

2021-01-27 Thread Ulrich Drepper via Gcc-patches
On 1/27/21 11:37 AM, Jakub Jelinek wrote: > Would equality comparison against 0 handle the most common cases. > > The user can write it as > __atomic_sub_fetch (x, y, z) == 0 > or > __atomic_fetch_sub (x, y, z) - y == 0 > thouch, so the expansion code would need to be able to cope with both. Plea

[PATCH] Add SSA coalescing verification (disabled)

2021-01-27 Thread Richard Biener
This adds a helper to allow verifying of abnormal coalescing at pass boundaries. It helps debugging issues like PR98845 since it's not always obvious where invalid overlapping life ranges of abnormals were introduced. The verifier is expensive so I've added it in a #if 0 block in the usual places

Re: [PATCH] [x86] Fix ICE [PR target/98833]

2021-01-27 Thread Jakub Jelinek via Gcc-patches
On Wed, Jan 27, 2021 at 06:38:23PM +0800, Hongtao Liu wrote: > Yes, and update patch. Ok, thanks. Jakub

Re: [PATCH] [x86] Fix ICE [PR target/98833]

2021-01-27 Thread Hongtao Liu via Gcc-patches
On Wed, Jan 27, 2021 at 6:38 PM Hongtao Liu wrote: > > On Wed, Jan 27, 2021 at 5:03 PM Jakub Jelinek wrote: > > > > On Wed, Jan 27, 2021 at 03:22:45PM +0800, Hongtao Liu wrote: > > > Hi: > > > As desribed in PR, also remove the relevant and useless expanders > > > and builtins, the user can > >

Re: [PATCH] [x86] Fix ICE [PR target/98833]

2021-01-27 Thread Hongtao Liu via Gcc-patches
On Wed, Jan 27, 2021 at 5:03 PM Jakub Jelinek wrote: > > On Wed, Jan 27, 2021 at 03:22:45PM +0800, Hongtao Liu wrote: > > Hi: > > As desribed in PR, also remove the relevant and useless expanders > > and builtins, the user can > > directly use == and >, without calling the builtin function. > >

Re: [PATCH] i386: Add peephole2 for __atomic_sub_fetch (x, y, z) == 0 [PR98737]

2021-01-27 Thread Jakub Jelinek via Gcc-patches
On Wed, Jan 27, 2021 at 11:22:57AM +0100, Uros Bizjak wrote: > > Bootstrapped/regtested on x86_64-linux and i686-linux. Is this ok for trunk > > (as exception), or for GCC 12? > > If there is no urgent need, I'd rather see to obey stage-4 and wait > for gcc-12. There is PR98375 meta bug to track

Re: [PATCH] i386: Add peephole2 for __atomic_sub_fetch (x, y, z) == 0 [PR98737]

2021-01-27 Thread Uros Bizjak via Gcc-patches
On Wed, Jan 27, 2021 at 10:20 AM Jakub Jelinek wrote: > > Hi! > > This patch adds a peephole2 for the optimization requested in the PR, > namely that we emit awful code for __atomic_sub_fetch (x, y, z) == 0 > or __atomic_sub_fetch (x, y, z) != 0 when y is not constant. > This can't be done in the

  1   2   >