[PATCH v2] ipa-cp: Fix PGO regression caused by r278808

2019-12-30 Thread luoxhu
v2 Changes: 1. Enable proportion orig_sum to the new nodes for self recursive node: new_sum = (orig_sum + new_sum) \ * self_recursive_probability * (1 / param_ipa_cp_max_recursive_depth). 2. Add value range for param_ipa_cp_max_recursive_depth. The performance of exchange2 built with PGO wil

Re: [GCC][PATCH][AArch64]Add ACLE intrinsics for dot product (usdot - vector, dot - by element) for AArch64 AdvSIMD ARMv8.6 Extension

2019-12-30 Thread Stam Markianos-Wright
On 12/20/19 2:13 PM, Richard Sandiford wrote: > Stam Markianos-Wright writes: >> diff --git a/gcc/config/aarch64/aarch64-simd.md >> b/gcc/config/aarch64/aarch64-simd.md >> index >> ad4676bc167f08951e693916c7ef796e3501762a..eba71f004ef67af654f9c512b720aa6cfdd1d7fc >> 100644 >> --- a/gcc/config

Re: [GCC][PATCH][AArch64]Add ACLE intrinsics for bfdot for ARMv8.6 Extension

2019-12-30 Thread Stam Markianos-Wright
On 12/20/19 2:36 PM, Richard Sandiford wrote: > Stam Markianos-Wright writes: >> Hi all, >> >> This patch adds the ARMv8.6 Extension ACLE intrinsics for the bfloat bfdot >> operation. >> >> The functions are declared in arm_neon.h with the armv8.2-a+bf16 target >> option >> as required. >> >> R

Re: [GCC][PATCH][AArch64]Add ACLE intrinsics for dot product (usdot - vector, dot - by element) for AArch64 AdvSIMD ARMv8.6 Extension

2019-12-30 Thread Richard Sandiford
Stam Markianos-Wright writes: > On 12/20/19 2:13 PM, Richard Sandiford wrote: >> Stam Markianos-Wright writes: >>> +**... >>> +**ret >>> +*/ >>> +int32x2_t ufoo (int32x2_t r, uint8x8_t x, int8x8_t y) >>> +{ >>> + return vusdot_s32 (r, x, y); >>> +} >>> + >> >> If we're using check-function-bodie

Re: [GCC][PATCH][AArch64]Add ACLE intrinsics for bfdot for ARMv8.6 Extension

2019-12-30 Thread Richard Sandiford
Stam Markianos-Wright writes: > diff --git a/gcc/config/aarch64/aarch64-simd.md > b/gcc/config/aarch64/aarch64-simd.md > index > adfda96f077075ad53d4bea2919c4d3b326e49f5..7587bc46ba1c80389ea49fa83a0e6f8a489711e9 > 100644 > --- a/gcc/config/aarch64/aarch64-simd.md > +++ b/gcc/config/aarch64/aarc

[wwwdocs] Add GCC10 IPA/LTO changes

2019-12-30 Thread Jan Hubicka
Hi, here are some of changes of LTO/IPA done in GCC10. There is also recursive cloning and some other stuff I will add incrementally as well as some data on overall compile time/memory use improvements as we reported in past years. I am still running tests and fixing bugs in this area. Honza dif

Fix SSA update when vectorisation adds a vdef to a read-only loop

2019-12-30 Thread Richard Sandiford
This patch fixes an awkward corner case in which: (a) we apply if-conversion to a loop; (b) the original scalar loop doesn't have a vdef, and thus doesn't need a virtual phi; (c) the vectorised main loop does need a vdef and a virtual phi (see below); (d) we also vectorise the epilogue; and

Re: Fix SSA update when vectorisation adds a vdef to a read-only loop

2019-12-30 Thread Richard Biener
On December 30, 2019 3:15:14 PM GMT+01:00, Richard Sandiford wrote: >This patch fixes an awkward corner case in which: > >(a) we apply if-conversion to a loop; > >(b) the original scalar loop doesn't have a vdef, and thus doesn't >need a virtual phi; > >(c) the vectorised main loop does need

[PATCH] Fix PR libgcc/92988

2019-12-30 Thread John David Anglin
Tested on hppa64-hp-hpux11.11. Okay? Dave 2019-12-30 John David Anglin PR libgcc/92988 * crtstuff.c (__do_global_dtors_aux): Only call __cxa_finalize if DEFAULT_USE_CXA_ATEXIT is true. Index: crtstuff.c

Re: [PATCH] rs6000: Fix PR92923, __builtin_vec_xor() causes subregs to be used when not using V4SImode vectors

2019-12-30 Thread Peter Bergner
On 12/20/19 12:20 PM, Peter Bergner wrote: >> On what kind of system did you test? >> >> I'd like to see this tested on both BE and LE, and various processor >> generations -- but we'll see if it regresses anyway, and it is still >> stage 3. So, okay for trunk, just please keep an eye out for >> r

[PATCH] Fix ICE caused by swallowing a token in c_parser_consume_token

2019-12-30 Thread Kerem Kat
Hi, This patch fixes ICE on invalid code, specifically files that have conflict-marker-like signs before EOF. 2019-12-30 Kerem Kat PR c/92833 gcc/ * c-parser.c (c_parser_consume_token): Fix peeked token stack pop to support 4 available tokens. * c-c++-common/pr92833-*.c: New test

[committed] Fix compilation of libgomp/target.c on hppa*-*-hpux11.*

2019-12-30 Thread John David Anglin
A recent change to target.c introduced the UINTPTR_MAX define. This exposed the fact that defines for INTPTR_MAX and UINTPTR_MAX are incomplete, and the define for SIZE_MAX is missing on hpux11.11. See: https://community.hpe.com/t5/Languages-and-Scripting/SIZE-MAX-undefined/td-p/4798688# The a

Re: [PATCH v2] ipa-cp: Fix PGO regression caused by r278808

2019-12-30 Thread Feng Xue OS
One comment: it's better to allow zero value for param_ipa_cp_max_recursive_depth, this can be used to disable recursive cloning. Feng From: luoxhu Sent: Monday, December 30, 2019 4:11 PM To: Jan Hubicka; Martin Jambor Cc: Martin Liška; gcc-patches@gcc.gn

Re: [PATCH v2] ipa-cp: Fix PGO regression caused by r278808

2019-12-30 Thread luoxhu
On 2019/12/31 14:43, Feng Xue OS wrote: One comment: it's better to allow zero value for param_ipa_cp_max_recursive_depth, this can be used to disable recursive cloning. Thanks, "1" means no recursive cloning but only constant propagation from caller to callee in your code? ipa-cp.c, line 2