Re: [PATCH V5 1/2] Add overflow API for plus minus mult on range

2023-08-30 Thread guojiufu via Gcc-patches
On 2023-08-03 21:18, Andrew MacLeod wrote: This is OK. Thanks a lot! Committed via r14-3582. BR, Jeff (Jiufu Guo) On 8/2/23 22:18, Jiufu Guo wrote: Hi, I would like to have a ping on this patch. BR, Jeff (Jiufu Guo) Jiufu Guo writes: Hi, As discussed in previous reviews, adding

Ping^^ [PATCH V5 2/2] Optimize '(X - N * M) / N' to 'X / N - M' if valid

2023-08-22 Thread guojiufu via Gcc-patches
Hi, I would like to have a gentle ping... BR, Jeff (Jiufu Guo) On 2023-08-07 10:45, guojiufu via Gcc-patches wrote: Hi, Gentle ping... On 2023-07-18 22:05, Jiufu Guo wrote: Hi, Integer expression "(X - N * M) / N" can be optimized to "X / N - M" if there is no wrap/ov

Re: [PATCH V4 1/4] rs6000: build constant via li;rotldi

2023-08-17 Thread guojiufu via Gcc-patches
Hi Segher, As discussed on "~" vs. "-", "~" is correct for this patch. I updated the patch according to Kewen's comments. If ok, I would commit to trunk. BR, Jeff (Jiufu Guo) On 2023-07-04 11:28, Kewen.Lin wrote: Hi Jeff, on 2023/7/4 10:18, Jiufu Guo via Gcc-patches wrote: Hi, If a c

Re: [PATCH V5 2/2] Optimize '(X - N * M) / N' to 'X / N - M' if valid

2023-08-06 Thread guojiufu via Gcc-patches
Hi, Gentle ping... On 2023-07-18 22:05, Jiufu Guo wrote: Hi, Integer expression "(X - N * M) / N" can be optimized to "X / N - M" if there is no wrap/overflow/underflow and "X - N * M" has the same sign with "X". Compare the previous version: https://gcc.gnu.org/pipermail/gcc-patches/2023-J

Re: [RFC] light expander sra for parameters and returns

2023-08-02 Thread guojiufu via Gcc-patches
On 2023-08-02 20:41, Richard Biener wrote: On Tue, 1 Aug 2023, Jiufu Guo wrote: Hi, Richard Biener writes: > On Mon, 24 Jul 2023, Jiufu Guo wrote: > >> >> Hi Martin, >> >> Not sure about your current option about re-using the ipa-sra code >> in the light-expander-sra. And if anything I coul

Re: [PATCH 4/4] rs6000: build constant via li/lis;rldic

2023-06-15 Thread guojiufu via Gcc-patches
On 2023-06-13 17:18, Jiufu Guo via Gcc-patches wrote: Hi David, Thanks for your valuable comments! David Edelsohn writes: ... Do you have any measurement of how expensive it is to test all of these additional methods to generate a constant? How much does this affect the compile time?

Re: [PATCH] Make sure SCALAR_INT_MODE_P before invoke try_const_anchors

2023-06-09 Thread guojiufu via Gcc-patches
Hi, On 2023-06-09 16:00, Richard Biener wrote: On Fri, 9 Jun 2023, Jiufu Guo wrote: Hi, As checking the code, there is a "gcc_assert (SCALAR_INT_MODE_P (mode))" in "try_const_anchors". This assert seems correct because the function try_const_anchors cares about integer values currently, and

Re: [PATCH V5] Use reg mode to move sub blocks for parameters and returns

2023-06-06 Thread guojiufu via Gcc-patches
Hi, On 2023-06-05 00:59, Jeff Law wrote: On 5/9/23 07:43, Jiufu Guo wrote: Thanks for point out this! Yes, BLKmode rtx may not always be a MEM. MEM_SIZE is only ok for MEM after the it's known size is computed. Here MEM_SIZE is fine just because it is an stack rtx corresponding to the type of

Re: [PATCH V4 2/2] rs6000: use li;x?oris to build constant

2023-05-16 Thread guojiufu via Gcc-patches
Hi, On 2023-05-15 14:53, Kewen.Lin wrote: Hi Jeff, on 2022/12/12 09:38, Jiufu Guo wrote: Hi, For constant C: If '(c & 0xULL) == 0x' or say: 32(1) || 1(0) || 15(x) || 16(0), we could use "lis; xoris" to build. Here N(M) means N continuous bit M, x for M means i

Re: [PATCH V3] rs6000: Load high and low part of 64bit constant independently

2023-05-07 Thread guojiufu via Gcc-patches
Hi, On 2023-04-26 17:35, Kewen.Lin wrote: Hi Jeff, on 2023/1/4 14:51, Jiufu Guo wrote: Hi, Compare with previous version, this patch updates the comments only. https://gcc.gnu.org/pipermail/gcc-patches/2022-December/608293.html For a complicate 64bit constant, below is one instruction-sequen

Re: [PATCH V5] Use reg mode to move sub blocks for parameters and returns

2023-05-03 Thread guojiufu via Gcc-patches
Hi, On 2023-05-01 23:52, Segher Boessenkool wrote: Hi! On Fri, Mar 17, 2023 at 11:39:52AM +0800, Jiufu Guo wrote: gcc/testsuite/ChangeLog: * gcc.target/powerpc/pr65421-1.c: New test. * gcc.target/powerpc/pr65421.c: New test. Please name the tests something else? -1.c and -2

Re: [PATCH V5] Use reg mode to move sub blocks for parameters and returns

2023-05-03 Thread guojiufu via Gcc-patches
Hi, On 2023-05-01 03:00, Jeff Law wrote: On 3/16/23 21:39, Jiufu Guo wrote: Hi, When assigning a parameter to a variable, or assigning a variable to return value with struct type, and the parameter/return is passed through registers. For this kind of case, it would be better to use the nature

Re: [PATCH] PR testsuite/106879 FAIL: gcc.dg/vect/bb-slp-layout-19.c on powerpc64

2023-04-19 Thread guojiufu via Gcc-patches
Hi Kewen, On 2023-04-19 10:53, Kewen.Lin wrote: Hi Jeff, on 2023/4/19 10:03, Jiufu Guo wrote: Hi, On P7, option -mno-allow-movmisalign is added during testing, which prevents slp happen on the case. Like Like PR65484 and PR87306, this patch use vect_hw_misalig to guard Dup like...

Re: [PATCH] testsuite: update builtins-5-p9-runnable.c for BE

2023-04-16 Thread guojiufu via Gcc-patches
On 2023-04-14 17:09, Kewen.Lin wrote: Hi Jeff, on 2023/4/14 16:01, guojiufu wrote: On 2023-04-14 15:30, Jiufu Guo wrote: Hi, As PR108809 mentioned, vec_xl_len_r and vec_xst_len_r are tested in gcc.target/powerpc/builtins-5-p9-runnable.c. The vector operand of these two bifs are different from

Re: [PATCH] testsuite: update builtins-5-p9-runnable.c for BE

2023-04-14 Thread guojiufu via Gcc-patches
On 2023-04-14 15:30, Jiufu Guo wrote: Hi, As PR108809 mentioned, vec_xl_len_r and vec_xst_len_r are tested in gcc.target/powerpc/builtins-5-p9-runnable.c. The vector operand of these two bifs are different from the view of v16_int8 between BE and LE, even it is same from the view of 128bits(uint

Re: [PATCH] testsuite: filter out warning noise for CWE-1341 test

2023-04-13 Thread guojiufu via Gcc-patches
Hi, On 2023-04-13 20:08, Segher Boessenkool wrote: On Thu, Apr 13, 2023 at 07:39:01AM +, Richard Biener wrote: On Thu, 13 Apr 2023, Jiufu Guo wrote: I think this should be fixed in the analyzer, "stripping" malloc tracking from fopen/fclose since it does this manually. I've adjusted the bu

Re: [PATCH] testsuite: update requires for powerpc/float128-cmp2-runnable.c

2023-04-13 Thread guojiufu via Gcc-patches
Hi, On 2023-04-12 20:47, Kewen.Lin wrote: Hi Segher & Jeff, on 2023/4/11 23:13, Segher Boessenkool wrote: On Tue, Apr 11, 2023 at 05:40:09PM +0800, Kewen.Lin wrote: on 2023/4/11 17:14, guojiufu wrote: Thanks for raising this concern. The behavior to check about bif on FLOAT128_HW and emit an

Re: [PATCH] testsuite: update requires for powerpc/float128-cmp2-runnable.c

2023-04-12 Thread guojiufu via Gcc-patches
Hi Mike, On 2023-04-12 22:46, Michael Meissner wrote: On Wed, Apr 12, 2023 at 01:31:46PM +0800, Jiufu Guo wrote: I understand that QP insns (e.g. xscmpexpqp) is valid if the system meets ISA3.0, no matter BE/LE, 32-bit/64-bit. I think option -mfloat128-hardware is designed for QP insns. While

Re: [PATCH] testsuite: update requires for powerpc/float128-cmp2-runnable.c

2023-04-11 Thread guojiufu via Gcc-patches
Hi Kewen, Thanks a lot for your very helpful comments! On 2023-04-10 17:26, Kewen.Lin wrote: Hi Jeff, on 2023/4/10 10:09, Jiufu Guo via Gcc-patches wrote: Hi, In this test case (float128-cmp2-runnable.c), the instruction xscmpexpqp is used to support a few builtins e.g. __builtin_vsx_scalar_

Re: [PATCH] loading float member of parameter stored via int registers

2022-12-21 Thread guojiufu via Gcc-patches
Hi, On 2022-12-21 15:30, Richard Biener wrote: On Wed, 21 Dec 2022, Jiufu Guo wrote: Hi, This patch is fixing an issue about parameter accessing if the parameter is struct type and passed through integer registers, and there is floating member is accessed. Like below code: typedef struct DF

Re: [PATCH 2/3]rs6000: NFC use sext_hwi to replace ((v&0xf..f)^0x80..0) - 0x80..0

2022-12-01 Thread guojiufu via Gcc-patches
On 2022-12-01 15:10, Jiufu Guo via Gcc-patches wrote: Hi Kewen, 在 12/1/22 2:11 PM, Kewen.Lin 写道: on 2022/12/1 13:35, Jiufu Guo wrote: Hi Kewen, Thanks for your quick and insight review! 在 12/1/22 1:17 PM, Kewen.Lin 写道: Hi Jeff, on 2022/12/1 09:36, Jiufu Guo wrote: Hi, This patch just use

Re: [PATCH] Check if loading const from mem is faster

2022-02-23 Thread guojiufu via Gcc-patches
On 2/22/22 PM3:26, Richard Biener wrote: On Tue, 22 Feb 2022, Jiufu Guo wrote: Hi, For constants, there are some codes to check: if it is able to put to instruction as an immediate operand or it is profitable to load from mem. There are still some places that could be improved for platform

Re: [PATCH] Check if loading const from mem is faster

2022-02-22 Thread guojiufu via Gcc-patches
On 2022-02-23 01:30, Segher Boessenkool wrote: Hi Jiu Fu, On Tue, Feb 22, 2022 at 02:53:13PM +0800, Jiufu Guo wrote: static bool rs6000_cannot_force_const_mem (machine_mode mode ATTRIBUTE_UNUSED, rtx x) { - if (GET_CODE (x) == HIGH - && GET_CODE (XEXP (x, 0)) == UNSPEC) + if (GET_CO

Re: [PATCH] disable aggressive_loop_optimizations until niter ready

2022-01-13 Thread guojiufu via Gcc-patches
On 2022-01-03 22:30, Richard Biener wrote: On Wed, 22 Dec 2021, Jiufu Guo wrote: Hi, Normaly, estimate_numbers_of_iterations get/caculate niter first, and then invokes infer_loop_bounds_from_undefined. While in some case, after a few call stacks, estimate_numbers_of_iterations is invoked befo

Re: [RFC] Overflow check in simplifying exit cond comparing two IVs.

2021-10-27 Thread guojiufu via Gcc-patches
I just had a test on ppc64le, this patch pass bootstrap and regtest. Is this patch OK for trunk? Thanks for any comments. BR, Jiufu On 2021-10-18 21:37, Jiufu Guo wrote: With reference the discussions in: https://gcc.gnu.org/pipermail/gcc-patches/2021-July/574334.html https://gcc.gnu.org/pip

Re: [PATCH] Use fold_build2 instead fold_binary for TRUTH_AND

2021-10-19 Thread guojiufu via Gcc-patches
On 2021-10-20 10:44, Andrew Pinski wrote: On Tue, Oct 19, 2021 at 7:30 PM Jiufu Guo via Gcc-patches wrote: In tree_simplify_using_condition_1, there is code which should be logic: "op0 || op1"/"op0 && op1". When creating expression for TRUTH_OR_EXPR and TRUTH_AND_EXPR, fold_build2 would be

Re: [PATCH] testsuite: Fix gcc.dg/vect/pr101145* tests [PR101145]

2021-08-31 Thread guojiufu via Gcc-patches
On 2021-08-31 20:12, Jakub Jelinek wrote: Hi! I'm getting: FAIL: gcc.dg/vect/pr101145.c scan-tree-dump-times vect "vectorized 1 loops" 7 FAIL: gcc.dg/vect/pr101145_1.c scan-tree-dump-times vect "vectorized 1 loops" 2 FAIL: gcc.dg/vect/pr101145_2.c scan-tree-dump-times vect "vectorized 1 loops

Re: [PATCH] Set bound/cmp/control for until wrap loop.

2021-08-30 Thread guojiufu via Gcc-patches
On 2021-08-30 20:02, Richard Biener wrote: On Mon, 30 Aug 2021, guojiufu wrote: On 2021-08-30 14:15, Jiufu Guo wrote: > Hi, > > In patch r12-3136, niter->control, niter->bound and niter->cmp are > derived from number_of_iterations_lt. While for 'until wrap condition', > the calculation in numb

Re: [PATCH] Set bound/cmp/control for until wrap loop.

2021-08-29 Thread guojiufu via Gcc-patches
On 2021-08-30 14:15, Jiufu Guo wrote: Hi, In patch r12-3136, niter->control, niter->bound and niter->cmp are derived from number_of_iterations_lt. While for 'until wrap condition', the calculation in number_of_iterations_lt is not align the requirements on the define of them and requirements

Re: Ping: [PATCH v2] Analyze niter for until-wrap condition [PR101145]

2021-08-24 Thread guojiufu via Gcc-patches
On 2021-08-16 09:33, Bin.Cheng wrote: On Wed, Aug 4, 2021 at 10:42 AM guojiufu wrote: ... >> diff --git a/gcc/testsuite/gcc.dg/vect/pr101145.inc >> b/gcc/testsuite/gcc.dg/vect/pr101145.inc >> new file mode 100644 >> index 000..6eed3fa8aca >> --- /dev/null >> +++ b/gcc/testsuite/gcc.d

Ping: [PATCH v2] Analyze niter for until-wrap condition [PR101145]

2021-08-03 Thread guojiufu via Gcc-patches
Hi, I would like to have a ping on this. https://gcc.gnu.org/pipermail/gcc-patches/2021-July/574596.html BR, Jiufu On 2021-07-15 08:17, guojiufu via Gcc-patches wrote: Hi, I would like to have an early ping on this with more mail addresses. BR, Jiufu. On 2021-07-07 20:47, Jiufu Guo wrote

Re: [PATCH V3] Use preferred mode for doloop IV [PR61837]

2021-07-28 Thread guojiufu via Gcc-patches
On 2021-07-27 23:40, Jeff Law wrote: On 7/27/2021 12:27 AM, Richard Biener wrote: On Fri, 23 Jul 2021, Jeff Law wrote: On 7/15/2021 4:08 AM, Jiufu Guo via Gcc-patches wrote: Refine code for V2 according to review comments: * Use if check instead assert, and refine assert * Use better RE chec

Re: [RFC] more no-wrap conditions for IV analyzing and scev

2021-07-22 Thread guojiufu via Gcc-patches
On 2021-06-21 20:36, Richard Biener wrote: On Mon, 21 Jun 2021, guojiufu wrote: On 2021-06-21 14:19, guojiufu via Gcc-patches wrote: > On 2021-06-09 19:18, guojiufu wrote: >> On 2021-06-09 17:42, guojiufu via Gcc-patches wrote: >>> On 2021-06-08 18:13, Richard Biener wrote:

Re: [PATCH V2] Use preferred mode for doloop iv [PR61837].

2021-07-15 Thread guojiufu via Gcc-patches
On 2021-07-15 14:06, Richard Biener wrote: On Tue, 13 Jul 2021, Jiufu Guo wrote: Major changes from v1: * Add target hook to query preferred doloop mode. * Recompute doloop iv base from niter under preferred mode. Currently, doloop.xx variable is using the type as niter which may shorter than

Re: [PATCH V2] Use preferred mode for doloop iv [PR61837].

2021-07-14 Thread guojiufu via Gcc-patches
On 2021-07-15 02:04, Segher Boessenkool wrote: Hi! On Wed, Jul 14, 2021 at 06:26:28PM +0800, guojiufu wrote: PR target/61837 Wrong PR number? There is a patch optimize "add -1; zero_ext; add +1" to "zero_ext" already. Having this patch would help to avoid the left 'zero_ext', so, I

Re: [PATCH v2] Analyze niter for until-wrap condition [PR101145]

2021-07-14 Thread guojiufu via Gcc-patches
Hi, I would like to have an early ping on this with more mail addresses. BR, Jiufu. On 2021-07-07 20:47, Jiufu Guo wrote: Changes since v1: * Update assumptions for niter, add more test cases check * Use widest_int/wide_int instead mpz to do +-/ * Move some early check for quick return For co

Re: [PATCH V2] Use preferred mode for doloop iv [PR61837].

2021-07-14 Thread guojiufu via Gcc-patches
On 2021-07-14 12:40, guojiufu via Gcc-patches wrote: Updated the patch as below: Thanks for comments. gcc/ChangeLog: 2021-07-13 Jiufu Guo PR target/61837 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook. (rs6000_preferred_doloop_mode): New hook

Re: [PATCH V2] Use preferred mode for doloop iv [PR61837].

2021-07-13 Thread guojiufu via Gcc-patches
On 2021-07-14 04:50, Segher Boessenkool wrote: Hi! On Tue, Jul 13, 2021 at 08:50:46PM +0800, Jiufu Guo wrote: * doc/tm.texi: Regenerated. Pet peeve: "Regenerate.", no "d". Ok, yeap. While, 'Regenerate and Regenerated' were used by commits somewhere :) +DEFHOOK +(preferred_dolo

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-13 Thread guojiufu via Gcc-patches
On 2021-07-13 23:38, Segher Boessenkool wrote: On Mon, Jul 12, 2021 at 08:20:14AM +0200, Richard Biener wrote: On Fri, 9 Jul 2021, Segher Boessenkool wrote: > Almost all targets just use Pmode, but there is no such guarantee I > think, and esp. some targets that do not have machine insns for thi

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-13 Thread guojiufu via Gcc-patches
On 2021-07-13 23:51, Segher Boessenkool wrote: On Tue, Jul 13, 2021 at 10:09:25AM +0800, guojiufu wrote: >For loop looks like: > do ; > while (n-- > 0); /* while (n-- > low); */ (This whole loop as written will be optimised away, but :-) ) At -O2, the loop is optimized away. At -O1, the lo

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-13 Thread guojiufu via Gcc-patches
On 2021-07-13 15:09, Richard Biener wrote: On Tue, 13 Jul 2021, guojiufu wrote: On 2021-07-12 23:53, guojiufu via Gcc-patches wrote: > On 2021-07-12 22:46, Richard Biener wrote: >> On Mon, 12 Jul 2021, guojiufu wrote: >> >>> On 2021-07-12 18:02, Richard Biener wrote: &

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-12 Thread guojiufu via Gcc-patches
On 2021-07-12 23:53, guojiufu via Gcc-patches wrote: On 2021-07-12 22:46, Richard Biener wrote: On Mon, 12 Jul 2021, guojiufu wrote: On 2021-07-12 18:02, Richard Biener wrote: > On Mon, 12 Jul 2021, guojiufu wrote: > >> On 2021-07-12 16:57, Richard Biener wrote: >> >

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-12 Thread guojiufu via Gcc-patches
On 2021-07-12 23:53, guojiufu via Gcc-patches wrote: On 2021-07-12 22:46, Richard Biener wrote: On Mon, 12 Jul 2021, guojiufu wrote: On 2021-07-12 18:02, Richard Biener wrote: > On Mon, 12 Jul 2021, guojiufu wrote: > >> On 2021-07-12 16:57, Richard Biener wrote: >> >

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-12 Thread guojiufu via Gcc-patches
On 2021-07-12 22:46, Richard Biener wrote: On Mon, 12 Jul 2021, guojiufu wrote: On 2021-07-12 18:02, Richard Biener wrote: > On Mon, 12 Jul 2021, guojiufu wrote: > >> On 2021-07-12 16:57, Richard Biener wrote: >> > On Mon, 12 Jul 2021, guojiufu wrote: >> > >> >> On 2021-07-12 14:20, Richard Bie

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-12 Thread guojiufu via Gcc-patches
On 2021-07-12 18:02, Richard Biener wrote: On Mon, 12 Jul 2021, guojiufu wrote: On 2021-07-12 16:57, Richard Biener wrote: > On Mon, 12 Jul 2021, guojiufu wrote: > >> On 2021-07-12 14:20, Richard Biener wrote: >> > On Fri, 9 Jul 2021, Segher Boessenkool wrote: >> > >> >> On Fri, Jul 09, 2021 at

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-12 Thread guojiufu via Gcc-patches
On 2021-07-12 16:57, Richard Biener wrote: On Mon, 12 Jul 2021, guojiufu wrote: On 2021-07-12 14:20, Richard Biener wrote: > On Fri, 9 Jul 2021, Segher Boessenkool wrote: > >> On Fri, Jul 09, 2021 at 08:43:59AM +0200, Richard Biener wrote: >> > I wonder if there's a way to query the target what

Re: [PATCH] Check type size for doloop iv on BITS_PER_WORD [PR61837]

2021-07-12 Thread guojiufu via Gcc-patches
On 2021-07-12 14:20, Richard Biener wrote: On Fri, 9 Jul 2021, Segher Boessenkool wrote: On Fri, Jul 09, 2021 at 08:43:59AM +0200, Richard Biener wrote: > I wonder if there's a way to query the target what modes the doloop > pattern can handle (not being too familiar with the doloop code). You

Re: Ping: [PATCH 1/2] correct BB frequencies after loop changed

2021-07-04 Thread guojiufu via Gcc-patches
scan-rtl-dump-times "internal loop alignment added" 1 "alignments"} } */ + -- 2.17.1 On 2021-06-18 16:24, guojiufu via Gcc-patches wrote: On 2021-06-15 12:57, guojiufu via Gcc-patches wrote: On 2021-06-14 17:16, Jan Hubicka wrote: On 5/6/2021 8:36 PM, guojiufu via Gcc

Re: [PATCH] Analyze niter for until-wrap condition [PR101145]

2021-07-02 Thread guojiufu via Gcc-patches
On 2021-07-01 20:35, Richard Biener wrote: On Thu, 1 Jul 2021, Jiufu Guo wrote: For code like: unsigned foo(unsigned val, unsigned start) { unsigned cnt = 0; for (unsigned i = start; i > val; ++i) cnt++; return cnt; } The number of iterations should be about UINT_MAX - start. For

Re: [PATCH] Analyze niter for until-wrap condition [PR101145]

2021-07-01 Thread guojiufu via Gcc-patches
On 2021-07-02 08:51, Bin.Cheng wrote: On Thu, Jul 1, 2021 at 10:15 PM guojiufu via Gcc-patches wrote: On 2021-07-01 20:35, Richard Biener wrote: > On Thu, 1 Jul 2021, Jiufu Guo wrote: > >> For code like: >> unsigned foo(unsigned val, unsigned start) >> { >>

Re: [PATCH] Analyze niter for until-wrap condition [PR101145]

2021-07-01 Thread guojiufu via Gcc-patches
On 2021-07-01 20:35, Richard Biener wrote: On Thu, 1 Jul 2021, Jiufu Guo wrote: For code like: unsigned foo(unsigned val, unsigned start) { unsigned cnt = 0; for (unsigned i = start; i > val; ++i) cnt++; return cnt; } The number of iterations should be about UINT_MAX - start. For

Re: [PATCH] Analyze niter for until-wrap condition [PR101145]

2021-07-01 Thread guojiufu via Gcc-patches
On 2021-07-01 15:22, Bin.Cheng wrote: On Thu, Jul 1, 2021 at 10:06 AM Jiufu Guo via Gcc-patches wrote: For code like: unsigned foo(unsigned val, unsigned start) { unsigned cnt = 0; for (unsigned i = start; i > val; ++i) cnt++; return cnt; } The number of iterations should be about U

Re: [PATCH V3] Split loop for NE condition.

2021-06-21 Thread guojiufu via Gcc-patches
On 2021-06-21 16:51, Richard Biener wrote: On Wed, 9 Jun 2021, guojiufu wrote: On 2021-06-09 17:42, guojiufu via Gcc-patches wrote: > On 2021-06-08 18:13, Richard Biener wrote: >> On Fri, 4 Jun 2021, Jiufu Guo wrote: >> > cut... >>> + gcond *cond = as_a (last)

[RFC] New idea to split loop based on no-wrap conditions

2021-06-21 Thread guojiufu via Gcc-patches
On 2021-06-21 14:19, guojiufu via Gcc-patches wrote: On 2021-06-09 19:18, guojiufu wrote: On 2021-06-09 17:42, guojiufu via Gcc-patches wrote: On 2021-06-08 18:13, Richard Biener wrote: On Fri, 4 Jun 2021, Jiufu Guo wrote: cut... cut... Besides the method in the previous mails,  I’m

Re: [PATCH V3] Split loop for NE condition.

2021-06-20 Thread guojiufu via Gcc-patches
On 2021-06-09 19:18, guojiufu wrote: On 2021-06-09 17:42, guojiufu via Gcc-patches wrote: On 2021-06-08 18:13, Richard Biener wrote: On Fri, 4 Jun 2021, Jiufu Guo wrote: cut... cut... Here is the updated patch, thanks for your time! Updates: . Enhance code to support negative step

Re: Ping: [PATCH 1/2] correct BB frequencies after loop changed

2021-06-18 Thread guojiufu via Gcc-patches
On 2021-06-15 12:57, guojiufu via Gcc-patches wrote: On 2021-06-14 17:16, Jan Hubicka wrote: On 5/6/2021 8:36 PM, guojiufu via Gcc-patches wrote: > Gentle ping. > > Original message: > https://gcc.gnu.org/pipermail/gcc-patches/2020-October/555871.html I think you need a more agg

Re: Ping: [PATCH 1/2] correct BB frequencies after loop changed

2021-06-16 Thread guojiufu via Gcc-patches
On 2021-06-15 12:57, guojiufu via Gcc-patches wrote: On 2021-06-14 17:16, Jan Hubicka wrote: On 5/6/2021 8:36 PM, guojiufu via Gcc-patches wrote: > Gentle ping. > > Original message: > https://gcc.gnu.org/pipermail/gcc-patches/2020-October/555871.html I think you need a more agg

Re: Ping: [PATCH 1/2] correct BB frequencies after loop changed

2021-06-14 Thread guojiufu via Gcc-patches
On 2021-06-14 17:16, Jan Hubicka wrote: On 5/6/2021 8:36 PM, guojiufu via Gcc-patches wrote: > Gentle ping. > > Original message: > https://gcc.gnu.org/pipermail/gcc-patches/2020-October/555871.html I think you need a more aggressive ping  :-) OK for the trunk.  Sorry for the lo

Re: [PATCH V3] Split loop for NE condition.

2021-06-09 Thread guojiufu via Gcc-patches
On 2021-06-09 17:42, guojiufu via Gcc-patches wrote: On 2021-06-08 18:13, Richard Biener wrote: On Fri, 4 Jun 2021, Jiufu Guo wrote: cut... + gcond *cond = as_a (last); + enum tree_code code = gimple_cond_code (cond); + if (!(code == NE_EXPR + || (code == EQ_EXPR

Re: [PATCH V3] Split loop for NE condition.

2021-06-09 Thread guojiufu via Gcc-patches
On 2021-06-08 18:13, Richard Biener wrote: On Fri, 4 Jun 2021, Jiufu Guo wrote: cut... + gcond *cond = as_a (last); + enum tree_code code = gimple_cond_code (cond); + if (!(code == NE_EXPR + || (code == EQ_EXPR && (e->flags & EDGE_TRUE_VALUE The NE_EXPR check mis

Re: Ping^2: [PATCH 1/2] correct BB frequencies after loop changed

2021-06-06 Thread guojiufu via Gcc-patches
Gentle ping ;) BR. Jiufu Guo On 2021-05-20 15:19, guojiufu via Gcc-patches wrote: Gentle ping^. On 2021-05-07 10:36, guojiufu via Gcc-patches wrote: Gentle ping. Original message: https://gcc.gnu.org/pipermail/gcc-patches/2020-October/555871.html Thanks, Jiufu Guo.

Re: [PATCH V2] Split loop for NE condition.

2021-06-01 Thread guojiufu via Gcc-patches
On 2021-06-01 11:28, guojiufu via Gcc-patches wrote: On 2021-05-26 17:50, Richard Biener wrote: On Mon, 17 May 2021, Jiufu Guo wrote: Or relax all this, of course. It would easy to handle the above cases: e->src before latch, or simple header. To relax this, we may need to p

Re: [PATCH V2] Split loop for NE condition.

2021-05-31 Thread guojiufu via Gcc-patches
On 2021-05-26 17:50, Richard Biener wrote: On Mon, 17 May 2021, Jiufu Guo wrote: ... while (++k > n) a[k] = b[k] + 1; then for the second loop, it could be optimized. Btw, I think even the first loop should be vectorized. I see we do not handle it in niter analysis: Analyzing loop

Re: [PATCH] go/100537 - Bootstrap-O3 and bootstrap-debug fail

2021-05-20 Thread guojiufu via Gcc-patches
On 2021-05-18 14:58, Richard Biener wrote: On Mon, 17 May 2021, Ian Lance Taylor wrote: On Mon, May 17, 2021 at 1:17 AM Richard Biener via Gcc-patches wrote: > > On Fri, May 14, 2021 at 11:19 AM guojiufu via Gcc-patches > wrote: > > > > On 2021-05-14 15:39, guojiufu

Ping^1: [PATCH 1/2] correct BB frequencies after loop changed

2021-05-20 Thread guojiufu via Gcc-patches
Gentle ping^. On 2021-05-07 10:36, guojiufu via Gcc-patches wrote: Gentle ping. Original message: https://gcc.gnu.org/pipermail/gcc-patches/2020-October/555871.html Thanks, Jiufu Guo.

Re: [PATCH] go/100537 - Bootstrap-O3 and bootstrap-debug fail

2021-05-18 Thread guojiufu via Gcc-patches
On 2021-05-18 14:58, Richard Biener wrote: On Mon, 17 May 2021, Ian Lance Taylor wrote: On Mon, May 17, 2021 at 1:17 AM Richard Biener via Gcc-patches wrote: > > On Fri, May 14, 2021 at 11:19 AM guojiufu via Gcc-patches > wrote: > > > > On 2021-05-14 15:39, guojiufu

Re: [PATCH V2] Split loop for NE condition.

2021-05-18 Thread guojiufu via Gcc-patches
On 2021-05-18 18:32, guojiufu wrote: On 2021-05-18 17:28, guojiufu via Gcc-patches wrote: On 2021-05-18 14:36, Bernd Edlinger wrote: On 5/17/21 4:01 AM, Jiufu Guo via Gcc-patches wrote: When there is the possibility that overflow/wrap may happen on the loop index, a few optimizations would

Re: [PATCH V2] Split loop for NE condition.

2021-05-18 Thread guojiufu via Gcc-patches
On 2021-05-18 17:28, guojiufu via Gcc-patches wrote: On 2021-05-18 14:36, Bernd Edlinger wrote: On 5/17/21 4:01 AM, Jiufu Guo via Gcc-patches wrote: When there is the possibility that overflow/wrap may happen on the loop index, a few optimizations would not happen. For example code: foo (int

Re: [PATCH V2] Split loop for NE condition.

2021-05-18 Thread guojiufu via Gcc-patches
On 2021-05-18 14:36, Bernd Edlinger wrote: On 5/17/21 4:01 AM, Jiufu Guo via Gcc-patches wrote: When there is the possibility that overflow/wrap may happen on the loop index, a few optimizations would not happen. For example code: foo (int *a, int *b, unsigned k, unsigned n) { while (++k !=

Re: [PATCH] go/100537 - Bootstrap-O3 and bootstrap-debug fail

2021-05-17 Thread guojiufu via Gcc-patches
On 2021-05-17 16:17, Richard Biener wrote: On Fri, May 14, 2021 at 11:19 AM guojiufu via Gcc-patches wrote: On 2021-05-14 15:39, guojiufu via Gcc-patches wrote: > On 2021-05-14 15:15, Richard Biener wrote: >> On May 14, 2021 4:52:56 AM GMT+02:00, Jiufu Guo >> wrote: >>

[RFC] split loop for NE condition.

2021-05-14 Thread guojiufu via Gcc-patches
I've refined the patch as below. This patch is checking "unsigned type" adn iv.no_overflow. While I'm thinking to use "number_of_iterations_exit (loop, e, &niter, false, false, NULL)" and "niter.control.no_overflow" to check overflow/wrap, which maybe more accurate, but relative "expensive". "n

Re: [PATCH] go/100537 - Bootstrap-O3 and bootstrap-debug fail

2021-05-14 Thread guojiufu via Gcc-patches
On 2021-05-14 15:39, guojiufu via Gcc-patches wrote: On 2021-05-14 15:15, Richard Biener wrote: On May 14, 2021 4:52:56 AM GMT+02:00, Jiufu Guo wrote: As discussed in the PR, Richard mentioned the method to figure out which VAR was not set TREE_ADDRESSABLE, and then cause this failure. It is

Re: [PATCH] go/100537 - Bootstrap-O3 and bootstrap-debug fail

2021-05-14 Thread guojiufu via Gcc-patches
On 2021-05-14 15:15, Richard Biener wrote: On May 14, 2021 4:52:56 AM GMT+02:00, Jiufu Guo wrote: As discussed in the PR, Richard mentioned the method to figure out which VAR was not set TREE_ADDRESSABLE, and then cause this failure. It is address_expression which build addr_expr (build_fold_a

Re: [PATCH] split loop for NE condition.

2021-05-07 Thread guojiufu via Gcc-patches
On 2021-05-06 16:27, Richard Biener wrote: On Thu, 6 May 2021, guojiufu wrote: On 2021-05-03 20:18, Richard Biener wrote: > On Thu, 29 Apr 2021, Jiufu Guo wrote: > >> When there is the possibility that overflow may happen on the loop index, >> a few optimizations would not happen. For example c

Ping: [PATCH 1/2] correct BB frequencies after loop changed

2021-05-06 Thread guojiufu via Gcc-patches
Gentle ping. Original message: https://gcc.gnu.org/pipermail/gcc-patches/2020-October/555871.html Thanks, Jiufu Guo.

Re: [PATCH] split loop for NE condition.

2021-05-06 Thread guojiufu via Gcc-patches
On 2021-05-03 20:18, Richard Biener wrote: On Thu, 29 Apr 2021, Jiufu Guo wrote: When there is the possibility that overflow may happen on the loop index, a few optimizations would not happen. For example code: foo (int *a, int *b, unsigned k, unsigned n) { while (++k != n) a[k] = b[k]

Re: [PATCH] split loop for NE condition.

2021-05-05 Thread guojiufu via Gcc-patches
On 2021-05-01 05:37, Segher Boessenkool wrote: Hi! On Thu, Apr 29, 2021 at 05:50:48PM +0800, Jiufu Guo wrote: When there is the possibility that overflow may happen on the loop index, a few optimizations would not happen. For example code: foo (int *a, int *b, unsigned k, unsigned n) { whil

Re: [PATCH] split loop for NE condition.

2021-05-05 Thread guojiufu via Gcc-patches
On 2021-05-01 00:27, Jeff Law wrote: On 4/29/2021 3:50 AM, Jiufu Guo via Gcc-patches wrote: When there is the possibility that overflow may happen on the loop index, a few optimizations would not happen. For example code: foo (int *a, int *b, unsigned k, unsigned n) { while (++k != n)

[PATCH] Clean up loop-closed PHIs at loopdone pass

2020-11-05 Thread guojiufu via Gcc-patches
In PR87473, there are discussions about loop-closed PHIs which are generated for loop optimization passes. It would be helpful to clean them up after loop optimization is done, then this may simplify some jobs of following passes. This patch introduces a cheaper way to propagate them out in pass_t

[PATCH] fold x << (n % C) to x << (n & C-1) if C meets power2

2020-10-15 Thread guojiufu via Gcc-patches
Hi, I just had a check on below patch for PR66552. https://gcc.gnu.org/pipermail/gcc-patches/2020-February/540930.html It seems this patch works fine now. This patch fixes PR66552 which request to optimizes (x shift (n mod C)) to (x shift (n bit_and (C - 1))) when C is a constant and power of two.

[PATCH 1/2] correct BB frequencies after loop changed

2020-10-09 Thread guojiufu via Gcc-patches
When investigating the issue from https://gcc.gnu.org/pipermail/gcc-patches/2020-July/549786.html I find the BB COUNTs of loop seems are not accurate in some case. For example: In below figure: COUNT:268435456 pre-header | | .

[PATCH 2/2] reset edge probibility and BB-count for peeled/unrolled loop

2020-10-09 Thread guojiufu via Gcc-patches
Hi, PR68212 mentioned that the COUNT of unrolled loop was not correct, and comments of this PR also mentioned that loop become 'cold'. This patch fixes the wrong COUNT/PROB of unrolled loop. And the patch handles the case where unrolling in unreliable count number can cause a loop to no longer lo

[RFC] update COUNTs of BB in loop.

2020-09-21 Thread guojiufu via Gcc-patches
Hi, When investigating the issue from https://gcc.gnu.org/pipermail/gcc-patches/2020-July/549786.html I find the BB COUNTs of loop seems are not accurate in some case. For example: In below figure: COUNT:268435456 pre-header | |

[PATCH] Check calls before loop unrolling

2020-08-19 Thread guojiufu via Gcc-patches
Hi, When unroll loops, if there are calls inside the loop, those calls may raise negative impacts for unrolling. This patch adds a param param_max_unrolled_calls, and checks if the number of calls inside the loop bigger than this param, loop is prevent from unrolling. This patch is checking the

[PATCH rs6000]: Refine RTL unroll hook for small loops

2020-07-13 Thread guojiufu via Gcc-patches
Hi, For very small loops (< 6 insns), it would be fine to unroll 4 times to run fast with less latency and better cache usage. Like below loops: while (i) a[--i] = NULL; while (p < e) *d++ = *p++; With this patch enhances, we could see some performance improvement for some workloads(e.g. SPE

[PATCH] rs6000: Refine RTL unroll adjust hook

2020-07-06 Thread guojiufu via Gcc-patches
For very small loops (< 6 insns), it would be fine to unroll 4 times to use cache line better. Like below loops: `while (i) a[--i] = NULL; while (p < e) *d++ = *p++;` And for very complex loops which may cause negative impacts: branch-miss or cache-miss. Like below loop: there are calls, earl

[PATCH 1/2] Introduce flag_cunroll_grow_size for cunroll

2020-05-28 Thread guojiufu via Gcc-patches
From: Jiufu Guo Currently GIMPLE complete unroller(cunroll) is checking flag_unroll_loops and flag_peel_loops to see if allow size growth. Beside affects curnoll, flag_unroll_loops also controls RTL unroler. To have more freedom to control cunroll and RTL unroller, this patch introduces flag_cunr

[PATCH 2/2] rs6000: allow cunroll to grow size according to -funroll-loop or -fpeel-loops

2020-05-28 Thread guojiufu via Gcc-patches
From: Jiufu Guo Previously, flag_unroll_loops was turned on at -O2 implicitly. It also turned on cunroll with allowance size increasing, and then cunroll will unroll/peel the loop even the loop is complex like code in PR95018. With this patch, size growth for cunroll is allowed if -funroll-loops

[PATCH 2/2] rs6000: Turn on -frtl-unroll-loops instead -funroll-loops at -O2

2020-05-25 Thread guojiufu via Gcc-patches
Previously, turning -funroll-loops on at -O2, which also turn on GIMPLE cunroll fully. While cunroll unrolls some complex loops. This patch turn on -frtl-unroll-loops at -O2 only, and continue to use previous tuned rs6000 heurisitics for small loops. While this patch does not turn on GIMPLE cunr

[PATCH 1/2] Seperate -funroll-loops for GIMPLE unroller and RTL unroller

2020-05-25 Thread guojiufu via Gcc-patches
Currently option -funroll-loops controls both GIMPLE unroler and RTL unroller. It is not able to control GIMPLE cunroller and RTL unroller independently. This patch introducing different flags to control them seperately, and this also provide more freedom to tune one of them without affecting anot

[PATCH,rs6000] enable -fweb for small loops unrolling

2020-04-20 Thread guojiufu via Gcc-patches
Hi, Previously -fweb was disabled if only unroll small loops. After that we find there is cases where it could help to rename pseudos and aovid some anti-dependence which may occur after unroll. Below is a patch to disable -fweb during loop unrolling. Bootstrap and regtest pass on powerpc, spec