Hi, Gentle ping this:
https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html BR, Kewen >>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote: >>>> Hi Segher, >>>> >>>> Thanks for the review! >>>> >>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote: >>>>> Hi! >>>>> >>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote: >>>>>> Currently we have the check: >>>>>> >>>>>> if (!insn >>>>>> || (value && rsp->last_set_table_tick >= label_tick_ebb_start)) >>>>>> rsp->last_set_invalid = 1; >>>>>> >>>>>> which means if we want to record some value for some reg and >>>>>> this reg got refered before in a valid scope, >>>>> >>>>> If we already know it is *set* in this same extended basic block. >>>>> Possibly by the same instruction btw. >>>>> >>>>>> we invalidate the >>>>>> set of reg (last_set_invalid to 1). It avoids to find the wrong >>>>>> set for one reg reference, such as the case like: >>>>>> >>>>>> ... op regX // this regX could find wrong last_set below >>>>>> regX = ... // if we think this set is valid >>>>>> ... op regX >>>>> >>>>> Yup, exactly. >>>>> >>>>>> But because of retry's existence, the last_set_table_tick could >>>>>> be set by some later reference insns, but we see it's set due >>>>>> to retry on the set (for that reg) insn again, such as: >>>>>> >>>>>> insn 1 >>>>>> insn 2 >>>>>> >>>>>> regX = ... --> (a) >>>>>> ... op regX --> (b) >>>>>> >>>>>> insn 3 >>>>>> >>>>>> // assume all in the same BB. >>>>>> >>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two >>>>>> (3 insns -> 2 insns), >>>>> >>>>> This will delete insn 1 and write the combined result to insns 2 and 3. >>>>> >>>>>> retrying from insn1 or insn2 again: >>>>> >>>>> Always 2, but your point remains valid. >>>>> >>>>>> it will scan insn (a) again, the below condition holds for regX: >>>>>> >>>>>> (value && rsp->last_set_table_tick >= label_tick_ebb_start) >>>>>> >>>>>> it will mark this set as invalid set. But actually the >>>>>> last_set_table_tick here is set by insn (b) before retrying, so it >>>>>> should be safe to be taken as valid set. >>>>> >>>>> Yup. >>>>> >>>>>> This proposal is to check whether the last_set_table safely happens >>>>>> after the current set, make the set still valid if so. >>>>> >>>>>> Full SPEC2017 building shows this patch gets more sucessful combines >>>>>> from 1902208 to 1902243 (trivial though). >>>>> >>>>> Do you have some example, or maybe even a testcase? :-) >>>>> >>>> >>>> Sorry for the late reply, it took some time to get one reduced case. >>>> >>>> typedef struct SA *pa_t; >>>> >>>> struct SC { >>>> int h; >>>> pa_t elem[]; >>>> }; >>>> >>>> struct SD { >>>> struct SC *e; >>>> }; >>>> >>>> struct SA { >>>> struct { >>>> struct SD f[1]; >>>> } g; >>>> }; >>>> >>>> void foo(pa_t *k, char **m) { >>>> int l, i; >>>> pa_t a; >>>> l = (int)a->g.f[5].e; >>>> i = 0; >>>> for (; i < l; i++) { >>>> k[i] = a->g.f[5].e->elem[i]; >>>> m[i] = ""; >>>> } >>>> } >>>> >>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 >>>> -fno-strict-aliasing", >>>> with this patch, the generated assembly can save two rlwinm s. >>>> >>>>>> + /* Record the luid of the insn whose expression involving register n. >>>>>> */ >>>>>> + >>>>>> + int last_set_table_luid; >>>>> >>>>> "Record the luid of the insn for which last_set_table_tick was set", >>>>> right? >>>>> >>>> >>>> But it can be updated later to one smaller luid, how about the wording >>>> like: >>>> >>>> >>>> + /* Record the luid of the insn which uses register n, the insn should >>>> + be the first one using register n in that block of the insn which >>>> + last_set_table_tick was set for. */ >>>> >>>> >>>>>> -static void update_table_tick (rtx); >>>>>> +static void update_table_tick (rtx, int); >>>>> >>>>> Please remove this declaration instead, the function is not used until >>>>> after its actual definition :-) >>>>> >>>> >>>> Done. >>>> >>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x) >>>>>> for (r = regno; r < endregno; r++) >>>>>> { >>>>>> reg_stat_type *rsp = ®_stat[r]; >>>>>> - rsp->last_set_table_tick = label_tick; >>>>>> + if (rsp->last_set_table_tick >= label_tick_ebb_start) >>>>>> + { >>>>>> + /* Later references should not have lower ticks. */ >>>>>> + gcc_assert (label_tick >= rsp->last_set_table_tick); >>>>> >>>>> This should be obvious, but checking it won't hurt, okay. >>>>> >>>>>> + /* Should pick up the lowest luid if the references >>>>>> + are in the same block. */ >>>>>> + if (label_tick == rsp->last_set_table_tick >>>>>> + && rsp->last_set_table_luid > insn_luid) >>>>>> + rsp->last_set_table_luid = insn_luid; >>>>> >>>>> Why? Is it conservative for the check you will do later? Please spell >>>>> this out, it is crucial! >>>>> >>>> >>>> Since later the combinations involving this insn probably make the >>>> register be used in one insn sitting ahead (which has smaller luid than >>>> the one which was recorded before). Yes, it's very conservative, this >>>> ensure that we always use the luid of the insn which is the first insn >>>> using this register in the block. The last_set invalidation is going >>>> to catch the case like: >>>> >>>> ... regX // avoid the set used here ... >>>> regX = ... >>>> ... >>>> >>>> Once we have the smallest luid one of all insns which use register X, >>>> any unsafe regX sets should be caught. >>>> >>>> I updated the comments to: >>>> >>>> + /* Since combination may generate some instructions >>>> + to replace some foregoing instructions with the >>>> + references to register r (using register r), we >>>> + need to make sure we record the first instruction >>>> + which is using register r, so always update with >>>> + the lowest luid here. If the given set happens >>>> + before this recorded earliest reference, the set >>>> + value should be safe to be used. */ >>>> >>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, >>>>>> rtx value) >>>>>> >>>>>> /* Mark registers that are being referenced in this value. */ >>>>>> if (value) >>>>>> - update_table_tick (value); >>>>>> + { >>>>>> + gcc_assert (insn); >>>>>> + update_table_tick (value, DF_INSN_LUID (insn)); >>>>>> + } >>>>> >>>>> Don't add that assert please. If you really want one it should come >>>>> right at the start of the function, not 60 lines later :-) >>>>> >>>> >>>> Exactly, fixed. >>>> >>>>> Looks good if I understood this correctly :-) >>>>> >>>>> >>>> >>>> Thanks again, I also updated the comments in func record_value_for_reg, >>>> the new version is attached. >>>> >>>> BR, >>>> Kewen >>>> ----- >>>> gcc/ChangeLog: >>>> >>>> * combine.c (struct reg_stat_type): New member >>>> last_set_table_luid. >>>> (update_table_tick): Add one argument for insn luid and >>>> set last_set_table_luid with it, remove its declaration. >>>> (record_value_for_reg): Adjust the condition to set >>>> last_set_invalid nonzero. >>>>