on 2023/11/23 16:20, Richard Biener wrote: > On Thu, Nov 23, 2023 at 4:02 AM Kewen.Lin <li...@linux.ibm.com> wrote: >> >> on 2023/11/22 18:25, Richard Biener wrote: >>> On Wed, Nov 22, 2023 at 10:31 AM Kewen.Lin <li...@linux.ibm.com> wrote: >>>> >>>> on 2023/11/17 20:55, Alexander Monakov wrote: >>>>> >>>>> On Fri, 17 Nov 2023, Kewen.Lin wrote: >>>>>>> I don't think you can run cleanup_cfg after sched_init. I would suggest >>>>>>> to put it early in schedule_insns. >>>>>> >>>>>> Thanks for the suggestion, I placed it at the beginning of >>>>>> haifa_sched_init >>>>>> instead, since schedule_insns invokes haifa_sched_init, although the >>>>>> calls rgn_setup_common_sched_info and rgn_setup_sched_infos are executed >>>>>> ahead but they are all "setup" functions, shouldn't affect or be affected >>>>>> by this placement. >>>>> >>>>> I was worried because sched_init invokes df_analyze, and I'm not sure if >>>>> cfg_cleanup can invalidate it. >>>> >>>> Thanks for further explaining! By scanning cleanup_cfg, it seems that it >>>> considers df, like compact_blocks checks df, try_optimize_cfg invokes >>>> df_analyze etc., but I agree that moving cleanup_cfg before sched_init >>>> makes more sense. >>>> >>>>> >>>>>>> I suspect this may be caused by invoking cleanup_cfg too late. >>>>>> >>>>>> By looking into some failures, I found that although cleanup_cfg is >>>>>> executed >>>>>> there would be still some empty blocks left, by analyzing a few failures >>>>>> there >>>>>> are at least such cases: >>>>>> 1. empty function body >>>>>> 2. block holding a label for return. >>>>>> 3. block without any successor. >>>>>> 4. block which becomes empty after scheduling some other block. >>>>>> 5. block which looks mergeable with its always successor but left. >>>>>> ... >>>>>> >>>>>> For 1,2, there is one single successor EXIT block, I think they don't >>>>>> affect >>>>>> state transition, for 3, it's the same. For 4, it depends on if we can >>>>>> have >>>>>> the assumption this kind of empty block doesn't have the chance to have >>>>>> debug >>>>>> insn (like associated debug insn should be moved along), I'm not sure. >>>>>> For 5, >>>>>> a reduced test case is: >>>>> >>>>> Oh, I should have thought of cases like these, really sorry about the slip >>>>> of attention, and thanks for showing a testcase for item 5. As Richard as >>>>> saying in his response, cfg_cleanup cannot be a fix here. The thing to >>>>> check >>>>> would be changing no_real_insns_p to always return false, and see if the >>>>> situation looks recoverable (if it breaks bootstrap, regtest statistics of >>>>> a non-bootstrapped compiler are still informative). >>>> >>>> As you suggested, I forced no_real_insns_p to return false all the time, >>>> some >>>> issues got exposed, almost all of them are asserting NOTE_P insn shouldn't >>>> be >>>> encountered in those places, so the adjustments for most of them are just >>>> to >>>> consider NOTE_P or this kind of special block and so on. One draft patch >>>> is >>>> attached, it can be bootstrapped and regress-tested on ppc64{,le} and x86. >>>> btw, it's without the previous cfg_cleanup adjustment (hope it can get more >>>> empty blocks and expose more issues). The draft isn't qualified for code >>>> review but I hope it can provide some information on what kinds of changes >>>> are needed for the proposal. If this is the direction which we all agree >>>> on, >>>> I'll further refine it and post a formal patch. One thing I want to note >>>> is >>>> that this patch disable one assertion below: >>>> >>>> diff --git a/gcc/sched-rgn.cc b/gcc/sched-rgn.cc >>>> index e5964f54ead..abd334864fb 100644 >>>> --- a/gcc/sched-rgn.cc >>>> +++ b/gcc/sched-rgn.cc >>>> @@ -3219,7 +3219,7 @@ schedule_region (int rgn) >>>> } >>>> >>>> /* Sanity check: verify that all region insns were scheduled. */ >>>> - gcc_assert (sched_rgn_n_insns == rgn_n_insns); >>>> + // gcc_assert (sched_rgn_n_insns == rgn_n_insns); >>>> >>>> sched_finish_ready_list (); >>>> >>>> Some cases can cause this assertion to fail, it's due to the mismatch on >>>> to-be-scheduled and scheduled insn counts. The reason why it happens is >>>> that >>>> one block previously has only one INSN_P but while scheduling some other >>>> blocks >>>> it gets moved as well then we ends up with an empty block so that the only >>>> NOTE_P insn was counted then, but since this block isn't empty initially >>>> and >>>> NOTE_P gets skipped in a normal block, the count to-be-scheduled can't >>>> count >>>> it in. It can be fixed with special-casing this kind of block for counting >>>> like initially recording which block is empty and if a block isn't recorded >>>> before then fix up the count for it accordingly. I'm not sure if someone >>>> may >>>> have an argument that all the complication make this proposal beaten by >>>> previous special-casing debug insn approach, looking forward to more >>>> comments. >>> >>> Just a comment that the NOTE_P thing is odd - do we only ever have those for >>> otherwise empty BBs? How are they skipped otherwise (and why does that not >>> work for otherwise empty BBs)? >> >> Yes, previously (bypassing empty BBs) there is no chance to encounter NOTE_P >> when scheduling insns, as for notes in normal BBs, when setting up the head >> and tail, some are skipped (like get_ebb_head_tail), and there are also some >> special handlings remove_notes and unlink_bb_notes to guarantee they are >> gone. By disabling empty BB bypassing, all empty BBs will be actually >> uniformed as (head == tail && NOTE_P (head)), we have to deal with NOTE_P. > > I see. I expected most of them to be naturally part of another EBB. So it's > rather a limitation of the head/tail representation. > > I wonder if there's a more minimal fix though. But iff head or tail
Not sure I got the question correctly, if it is for this inconsistent states issue, I think what Maxim suggested seems to be the minimal fixes. > of an EBB then I > guess either head or tail has to point to a stmt in said block which > necessarily > then means either a debug or note. Yes, will only have NOTE_P for empty BB, if there is a debug insn then it wins (all NOTEs get dropped). remove_notes runs from head to tail, it's applied for EBB's special head and tail. BR, Kewen