On Fri, Aug 12, 2022 at 10:41 PM Roger Sayle <ro...@nextmovesoftware.com> wrote: > > > This patch fixes PR target/106577 which is a recent ICE on valid regression > caused by my introduction of a *testti_doubleword pre-reload splitter in > i386.md. During the split pass before reload, this converts the virtual > *testti_doubleword into an *andti3_doubleword and *cmpti_doubleword, > checking that any immediate operand is a valid "x86_64_hilo_general_operand" > and placing it into a TImode register using force_reg if it isn't. > > The unexpected behaviour (that caught me out) is that calling force_reg > may occasionally clobber the contents of the global operands array, or > more accurately recog_data.operand[0], which means that by the time > split_XXX calls gen_split_YYY the replacement insn's operands have been > corrupted. > > It's difficult to tell who (if anyone is at fault). The re-entrant > stack trace (for the attached PR) looks like: > > gen_split_203 (*testti_doubleword) calls > force_reg calls > emit_move_insn calls > emit_move_insn_1 calls > gen_movti calls > ix86_expand_move calls > ix86_convert_const_wide_int_to_broadcast calls > ix86_vector_duplicate_value calls > recog_memoized calls > recog. > > By far the simplest and possibly correct fix is rather than attempt > to push and pop recog_data, to simply (in pre-reload splits) save a > copy of any operands that will be needed after force_reg, and use > these copies afterwards. Many pre-reload splitters avoid this issue > using "[(clobber (const_int 0))]" and so avoid gen_split_YYY functions, > but in our case we still need to save a copy of operands[0] (even if we > call emit_insn or expand_* ourselves), so we might as well continue to > use the conveniently generated gen_split. > > This patch has been tested on x86_64-pc-linux-gnu with make bootstrap > and make -k check, both with and without --target_board=unix{-m32}, > with no new failures. Ok for mainline?
Why this obviously fixes the issue seen I wonder whether there's more of recog_data that might be used after control flow returns to recog_memoized and thus the fix would be there, not in any backend pattern triggering the issue like this? The "easiest" fix would maybe to add a in_recog flag and simply return FAIL from recog when recursing. Not sure what the effect on this particular pattern would be though? The better(?) fix might be to push/pop recog_data in 'recog', but of course give that recog_data is currently a global leakage in intermediate code can still happen. That said - does anybody know of similar fixes for this issue in other backends patterns? Thanks, Richard. > > > 2022-08-12 Roger Sayle <ro...@nextmovesoftware.com> > > gcc/ChangeLog > PR target/106577 > * config/i386/i386.md (*testti_doubleword): Preserve a copy of > operands[0], and move initialization of operands[2] later, as the > call to force_reg may clobber the contents of the operands array. > > gcc/testsuite/ChangeLog > PR target/106577 > * gcc.target/i386/pr106577.c: New test case. > > > Thanks, > Roger > -- >