[ Please don't post new patch series as replies to old ]

On Wed, Jul 22, 2020 at 12:02:33PM +0200, Andrea Corallo wrote:
> This first patch implements the addition of a new RTX instruction class
> FILLER_INSN, which has been white listed to allow placement of NOPs
> outside of a basic block.  This is to allow padding after unconditional
> branches.  This is favorable so that any performance gained from
> diluting branches is not paid straight back via excessive eating of
> nops.
> 
> It was deemed that a new RTX class was less invasive than modifying
> behavior in regards to standard UNSPEC nops.

Deemed, by whom?  There are several people very against it, too.  You
need to modify only one simple behaviour (maybe in a handful of places),
making a new RTX class for that is excessive.

>       * cfgbuild.c (inside_basic_block_p): Handle FILLER_INSN.
>       * cfgrtl.c (rtl_verify_bb_layout): Whitelist FILLER_INSN outside
>       basic blocks.
>       * coretypes.h: New rtx class.

coretypes.h is not a new RTX class?  :-)  Maybe:

        * coretypes.h (rtx_filler_insn): New struct.

> @@ -3033,7 +3034,20 @@ rtl_verify_bb_layout (void)
>             break;
>  
>           default:
> -           fatal_insn ("insn outside basic block", x);
> +           /* Allow nops after branches, via FILLER_INSN.  */
> +           bool fail = true;
> +           subrtx_iterator::array_type array;
> +           FOR_EACH_SUBRTX (iter, array, x, ALL)
> +             {
> +               const_rtx rtx = *iter;
> +               if (GET_CODE (rtx) == FILLER_INSN)
> +                 {
> +                   fail = false;
> +                   break;
> +                 }
> +             }
> +           if (fail)
> +             fatal_insn ("insn outside basic block", x);
>           }
>       }

It wouldn't be hard to allow some existing RTL here.  Maybe something
with CODE_FOR_filler_nop or similar (after you recog () it).

It still allows anything after leading filler insns, btw; you could get
rid of the "fail" variable altogether, just call fatal_insn as soon as
you see some unexpected RTX code.

> +  rtx_insn* i = make_insn_raw (pattern);

rtx_insn *i = ...


Segher

Reply via email to