On Thu, 19 Aug 2021, Qing Zhao wrote:

> 
> 
> > On Aug 19, 2021, at 4:00 AM, Richard Biener <rguent...@suse.de> wrote:
> > 
> > On Wed, 18 Aug 2021, Qing Zhao wrote:
> > 
> >> 
> >> 
> >>> On Aug 18, 2021, at 2:15 AM, Richard Biener <rguent...@suse.de> wrote:
> >>> 
> >>> On Tue, 17 Aug 2021, Qing Zhao wrote:
> >>> 
> >>>> 
> >>>> 
> >>>>> On Aug 17, 2021, at 9:50 AM, Qing Zhao via Gcc-patches 
> >>>>> <gcc-patches@gcc.gnu.org> wrote:
> >>>>> 
> >>>>> 
> >>>>> 
> >>>>>> On Aug 17, 2021, at 3:29 AM, Richard Biener <rguent...@suse.de> wrote:
> >>>>>> 
> >>>>>> On Mon, 16 Aug 2021, Qing Zhao wrote:
> >>>>>> 
> >>>>>>> My current code for expand_DEFERRED_INIT is like the following, could 
> >>>>>>> you check and see whether there is any issue for it:
> >>>>>>> 
> >>>>>>> #define INIT_PATTERN_VALUE  0xFE
> >>>>>>> static void
> >>>>>>> expand_DEFERRED_INIT (internal_fn, gcall *stmt)
> >>>>>>> {
> >>>>>>> tree lhs = gimple_call_lhs (stmt);
> >>>>>>> tree var_size = gimple_call_arg (stmt, 0);
> >>>>>>> enum auto_init_type init_type
> >>>>>>> = (enum auto_init_type) TREE_INT_CST_LOW (gimple_call_arg (stmt, 1));
> >>>>>>> bool is_vla = (bool) TREE_INT_CST_LOW (gimple_call_arg (stmt, 2));
> >>>>>>> 
> >>>>>>> tree var_type = TREE_TYPE (lhs);
> >>>>>>> gcc_assert (init_type > AUTO_INIT_UNINITIALIZED);
> >>>>>>> 
> >>>>>>> if (is_vla || (!use_register_for_decl (lhs)))
> >>>>>>> {
> >>>>>>>   if (TREE_CODE (lhs) == SSA_NAME)
> >>>>>>>     lhs = SSA_NAME_VAR (lhs);
> >>>>>> 
> >>>>>> this should not be necessary (in fact you shouldn't see a SSA_NAME
> >>>>>> here, if you do then using SSA_NAME_VAR is wrong)
> >>>>> You mean during RTL expansion phase, all SSA_NAMEs are gone already?
> >>>> 
> >>>> Actually, the lhs could be SSA_NAME here, 
> >>>> 
> >>>> Breakpoint 1, expand_DEFERRED_INIT (stmt=0x7fffe96ae348) at 
> >>>> ../../latest-gcc/gcc/internal-fn.c:3021
> >>>> 3021           mark_addressable (lhs);
> >>>> (gdb) call debug_tree(lhs)
> >>>> <ssa_name 0x7fffe9584e58
> >>>>   type <real_type 0x7fffe959b2a0 float sizes-gimplified SF
> >>>>       size <integer_cst 0x7fffe9579f48 constant 32>
> >>>>       unit-size <integer_cst 0x7fffe9579f60 constant 4>
> >>>>       align:32 warn_if_not_align:0 symtab:0 alias-set 2 canonical-type 
> >>>> 0x7fffe959b2a0 precision:32
> >>>>       pointer_to_this <pointer_type 0x7fffe959b7e0>>
> >>>>   visited var <var_decl 0x7ffff7ff7bd0 temp1>
> >>>>   def_stmt temp1_5 = .DEFERRED_INIT (4, 2, 0, &"temp1"[0]);
> >>>>   version:5>
> >>>> 
> >>>> when I deleted:
> >>>> 
> >>>> if (TREE_CODE (lhs) == SSA_NAME
> >>>>  lhs = SSA_NAME_VAR (lhs);
> >>> 
> >>> but then using SSA_NAME_VAR is broken.  I suspect use_register_for_decl
> >>> isn't the correct thing to look at.  I think we need to look at what
> >>> the LHS expanded to if it is a SSA_VAR_P (that includes SSA names
> >>> but also plain DECLs but not what we get from VLAs where we'd see
> >>> *ptr).  So sth like
> >>> 
> >>> bool reg_lhs;
> >>> if (SSA_VAR_P (lhs))
> >>>   {
> >>>     rtx tem = expand_expr (lhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> >>>     reg_lhs = !MEM_P (tem);
> >>>     /* If not MEM_P reg_lhs should be REG_P or SUBREG_P (but maybe
> >>>        also CONCAT or lowpart...?)  */
> >>>   }
> >>> else
> >>>   {
> >>>     gcc_assert (is_vla);
> >>>     reg_lhs = false;
> >>>   }
> >>> 
> >>> if (!reg_lhs)
> >>>   memset path
> >>> else
> >>>   expand_assignment path
> >> 
> >> After making the following change:
> >> 
> >> +  bool reg_lhs = true;
> >> 
> >>   tree var_type = TREE_TYPE (lhs);
> >>   gcc_assert (init_type > AUTO_INIT_UNINITIALIZED);
> >> 
> >> -  if (is_vla || (!use_register_for_decl (lhs)))
> >> +  if (SSA_VAR_P (lhs))
> >> +    {
> >> +      rtx tem = expand_expr (lhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> >> +      reg_lhs = !MEM_P (tem);
> >> +    }
> >> +  else
> >> +    {
> >> +      gcc_assert (is_vla);
> >> +      reg_lhs = false;
> >> +    }
> >> +
> >> +  if (!reg_lhs)
> >>     {
> >> 
> >> I got exactly the same internal error that failed at expr.c:
> >> 
> >> 8436   /* We must have made progress.  */
> >> 8437   gcc_assert (inner != exp);
> >> 
> >> 
> >> Looks like for the following code:
> >> 
> >> 3026   if (!reg_lhs)
> >> 3027     {
> >> 3028     /* If this is a VLA or the variable is not in register,
> >> 3029        expand to a memset to initialize it.  */
> >> 3030       mark_addressable (lhs);
> >> 3031       tree var_addr = build_fold_addr_expr (lhs);
> >> 3032 
> >> 3033       tree value = (init_type == AUTO_INIT_PATTERN) ?
> >> 3034                     build_int_cst (integer_type_node,
> >> 3035                                    INIT_PATTERN_VALUE) :
> >> 3036                     integer_zero_node;
> >> 3037       tree m_call = build_call_expr (builtin_decl_implicit 
> >> (BUILT_IN_MEMSET),
> >> 3038                                      3, var_addr, value, var_size);
> >> 3039       /* Expand this memset call.  */
> >> 3040       expand_builtin_memset (m_call, NULL_RTX, TYPE_MODE (var_type));
> >> 3041     }
> >> 
> >> At line 3030, “lhs” could be a SSA_NAME.
> >> 
> >> My questions are:
> >> 
> >> 1. Could the routine “mark_addressable” and “build_fold_addr_expr” be 
> >> applied on SSA_NAME?
> > 
> > No.
> > 
> >> 2. Could the routine “expand_builtin_memset” be applied on the memset call 
> >> whose “DEST” is
> >>    an address expression on SSA_NAME? 
> > 
> > No.
> > 
> >> 3. Within “expand_DEFERRED_INIT”, can I call “expand_builtin_memset” to 
> >> expand .DEFERRED_INIT?
> > 
> > Well, not with "invalid" GENERIC I fear (address of a SSA name).
> > 
> >> I suspect that one of the above 3 might be the issue, but not sure which 
> >> one?
> > 
> > All of the above ;)  So while reg_lhs is now precise as to how the
> > variable will end up (the SSA name will end up as a stack variable in this
> > case, for whatever reason), expansion via memcpy only works when
> > working on the RTL representation.  The usual "workaround" (ugh)
> > is to use make_tree (), so in the !reg_lhs path you'd do
> > 
> >  /* Get a new GENERIC representation for the RTL.  That's necesary
> >     in case LHS is an SSA name.  */
> >  lhs = make_tree (TREE_TYPE (lhs), tem);
> 
> This resolved the issue.
> 
> Another question,
> 
> Previously, I used
> 
>     if (TREE_CODE (lhs) == SSA_NAME)
>        lhs = SSA_NAME_VAR (lhs);
> 
> To resolve this issue. The purpose looks like the same as “make_tree”, just 
> get an generic tree for the LHS. 
> 
> Why you said using SSA_NAME_VAR is broken?  Is it because SSA_NAME_VAR will 
> not always return a valid TREE?

Because it's simply the wrong entity - I have no idea why that even
worked.  Ah, cfgexpand associates it with some DECL_RTL for the 
benefit of debug info.  But it's still wrong.

> I should use as following
> 
> 
>    If (TREE_CODE (lhs) == SSA_NAME) && SSA_NAME_VAR (lhs))
>       Lhs = SSA_NAME_VAR (lhs)
> 
> ?

No.  A SSA_NAME_VAR can have multiple SSA_NAMEs (obviously) and
they do not necessarily have to be allocated to the same variable
partition - that is, there's no 1:1 relationship between SSA_NAME
and stack slot or (pseudo) register.  You want to initialize the
storage associated with the SSA_NAME in the .DEFERRED_INIT call,
not some other storage.

> > 
> > alternatively you could maybe do
> > 
> >  if (DECL_P (lhs))
> >    {
> > +      rtx tem = expand_expr (lhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> > +      reg_lhs = !MEM_P (tem);
> >    }
> >  else if (TREE_CODE (lhs) == SSA_NAME)
> >    reg_lhs = true;
> >  else
> >    reg_lhs = false;
> > 
> > thus treat SSA names as register storage always (even if it will end
> > up on the stack).
> 
> My question here, for a complicate structure SSA_NAME, will expanding through 
> memset better than expand_asssignment? 

It depends.  In the end I'd consider it a missed-optimization bug on
the side that generates worse code - but I do expect cases will exist
for both.  Clearly memset will be worse when dealing with register
initialization (thus the !MEM_P check) and I expect memset to be OK
for stack where member-wise init esp. with non-zero might turn up
worse code.

Richard.

> Qing
> > 
> > Richard.
> > 
> >> Thanks a lot.
> >> 
> >> Qing
> >> 
> >> 
> >> 
> >>> bool reg_lhs;
> >>> if (SSA_VAR_P (lhs))
> >>>   {
> >>>     rtx tem = expand_expr (lhs, NULL_RTX, VOIDmode, EXPAND_WRITE);
> >>>     reg_lhs = !MEM_P (tem);
> >>>     /* If not MEM_P reg_lhs should be REG_P or SUBREG_P (but maybe
> >>>        also CONCAT or lowpart...?)  */
> >>>   }
> >>> else
> >>>   {
> >>>     gcc_assert (is_vla);
> >>>     reg_lhs = false;
> >>>   }
> >> 
> >> 
> >>> 
> >>>> Many testing cases failed with internal compiler error:
> >>>> 
> >>>> /home/opc/Work/GCC/latest-gcc/gcc/testsuite/c-c++-common/auto-init-3.c:9:9:
> >>>>  internal compiler error: in expand_expr_addr_expr_1, at expr.c:8437
> >>>> 0xe237aa expand_expr_addr_expr_1
> >>>>  ../../latest-gcc/gcc/expr.c:8437
> >>>> 0xe24059 expand_expr_addr_expr
> >>>>  ../../latest-gcc/gcc/expr.c:8525
> >>>> 0xe32b56 expand_expr_real_1(tree_node*, rtx_def*, machine_mode, 
> >>>> expand_modifier, rtx_def**, bool)
> >>>>  ../../latest-gcc/gcc/expr.c:11741
> >>>> 0xe2da52 expand_expr_real_1(tree_node*, rtx_def*, machine_mode, 
> >>>> expand_modifier, rtx_def**, bool)
> >>>>  ../../latest-gcc/gcc/expr.c:10777
> >>>> 0xe24706 expand_expr_real(tree_node*, rtx_def*, machine_mode, 
> >>>> expand_modifier, rtx_def**, bool)
> >>>>  ../../latest-gcc/gcc/expr.c:8713
> >>>> 0xc13f15 expand_expr
> >>>>  ../../latest-gcc/gcc/expr.h:301
> >>>> 0xc17acb get_memory_rtx
> >>>>  ../../latest-gcc/gcc/builtins.c:1370
> >>>> 0xc2223d expand_builtin_memset_args
> >>>>  ../../latest-gcc/gcc/builtins.c:4102
> >>>> 0xc21a20 expand_builtin_memset(tree_node*, rtx_def*, machine_mode)
> >>>>  ../../latest-gcc/gcc/builtins.c:3886
> >>>> 0xfb5c85 expand_DEFERRED_INIT
> >>>>  ../../latest-gcc/gcc/internal-fn.c:3031
> >>>> 
> >>>> 
> >>>> So, did I do anything wrong?
> >>>> 
> >>>> Qing
> >>> 
> >>> -- 
> >>> Richard Biener <rguent...@suse.de>
> >>> SUSE Software Solutions Germany GmbH, Maxfeldstrasse 5, 90409 Nuernberg,
> >>> Germany; GF: Felix Imendörffer; HRB 36809 (AG Nuernberg)
> >> 
> >> 
> > 
> > -- 
> > Richard Biener <rguent...@suse.de>
> > SUSE Software Solutions Germany GmbH, Maxfeldstrasse 5, 90409 Nuernberg,
> > Germany; GF: Felix Imendörffer; HRB 36809 (AG Nuernberg)
> 
> 

-- 
Richard Biener <rguent...@suse.de>
SUSE Software Solutions Germany GmbH, Maxfeldstrasse 5, 90409 Nuernberg,
Germany; GF: Felix Imendörffer; HRB 36809 (AG Nuernberg)

Reply via email to