Hi Kewen,

On 9/28/21 9:34 PM, Kewen.Lin wrote:
> Hi Bill,
>
> Thanks for your prompt comments!
>
> on 2021/9/29 上午3:24, Bill Schmidt wrote:
>> Hi Kewen,
>>
>> Although I agree that what we do now is tragically bad (and will be fixed in 
>> the builtin rewrite), this seems a little too cavalier to remove all 
>> checking during initialization without adding any checking somewhere else. 
>> :-)  We still need to check for invalid usage when the builtin is expanded, 
>> and I don't think the old code does this at all.
>>
> If I read the code right, there are some following places to check the 
> invalid usage or not.
>   1) for folding, rs6000_gimple_fold_builtin -> rs6000_builtin_is_supported_p 
> -> check mask
>                   -> defer to expand if invalid.
>   2) for expanding, obtain func_valid_p, error in rs6000_invalid_builtin.
>
> Both places seem to exist before the builtin rewrite, am I missing something?
>
> btw, I remembered I used one built gcc with my fix to compile one test case 
> which is supposed to fail
> due to its invalid usage builtin at option -flto, it failed (errored) as 
> expected but at LTRANS phase
> since it's the time to do expansion for no-fat-objs scenario.

OK.  If you are comfortable that this will be caught when the builtin is 
actually not valid, then I'll
withdraw my objection.  Can you test it?  I know that we've been trying to fix 
these cases piecemeal
in the old support, and as Peter says it's important to backport this, we need 
the solution.  I just
want to be sure we're not breaking something, and test coverage in this area is 
pretty terrible.

Thanks!
Bill

>
>> Unless you are planning to do a backport, I think the proper way forward 
>> here is to just wait for the new builtin support to land.  In the new code, 
>> we initialize all built-ins up front, and check properly at expansion time 
>> whether the builtin is enabled in the environment that obtains during expand.
> Good to know that!  Nice!  btw, for this issue itself, the current 
> implementation (without rewriting)
> also initializes the built-ins in the table since MMA built-ins guarded in 
> TARGET_EXTRA_BUILTINS,
> the root cause is the rs6000_builtin_mask can't set up (be switched) 
> expectedly since the checking
> time is too early right when the built-in function_decl being created.
>
> BR,
> Kewen
>
>> My two cents,
>> Bill
>>
>> On 9/28/21 3:13 AM, Kewen.Lin wrote:
>>> Hi,
>>>
>>> As the discussion in PR102347, currently builtin_decl is invoked so
>>> early, it's when making up the function_decl for builtin functions,
>>> at that time the rs6000_builtin_mask could be wrong for those
>>> builtins sitting in #pragma/attribute target functions, though it
>>> will be updated properly later when LTO processes all nodes.
>>>
>>> This patch is to align with the practice i386 port adopts, also
>>> align with r10-7462 by relaxing builtin mask checking in some places.
>>>
>>> Bootstrapped and regress-tested on powerpc64le-linux-gnu P9 and
>>> powerpc64-linux-gnu P8.
>>>
>>> Is it ok for trunk?
>>>
>>> BR,
>>> Kewen
>>> -----
>>> gcc/ChangeLog:
>>>
>>>     PR target/102347
>>>     * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin
>>>     mask check.
>>>
>>> gcc/testsuite/ChangeLog:
>>>
>>>     PR target/102347
>>>     * gcc.target/powerpc/pr102347.c: New test.
>>>
>>> ---
>>>  gcc/config/rs6000/rs6000-call.c             | 14 ++++----------
>>>  gcc/testsuite/gcc.target/powerpc/pr102347.c | 15 +++++++++++++++
>>>  2 files changed, 19 insertions(+), 10 deletions(-)
>>>  create mode 100644 gcc/testsuite/gcc.target/powerpc/pr102347.c
>>>
>>> diff --git a/gcc/config/rs6000/rs6000-call.c 
>>> b/gcc/config/rs6000/rs6000-call.c
>>> index fd7f24da818..15e0e09c07d 100644
>>> --- a/gcc/config/rs6000/rs6000-call.c
>>> +++ b/gcc/config/rs6000/rs6000-call.c
>>> @@ -13775,23 +13775,17 @@ rs6000_init_builtins (void)
>>>      }
>>>  }
>>>
>>> -/* Returns the rs6000 builtin decl for CODE.  */
>>> +/* Returns the rs6000 builtin decl for CODE.  Note that we don't check
>>> +   the builtin mask here since there could be some #pragma/attribute
>>> +   target functions and the rs6000_builtin_mask could be wrong when
>>> +   this checking happens, though it will be updated properly later.  */
>>>
>>>  tree
>>>  rs6000_builtin_decl (unsigned code, bool initialize_p ATTRIBUTE_UNUSED)
>>>  {
>>> -  HOST_WIDE_INT fnmask;
>>> -
>>>    if (code >= RS6000_BUILTIN_COUNT)
>>>      return error_mark_node;
>>>
>>> -  fnmask = rs6000_builtin_info[code].mask;
>>> -  if ((fnmask & rs6000_builtin_mask) != fnmask)
>>> -    {
>>> -      rs6000_invalid_builtin ((enum rs6000_builtins)code);
>>> -      return error_mark_node;
>>> -    }
>>> -
>>>    return rs6000_builtin_decls[code];
>>>  }
>>>
>>> diff --git a/gcc/testsuite/gcc.target/powerpc/pr102347.c 
>>> b/gcc/testsuite/gcc.target/powerpc/pr102347.c
>>> new file mode 100644
>>> index 00000000000..05c439a8dac
>>> --- /dev/null
>>> +++ b/gcc/testsuite/gcc.target/powerpc/pr102347.c
>>> @@ -0,0 +1,15 @@
>>> +/* { dg-do link } */
>>> +/* { dg-require-effective-target power10_ok } */
>>> +/* { dg-require-effective-target lto } */
>>> +/* { dg-options "-flto -mdejagnu-cpu=power9" } */
>>> +
>>> +/* Verify there are no error messages in LTO mode.  */
>>> +
>>> +#pragma GCC target "cpu=power10"
>>> +int main ()
>>> +{
>>> +  float *b;
>>> +  __vector_quad c;
>>> +  __builtin_mma_disassemble_acc (b, &c);
>>> +  return 0;
>>> +}
>>> --
>>> 2.27.0
>>>
>

Reply via email to