On Mon, Jul 29, 2024 at 10:11 AM Andrew Pinski <pins...@gmail.com> wrote: > > On Mon, Jul 29, 2024 at 12:57 AM Kugan Vivekanandarajah > <kugan...@gmail.com> wrote: > > > > On Thu, Jul 25, 2024 at 10:19 PM Richard Biener > > <richard.guent...@gmail.com> wrote: > > > > > > On Thu, Jul 25, 2024 at 4:42 AM Kugan Vivekanandarajah > > > <kugan...@gmail.com> wrote: > > > > > > > > On Tue, Jul 23, 2024 at 11:56 PM Richard Biener > > > > <richard.guent...@gmail.com> wrote: > > > > > > > > > > On Tue, Jul 23, 2024 at 10:27 AM Kugan Vivekanandarajah > > > > > <kugan...@gmail.com> wrote: > > > > > > > > > > > > On Tue, Jul 23, 2024 at 10:35 AM Andrew Pinski <pins...@gmail.com> > > > > > > wrote: > > > > > > > > > > > > > > On Mon, Jul 22, 2024 at 5:26 PM Kugan Vivekanandarajah > > > > > > > <kvivekana...@nvidia.com> wrote: > > > > > > > > > > > > > > > > Revised based on the comment and moved it into existing > > > > > > > > patterns as. > > > > > > > > > > > > > > > > gcc/ChangeLog: > > > > > > > > > > > > > > > > * match.pd: Extend A CMP 0 ? A : -A into (type)A CMP 0 ? A : -A. > > > > > > > > Extend A CMP 0 ? A : -A into (type) A CMP 0 ? A : -A. > > > > > > > > > > > > > > > > gcc/testsuite/ChangeLog: > > > > > > > > > > > > > > > > * gcc.dg/tree-ssa/absfloat16.c: New test. > > > > > > > > > > > > > > The testcase needs to make sure it runs only for targets that > > > > > > > support > > > > > > > float16 so like: > > > > > > > > > > > > > > /* { dg-require-effective-target float16 } */ > > > > > > > /* { dg-add-options float16 } */ > > > > > > Added in the attached version. > > > > > > > > > > + /* (type)A >=/> 0 ? A : -A same as abs (A) */ > > > > > (for cmp (ge gt) > > > > > (simplify > > > > > - (cnd (cmp @0 zerop) @1 (negate @1)) > > > > > - (if (!HONOR_SIGNED_ZEROS (TREE_TYPE(@0)) > > > > > - && !TYPE_UNSIGNED (TREE_TYPE(@0)) > > > > > - && bitwise_equal_p (@0, @1)) > > > > > + (cnd (cmp (convert?@0 @1) zerop) @2 (negate @2)) > > > > > + (if (!HONOR_SIGNED_ZEROS (TREE_TYPE (@1)) > > > > > + && !TYPE_UNSIGNED (TREE_TYPE (@1)) > > > > > + && ((VECTOR_TYPE_P (type) > > > > > + && tree_nop_conversion_p (TREE_TYPE (@0), TREE_TYPE > > > > > (@1))) > > > > > + || (!VECTOR_TYPE_P (type) > > > > > + && (TYPE_PRECISION (TREE_TYPE (@1)) > > > > > + <= TYPE_PRECISION (TREE_TYPE (@0))))) > > > > > + && bitwise_equal_p (@1, @2)) > > > > > > > > > > I wonder about the bitwise_equal_p which tests @1 against @2 now > > > > > with the convert still applied to @1 - that looks odd. You are > > > > > allowing > > > > > sign-changing conversions but doesn't that change ge/gt behavior? > > > > > Also why are sign/zero-extensions not OK for vector types? > > > > Thanks for the review. > > > > My main motivation here is for _Float16 as below. > > > > > > > > _Float16 absfloat16 (_Float16 x) > > > > { > > > > float _1; > > > > _Float16 _2; > > > > _Float16 _4; > > > > <bb 2> [local count: 1073741824]: > > > > _1 = (float) x_3(D); > > > > if (_1 < 0.0) > > > > goto <bb 3>; [41.00%] > > > > else > > > > goto <bb 4>; [59.00%] > > > > <bb 3> [local count: 440234144]:\ > > > > _4 = -x_3(D); > > > > <bb 4> [local count: 1073741824]: > > > > # _2 = PHI <_4(3), x_3(D)(2)> > > > > return _2; > > > > } > > > > > > > > This is why I added bitwise_equal_p test of @1 against @2 with > > > > TYPE_PRECISION checks. > > > > I agree that I will have to check for sign-changing conversions. > > > > > > > > Just to keep it simple, I disallowed vector types. I am not sure if > > > > this would hit vec types. I am happy to handle this if that is > > > > needed. > > > > > > I think with __builtin_convertvector you should be able to construct > > > a testcase that does > > Thanks. > > > > For the pattern, > > ``` > > /* A >=/> 0 ? A : -A same as abs (A) */ > > (for cmp (ge gt) > > (simplify > > (cnd (cmp @0 zerop) @1 (negate @1)) > > (if (!HONOR_SIGNED_ZEROS (TREE_TYPE(@0)) > > && !TYPE_UNSIGNED (TREE_TYPE(@0)) > > && bitwise_equal_p (@0, @1)) > > (if (TYPE_UNSIGNED (type)) > > (absu:type @0) > > (abs @0))))) > > ``` > > the vector type doesn't seem right. For example, if we have a 4 > > element vector with some negative and positive, I don't think it > > makes sense. Also, we dont seem to generate (cmp @0 zerop). Am I > > missing it completely? > > Looks like I missed adding some vector testcases anyways here is one > to get this, note it is C++ due to the C front-end not support `?:` > for vectors yet (there is a patch). > ``` > #define vect8 __attribute__((vector_size(8))) > vect8 int f(vect8 int a) > { > vect8 int na = -a; > return (a > 0) ? a : na; > } > ``` > At -O2 before forwprop1, we have: > ``` > vector(2) intD.9 a_2(D) = aD.2796; > vector(2) intD.9 naD.2799; > vector(2) <signed-boolean:32> _1; > vector(2) intD.9 _4; > > na_3 = -a_2(D); > _1 = a_2(D) > { 0, 0 }; > _4 = VEC_COND_EXPR <_1, a_2(D), na_3>; > ``` > And forwprop using match is able to do: > ``` > Applying pattern match.pd:6306, gimple-match-10.cc:19843 > gimple_simplified to _4 = ABS_EXPR <a_2(D)>; > Removing dead stmt:_1 = a_2(D) > { 0, 0 }; > Removing dead stmt:na_3 = -a_2(D); > ``` > (replace int with float and add -fno-signed-zeros you can get the ABS also). > > Note comparisons with vector types always generate a vector boolean > type. So cond_expr will never show up with a vector comparison; only > vec_cond.
Thanks for the information. I have revised the patch by adding test case for the vector type. I will look at removing the VEC_COND_EXPR as a follow up. This is still rejected by the type checker. v2hi absvect2 (v2hi x, int i) { v2hi neg = -x; v2si tmp = __builtin_convertvector (x, v2si); return (tmp > 0) ? x : neg; } as: error: incompatible vector types in conditional expression: '__vector(2) int', 'v2hi' {aka '__vector(2) short int'} and 'v2hi' {aka '__vector(2) short int'} 8 | return (tmp > 0) ? x : neg; | ~~~~~~~~~~^~~~~~~~~ Also - (absu:type @0) - (abs @0))))) + (absu:type @1) + (abs @1))))) Changing the @1 to @2 is resulting in failures. Existing tests like the following would be optimized away in this case. ``` unsigned abs_with_convert0 (int x) { unsigned int y = x; if (x < 0) y = -y; return y; } ``` This follows the same intent as the original pattern. Bootstrapped and regression tested on aarch64-linux-gnu. Is this OK for trunk. Thanks, Kugan gcc/ChangeLog: * match.pd: Extend A CMP 0 ? A : -A into (type)A CMP 0 ? A : -A. Extend A CMP 0 ? A : -A into (type) A CMP 0 ? A : -A. gcc/testsuite/ChangeLog: * g++.dg/absvect.C: New test. * gcc.dg/tree-ssa/absfloat16.c: New test. signed-off-by: Kugan Vivekanandarajah <kvivekana...@nvidia.com<mailto:kvivekana...@nvidia.com>> Note I think we should be able to merge VEC_COND_EXPR and COND_EXPR codes by instead looking whether the condition is vector or scalar. Fallout might be a bit too big, but it might be a thing to try. Richard. > Thanks, > Andrew Pinski > > > > > Thanks, > > Kugan > > > > > > > > > > > > > > > > > > > + (absu:type @1) > > > > > + (abs @1))))) > > > > > > > > > > I think this should use @2 now. > > > > I will change this. > > > > > > > > Thanks, > > > > Kugan > > > > > > > > > > > > > > > Thanks. > > > > > > Kugan > > > > > > > > > > > > > > (like what is in gcc.dg/c11-floatn-3.c and others). > > > > > > > > > > > > > > Other than that it looks good but I can't approve it. > > > > > > > > > > > > > > Thanks, > > > > > > > Andrew Pinski > > > > > > > > > > > > > > > > > > > > > > > Signed-off-by: Kugan Vivekanandarajah <kvivekana...@nvidia.com> > > > > > > > > > > > > > > > > Bootstrapped and regression test on aarch64-linux-gnu. Is this > > > > > > > > OK for trunk? > > > > > > > > Thanks, > > > > > > > > Kugan > > > > > > > > > > > > > > > > ________________________________ > > > > > > > > From: Andrew Pinski <pins...@gmail.com> > > > > > > > > Sent: Monday, 15 July 2024 5:30 AM > > > > > > > > To: Kugan Vivekanandarajah <kvivekana...@nvidia.com> > > > > > > > > Cc: gcc-patches@gcc.gnu.org <gcc-patches@gcc.gnu.org>; > > > > > > > > richard.guent...@gmail.com <richard.guent...@gmail.com> > > > > > > > > Subject: Re: [PATCH] MATCH: add abs support for half float > > > > > > > > > > > > > > > > External email: Use caution opening links or attachments > > > > > > > > > > > > > > > > > > > > > > > > On Sun, Jul 14, 2024 at 1:12 AM Kugan Vivekanandarajah > > > > > > > > <kvivekana...@nvidia.com> wrote: > > > > > > > > > > > > > > > > > > This patch extends abs detection in matched for half float. > > > > > > > > > > > > > > > > > > Bootstrapped and regression test on aarch64-linux-gnu. Is > > > > > > > > > this OK for trunk? > > > > > > > > > > > > > > > > This is basically this pattern: > > > > > > > > ``` > > > > > > > > /* A >=/> 0 ? A : -A same as abs (A) */ > > > > > > > > (for cmp (ge gt) > > > > > > > > (simplify > > > > > > > > (cnd (cmp @0 zerop) @1 (negate @1)) > > > > > > > > (if (!HONOR_SIGNED_ZEROS (TREE_TYPE(@0)) > > > > > > > > && !TYPE_UNSIGNED (TREE_TYPE(@0)) > > > > > > > > && bitwise_equal_p (@0, @1)) > > > > > > > > (if (TYPE_UNSIGNED (type)) > > > > > > > > (absu:type @0) > > > > > > > > (abs @0))))) > > > > > > > > ``` > > > > > > > > > > > > > > > > except extended to handle an optional convert. Why didn't you > > > > > > > > just > > > > > > > > extend the above pattern to handle the convert instead? Also I > > > > > > > > think > > > > > > > > you have an issue with unsigned types with the comparison. > > > > > > > > Also you should extend the -abs(A) pattern right below it in a > > > > > > > > similar fashion. > > > > > > > > > > > > > > > > Thanks, > > > > > > > > Andrew Pinski > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > gcc/ChangeLog: > > > > > > > > > > > > > > > > > > * match.pd: Add pattern to convert (type)A >=/> 0 ? A : -A > > > > > > > > > into abs (A) for half float. > > > > > > > > > > > > > > > > > > gcc/testsuite/ChangeLog: > > > > > > > > > > > > > > > > > > * gcc.dg/tree-ssa/absfloat16.c: New test. > > > > > > > > > > > > > > > > > > Signed-off-by: Kugan Vivekanandarajah > > > > > > > > > <kvivekana...@nvidia.com> > > > > > > > > >
0001-abshalffloat-v4.patch
Description: 0001-abshalffloat-v4.patch