Re: VEC_COND_EXPR optimizations v2

Marc Glisse Thu, 06 Aug 2020 11:08:11 -0700

On Thu, 6 Aug 2020, Christophe Lyon wrote:

Was I on the right track configuring with
--target=arm-none-linux-gnueabihf --with-cpu=cortex-a9
--with-fpu=neon-fp16
then compiling without any special option?


Maybe you also need --with-float=hard, I don't remember if it's
implied by the 'hf' target suffix


Thanks! That's what I was missing to reproduce the issue. Now I can
reproduce it with just

typedef unsigned int vec __attribute__((vector_size(16)));
typedef int vi __attribute__((vector_size(16)));
vi f(vec a,vec b){
    return a==5 | b==7;
}

with -fdisable-tree-forwprop1 -fdisable-tree-forwprop2 -fdisable-tree-forwprop3 
-O1

  _1 = a_5(D) == { 5, 5, 5, 5 };
  _3 = b_6(D) == { 7, 7, 7, 7 };
  _9 = _1 | _3;
  _7 = .VCOND (_9, { 0, 0, 0, 0 }, { -1, -1, -1, -1 }, { 0, 0, 0, 0 }, 107);

we fail to expand the equality comparison (expand_vec_cmp_expr_p returns
false), while with -fdisable-tree-forwprop4 we do manage to expand

  _2 = .VCONDU (a_5(D), { 5, 5, 5, 5 }, { -1, -1, -1, -1 }, { 0, 0, 0, 0 }, 
112);

It doesn't make much sense to me that we can expand the more complicated
form and not the simpler form of the same operation (both compare a to 5
and produce a vector of -1 or 0 of the same size), especially when the
target has an instruction (vceq) that does just what we want.

Introducing boolean vectors was fine, but I think they should be realtypes, that we can operate on, not be forced to appear only as the firstargument of a vcond.

I can think of 2 natural ways to improve things: either implement vectorcomparisons in the ARM backend (possibly by forwarding to their existingcode for vcond), or in the generic expansion code try using vcond if thedirect comparison opcode is not provided.

We can temporarily revert my patch, but I would like it to be temporary.Since aarch64 seems to handle the same code just fine, maybe someone whoknows arm could copy the relevant code over?


Does my message make sense, do people have comments?

--
Marc Glisse

Re: VEC_COND_EXPR optimizations v2

Reply via email to