(if-conversion could directly generate masked load/stores
  of course and not use a scratch-pad at all in that case).

[Abe wrote:]
IMO that`s a great idea, but I don`t know how to do it.
Hints would be welcome.  In particular, how does one
>> "generate masked load/stores" at the GIMPLE level?

[Richard Biener wrote:]
It already does that, see predicate_mem_writes.
> You should definitely preserve that path

Thanks.  Yes, we have not intentionally disabled that.


On what hardware did you test?

AMD64 arch., Intel implementation.  Nothing fancy AFAIK in the flags to make it 
super-specific,
e.g. "-march=nocona" or "-march=native".  Occasionally using AVX-2 flags as 
specified by some test cases.

Regards,

Abe

Reply via email to