Re: RFR: 8322768: Optimize non-subword vector compress and expand APIs for AVX2 target. [v3]

Jatin Bhateja Sun, 07 Jan 2024 22:29:26 -0800

On Fri, 5 Jan 2024 10:02:28 GMT, Emanuel Peter <[email protected]> wrote:


> Thanks for the updates!
> 
> One more idea: Your AVX2 solution has a lot of cost for converting the mask 
> to a permutation. Might it make sense to split this off into a separate 
> vector-node, so that it can float out of a loop if the mask is invariant?

CompressV / ExpandV only accepts two inputs, vector to be operated on and mask 
under which operation is performed, permute table based implementation is 
specific to x86 backend implementation.

-------------

PR Comment: https://git.openjdk.org/jdk/pull/17261#issuecomment-1880430502

Re: RFR: 8322768: Optimize non-subword vector compress and expand APIs for AVX2 target. [v3]

Reply via email to