On Mon, Aug 26, 2024 at 2:43 PM Haochen Jiang <haochen.ji...@intel.com> wrote:
>
> Hi all,
>
> I have just commited AVX10.2 new instructions patches into trunk hours
> ago. The next and final part for AVX10.2 upstream is to optimize code
> with AVX10.2 new instructions.
>
> In this patch series, it will contain the following optimizations:
>
>   - VNNI instruction auto vectorize (PATCH 1).
>   - Codegen optimization with new scalar comparison instructions to
>     eliminate redundant code (PATCH 2-3).
>   - BF16 instruction auto vectorize (PATCH 4-8).
>
> This will finish the upstream for AVX10.2 series.
>
> Afterwards, we may add V2BF/V4BF in another thread just like what we
> have done for V2HF/V4HF when AVX512FP16 upstreamed.
>
> Bootstrapped on x86-64-pc-linux-gnu. Ok for trunk?
Ok for all 8 patches.
>
> Thx,
> Haochen
>
>


-- 
BR,
Hongtao

Reply via email to