https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107413

--- Comment #13 from Rama Malladi <rvmallad at amazon dot com> ---
(In reply to CVS Commits from comment #12)
> The master branch has been updated by Wilco Dijkstra <wi...@gcc.gnu.org>:
> 
> https://gcc.gnu.org/g:0c1b0a23f1fe7db6a2e391b7cb78cff900377772
> 
> commit r13-4291-g0c1b0a23f1fe7db6a2e391b7cb78cff900377772
> Author: Wilco Dijkstra <wilco.dijks...@arm.com>
> Date:   Wed Nov 23 17:27:19 2022 +0000
> 
>     AArch64: Add fma_reassoc_width [PR107413]
>     
>     Add a reassocation width for FMA in per-CPU tuning structures. Keep
>     the existing setting of 1 for cores with 2 FMA pipes (this disables
>     reassociation), and use 4 for cores with 4 FMA pipes.  This improves
>     SPECFP2017 on Neoverse V1 by ~1.5%.
>     
>     gcc/
>             PR tree-optimization/107413
>             * config/aarch64/aarch64.cc (struct tune_params): Add
>             fma_reassoc_width to all CPU tuning structures.
>             (aarch64_reassociation_width): Use fma_reassoc_width.
>             * config/aarch64/aarch64-protos.h (struct tune_params): Add
>             fma_reassoc_width.

Thank you for this code change/ fix. I will attempt a run with this change.

Reply via email to