Re: RISC-V: Folding memory for FP + constant case

Jeff Law via Gcc-patches Fri, 14 Jul 2023 23:17:03 -0700



On 7/12/23 14:59, Jivan Hakobyan via Gcc-patches wrote:

Accessing local arrays element turned into load form (fp + (index <<
C1)) + C2 address. In the case when access is in the loop we got loop
invariant computation. For some reason, moving out that part cannot
be done in loop-invariant passes. But we can handle that in
target-specific hook (legitimize_address). That provides an
opportunity to rewrite memory access more suitable for the target
architecture.

This patch solves the mentioned case by rewriting mentioned case to
((fp + C2) + (index << C1)) I have evaluated it on SPEC2017 and got
an improvement on leela (over 7b instructions, .39% of the dynamic
count) and dwarfs the regression for gcc (14m instructions, .0012% of
the dynamic count).


gcc/ChangeLog: * config/riscv/riscv.cc (riscv_legitimize_address):
Handle folding. (mem_shadd_or_shadd_rtx_p): New predicate.

So I still need to give the new version a review. But a high levelquestion -- did you re-run the benchmarks with this version to verifythat we still saw the same nice improvement in leela?

The reason I ask is when I use this on Ventana's internal tree I don'tsee any notable differences in the dynamic instruction counts. Andprobably the most critical difference between the upstream tree andVentana's tree in this space is Ventana's internal tree has an earlierversion of the fold-mem-offsets work from Manolis.

It may ultimately be the case that this work and Manolis's f-m-o patchhave a lot of overlap in terms of their final effect on code generation.Manolis's pass runs much later (after register allocation), so it'snot going to address the loop-invariant-code-motion issue thatoriginally got us looking into this space. But his pass is genericenough that it helps other targets. So we may ultimately want both.

Anyway, just wanted to verify if this variant is still showing the niceimprovement on leela that the prior version did.


Jeff

ps.  I know you're on PTO.  No rush on responding -- enjoy the time off.

Re: RISC-V: Folding memory for FP + constant case

Reply via email to