Re: [RISC-V] Reorder the ready queue to avoid extraneous vsetvls

Jeff Law Mon, 04 Nov 2024 16:03:59 -0800



On 10/30/24 7:06 PM, Jeff Law wrote:

So this patch is a very conservative approach to eliminate more vsetvlinstructions.
As we know, scheduling can scramble the instruction stream based on avariety of factors and can easily result in an instruction sequencewhere we ping-pong between different vector configurations, thusresulting in more vsetvl instructions than if we had issued instructionsin a slightly different order.
Robin did some experiments with vsetvl aware scheduling several weeksago. I believe he was adjusting the priority or cost of the insns basedon their vsetvl needs. This experiment actually showed worseperformance on our design, which is a good indicator that scheduling forlatency and functional hazards is more important than eliminating vsetvlinstructions (not a surprise obviously).
That experiment greatly influenced this implementation. Specifically wedon't adjust cost/priority of any insns. Instead we use vectorconfiguration information to potentially swap two insns in thescheduler's ready queue iff swapping would result in avoiding a vsetvland the two insns being swapped in the ready queue have the samepriority/cost.
So it's quite conservative, essentially using vector configuration as asort key of last resort and only for the highest priority insns in theready queue and only when it's going to let us eliminate a vsetvl.
For something like SATD we eliminate a few gratuitous vsetvlinstructions. As expected it makes no performance difference in thepico benchmarks we've built on our design. The main goal here is toside step questions about the over-active changing of vectorconfigurations.
My BPI is busy, so I don't have performance data on that design, but itdoes result in a ~1% drop in dynamic instructions. On a highperformance design I wouldn't expect this to help performance in ansignificant way, but it does make the code look better.
Bootstrapped and regression tested on riscv64-linux-gnu, also regressiontested in my tester for rv32 and rv64 elf configurations.
I'll wait for the pre-commit tester to render a verdict *and* for otherswho know the vsetvl code to have time to chime in. Assuming clean frompre-commit and no negative comments the plan would be to commit nextweek sometime.

So I'm putting this on hold. I'm seeing unexpected performanceregressions when I ran this on the Ventana design. Not really surewhat's going on and given this was supposed to be neutral, something'sclearly not right. That would tend to indicate something is goofy inthe priorities.

The other possibility is the swap step. It would have been better toslide the entries into the hold vacated by the insn we want to move tothe head of the list -- that permutes the schedule a bit less. But ifthat's making a performance difference, again that would be a sign thatpriorities are goofy.

Either way, I'm putting this on ice for now. Hopefully I'll come backto it (or someone else can take a looksie) in the not terribly distantfuture.


jeff

Re: [RISC-V] Reorder the ready queue to avoid extraneous vsetvls

Reply via email to