Re: [PATCH v3] RISC-V: Fix wrong LMUL when only implict zve32f.

Robin Dapp Mon, 31 Mar 2025 23:32:49 -0700

Yeah...and I also don't like the magic "ceil(AVL / 2) ≤ vl ≤ VLMAX if
AVL < (2 * VLMAX)" rule...
+1, spec has some description about this but I am not sure if I really get the 
point.

From Spec:
"For example, this permits an implementation to set vl = ceil(AVL/ 2) for VLMAX < AVL < 2*VLMAX in order to evenlydistribute work over the last two iterations of a stripmine loop. Requirement2 ensures that the rst stripmine iteration of reductionloops uses the largest vector length of all iterations, even in the case ofAVL < 2*VLMAX. This allows software to avoid needing toexplicitly calculate a running maximum of vector lengths observedduring a stripmined loop. Requirement 2 also allows an
implementation to set vl to VLMAX for VLMAX < AVL < 2*VLMAX"


Yeah, that's very unfortunate.

The rule is something like

if AVL >= 2 * VLMAX

   vl = vsetvl = min (AVL, VLMAX)

 if VLMAX > AVL < 2 * VLMAX
   vl = vsetvl = "whatever" ;)

 if AVL <= VLMAX
   vl = vsetvl = min (AVL, VLMAX)

The idea of load balancing is alright I guess but it really complicates mattersin the compiler.

FWIW my plan for GCC 16 is to define a SELECT_VL_SANE (or any better name I cancome up with) that doesn't have this behavior and always only performs aminimum instead. This will allow us to perform scalar evolution on vsetvlrather than giving up as we do right now. Microarchitectures where vsetvlalways behaves like a minimum would then enable the corresponding expander/insnand others would fall back to the current behavior.


--
Regards
Robin

Re: [PATCH v3] RISC-V: Fix wrong LMUL when only implict zve32f.

Reply via email to