Some of these are not yet fully functional due to related bugs, but this the correct op mapping. The native ball/bany opcodes act on vec4's unconditionally. That said, both ball and bany have the nice property that duplicating an argument does not affect their output, so the default "hanging swizzles" allow us to implement 2/3-component opcodes correctly, implicitly lowering.
Signed-off-by: Alyssa Rosenzweig <aly...@rosenzweig.io> --- .../drivers/panfrost/midgard/midgard_compile.c | 11 +++++++++++ 1 file changed, 11 insertions(+) diff --git a/src/gallium/drivers/panfrost/midgard/midgard_compile.c b/src/gallium/drivers/panfrost/midgard/midgard_compile.c index 2be883fa3fc..fe779b154a8 100644 --- a/src/gallium/drivers/panfrost/midgard/midgard_compile.c +++ b/src/gallium/drivers/panfrost/midgard/midgard_compile.c @@ -1007,9 +1007,20 @@ emit_alu(compiler_context *ctx, nir_alu_instr *instr) ALU_CASE(ishr, iasr); ALU_CASE(ushr, ilsr); + ALU_CASE(ball_fequal2, fball_eq); + ALU_CASE(ball_fequal3, fball_eq); ALU_CASE(ball_fequal4, fball_eq); + + ALU_CASE(bany_fnequal2, fbany_neq); + ALU_CASE(bany_fnequal3, fbany_neq); ALU_CASE(bany_fnequal4, fbany_neq); + + ALU_CASE(ball_iequal2, iball_eq); + ALU_CASE(ball_iequal3, iball_eq); ALU_CASE(ball_iequal4, iball_eq); + + ALU_CASE(bany_inequal2, ibany_neq); + ALU_CASE(bany_inequal3, ibany_neq); ALU_CASE(bany_inequal4, ibany_neq); /* For greater-or-equal, we use less-or-equal and flip the -- 2.20.1 _______________________________________________ mesa-dev mailing list mesa-dev@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/mesa-dev