I see, indeed. For this case, the code should already be efficient enough. The temp will get narrowed into a size 1 buffer and then into register during codegen
--- [Visit Topic](https://discuss.tvm.apache.org/t/tir-problem-inlining-addition-into-matmul-block/18066/4) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/467f3ff2efdcbd4441392573590bdc25f073e131be330f7dba512d4a3a4f8879).