I don't think u should set `TVM_NUM_THREADS` on arm because of arm's BIG LITTLE architecture. I think you should call `runtime.config_thread_pool` to complete the core binding work. Another thing is we shouldn't make tvm worker thread run different frequency cpus (aka, one worker thread is in the BIG cpu, one worker thread is in the LITTLE cpu), this will bring worse performance.
--- [Visit Topic](https://discuss.tvm.apache.org/t/strassen-algorithm-for-dense/2661/12) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/6087b155e7bad925912521b9af71acb320512ecc0b166de2c24c8f7cb94b4295).