Hi @max1996 ,
Have a go at this : https://github.com/apache/tvm/pull/7938 The metadata.json is augmented to include peak memory usage in the PR. Please note if you are using crt graph runtime, it might copy the weights (~ (if link-params is used && load_params is called) ) and also maintain a copy of I/O tensors in the workspace. For those reasons, that size is also mentioned in the .json. Moreover, I've added instrumentation to see which relay primfunc and tir primfunc resulted in using that much of memory. In order to access this, in the PR, you could use relay.build(...)'s output which is lib : GraphExecutorFactory. (Use lib.function_metadata). --- [Visit Topic](https://discuss.tvm.apache.org/t/measure-memory-allocation-using-debug-executor/9679/5) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/20b216216e935bb1e2907428bcafd1840419a0316e656e7106252ea40df7f9aa).