Thanks, that makes sense. I was thinking that while calibration, you could use different attributes for `simulated_quantize` and `simulated_dequantize` ops. In the callback of calibrating an operator, one can simulate the affine space and argue about scales and zero points. But for capturing real values, you could use the passthrough feature of simulated ops to prevent any error. In this case, `qnn.simulated_quantize (passthrough)` --> `nn.conv2d` --> `qnn.simualted_dequantize (passthrough)` will work. But, I read your earlier RFC, and I think you are also maintaining the original graph to find the real tensor values without any error if needed. So, it makes sense to me.
--- [Visit Topic](https://discuss.tvm.apache.org/t/rfc-quantization-a-new-quantization-framework-in-tvm-initial-rfc-1-4/9775/25) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/8b9f22eb388a25c452dfcc452b49569cc5325d764172b6e8ddc2360af56a8c18).