Thanks, that makes sense. I was thinking that while calibration, you could use 
different attributes for `simulated_quantize` and `simulated_dequantize` ops. 
In the callback of calibrating an operator, one can simulate the affine space 
and argue about scales and zero points. But for capturing real values, you 
could use the passthrough feature of simulated ops to prevent any error. In 
this case, `qnn.simulated_quantize (passthrough)` --> `nn.conv2d` --> 
`qnn.simualted_dequantize (passthrough)` will work. But, I read your earlier 
RFC, and I think you are also maintaining the original graph to find the real 
tensor values without any error if needed. So, it makes sense to me.





---
[Visit 
Topic](https://discuss.tvm.apache.org/t/rfc-quantization-a-new-quantization-framework-in-tvm-initial-rfc-1-4/9775/25)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/8b9f22eb388a25c452dfcc452b49569cc5325d764172b6e8ddc2360af56a8c18).

Reply via email to