Also, as part of the standardization of QNN, we could ensure that all QNN 
"compute" ops go from `int8 -> int8` . I believe that `qnn.conv2d` is the only 
QNN op that outputs an accumulation dtype, so we could change `qnn.conv2d` to 
take in bias in addition to the data and weight.





---
[Visit 
Topic](https://discuss.tvm.apache.org/t/rfc-quantization-a-new-quantization-framework-in-tvm-initial-rfc-1-4/9775/24)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/1820bb9f1fb33b6b3299666fed911f862722188ff19ad44e856268522d3f8df2).

Reply via email to