First of all, Ansor is no good for int8, since it cannot use fast int8 hardware 
(VNNI, tensorcore) at all.

* How are you quantizing the model?
* What backends are you interested in? CPU or GPU?





---
[Visit Topic](https://discuss.tvm.apache.org/t/quantized-transformer/11850/2) 
to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/137017a5eb5bde025d0285bea9e76973b1fda29a58caf76c6f66dba158e76825).

Reply via email to