First of all, Ansor is no good for int8, since it cannot use fast int8 hardware (VNNI, tensorcore) at all.
* How are you quantizing the model? * What backends are you interested in? CPU or GPU? --- [Visit Topic](https://discuss.tvm.apache.org/t/quantized-transformer/11850/2) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/137017a5eb5bde025d0285bea9e76973b1fda29a58caf76c6f66dba158e76825).