Yes, this is the common issue that hardware vendor meet when integrating there tool chain through BYOC. You can refer the method used by tensorRT, 1. parttion any op your acc supported, then maybe you get multiple your acc subgraph. 2. reinline the subgraphs that haven't big computation into relay main function.
The code of tensorRT already in the TVM repo. --- [Visit Topic](https://discuss.tvm.apache.org/t/byoc-how-to-partition-specific-region-of-a-relay-graph-to-cpu/11758/4) to respond. You are receiving this because you enabled mailing list mode. To unsubscribe from these emails, [click here](https://discuss.tvm.apache.org/email/unsubscribe/5b77fdb21494960b36c6ad5414287762d2f903b3ba261d8844482f682bcc45e1).