Yes, this is the common issue that hardware vendor meet when integrating there 
tool chain through BYOC. 
You can refer the method used by tensorRT, 
1. parttion any op your acc supported, then maybe you get multiple your acc 
subgraph.
2. reinline the subgraphs that haven't big computation into relay main function.

The code of tensorRT already in the TVM repo.





---
[Visit 
Topic](https://discuss.tvm.apache.org/t/byoc-how-to-partition-specific-region-of-a-relay-graph-to-cpu/11758/4)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.apache.org/email/unsubscribe/5b77fdb21494960b36c6ad5414287762d2f903b3ba261d8844482f682bcc45e1).

Reply via email to