Hi,

I think the runtime support here 
(https://github.com/apache/incubator-tvm/pull/3554) is for uop and instructions 
sync via PCIe. However, if we want to run a full network (e.g., Resnet), we're 
still missing layer-wise synchronization/device_copy if two adjacent layers are 
resident in different devices.

![image|392x498, 75%](upload://c48rnK2nlR7Xhh1mZOzHJz8qs79.png) 

For example, in the above figure, we have to auto-insert a *device_copy* op 
between maxpool and conv2d.





---
[Visit 
Topic](https://discuss.tvm.ai/t/rfc-vta-support-for-cloud-devices-opencl-compatible/6676/22)
 to respond.

You are receiving this because you enabled mailing list mode.

To unsubscribe from these emails, [click 
here](https://discuss.tvm.ai/email/unsubscribe/e42b1966b3e313c63d9bc539f9d5a2b7bc6c880565d9e0e928256312d758e328).

Reply via email to