This series of patches reworks the way that OpenACC kernels regions are processed by GCC. Instead of relying on the parloops pass for auto-parallelisation of the kernel region, the contents of the region are transformed into a sequence of offloaded regions, which are then processed individually.

Tested on an x86_64 host, with offloading to a Nvidia Tesla K20c card.

Okay for trunk?

Thanks

Kwok

Reply via email to