This series of patches reworks the way that OpenACC kernels regions are
processed by GCC. Instead of relying on the parloops pass for
auto-parallelisation of the kernel region, the contents of the region are
transformed into a sequence of offloaded regions, which are then processed
individually.
Tested on an x86_64 host, with offloading to a Nvidia Tesla K20c card.
Okay for trunk?
Thanks
Kwok