Hi Bill, a DataSet is just a logical concept in Flink. DataSets are often not persisted and just streamed along operators. At the moment, there is no way to access an intermediate DataSet of a Flink program directly (this might change in the future).
You can process data in another function by implementing a Java user function (for example a MapPartition function) and sending the data through JNI to a C function (if you need the full data set, you must set the parallelism to 1). Flink's Python API follows a similar approach to ship data from Flink to an external Python process. Best, Fabian 2015-07-06 9:30 GMT+02:00 Bill Sparks <jspa...@cray.com>: > Just a question if there was some prior-art here. Just say someone > wanted to use flink for processing, but at some point they wanted to call > another function via say JNI/C which doesn't understand DataSet's. How > would one go about this ... I'm assuming the code would have to convert the > data to a common format prior to calling the function. > > > > > > Regards, > > Bill. >