If you want to write a single file, you need to write it with one task. So, you can run a program with parallelism 100 and just set the sink operator to parallelism 1.
You can set the parallelism of each individual operator by calling "setParallelism()" after the operation, for example "result.writeAsText(path).setParallelism(1)". On Wed, May 13, 2015 at 8:02 PM, Mihail Vieru <vi...@informatik.hu-berlin.de > wrote: > Hi, > > I need to write a data set to a single file without setting the > parallelism to 1. > How can I achieve this? > > Cheers, > Mihail > > P.S.: it's for persisting intermediate results in loops and reading those > in the next iteration. > Which btw work for higher iteration counts with explicit persistence. >