Re: Iteration stats logging

2015-06-22 Thread Stephan Ewen
Hi Nam-Luc! Having per-iteration statistics and accumulators is on the roadmap. The way I have done this so far is to create accumulators like shown below, which creates a new accumulator for each superstep: class MyFunction extends RichMapFunction{ private LongCounter counter; public

Re: Iteration stats logging

2015-06-20 Thread Robert Metzger
Are you running a fixed number of iterations or do you use a dynamic termination criterion? For fixed iterations, you can get the id of the current iteration ... which allows you to find out when you are running the last iterations. Would it be feasible for you to just log these statistics to the

Re: Iteration stats logging

2015-06-15 Thread Nam-Luc Tran
Hi Ufuk, The kind of things we'd like to log are: time spent in the iteration, residual of the algorithm (convergence), current iteration. Best regards, Tran Nam-Luc   At Monday, 15/06/2015 on 16:15 Ufuk Celebi wrote: Hey Tran Nam-Luc, there is currently no way to do this. The iteration sync

Re: Iteration stats logging

2015-06-15 Thread Ufuk Celebi
Hey Tran Nam-Luc, there is currently no way to do this. The iteration sync tasks keeps track of iteration convergence/max number of iterations and signals termination to the iteration head. After this, the head flushes the produced result to the next task (after the iteration) and the intermed

Iteration stats logging

2015-06-15 Thread Nam-Luc Tran
Hello Everyone, I would like to log certain stats during iterations in a bulk iterative job. The way I do this is store the things I want at each iteration and plan to flush everything to HDFS once all the iterations are done. To do that I would need to know when the last iteration is invoked in o