It is nice to see that we converge on the issues we find.
Means that this is getting pretty stable :-)
On Tue, Jan 19, 2016 at 8:17 PM, Stephan Ewen wrote:
> Yeah, we saw this as well this morning, in a job that triggers checkpoints
> super fast (50msecs).
>
> I think we have a good fix figured
Yeah, we saw this as well this morning, in a job that triggers checkpoints
super fast (50msecs).
I think we have a good fix figured out, let's solve this for 1.0...
On Tue, Jan 19, 2016 at 3:25 PM, Gyula Fóra wrote:
> I just got back to this issue. The problem wasn't with the locking but that
>
I just got back to this issue. The problem wasn't with the locking but that
the StreamTask wasn't in running state before the first checkpoint trigger
message.
I actually just saw your JIRA as well, funny... :)
Regards,
Gyula
Stephan Ewen ezt írta (időpont: 2016. jan. 8., P, 15:36):
> Hmm, stra
Hmm, strange issue indeed.
So, checkpoints are definitely triggered (log message by coordinator to
trigger checkpoint) but are not completing?
Can you check which is the first checkpoint to complete? Is it Checkpoint
1, or a later one (indicating that checkpoint 1 was somehow subsumed).
Can you c
Hey,
I have encountered a weird issue in a checkpointing test I am trying to
write. The logic is the same as with the previous checkpointing tests,
there is a OnceFailingReducer.
My problem is that before the reducer fails, my job cannot take any
snapshots. The Runnables executing the checkpointi