GitHub user xinyuiscool opened a pull request:
https://github.com/apache/samza/pull/364
SAMZA-1505: Fix CheckpointTool writing only one ssp per task
Currently when using CheckpointTool to write checkpoints, it only writes a
checkpoint of a single ssp per task. By debugging the code, looks like the
flatMap() on the checkpoint of Optional tuple(taskname -> Map(ssp -> offset))
merges the results by key taskname. This patch stores the results explicitly in
a list and then groupBy() on it, which fixes the problem.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/xinyuiscool/samza SAMZA-1505
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/samza/pull/364.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #364
----
commit 98fb40bdb1f0c5f1c8ab55c231b6fddb0862e739
Author: xiliu <[email protected]>
Date: 2017-11-20T21:36:44Z
SAMZA-1505: Fix CheckpointTool writing only one ssp per task
----
---