Re: [DISCUSS] State Management API for Streaming

Stephan Ewen Wed, 03 Jun 2015 05:40:05 -0700

Hi!

Sorry for not responding earlier.


Thanks for taking an initiative there and drafting this. I would like to
have a close look after the 0.9 release is out.

I am in favor of making this a major item for 0.10, to get the state with
all its characteristics defined in a principles fashion: repartitionable,
synchronous vs. asynchronous checkpointing, full vs incremental checkpoints.

Greetings,
Stephan


On Fri, May 29, 2015 at 2:13 PM, Paris Carbone <par...@kth.se> wrote:

> Hello fellow squirrels!
>
>
> We just made a PR [1] of a prototype targeting flexible state management
> for streaming tasks with the prospect of further implementing on top
> different strategies such as lazy state updates, incremental snapshots and
> state partitioning. You can read more regarding the motivation behind this
> design in the doc Gyula created [2].
>
>
> As described in the doc, "managed" operator state and updates are
> explicitly specified by the OperatorState abstraction. Furthermore,
> OperatorState can be partitioned by a key and retrieved by the getState
> method. If the state is partitioned the getState method implicitly returns
> the state of the respective key that is currently processed by the
> operator. It will still be possible to add functionality in order to access
> keys arbitrarily if needed but for now this looks hopefully clear enough.
>
>
> Let's discuss here how we can use/modify this api to fit our general needs
> and vision for state management.
>
>
> Paris
>
>
> [1] https://github.com/apache/flink/pull/747
>
> [2]
> https://docs.google.com/document/d/1nTn4Tpafsnt-TCT6L1vlHtGGgRevU90yRsUQEmkRMjk
>
> ?
>
>

Re: [DISCUSS] State Management API for Streaming

Reply via email to