For future reference here is a cross link to the referred ML thread
discussion [1].

[1]
http://mail-archives.apache.org/mod_mbox/flink-user/201904.mbox/%3cm2ef5tpfwy.wl-nings...@gmail.com%3E

Cheers,
Till

On Wed, Apr 24, 2019 at 4:00 AM Ning Shi <nings...@gmail.com> wrote:

> Hi Congxian,
>
> I think I have figured out the issue. It's related to the checkpoint
> directory
> collision issue you responded to in the other thread. We reproduced this
> bug on
> 1.6.1 after unchaining the operators.
>
> There are two stateful operators in the chain, one is a
> CoBroadcastWithKeyedOperator, the other one is a StreamMapper. The
> CoBroadcastWithKeyedOperator creates timer states in RocksDB, the latter
> doesn’t. Because of the checkpoint directory collision bug, we always end
> up
> saving the states for CoBroadcastWithKeyedOperator.
>
> After breaking these two operators apart, they try to restore from the
> same set
> of saved states. When the StreamMapper opens the RocksDB files, it doesn’t
> care
> about any of the column families in there, including the timer states.
> Hence the
> error.
>
> --
> Ning
>

Reply via email to