Hi Zakelly,

Thank you for driving this proposal!

Overall good for me. I have some questions about these names.

1. How about execution.checkpointing.storage.type instead of
execution.checkpointing.storage?

It's similar to state.backend.type.

2. How about execution.checkpointing.local-copy.enabled instead of
execution.checkpointing.local-copy?

You added a new option: execution.checkpointing.local-copy.dir.
IIUC, one option name shouldn't be the prefix of other options.
If you add a new option execution.checkpointing.local-copy,
flink CI will fail directly.

3. execution.checkpointing.savepoint.dir is a little weird.

For old options: state.savepoints.dir and state.checkpoints.dir,
the savepoint and checkpoint are the same level. It means
it's a checkpoint or savepoint.

The new option execution.checkpointing.dir is fine for me.
However, execution.checkpointing.savepoint.dir is a little weird.
I don't know which name is better now. Let us think about it more.

4. How about execution.recovery.claim-mode instead of
execution.recovery.mode?

The meaning of mode is too broad. The claim-mode may
be more accurate for users.

WDYT?

Best,
Rui

On Mon, Dec 25, 2023 at 5:14 PM Zakelly Lan <zakelly....@gmail.com> wrote:

> Hi devs,
>
> I'd like to start a discussion on FLIP-406: Reorganize State &
> Checkpointing & Recovery Configuration[1].
>
> Currently, the configuration options pertaining to checkpointing, recovery,
> and state management are primarily grouped under the following prefixes:
>
>    - state.backend.* : configurations related to state accessing and
>    checkpointing, as well as specific options for individual state backends
>    - execution.checkpointing.* : configurations associated with checkpoint
>    execution and recovery
>    - execution.savepoint.*: configurations for recovery from savepoint
>
> In addition, there are several individual options such as '
> *state.checkpoint-storage*' and '*state.checkpoints.dir*' that fall outside
> of these prefixes. The current arrangement of these options, which span
> multiple modules, is somewhat haphazard and lacks a systematic structure.
> For example, the options under the '*CheckpointingOptions*' and '
> *ExecutionCheckpointingOptions*' are related and have no clear boundaries
> from the user's perspective, but there is no unified prefix for them. With
> the upcoming release of Flink 2.0, we have an excellent opportunity to
> overhaul and restructure the configurations related to checkpointing,
> recovery, and state management. This FLIP proposes to reorganize these
> settings, making it more coherent by module, which would significantly
> lower the barriers for understanding and reduce the development costs
> moving forward.
>
> Looking forward to hearing from you!
>
> [1]
> https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=284789560
>
> Best,
> Zakelly
>

Reply via email to