Hi Rui Fan and Junrui,

Thanks for the reminder! I agree to change the
'execution.checkpointing.local-copy' to
'execution.checkpointing.local-copy.enabled'.

And for other suggestions Rui proposed:

1. How about execution.checkpointing.storage.type instead
> of execution.checkpointing.storage?


Ah, I missed something here. Actually I suggest we could merge the current
'state.checkpoints.dir' and 'state.checkpoint-storage' into one URI
configuration named 'execution.checkpointing.dir'. WDYT?

3. execution.checkpointing.savepoint.dir is a little weird.
>

Yes, I think it is better to make 'savepoint' and 'checkpoint' the same
level. But I'm not so sure since there is only one savepoint-related
option. Maybe someone else could share some thoughts here.

4. How about execution.recovery.claim-mode instead of
> execution.recovery.mode?
>

 Agreed. That's more accurate.


Many thanks for your suggestions!

Best,
Zakelly

On Mon, Dec 25, 2023 at 8:18 PM Junrui Lee <jrlee....@gmail.com> wrote:

> Hi Zakelly,
>
> Thanks for driving this. I agree that the proposed restructuring of the
> configuration options is largely positive. It will make understanding and
> working with Flink configurations more intuitive.
>
> Most of the proposed changes look great. Just a heads-up, as Rui Fan
> mentioned, Flink currently requires that no configOption's key be the
> prefix of another to avoid issues when we eventually adopt a standard YAML
> parser, as detailed in FLINK-29372 (
> https://issues.apache.org/jira/browse/FLINK-29372). Therefore, it's better
> to change the key 'execution.checkpointing.local-copy' because it serves as
> a prefix to the key 'execution.checkpointing.local-copy.dir'.
>
> Best regards,
> Junrui
>
> Rui Fan <1996fan...@gmail.com> 于2023年12月25日周一 19:11写道:
>
> > Hi Zakelly,
> >
> > Thank you for driving this proposal!
> >
> > Overall good for me. I have some questions about these names.
> >
> > 1. How about execution.checkpointing.storage.type instead of
> > execution.checkpointing.storage?
> >
> > It's similar to state.backend.type.
> >
> > 2. How about execution.checkpointing.local-copy.enabled instead of
> > execution.checkpointing.local-copy?
> >
> > You added a new option: execution.checkpointing.local-copy.dir.
> > IIUC, one option name shouldn't be the prefix of other options.
> > If you add a new option execution.checkpointing.local-copy,
> > flink CI will fail directly.
> >
> > 3. execution.checkpointing.savepoint.dir is a little weird.
> >
> > For old options: state.savepoints.dir and state.checkpoints.dir,
> > the savepoint and checkpoint are the same level. It means
> > it's a checkpoint or savepoint.
> >
> > The new option execution.checkpointing.dir is fine for me.
> > However, execution.checkpointing.savepoint.dir is a little weird.
> > I don't know which name is better now. Let us think about it more.
> >
> > 4. How about execution.recovery.claim-mode instead of
> > execution.recovery.mode?
> >
> > The meaning of mode is too broad. The claim-mode may
> > be more accurate for users.
> >
> > WDYT?
> >
> > Best,
> > Rui
> >
> > On Mon, Dec 25, 2023 at 5:14 PM Zakelly Lan <zakelly....@gmail.com>
> wrote:
> >
> > > Hi devs,
> > >
> > > I'd like to start a discussion on FLIP-406: Reorganize State &
> > > Checkpointing & Recovery Configuration[1].
> > >
> > > Currently, the configuration options pertaining to checkpointing,
> > recovery,
> > > and state management are primarily grouped under the following
> prefixes:
> > >
> > >    - state.backend.* : configurations related to state accessing and
> > >    checkpointing, as well as specific options for individual state
> > backends
> > >    - execution.checkpointing.* : configurations associated with
> > checkpoint
> > >    execution and recovery
> > >    - execution.savepoint.*: configurations for recovery from savepoint
> > >
> > > In addition, there are several individual options such as '
> > > *state.checkpoint-storage*' and '*state.checkpoints.dir*' that fall
> > outside
> > > of these prefixes. The current arrangement of these options, which span
> > > multiple modules, is somewhat haphazard and lacks a systematic
> structure.
> > > For example, the options under the '*CheckpointingOptions*' and '
> > > *ExecutionCheckpointingOptions*' are related and have no clear
> boundaries
> > > from the user's perspective, but there is no unified prefix for them.
> > With
> > > the upcoming release of Flink 2.0, we have an excellent opportunity to
> > > overhaul and restructure the configurations related to checkpointing,
> > > recovery, and state management. This FLIP proposes to reorganize these
> > > settings, making it more coherent by module, which would significantly
> > > lower the barriers for understanding and reduce the development costs
> > > moving forward.
> > >
> > > Looking forward to hearing from you!
> > >
> > > [1]
> > >
> >
> https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=284789560
> > >
> > > Best,
> > > Zakelly
> > >
> >
>

Reply via email to