[ https://issues.apache.org/jira/browse/FLINK-26577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17504076#comment-17504076 ]
Gyula Fora commented on FLINK-26577: ------------------------------------ cc [~wangyang0918] [~thw] > Avoid state loss when switching to last-state upgrade mode > ---------------------------------------------------------- > > Key: FLINK-26577 > URL: https://issues.apache.org/jira/browse/FLINK-26577 > Project: Flink > Issue Type: Sub-task > Components: Kubernetes Operator > Reporter: Gyula Fora > Priority: Major > > At the moment there are several corner cases which can lead to accidental > state loss (or at least weird behaviour) when switching to last-state upgrade > mode from other modes. > 2 cases that immediately come to mind: > savepoint to last-state: > When the new upgrade mode is last-state, the job deployment will simply be > deleted. If HA was not enabled previously, the last savepoint might be very > far back in time. > stateless to last-state: > If checkpointing and HA is not enabled, the deployment will simply be killed > like previously and we might start a job from empty state. Maybe taking a > savepoint would be the right approach in this case and continue from there. > Maybe when switching between modes we should consider the previous mode as > well as the target mode when deciding the on the suspend strategy. We could > also simply not allow to switch to last-state if HA is not enabled previously > but that might be too restrictive. -- This message was sent by Atlassian Jira (v8.20.1#820001)