Re: Using retained checkpoints as savepoints

Aljoscha Krettek Tue, 18 Feb 2020 04:07:13 -0800

Hi,

the reason why we are quite conservative when it comes to statingproperties of checkpoints is that we don't want to prevent ourselvesfrom implementing possibly optimized checkpoint formats that would notsupport these features.

You're right that currently checkpoints support most of the features ofsavepoints because they did not diverge far in their formats (or not atall).

AFAIK, this is not written down anywhere so it would be good to discussif we want to give those guarantees (which ties our hands a bit more) orkeep it as is but properly document it.


Best,
Aljoscha

On 30.01.20 01:58, Ken Krugler wrote:

Hi all,

Currently 
https://ci.apache.org/projects/flink/flink-docs-master/ops/state/checkpoints.html#difference-to-savepoints
 
<https://ci.apache.org/projects/flink/flink-docs-master/ops/state/checkpoints.html#difference-to-savepoints>
 says checkpoints…

"do not support Flink specific features like rescaling"

But I believe they do, and really must if you can use them like a savepoint. 
Should that sentence be changed, or removed?

Also this page doesn’t talk about state migration, which is another aspect of 
restarting a (modified) workflow from a retained checkpoint…will that work?

This sentence about checkpoints on 
https://ci.apache.org/projects/flink/flink-docs-master/ops/state/savepoints.html#what-is-a-savepoint-how-is-a-savepoint-different-from-a-checkpoint
 
<https://ci.apache.org/projects/flink/flink-docs-master/ops/state/savepoints.html#what-is-a-savepoint-how-is-a-savepoint-different-from-a-checkpoint>
 implies not:

"Optimizations towards those goals can exploit certain properties, e.g. that the job 
code doesn’t change between the execution attempts"

Thanks,

— Ken

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Cassandra & Solr

Re: Using retained checkpoints as savepoints

Reply via email to