Hi All,

I have two questions:

a) does the records/elements themselves would be checkpointed? or just
record offset checkpointed? That is, what data included in the
checkpoint except for states?

b) where flink stores the state globally? so that the job manager
could restore them on each task manger at failure restart.

For the heap backend, all task managers would send states to job
manager, and job manager would save it in its heap, correct?

For the fs/rocksdb backend, all task managers would save states
(incrementally or not) in local path temporarily, and send them (in
rocksdb snapshot format for the rocksdb case?) to the job manager at
checkpoint?

The path we used to configure backend is the path on the job manager
machine but not on the task managers' machines? So that's the
bottleneck and single failure point? So it's better to use hdfs path
so that we could scale the storage and make it high availability as
well?

Thank you all.

Reply via email to