Hi All, I have two questions:
a) does the records/elements themselves would be checkpointed? or just record offset checkpointed? That is, what data included in the checkpoint except for states? b) where flink stores the state globally? so that the job manager could restore them on each task manger at failure restart. For the heap backend, all task managers would send states to job manager, and job manager would save it in its heap, correct? For the fs/rocksdb backend, all task managers would save states (incrementally or not) in local path temporarily, and send them (in rocksdb snapshot format for the rocksdb case?) to the job manager at checkpoint? The path we used to configure backend is the path on the job manager machine but not on the task managers' machines? So that's the bottleneck and single failure point? So it's better to use hdfs path so that we could scale the storage and make it high availability as well? Thank you all.