I'm trying to get my head around the impact of setting max parallelism.

* Does max parallelism primarily serve as a reservation for future
increases to parallelism? The reservation being the ability to restore
from checkpoints and savepoints after increases to parallelism.

* Does it serve as a runtime suggestion for how many instances of an
operator the job could spin up? Or is it just a reservation like I
asked above?

* It also appears to impact the distribution of key groups among
subtasks from what I've read and seen from testing. Is that
understanding correct?

* What are the other important implications?


Thank you,
Aeden

Reply via email to