Hey Folks:
I am trying to figure out the options for running Flink on Kubernetes and am
trying to find out the pros and cons of running in Flink Session vs Flink
Cluster mode
(https://ci.apache.org/projects/flink/flink-docs-stable/ops/deployment/kubernetes.html#flink-session-cluster-on-kubernetes).
I understand that in job mode there is no need to submit the job since it is
part of the job image. But what are other the pros and cons of this approach
vs session mode where a job manager is deployed and flink jobs can be submitted
it ? Are there any benefits with regards to:
1. Configuring the jobs 2. Scaling the taskmanager3. Restarting jobs4. Managing
the flink jobs5. Passing credentials (in case of AWS, etc)6. Fault tolerence
and recovery of jobs from failure
Also, we will be keeping the checkpoints for the jobs on S3. Is there any need
for specifying volume for the pods ? If volume is required do we need
provisioned volume and what are the recommended alternatives/considerations
especially with AWS.
If there are any other considerations, please let me know.
Thanks for your advice.