Re: Flink Job Manager Recovery from EKS Node Terminations

2023-01-11 Thread Yang Wang
First, JobManager does not store any persistent data to local when the Kubernetes HA + S3 used. It means that you do not need to mount a PV for JobMananger deployment. Secondly, node failures or terminations should not cause the CrashLoopBackOff status. One possible reason I could imagine is a bug

Flink Job Manager Recovery from EKS Node Terminations

2023-01-05 Thread Vijay Jammi
Hi, Have a query on the Job Manager HA for flink 1.15. We currently run a standalone flink cluster with a single JobManager and multiple TaskManagers, deployed on top of a kubernetes cluster (EKS cluster) in application mode (reactive mode). The Task Managers are deployed as a ReplicaSet and the