Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-31 Thread Anton Ippolitov via user
Makes sense, thank you! On Tue, Jan 31, 2023 at 10:48 AM Gyula Fóra wrote: > Thanks @Anton Ippolitov > At this stage I would highly recommend the native mode if you have the > liberty to try that. > I think that has better production characteristics and will work out of > the box with the autos

Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-31 Thread Gyula Fóra
Thanks @Anton Ippolitov At this stage I would highly recommend the native mode if you have the liberty to try that. I think that has better production characteristics and will work out of the box with the autoscaler. (the standalone mode won't) Gyula On Tue, Jan 31, 2023 at 10:41 AM Anton Ippoli

Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-31 Thread Anton Ippolitov via user
I am using the Standalone Mode indeed, should've mentioned it right away. This fix looks exactly like what I need, thank you!! On Tue, Jan 31, 2023 at 9:16 AM Gyula Fóra wrote: > There is also a pending fix for the standalone + k8s HA case : > https://github.com/apache/flink-kubernetes-operator/

Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-31 Thread Gyula Fóra
There is also a pending fix for the standalone + k8s HA case : https://github.com/apache/flink-kubernetes-operator/pull/518 You could maybe try and review the fix :) Gyula On Tue, Jan 31, 2023 at 8:36 AM Yang Wang wrote: > I assume you are using the standalone mode. Right? > > For the native K

Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-30 Thread Yang Wang
I assume you are using the standalone mode. Right? For the native K8s mode, the leader address should be *akka.tcp://flink@JM_POD_IP:6123/user/rpc/dispatcher_1 *when HA enabled. Best, Yang Anton Ippolitov via user 于2023年1月31日周二 00:21写道: > This is actually what I'm already doing, I'm only sett

Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-30 Thread Anton Ippolitov via user
This is actually what I'm already doing, I'm only setting high-availability: kubernetes myself. The other values are either defaults or set by the Operator: - jobmanager.rpc.port: 6123 is the default value (docs

Re: "Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-27 Thread Gyula Fóra
We never encountered this problem before but also we don't configure those settings. Can you simply try: high-availability: kubernetes And remove the other configs? I think that can only cause problems and should not achieve anything :) Gyula On Fri, Jan 27, 2023 at 6:44 PM Anton Ippolitov via

"Error while retrieving the leader gateway" when using Kubernetes HA

2023-01-27 Thread Anton Ippolitov via user
Hi everyone, I've been experimenting with Kubernetes HA and the Kubernetes Operator and ran into the following issue which is happening regularly on TaskManagers with Flink 1.16: Error while retrieving the leader gateway. Retrying to connect to akka.tcp://flink@SERVICE-NAME-HERE.NAMESPACE-HERE:61