[jira] [Comment Edited] (FLINK-30518) [flink-operator] Kubernetes HA Service not working with standalone mode

Binh-Nguyen Tran (Jira) Wed, 28 Dec 2022 08:04:05 -0800


    [ 
https://issues.apache.org/jira/browse/FLINK-30518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17652530#comment-17652530
 ]


Binh-Nguyen Tran edited comment on FLINK-30518 at 12/28/22 4:03 PM:
--------------------------------------------------------------------

[~gyfora] I think my assumption is wrong about the flink-conf configmap as even 
with native mode the rpc.address is not set in flink-conf.yaml (file is 
read-only) but the advertised address is correct in cluster configmap. May be 
due to something else. You can try with the basic-checkpoint-ha-example and 
switch the mode to "standalone".

Update: I think may be retrieving Pod IP requires Service Acccount which is not 
used in "standalone" mode?


was (Author: tbnguyen1407):
[~gyfora] I think my assumption is wrong about the flink-conf configmap as even 
with native mode the rpc.address is not set in flink-conf.yaml (file is 
read-only) but the advertised address is correct in cluster configmap. May be 
due to something else. You can try with the basic-checkpoint-ha-example and 
switch the mode to "standalone".

> [flink-operator] Kubernetes HA Service not working with standalone mode
> -----------------------------------------------------------------------
>
>                 Key: FLINK-30518
>                 URL: https://issues.apache.org/jira/browse/FLINK-30518
>             Project: Flink
>          Issue Type: Bug
>          Components: Kubernetes Operator
>    Affects Versions: kubernetes-operator-1.3.0
>            Reporter: Binh-Nguyen Tran
>            Priority: Major
>         Attachments: flink-configmap.png, screenshot-1.png
>
>
> -Since flink-conf.yaml is mounted as read-only configmap, the 
> /docker-entrypoint.sh script is not able to inject correct Pod IP to 
> `jobmanager.rpc.address`. This leads to same address (e.g flink.ns-ext) being 
> set for all Job Manager pods. This causes:-
> Setting up FlinkDeployment in Standalone mode with Kubernetes HA Service. 
> Problems:
> (1) flink-cluster-config-map always contains wrong address for all 3 
> component leaders (see screenshot, should be pod IP instead of clusterIP 
> service name)
> (2) Accessing Web UI when jobmanager.replicas > 1 is not possible with error
> {code:java}
> {"errors":["Service temporarily unavailable due to an ongoing leader 
> election. Please refresh."]} {code}
>  
> ~ flinkdeployment.yaml ~
> {code:java}
> spec:
>   mode: standalone
>   flinkConfiguration:
>     high-availability: kubernetes
>     high-availability.storageDir: "file:///opt/flink/storage"
>     ...
>   jobManager:
>     replicas: 3
>   ... {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Comment Edited] (FLINK-30518) [flink-operator] Kubernetes HA Service not working with standalone mode

Reply via email to