Hello, thanks for the response I am using kubernetes standalone application mode not the native one.
and this error happens randomly at some point while running the job. Also i am using just one replicas of the jobmanager here is some other logs:: {"@timestamp":"2021-08-30T15:43:44.970+02:00","@version":"1","message":"Exception occurred while renewing lock: Unable to update ConfigMapLock","logger_name":"io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector", "thread_name":"pool-685-thread-1","level":"DEBUG","level_value":10000,"stack_trace":"io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.LockException: Unable to update ConfigMapLock io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.ConfigMapLock.update(ConfigMapLock.java:108) io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.tryAcquireOrRenew(LeaderElector.java:156) io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.renew(LeaderElector.java:120) io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.lambda$null$1(LeaderElector.java:104) java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)\n\tat java.util.concurrent.FutureTask.run(FutureTask.java:266) java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)\n\tat java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) java.lang.Thread.run(Thread.java:748) Caused by: io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: PUT at: https://172.31.64.1/api/v1/namespaces/flink-pushavoo-flink-rec/configmaps/elifibre-00000000000000000000000000000000-jobmanager-leader. Message: Operation cannot be fulfilled on configmaps \"elifibre-00000000000000000000000000000000-jobmanager-leader\": the object has been modified; please apply your changes to the latest version and try again. Received status: Status(apiVersion=v1, code=409, details=StatusDetails(causes=[], group=null, kind=configmaps, name=elifibre-00000000000000000000000000000000-jobmanager-leader, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=Operation cannot be fulfilled on configmaps \"elifibre-00000000000000000000000000000000-jobmanager-leader\": the object has been modified; please apply your changes to the latest version and try again, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Conflict, status=Failure, additionalProperties={}). io.fabric8.kubernetes.client.dsl.base.OperationSupport.requestFailure(OperationSupport.java:568) io.fabric8.kubernetes.client.dsl.base.OperationSupport.assertResponseCode(OperationSupport.java:507) io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleResponse(OperationSupport.java:471) io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleResponse(OperationSupport.java:430) io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleReplace(OperationSupport.java:289) io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleReplace(OperationSupport.java:269) io.fabric8.kubernetes.client.dsl.base.BaseOperation.handleReplace(BaseOperation.java:820) io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.lambda$replace$1(HasMetadataOperation.java:86) io.fabric8.kubernetes.api.model.DoneableConfigMap.done(DoneableConfigMap.java:26) io.fabric8.kubernetes.api.model.DoneableConfigMap.done(DoneableConfigMap.java:5) io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:92) io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:36) io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.ConfigMapLock.update(ConfigMapLock.java:106) ... 10 common frames omitted\n"} ********************************************************************************************************** On 2021/08/30 10:53:10, Roman Khachatryan <ro...@apache.org> wrote: > Hello, > > Do I understand correctly that you are using native Kubernetes > deployment in application mode; > and the issue *only* happens if you set kubernetes-jobmanager-replicas > [1] to a value greater than 1? > > Does it happen during deployment or at some point while running the job? > > Could you share Flink and Kubernetes versions and HA configuration > [2]? (I'm assuming you're using Kubernetes for HA, not ZK). > > [1] > https://ci.apache.org/projects/flink/flink-docs-master/docs/deployment/config/#kubernetes-jobmanager-replicas > [2] > https://ci.apache.org/projects/flink/flink-docs-master/docs/deployment/ha/kubernetes_ha/ > > Regards, > Roman > > On Fri, Aug 27, 2021 at 2:31 PM mejri houssem <mejrihousse...@gmail.com> > wrote: > > > > hello i am deploying a flink application cluster with kubernetes HA mode, > > but i am facing this recurrent problem and i didn't know how to solve it. > > > > Any help would be appreciated. > > > > > > > > this of the jobManager: > > > > {"@timestamp":"2021-08-27T14:19:42.447+02:00","@version":"1","message":"Exception > > occurred while renewing lock: Unable to update > > ConfigMapLock","logger_name":"io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector","thread_name":"pool-4092-thread-1","level":"DEBUG","level_value":10000,"stack_trace":"io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.LockException: > > Unable to update ConfigMapLock\n\tat > > io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.ConfigMapLock.update(ConfigMapLock.java:108)\n\tat > > > > io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.tryAcquireOrRenew(LeaderElector.java:156)\n\tat > > > > io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.renew(LeaderElector.java:120)\n\tat > > > > io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.lambda$null$1(LeaderElector.java:104)\n\tat > > > > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)\n\tat > > java.util.concurrent.FutureTask.run(Fut ureT > > ask.java:266)\n\tat > > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)\n\tat > > > > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)\n\tat > > > > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)\n\tat > > > > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)\n\tat > > java.lang.Thread.run(Thread.java:748)\nCaused by: > > io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: > > PUT at: > > https://172.31.64.1/api/v1/namespaces/flink-pushavoo-flink-rec/configmaps/elifibre-00000000000000000000000000000000-jobmanager-leader. > > Message: Operation cannot be fulfilled on configmaps > > \"elifibre-00000000000000000000000000000000-jobmanager-leader\": the object > > has been modified; please apply your changes to the latest version and try > > again. Received status: Status(apiVersion=v1, code=409, > > details=StatusDetails(causes=[], gro > > up=null, kind=configmaps, > > name=elifibre-00000000000000000000000000000000-jobmanager-leader, > > retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, > > message=Operation cannot be fulfilled on configmaps > > \"elifibre-00000000000000000000000000000000-jobmanager-leader\": the object > > has been modified; please apply your changes to the latest version and try > > again, metadata=ListMeta(_continue=null, remainingItemCount=null, > > resourceVersion=null, selfLink=null, additionalProperties={}), > > reason=Conflict, status=Failure, additionalProperties={}).\n\tat > > io.fabric8.kubernetes.client.dsl.base.OperationSupport.requestFailure(OperationSupport.java:568)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.OperationSupport.assertResponseCode(OperationSupport.java:507)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleResponse(OperationSupport.java:471)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleResponse(OperationSupport.java:430)\n\tat > > io.fabric8.kube rnet > > > > es.client.dsl.base.OperationSupport.handleReplace(OperationSupport.java:289)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.OperationSupport.handleReplace(OperationSupport.java:269)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.BaseOperation.handleReplace(BaseOperation.java:820)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.lambda$replace$1(HasMetadataOperation.java:86)\n\tat > > > > io.fabric8.kubernetes.api.model.DoneableConfigMap.done(DoneableConfigMap.java:26)\n\tat > > > > io.fabric8.kubernetes.api.model.DoneableConfigMap.done(DoneableConfigMap.java:5)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:92)\n\tat > > > > io.fabric8.kubernetes.client.dsl.base.HasMetadataOperation.replace(HasMetadataOperation.java:36)\n\tat > > > > io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.ConfigMapLock.update(ConfigMapLock.java:106)\n\t... > > 10 common frames omitted\n"} > > >