Yes, it does sound quite a lot like FLINK-10225. I assume it is only happening for some task executors and not all of them?

Unfortunately I don't think this issue will be fixed anytime soon.

On 1/28/2021 12:59 PM, Daniel Peled wrote:
Hi,

We have followed the instructions in the following link ""Enabling Queryable State" with kubernetes: https://ci.apache.org/projects/flink/flink-docs-stable/deployment/resource-providers/standalone/kubernetes.html#enabling-queryable-state <https://ci.apache.org/projects/flink/flink-docs-stable/deployment/resource-providers/standalone/kubernetes.html#enabling-queryable-state>

*When the replicas of the task-manager pods is 1 we get NO error
But when the replicas is greater than 1 for example 7 we get the following error when trying to access flink state: We think it might be related to jira issue *FLINK-10225 <https://issues.apache.org/jira/browse/FLINK-10225> *that has been abandoned*

java.util.concurrent.ExecutionException: java.lang.RuntimeException: Failed request 1.  Caused by: org.apache.flink.queryablestate.exceptions.UnknownLocationException: Could not retrieve location of state=stopJobValueState of job=d5d14923157f5c3d3c4b2e1b7c02a942. Potential reasons are: i) the state is not ready, or ii) the job does not exist.  Caused by: org.apache.flink.queryablestate.exceptions.UnknownLocationException: Could not retrieve location of state=stopJobValueState of job=d5d14923157f5c3d3c4b2e1b7c02a942. Potential reasons are: i) the state is not ready, or ii) the job does not exist.

at org.apache.flink.queryablestate.client.proxy.KvStateClientProxyHandler.getKvStateLookupInfo(KvStateClientProxyHandler.java:247) at org.apache.flink.queryablestate.client.proxy.KvStateClientProxyHandler.getState(KvStateClientProxyHandler.java:164) at org.apache.flink.queryablestate.client.proxy.KvStateClientProxyHandler.executeActionAsync(KvStateClientProxyHandler.java:131) at org.apache.flink.queryablestate.client.proxy.KvStateClientProxyHandler.handleRequest(KvStateClientProxyHandler.java:121) at org.apache.flink.queryablestate.client.proxy.KvStateClientProxyHandler.handleRequest(KvStateClientProxyHandler.java:63) at org.apache.flink.queryablestate.network.AbstractServerHandler$AsyncRequestTask.run(AbstractServerHandler.java:258) at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:834)

BR,
Danny


Reply via email to