Re: REST API in an HA setup - must the leading JM be called?

Chesnay Schepler Wed, 18 Aug 2021 04:28:44 -0700

You've pretty much answered the question yourself. *thumbs up*


For the vast majority of cases you can call any JobManager.

The exceptions are jar operations (because they are persisted in theJM-local filesystem, and other JMs don't know about them) and triggeringsavepoints (because metadata for on-going savepoint operations (i.e.,the information returned when querying the savepoint operation status)is also kept locally in the JM).


This does indeed imply that on JM failover all this information is lost.

There are ideas to solve is, but no concrete timeline. Seehttps://issues.apache.org/jira/browse/FLINK-18312


On 18/08/2021 11:54, Juha Mynttinen wrote:

I have questions related to REST API in the case of ZooKeeper HA and astandalone cluster. But I think the questions apply to other setupstoo such as YARN.
Let's assume a standalone cluster with multiple JobManagers. TheJobManagers elect the leader among themselves and register that toZooKeeper. When using the Flink command line, AFAIK the code will goto ZooKeeper to find the host and port of the leading JobManager andsend HTTP requests there.
My question is: when accessing the REST API directly (e.g. curl) doesone need to call the leading JobManager or will any up andrunning JobManager do? And if the leader needs to be called, why is it so?
Behind the scenes the REST API will connect to the leading"JobManager" over RPC, making it irrelevant which JobManager receivesthe HTTP request.
By experimenting, I found the Web UI works fine if all the JobManagersare behind a load balancer and leading and standby JobManagers arecalled. The only issue I found was that when a jar is submitted(/jars/upload), it is stored on the local disk of the JobManager thathappens to handle that request. As a consequence, creating a job fromthat jar only succeeds if the HTTP request hits the JobManager thathas the file. There might be a "hack" to overcome this limitation, setweb.upload.dir to be in S3 / GCS or elsewhere accessible by allJobManagers. I didn't try this. Or in the case of uploading jars andcreating jobs, ensure the same JobManager is called (bypass loadbalancer).
But I wonder if there's something else why the leading JM should becalled.
A follow-up question arises. If the jars are stored only on theleading JobManager, doesn't that mean that if the leader changes, thenew leader is not aware of the jars uploaded to the old leader? Fromthe REST API's perspective this means that even in the JobManager HAsetup and when always calling the leader, a simple "upload a jar and adeploy a job"-cycle is not guaranteed to work if the leader happens tochange between the requests. Did I miss something?
--
Regards,
Juha

Re: REST API in an HA setup - must the leading JM be called?

Reply via email to