A quick idea is that we separate the deployment from user program that it has always been done outside the program. On user program executed there is always a ClusterClient that communicates with an existing cluster, remote or local. It will be another thread so just for your information.
Best, tison. tison <wander4...@gmail.com> 于2019年12月12日周四 下午4:40写道: > Hi Peter, > > Another concern I realized recently is that with current Executors > abstraction(FLIP-73) > I'm afraid that user program is designed to ALWAYS run on the client side. > Specifically, > we deploy the job in executor when env.execute called. This abstraction > possibly prevents > Flink runs user program on the cluster side. > > For your proposal, in this case we already compiled the program and run on > the client side, > even we deploy a cluster and retrieve job graph from program metadata, it > doesn't make > many sense. > > cc Aljoscha & Kostas what do you think about this constraint? > > Best, > tison. > > > Peter Huang <huangzhenqiu0...@gmail.com> 于2019年12月10日周二 下午12:45写道: > >> Hi Tison, >> >> Yes, you are right. I think I made the wrong argument in the doc. >> Basically, the packaging jar problem is only for platform users. In our >> internal deploy service, >> we further optimized the deployment latency by letting users to packaging >> flink-runtime together with the uber jar, so that we don't need to >> consider >> multiple flink version >> support for now. In the session client mode, as Flink libs will be shipped >> anyway as local resources of yarn. Users actually don't need to package >> those libs into job jar. >> >> >> >> Best Regards >> Peter Huang >> >> On Mon, Dec 9, 2019 at 8:35 PM tison <wander4...@gmail.com> wrote: >> >> > > 3. What do you mean about the package? Do users need to compile their >> > jars >> > inlcuding flink-clients, flink-optimizer, flink-table codes? >> > >> > The answer should be no because they exist in system classpath. >> > >> > Best, >> > tison. >> > >> > >> > Yang Wang <danrtsey...@gmail.com> 于2019年12月10日周二 下午12:18写道: >> > >> > > Hi Peter, >> > > >> > > Thanks a lot for starting this discussion. I think this is a very >> useful >> > > feature. >> > > >> > > Not only for Yarn, i am focused on flink on Kubernetes integration and >> > come >> > > across the same >> > > problem. I do not want the job graph generated on client side. >> Instead, >> > the >> > > user jars are built in >> > > a user-defined image. When the job manager launched, we just need to >> > > generate the job graph >> > > based on local user jars. >> > > >> > > I have some small suggestion about this. >> > > >> > > 1. `ProgramJobGraphRetriever` is very similar to >> > > `ClasspathJobGraphRetriever`, the differences >> > > are the former needs `ProgramMetadata` and the latter needs some >> > arguments. >> > > Is it possible to >> > > have an unified `JobGraphRetriever` to support both? >> > > 2. Is it possible to not use a local user jar to start a per-job >> cluster? >> > > In your case, the user jars has >> > > existed on hdfs already and we do need to download the jars to >> deployer >> > > service. Currently, we >> > > always need a local user jar to start a flink cluster. It is be great >> if >> > we >> > > could support remote user jars. >> > > >> In the implementation, we assume users package flink-clients, >> > > flink-optimizer, flink-table together within the job jar. Otherwise, >> the >> > > job graph generation within JobClusterEntryPoint will fail. >> > > 3. What do you mean about the package? Do users need to compile their >> > jars >> > > inlcuding flink-clients, flink-optimizer, flink-table codes? >> > > >> > > >> > > >> > > Best, >> > > Yang >> > > >> > > Peter Huang <huangzhenqiu0...@gmail.com> 于2019年12月10日周二 上午2:37写道: >> > > >> > > > Dear All, >> > > > >> > > > Recently, the Flink community starts to improve the yarn cluster >> > > descriptor >> > > > to make job jar and config files configurable from CLI. It improves >> the >> > > > flexibility of Flink deployment Yarn Per Job Mode. For platform >> users >> > > who >> > > > manage tens of hundreds of streaming pipelines for the whole org or >> > > > company, we found the job graph generation in client-side is another >> > > > pinpoint. Thus, we want to propose a configurable feature for >> > > > FlinkYarnSessionCli. The feature can allow users to choose the job >> > graph >> > > > generation in Flink ClusterEntryPoint so that the job jar doesn't >> need >> > to >> > > > be locally for the job graph generation. The proposal is organized >> as a >> > > > FLIP >> > > > >> > > > >> > > >> > >> https://cwiki.apache.org/confluence/display/FLINK/FLIP-85+Delayed+JobGraph+Generation >> > > > . >> > > > >> > > > Any questions and suggestions are welcomed. Thank you in advance. >> > > > >> > > > >> > > > Best Regards >> > > > Peter Huang >> > > > >> > > >> > >> >