It would be helpful to expand on how, in job mode, the job graph would be produced. The phrase 'which contains the single job you want to execute' has a few meanings; I believe Till means a serialized job graph, not an executable JAR w/ main method. Till is that correct?
On Tue, Mar 20, 2018 at 2:16 AM, Till Rohrmann <trohrm...@apache.org> wrote: > Hi Edward, > > you're right that Flink's Kubernetes documentation has not been updated > with respect to Flip-6. This will be one of the tasks during the Flink 1.5 > release testing and is still pending. > > A Flink cluster can be run in two modes: session mode vs per-job mode. The > former starts a cluster to which you can submit multiple jobs. The cluster > shares the same ResourceManager and a Dispatcher which is responsible for > spawning JobMasters which execute a single job each. The latter starts a > Flink cluster which is pre-initialized with a JobGraph and only runs this > job. Here we also start a ResourceManager and a MiniDispatcher whose job it > is to simply start a single JobMaster with the pre-initialized JobGraph. > > StandaloneSessionClusterEntrypoint is the entrypoint for the session mode. > > The JobClusterEntrypoint is the entrypoint for the per-job mode. Take a > look at YarnJobClusterEntrypoint to see how the entrypoint retrieves the > JobGraph from HDFS and then automatically starts executing it. There is no > script which directly starts this entrypoint, but the YarnClusterDescriptor > uses it when `deployJobCluster` is called. > > Depending on what you want to achieve: Either building generic K8 images > to which you can submit any number of Flink jobs or having a special image > which contains the single job you want to exeucte, you either have to call > into the SessionClusterEntrypoint or the JobClusterEntrypoint. When > starting a session cluster, then you can use bin/flink run to submit a job > to this cluster. > > Let me know if you have other questions. > > Cheers, > Till > > On Thu, Mar 15, 2018 at 7:53 PM, Edward Rojas <edward.roja...@gmail.com> > wrote: > >> Hello, >> >> Currently I have a Flink 1.4 cluster running on kubernetes based on the >> configuration describe on >> https://ci.apache.org/projects/flink/flink-docs-release-1.4/ >> ops/deployment/kubernetes.html >> with additional config for HA with Zookeeper. >> >> With this I have several Taskmanagers, a single Jobmanager and I create a >> container for each job to perform the Job submission and manage Job >> updates >> with savepoints. >> >> >> I'm looking into what would be needed to migrate to the new architecture >> on >> FLIP6 as we are planning to use Flink 1.5 once it's ready. >> >> If I understand correctly from >> https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=65147077 >> and the current code on master: >> >> * Taskmanagers would continue the same, i.e they will execute the >> taskmanager.sh start-foreground script, which with the flip6 mode >> activated >> will execute the new taskexecutor.TaskManagerRunner. >> >> * We will have now one Job Manager per Job which is really good; but I >> don't >> fully understand how this would be started. >> >> I notice that the jobmanager.sh with flip6 mode activated will execute >> entrypoint.StandaloneSessionClusterEntrypoint but I don't see how we >> could >> pass the job jar and parameters (?) >> >> So I think the other possibility to start the job would be via the /flink >> run/ command with maybe an option to tell that we are creating a job with >> job manager or would be this the default behaviour ? >> >> Or would be this the role of the JobMaster ? I didn't take a look to its >> code but it's mentioned on the flip6 page. (however I don't see an >> entrypoint from the scripts (?)) >> >> Could you help me to understand how this is expected to be done ? >> >> >> * Also I'm not sure to understand whether it would be better to have a >> ResourceManager per job or a single ResourceManager per cluster, as in the >> page is stated that there is a ResourceManager for >> Self-contained-single-job, but it seems to me that it needs to have the >> information about all JobManagers and TaskManagers (?) >> >> >> Thanks in advance for the help you could provide. >> >> I'm interested in using Flip6 on kubernetes when it will be ready, so I >> could help with some testing if needed. >> >> -- >> Edward >> >> >> >> -- >> Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4. >> nabble.com/ >> > >