Hi Averell,

According to the AWS documentation [1], the master node only runs the YARN
ResourceManager and the HDFS NameNode. Containers can only by launched on
nodes that are running the YARN NodeManager [2]. Therefore, if you want TMs
or
JMs to be launched on your EMR master node, you have to start the
NodeManager
process there but I do not know how well this is supported by AWS EMR.

You can choose a smaller server for the master node but keep in mind that
it is
running the HDFS NameNode as well. The hardware requirements will therefore
partially depend on the HDFS workload.

Best,
Gary

[1]
https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-plan-instances.html
[2]
https://hadoop.apache.org/docs/r2.8.0/hadoop-yarn/hadoop-yarn-site/NodeManager.html

On Mon, Sep 17, 2018 at 5:22 AM, Averell <lvhu...@gmail.com> wrote:

> Hello everyone,
>
> I'm trying to run Flink on AWS EMR following the guides from  Flink doc
> <https://ci.apache.org/projects/flink/flink-docs-
> master/ops/deployment/yarn_setup.html#run-a-single-flink-
> job-on-hadoop-yarn>
> and from  AWS
> <https://docs.aws.amazon.com/emr/latest/ReleaseGuide/flink-configure.html>
>
> , and it looks like the EMR master is never used, neither for JM nor TM.
> "bin/yarn-session.sh -q" only shows the core nodes. We are only running
> Flink on that EMR, so it is wasting of resources.
>
> So, is there any way to use the master node for the job, at least for the
> JM
> only?
>
> If that is not possible, should I have different hardware configurations
> between the master node and core nodes (smaller server for the master)?
>
> Thanks and best regards,
> Averell
>
>
>
>
> --
> Sent from: http://apache-flink-user-mailing-list-archive.2336050.
> n4.nabble.com/
>

Reply via email to