[ https://issues.apache.org/jira/browse/FLINK-12343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16831489#comment-16831489 ]
Till Rohrmann commented on FLINK-12343: --------------------------------------- [~ZhenqiuHuang] I think the {{ResourceManager}} only stores the {{TaskExecutors}} configuration file to HDFS when setting up the {{TaskExecutorContext}}. Ideally we treat this situation similarly to the upload of the other Flink cluster artifacts. But this should not be hard to do. I guess we have agreed on the overall approach and can now start with the implementation. I think in order to test it, we either need a {{HdfsMiniCluster}} or the {{YarnMiniCluster}} to deploy and check the replication factor. > Allow set file.replication in Yarn Configuration > ------------------------------------------------ > > Key: FLINK-12343 > URL: https://issues.apache.org/jira/browse/FLINK-12343 > Project: Flink > Issue Type: Improvement > Components: Command Line Client, Deployment / YARN > Affects Versions: 1.6.4, 1.7.2, 1.8.0 > Reporter: Zhenqiu Huang > Assignee: Zhenqiu Huang > Priority: Minor > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > Currently, FlinkYarnSessionCli upload jars into hdfs with default 3 > replications. From our production experience, we find that 3 replications > will block big job (256 containers) to launch, when the HDFS is slow due to > big workload for batch pipelines. Thus, we want to make the factor > customizable from FlinkYarnSessionCli by adding an option. -- This message was sent by Atlassian JIRA (v7.6.3#76005)