Wenlong Lyu created FLINK-5815:
----------------------------------

             Summary: Add resource files configuration for Yarn Mode
                 Key: FLINK-5815
                 URL: https://issues.apache.org/jira/browse/FLINK-5815
             Project: Flink
          Issue Type: Improvement
          Components: Client, Distributed Coordination
            Reporter: Wenlong Lyu
            Assignee: Wenlong Lyu


Currently in flink, when we want to setup a resource file to distributed cache, 
we need to make the file accessible remotely by a url, which is often difficult 
to maintain a service like that. What's more, when we want do add some extra 
jar files to job classpath, we need to copy the jar files to blob server when 
submitting the jobgraph. In yarn, especially in flip-6, the blob server is not 
running yet when we try to start a flink job. 
Yarn has a efficient distributed cache implementation for application running 
on it, what's more we can be easily share the files stored in hdfs in different 
application by distributed cache without extra IO operations. 
I suggest to introduce -yfiles, -ylibjars -yarchives options to FlinkYarnCLI to 
enable yarn user setup their job resource files by yarn distributed cache. The 
options is compatible with what is used in mapreduce, which make it easy to use 
for yarn user who generally has experience on using mapreduce.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to