Dian Fu created FLINK-35208:
-------------------------------

             Summary: Respect pipeline.cached-files during processing Python 
dependencies
                 Key: FLINK-35208
                 URL: https://issues.apache.org/jira/browse/FLINK-35208
             Project: Flink
          Issue Type: Bug
          Components: API / Python
            Reporter: Dian Fu
            Assignee: Dian Fu


Currently, PyFlink will make use of distributed cache (update 
StreamExecutionEnvironment#cachedFiles) during handling the Python 
dependencies(See 
[https://github.com/apache/flink/blob/master/flink-python/src/main/java/org/apache/flink/python/util/PythonDependencyUtils.java#L339]
 for more details). 

However, if pipeline.cached-files is configured, it will clear 
StreamExecutionEnvironment#cachedFiles(see 
[https://github.com/apache/flink/blob/master/flink-streaming-java/src/main/java/org/apache/flink/streaming/api/environment/StreamExecutionEnvironment.java#L1132]
 for more details) which may break the above functionalities.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to