[jira] [Commented] (FLINK-20115) Test Batch execution for the DataStream API

Till Rohrmann (Jira) Tue, 17 Nov 2020 09:36:16 -0800


    [ 
https://issues.apache.org/jira/browse/FLINK-20115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233765#comment-17233765
 ]


Till Rohrmann commented on FLINK-20115:
---------------------------------------

I started testing the new batch execution. Since the documentation isn't ready 
yet, I used the corresponding 
[FLIP-134|https://cwiki.apache.org/confluence/display/FLINK/FLIP-134%3A+Batch+execution+for+the+DataStream+API]
 as a starting point. The first thing I noticed is that the actual 
implementation does not fully implement the FLIP as some of the proposed API 
does not exist. For example

{code}
StreamExecutionEnvironment env = 
StreamExecutionEnvironment.getExecutionEnvironment();
env.setRuntimeMode(RuntimeMode.BATCH)
{code}

does not work. Moreover, some of the mentioned configuration parameters do not 
exist (only {{execution.runtime-mode}} exists). I would suggest to update the 
FLIP according to the actual progress in order to avoid confusion.

I am not entirely sold on configuring the execution mode as a configuration 
parameter. I see a couple of problems with it:

1) The execution mode is probably a job specific configuration (e.g. depending 
on the actual sources). Why should it be configurable by the client's 
{{flink-conf.yaml}}?
2) The discoverability of these options are poor imo. The problem is that users 
need to know about {{ExecutionOptions}} in order to find the 
{{ExecutionOptions.RUNTIME_MODE}}. Allowing the user to configure the mode more 
prominently could avoid confusion.

Another problem I run into is that one can construct a {{BATCH}} job which 
needs more than a single slot to execute. Hence, the FLIP's statement that one 
only needs a single slot is wrong. The problem is that one can still construct 
a pipelined region which requires more than a single slot by using {{rescale}}. 
A job showing the problem can be found 
[here|https://github.com/tillrohrmann/flink-streaming-batch-execution/tree/rescale-fails-scheduling-with-single-slot].

Next I tried to use the new {{FileSource}}. The first problem I ran into was 
that the connector depends on {{flink-connector-base}} which I needed to add 
additionally. This seems a bit cumbersome. The ticket to discuss this problem 
is FLINK-20196.

Then I tried to add the new {{FileSink}}. The job can be found 
[here|https://github.com/tillrohrmann/flink-streaming-batch-execution/tree/file-sink-fails-with-default-bucket-assigner].
 Here I ran immediately into an {{UnsupportedOperationException}} when using 
the default {{BucketAssigner}}. The ticked to discuss this problem is 
FLINK-20197.

> Test Batch execution for the DataStream API 
> --------------------------------------------
>
>                 Key: FLINK-20115
>                 URL: https://issues.apache.org/jira/browse/FLINK-20115
>             Project: Flink
>          Issue Type: Sub-task
>          Components: API / DataStream
>    Affects Versions: 1.12.0
>            Reporter: Robert Metzger
>            Assignee: Till Rohrmann
>            Priority: Critical
>             Fix For: 1.12.0
>
>
> Test the following new features:
>  - https://issues.apache.org/jira/browse/FLINK-19316
>  - https://issues.apache.org/jira/browse/FLINK-19268
>  - https://issues.apache.org/jira/browse/FLINK-19758 
> The three issues can really only be tested in combination. FLINK-19316 is 
> done but missing documentation.
> Write an example that uses a (new) FileSource, a (new) FileSink, some random 
> transformations
> Run the example in BATCH mode
> How ergonomic is the API/configuration?
> Are there any weird log messages/exceptions in the JM/TM logs
> Maybe try sth that doesn't work on BATCH execution, such as 
> iterations/feedback edges.
> ----
> [General Information about the Flink 1.12 release 
> testing|https://cwiki.apache.org/confluence/display/FLINK/1.12+Release+-+Community+Testing]
> When testing a feature, consider the following aspects:
> - Is the documentation easy to understand
> - Are the error messages, log messages, APIs etc. easy to understand
> - Is the feature working as expected under normal conditions
> - Is the feature working / failing as expected with invalid input, induced 
> errors etc.
> If you find a problem during testing, please file a ticket 
> (Priority=Critical; Fix Version = 1.12.0), and link it in this testing ticket.
> During the testing, and once you are finished, please write a short summary 
> of all things you have tested.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (FLINK-20115) Test Batch execution for the DataStream API

Reply via email to