Re: [DISCUSS] Behaviour of startNewChain() in Streaming

2015-05-26 Thread Maximilian Michels
I second Aljoscha's and Matthias' opinion on the behavior of `startNewChain()`. In the case of `setParallelism(..)`, we set the parallelism of the operator but in case of `startNewChain()`, we explicitly start a new chain; for the user, this is not connected to the previous operation even though th

Re: [DISCUSS] Behaviour of startNewChain() in Streaming

2015-05-25 Thread Matthias J. Sax
I agree with Aljoschas argumentation. It would be more intuitive if "startNewChain()" splits the chain where is it put. On 05/25/2015 10:48 AM, Aljoscha Krettek wrote: > Yes, this is another example where it might be problematic but I think > there are different ideas here: Methods such as setPar

Re: [DISCUSS] Behaviour of startNewChain() in Streaming

2015-05-25 Thread Aljoscha Krettek
Yes, this is another example where it might be problematic but I think there are different ideas here: Methods such as setParallelism(), name() and so on can be seen as modifying the operation that was previously constructed. Method such as groupBy(), startNewChain() can be seen as acting at that p

Re: [DISCUSS] Behaviour of startNewChain() in Streaming

2015-05-25 Thread Gyula Fóra
I see your point but this is a general problem with any property that we set on the operators itself. Same goes for instance for parallelism : input .map(new Map1()) .setParallelism(2) .map(new Map2)) .print() Do we change the parallelism after map 1 so it applies to map 2? Gyula On Mo

[DISCUSS] Behaviour of startNewChain() in Streaming

2015-05-25 Thread Aljoscha Krettek
Hi, I think people will be confused by the behaviour of startNewChain() in the streaming API. I myself had wrong assumptions about how it behaves when I was writing a test Job and the only other Job where someone not coming from Streaming wrote a Streaming Test is also making wrong assumptions. (St