Re: [DISCUSS] Dedicated streaming mode

2015-05-27 Thread Maximilian Michels
Hi Henry! I think the idea was to have a dedicated streaming mode as long as the default cluster mode does not support batch and streaming equally well. Once we have reached this level in the dedicated streaming mode, this will be the default cluster mode. I share your doubts about whether it is a

Re: [DISCUSS] Dedicated streaming mode

2015-05-26 Thread Henry Saputra
Ah yes, technically the streaming mode could run batch jobs as well in Flink. I am thinking that it could cause confusion with users since most systems that does batch and stream (well, pretty much Spark ^_^) does not differentiate the deployment topologies for the cluster to support different mode

Re: [DISCUSS] Dedicated streaming mode

2015-05-26 Thread Stephan Ewen
The streaming mode runs batch jobs as well :-) There should be slightly reduced predictability in the memory management in the streaming mode, but otherwise there should not be a problem. So if you want to run mixed workloads, you start the streaming mode. (Note: Currently, the batch mode runs

Re: [DISCUSS] Dedicated streaming mode

2015-05-26 Thread Henry Saputra
One immediate concern I have is the deployment topology. With streaming has its own cluster deployment, this means that in standalone mode, if ops would like to deploy Flink it has to know what mode it needs to deploy Flink as, either batch or Streaming. So, if the use case was to support both batc

Re: [DISCUSS] Dedicated streaming mode

2015-05-26 Thread Maximilian Michels
+1 great changes coming up! I like the idea that, ultimately, Flink will handle streaming and batch programs equally well independently of the chosen cluster startup mode. What is the time frame for these changes? On Tue, May 26, 2015 at 7:34 AM, Henry Saputra wrote: > Thanks Aljoscha and Steph

Re: [DISCUSS] Dedicated streaming mode

2015-05-25 Thread Henry Saputra
Thanks Aljoscha and Stephan, this helps - Henry On Fri, May 22, 2015 at 4:37 AM, Stephan Ewen wrote: > Aljoscha is right. There are plans to migrate the streaming state to the > MemoryManager as well, but streaming state is not managed at this point. > > What is managed in streaming jobs is the

Re: [DISCUSS] Dedicated streaming mode

2015-05-22 Thread Stephan Ewen
Aljoscha is right. There are plans to migrate the streaming state to the MemoryManager as well, but streaming state is not managed at this point. What is managed in streaming jobs is the data buffered and cached in the network stack. But that is a different memory pool than the memory manager. We

Re: [DISCUSS] Dedicated streaming mode

2015-05-22 Thread Aljoscha Krettek
Hi, streaming currently does not use any memory manager. All state is kept in Java Objects on the Java Heap, for example an ArrayList<> for the window buffer. On Thu, May 21, 2015 at 11:56 PM, Henry Saputra wrote: > Hi Stephan, Gyula, Paris, > > How does streaming currently different in term of m

Re: [DISCUSS] Dedicated streaming mode

2015-05-21 Thread Henry Saputra
Hi Stephan, Gyula, Paris, How does streaming currently different in term of memory management? Currently we only have one MemoryManager which is used by both modes I believe. - Henry On Thu, May 21, 2015 at 12:34 PM, Stephan Ewen wrote: > I discussed a bit via Skype with Gyula and Paris. > > >

Re: [DISCUSS] Dedicated streaming mode

2015-05-21 Thread Gyula Fóra
Huge +1 from my side :) Sorry for the late response. On Thu, May 21, 2015 at 9:54 PM, Aljoscha Krettek wrote: > This sounds very reasonable. > On May 21, 2015 9:34 PM, "Stephan Ewen" wrote: > > > I discussed a bit via Skype with Gyula and Paris. > > > > > > We thought about the following way t

Re: [DISCUSS] Dedicated streaming mode

2015-05-21 Thread Aljoscha Krettek
This sounds very reasonable. On May 21, 2015 9:34 PM, "Stephan Ewen" wrote: > I discussed a bit via Skype with Gyula and Paris. > > > We thought about the following way to do it: > > - We add a dedicated streaming mode for now. The streaming mode supersedes > the batch mode, so it can run both t

Re: [DISCUSS] Dedicated streaming mode

2015-05-21 Thread Stephan Ewen
I discussed a bit via Skype with Gyula and Paris. We thought about the following way to do it: - We add a dedicated streaming mode for now. The streaming mode supersedes the batch mode, so it can run both type of programs. - The streaming mode sets the memory manager to "lazy allocation".

Re: [DISCUSS] Dedicated streaming mode

2015-05-21 Thread Aljoscha Krettek
Would it not be possible to start the snapshot service once the user starts the first streaming job? About 2) with checkpointing coming up, would it not make sense to shift to managed memory rather sooner than later. Then this point would become moot. On Thu, May 21, 2015 at 3:47 PM, Matthias J. S

Re: [DISCUSS] Dedicated streaming mode

2015-05-21 Thread Matthias J. Sax
What would be the consequences on "mixed" programs? (If there is any plan to support those?) Would it be necessary to have a third mode? Or would those programs simple run in streaming mode? -Matthias On 05/21/2015 03:12 PM, Stephan Ewen wrote: > Hi all! > > We discussed a while back about intr

[DISCUSS] Dedicated streaming mode

2015-05-21 Thread Stephan Ewen
Hi all! We discussed a while back about introducing a dedicated streaming mode for Flink. I would like to take a go at this and implement the changes, but discuss them before. Here is a brief summary why we wanted to introduce the dedicated streaming mode: Even though both batch and streaming ar

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-27 Thread Márton Balassi
Today we had a discussion with Robert on this issue. I would like to eventually have the streaming grouped and the windowing buffers/state maybe along with the crucial state of the user in the managed memory. If we had this separating the two modes could became less important as streaming would als

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-18 Thread Paris Carbone
+1 I agree it’s a proper way to go. On 18 Feb 2015, at 10:41, Max Michels mailto:m...@apache.org>> wrote: +1 On Tue, Feb 17, 2015 at 2:40 PM, Aljoscha Krettek mailto:aljos...@apache.org>> wrote: +1 On Tue, Feb 17, 2015 at 1:34 PM, Till Rohrmann mailto:trohrm...@apache.org>> wrote: +1 On Tu

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-18 Thread Max Michels
+1 On Tue, Feb 17, 2015 at 2:40 PM, Aljoscha Krettek wrote: > +1 > > On Tue, Feb 17, 2015 at 1:34 PM, Till Rohrmann wrote: >> +1 >> >> On Tue, Feb 17, 2015 at 1:34 PM, Kostas Tzoumas wrote: >> >>> +1 >>> >>> On Tue, Feb 17, 2015 at 12:14 PM, Márton Balassi >>> wrote: >>> >>> > When it comes to

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Aljoscha Krettek
+1 On Tue, Feb 17, 2015 at 1:34 PM, Till Rohrmann wrote: > +1 > > On Tue, Feb 17, 2015 at 1:34 PM, Kostas Tzoumas wrote: > >> +1 >> >> On Tue, Feb 17, 2015 at 12:14 PM, Márton Balassi >> wrote: >> >> > When it comes to the current use cases I'm for this separation. >> > @Ufuk: As Gyula has alre

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Till Rohrmann
+1 On Tue, Feb 17, 2015 at 1:34 PM, Kostas Tzoumas wrote: > +1 > > On Tue, Feb 17, 2015 at 12:14 PM, Márton Balassi > wrote: > > > When it comes to the current use cases I'm for this separation. > > @Ufuk: As Gyula has already pointed out with the current design of > > integration it should not

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Kostas Tzoumas
+1 On Tue, Feb 17, 2015 at 12:14 PM, Márton Balassi wrote: > When it comes to the current use cases I'm for this separation. > @Ufuk: As Gyula has already pointed out with the current design of > integration it should not be a problem. Even if we submitted programs to > the wrong clusters it wou

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Márton Balassi
When it comes to the current use cases I'm for this separation. @Ufuk: As Gyula has already pointed out with the current design of integration it should not be a problem. Even if we submitted programs to the wrong clusters it would only cause performance issues. Eventually it would be nice to have

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Gyula Fóra
So the current setup is to share results between the two apis by files. So I dont see any reason why this couldnt work with the 2 cluster setup. It makes deployment a little trickier but still feasible. On Tue, Feb 17, 2015 at 11:55 AM, Ufuk Celebi wrote: > I think this separation reflects the w

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Ufuk Celebi
I think this separation reflects the way that Flink is used currently anyways. I would be in favor of it as well. - What about the ongoing efforts (I think by Gyula) to combine both the batch and stream processing APIs? I assume that this would only effect the performance and wouldn't pose a funda

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Gyula Fóra
+1 Let's do this soon to avoid performance issues for streaming. On Tue, Feb 17, 2015 at 11:39 AM, Fabian Hueske wrote: > sounds like a good idea to me. > +1 > > 2015-02-17 11:28 GMT+01:00 Stephan Ewen : > > > Hi everyone! > > > > What do you think about making the streaming execution mode of th

Re: [DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Fabian Hueske
sounds like a good idea to me. +1 2015-02-17 11:28 GMT+01:00 Stephan Ewen : > Hi everyone! > > What do you think about making the streaming execution mode of the system > explicit? That means that people start a Flink cluster explicitly in Batch > mode or in Streaming mode. > > The rational behin

[DISCUSS] Dedicated streaming mode and start scripts

2015-02-17 Thread Stephan Ewen
Hi everyone! What do you think about making the streaming execution mode of the system explicit? That means that people start a Flink cluster explicitly in Batch mode or in Streaming mode. The rational behind this idea is that I am not sure how batch and streaming clusters are really shared in a