Re: Towards a spec for robust streaming SQL, Part 1

2017-05-12 Thread Shaoxuan Wang
Tyler, Yes, dynamic table changes over time. You can find more details about dynamic table from this Flink blog ( https://flink.apache.org/news/2017/04/04/dynamic-tables.html). Fabian, me and Xiaowei posted it a week before the flink-forward@SF. "A dynamic table is a table that is continuously upda

Re: Towards a spec for robust streaming SQL, Part 1

2017-05-12 Thread Tyler Akidau
Being able to support an EMIT config independent of the query itself sounds great for compatible use cases (which should be many :-). Shaoxuan, can you please refresh my memory what a dynamic table means in Flink? It's basically just a state table, right? The "dynamic" part of the name is to simpl

Re: Towards a spec for robust streaming SQL, Part 1

2017-05-12 Thread Shaoxuan Wang
Thanks to Tyler and Fabian for sharing your thoughts. Regarding to the early/late update control of FLINK. IMO, each dynamic table can have an EMIT config. For FLINK table-API, this can be easily implemented in different manners, case by case. For instance, in window aggregate, we could define "wh

Re: Towards a spec for robust streaming SQL, Part 1

2017-05-12 Thread Fabian Hueske
2017-05-11 7:14 GMT+02:00 Tyler Akidau : > On Tue, May 9, 2017 at 3:06 PM Fabian Hueske wrote: > > > Hi Tyler, > > > > thank you very much for this excellent write-up and the super nice > > visualizations! > > You are discussing a lot of the things that we have been thinking about > as > > well f

Re: Towards a spec for robust streaming SQL, Part 1

2017-05-10 Thread Tyler Akidau
On Tue, May 9, 2017 at 3:06 PM Fabian Hueske wrote: > Hi Tyler, > > thank you very much for this excellent write-up and the super nice > visualizations! > You are discussing a lot of the things that we have been thinking about as > well from a different perspective. > IMO, yours and the Flink mod

Re: Towards a spec for robust streaming SQL, Part 1

2017-05-09 Thread Fabian Hueske
Hi Tyler, thank you very much for this excellent write-up and the super nice visualizations! You are discussing a lot of the things that we have been thinking about as well from a different perspective. IMO, yours and the Flink model are pretty much aligned although we use a different terminology

Re: Towards a spec for robust streaming SQL, Part 1

2017-05-08 Thread Tyler Akidau
Any thoughts here Fabian? I'm planning to start sending out some more emails towards the end of the week. -Tyler On Wed, Apr 26, 2017 at 8:18 AM Tyler Akidau wrote: > No worries, thanks for the heads up. Good luck wrapping all that stuff up. > > -Tyler > > On Tue, Apr 25, 2017 at 12:07 AM Fabi

Re: Towards a spec for robust streaming SQL, Part 1

2017-04-26 Thread Tyler Akidau
No worries, thanks for the heads up. Good luck wrapping all that stuff up. -Tyler On Tue, Apr 25, 2017 at 12:07 AM Fabian Hueske wrote: > Hi Tyler, > > thanks for pushing this effort and including the Flink list. > I haven't managed to read the doc yet, but just wanted to thank you for the > wr

Re: Towards a spec for robust streaming SQL, Part 1

2017-04-25 Thread Fabian Hueske
Hi Tyler, thanks for pushing this effort and including the Flink list. I haven't managed to read the doc yet, but just wanted to thank you for the write-up and let you know that I'm very interested in this discussion. We are very close to the feature freeze of Flink 1.3 and I'm quite busy getting

Re: Towards a spec for robust streaming SQL, Part 1

2017-04-21 Thread Tyler Akidau
Good point, when you start talking about anything less than a full join, triggers get involved to describe how one actually achieves the desired semantics, and they may end up being tied to just one of the inputs (e.g., you may only care about the watermark for one side of the join). Am expecting u

Re: Towards a spec for robust streaming SQL, Part 1

2017-04-21 Thread Kenneth Knowles
There's something to be said about having different triggering depending on which side of a join data comes from, perhaps? (delightful doc, as usual) Kenn On Fri, Apr 21, 2017 at 1:33 PM, Tyler Akidau wrote: > Thanks for reading, Luke. The simple answer is that CoGBK is basically > flatten + G

Re: Towards a spec for robust streaming SQL, Part 1

2017-04-21 Thread Tyler Akidau
Thanks for reading, Luke. The simple answer is that CoGBK is basically flatten + GBK. Flatten is a non-grouping operation that merges the input streams into a single output stream. GBK then groups the data within that single union stream as you might otherwise expect, yielding a single table. So I

Re: Towards a spec for robust streaming SQL, Part 1

2017-04-21 Thread Lukasz Cwik
The doc is a good read. I think you do a great job of explaining table -> stream, stream -> stream, and stream -> table when there is only one stream. But when there are multiple streams reading/writing to a table, how does that impact what occurs? For example, with CoGBK you have multiple streams

Towards a spec for robust streaming SQL, Part 1

2017-04-20 Thread Tyler Akidau
Hello Beam, Calcite, and Flink dev lists! Apologies for the big cross post, but I thought this might be something all three communities would find relevant. Beam is finally making progress on a SQL DSL utilizing Calcite, thanks to Mingmin Xu. As you can imagine, we need to come to some conclusion