Re: correctness issue on chained streaming-streaming join

2019-06-12 Thread Jungtaek Lim
General representation of this issue would be: - stateful operator would evict rows in state when watermark passes by - for append mode, evicted rows are used as output rows, in other words, input rows for next stateful operator - next stateful operator would discard late input rows using same wat

correctness issue on chained streaming-streaming join

2019-06-12 Thread Jungtaek Lim
Hi devs, While helping user in user mailing list, I start to suspect that chained streaming-streaming joins works incorrectly but Structured Streaming doesn't prevent it. The reason is actually similar to why chained streaming aggregations is not supported in Structured Streaming, global watermark