But the join requirement is to match the records from two streams occurring
within one hour (besides the normal join key condition), if I use the
second join window, those records wouldn't be in the same window any more.



On Thu, Apr 28, 2016 at 11:47 PM, Ashutosh Kumar <kmr.ashutos...@gmail.com>
wrote:

> Time unit can be in seconds as well. Is there specific need to get bursts
> hourly?
>
> On Fri, Apr 29, 2016 at 11:48 AM, Henry Cai <h...@pinterest.com> wrote:
>
>> For the below standard stream/stream join, does flink store the results
>> of stream 1 and stream 2 into state store for the current hour and at the
>> end of the hour window it will fire the window by iterating through all
>> stored elements in the state store to find join matches?
>>
>> My concern is during most of the time in the hour, the output (assuming
>> the output is going to another stream) will be idle and on each hour mark
>> there will be huge outputs of joined records emitted, any way to make it
>> more gradual?
>>
>>
>> dataStream.join(otherStream)
>>     .where(0).equalTo(1)
>>     .window(TumblingEventTimeWindows.of(Time.hours(1)))
>>     .apply (new JoinFunction () {...});
>>
>>
>

Reply via email to