[ 
https://issues.apache.org/jira/browse/FLINK-9422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16493571#comment-16493571
 ] 

Fabian Hueske commented on FLINK-9422:
--------------------------------------

As for most relational operators, the user should not need to worry about the 
implementation.
If both inputs have the same time attribute, the first implementation is 
chosen, otherwise the second. If users want to have more confidence about the 
deduplication, they should configure more a more conservative watermark 
strategy instead of waiting for two watermarks.

If a user really wants to use the non-watermarked implementation, they can cast 
the time attribute (with WMs) into a regular timestamp.

> Dedicated operator for UNION on streaming tables with time attributes
> ---------------------------------------------------------------------
>
>                 Key: FLINK-9422
>                 URL: https://issues.apache.org/jira/browse/FLINK-9422
>             Project: Flink
>          Issue Type: Improvement
>          Components: Table API & SQL
>            Reporter: Fabian Hueske
>            Assignee: Ruidong Li
>            Priority: Minor
>
> We can implement a dedicated operator for a {{UNION}} operator on tables with 
> time attributes. Currently, {{UNION}} is translated into a {{UNION ALL}} and 
> a subsequent {{GROUP BY}} on all attributes without aggregation functions. 
> The state of the grouping operator is only clean up using state retention 
> timers. 
> The dedicated operator would leverage the monotonicity property of the time 
> attribute and watermarks to automatically clean up its state.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to