[ 
https://issues.apache.org/jira/browse/FLINK-2107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14960424#comment-14960424
 ] 

ASF GitHub Bot commented on FLINK-2107:
---------------------------------------

GitHub user fhueske opened a pull request:

    https://github.com/apache/flink/pull/1262

    [FLINK-2107] Add hash-based strategies for left and right outer joins.

    This PR adds hash-based execution strategies for left and right outer 
joins, that have the outer side as the probe side of a hash table.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/fhueske/flink outerJoinHash

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/1262.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1262
    
----
commit e24f487112694e1cd757601f5ad59037c0312499
Author: Fabian Hueske <[email protected]>
Date:   2015-10-15T08:58:58Z

    [FLINK-2107] Add hash-based strategies for left and right outer joins.

----


> Implement Hash Outer Join algorithm
> -----------------------------------
>
>                 Key: FLINK-2107
>                 URL: https://issues.apache.org/jira/browse/FLINK-2107
>             Project: Flink
>          Issue Type: New Feature
>          Components: Local Runtime
>            Reporter: Fabian Hueske
>            Assignee: Chiwan Park
>            Priority: Minor
>             Fix For: pre-apache
>
>
> Flink does not natively support outer joins at the moment.
> This issue proposes to implement a hash outer join algorithm that can cover 
> left and right outer joins.
> The implementation can be based on the regular hash join iterators (for 
> example `ReusingBuildFirstHashMatchIterator` and 
> `NonReusingBuildFirstHashMatchIterator`, see also `MatchDriver` class)
> The Reusing and NonReusing variants differ in whether object instances are 
> reused or new objects are created. I would start with the NonReusing variant 
> which is safer from a user's point of view and should also be easier to 
> implement.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to