[ 
https://issues.apache.org/jira/browse/HIVE-15808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Deepak Jaiswal updated HIVE-15808:
----------------------------------
    Description: If there is a semijoin branch on the same operator pipeline 
which contains a hash join then it is by design on big table which is not 
optimal. The operator cycle detection logic may not find a cycle as there is no 
cycle at operator level. However, once Tez builds its task there can be a cycle 
at task level causing the query to fail.  (was: It is found that the current 
logic of cycle detection does not find cycles created when there is a semijoin 
branch parallel to a hash join on a reducer.
To avoid such cycles, remove the semijoin reduction optimization.)
        Summary: Remove semijoin reduction branch if it is on bigtable along 
with hash join  (was: Remove Semijoin reduction branch on reducers if there is 
hash join)

> Remove semijoin reduction branch if it is on bigtable along with hash join
> --------------------------------------------------------------------------
>
>                 Key: HIVE-15808
>                 URL: https://issues.apache.org/jira/browse/HIVE-15808
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Deepak Jaiswal
>            Assignee: Deepak Jaiswal
>         Attachments: HIVE-15808.patch
>
>
> If there is a semijoin branch on the same operator pipeline which contains a 
> hash join then it is by design on big table which is not optimal. The 
> operator cycle detection logic may not find a cycle as there is no cycle at 
> operator level. However, once Tez builds its task there can be a cycle at 
> task level causing the query to fail.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to