[ 
https://issues.apache.org/jira/browse/HIVE-24969?focusedWorklogId=631956&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-631956
 ]

ASF GitHub Bot logged work on HIVE-24969:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 31/Jul/21 10:56
            Start Date: 31/Jul/21 10:56
    Worklog Time Spent: 10m 
      Work Description: dengzhhu653 commented on a change in pull request #2145:
URL: https://github.com/apache/hive/pull/2145#discussion_r680344086



##########
File path: ql/src/test/results/clientpositive/llap/subquery_multi.q.out
##########
@@ -4769,7 +4769,7 @@ POSTHOOK: Input: default@tempty
 85768  almond antique chartreuse lavender yellow       Manufacturer#1  
Brand#12        LARGE BRUSHED STEEL     34      SM BAG  1753.76 refull
 86428  almond aquamarine burnished black steel Manufacturer#1  Brand#12        
STANDARD ANODIZED STEEL 28      WRAP BAG        1414.42 arefully 
 90681  almond antique chartreuse khaki white   Manufacturer#3  Brand#31        
MEDIUM BURNISHED TIN    17      SM CASE 1671.68 are slyly after the sl
-Warning: Shuffle Join MERGEJOIN[39][tables = [$hdt$_0, $hdt$_1, $hdt$_2]] in 
Stage 'Reducer 3' is a cross product
+Warning: Shuffle Join MERGEJOIN[39][tables = [$hdt$_0, $hdt$_2]] in Stage 
'Reducer 3' is a cross product

Review comment:
       Same as above, the `$hdt$_1` does not participate in `Reducer 3`, as it 
produces no output after the semi join(`p_name IN (select p_name from 
part_null)`)




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 631956)
    Time Spent: 1.5h  (was: 1h 20m)

> Predicates may be removed when decorrelating subqueries with lateral
> --------------------------------------------------------------------
>
>                 Key: HIVE-24969
>                 URL: https://issues.apache.org/jira/browse/HIVE-24969
>             Project: Hive
>          Issue Type: Bug
>          Components: Logical Optimizer
>            Reporter: Zhihua Deng
>            Assignee: Zhihua Deng
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Step to reproduce:
> {code:java}
> select count(distinct logItem.triggerId)
> from service_stat_log LATERAL VIEW explode(logItems) LogItemTable AS logItem
> where logItem.dsp in ('delivery', 'ocpa')
> and logItem.iswin = true
> and logItem.adid in (
>  select distinct adId
>  from ad_info
>  where subAccountId in (16010, 14863));  {code}
> For predicates _logItem.dsp in ('delivery', 'ocpa')_  and _logItem.iswin = 
> true_ are removed when doing ppd: JOIN ->   RS  -> LVJ.  The JOIN has 
> candicates: logitem -> [logItem.dsp in ('delivery', 'ocpa'), logItem.iswin = 
> true],when pushing them to the RS followed by LVJ,  none of them are pushed, 
> the candicates of logitem are removed finally by default, which cause to the 
> wrong result.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to