[ 
https://issues.apache.org/jira/browse/HIVE-24601?focusedWorklogId=542829&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-542829
 ]

ASF GitHub Bot logged work on HIVE-24601:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 27/Jan/21 13:34
            Start Date: 27/Jan/21 13:34
    Worklog Time Spent: 10m 
      Work Description: zabetak commented on a change in pull request #1875:
URL: https://github.com/apache/hive/pull/1875#discussion_r565310416



##########
File path: ql/src/java/org/apache/hadoop/hive/ql/parse/CalcitePlanner.java
##########
@@ -679,27 +680,15 @@ Operator genOPTree(ASTNode ast, PlannerContext 
plannerCtx) throws SemanticExcept
 
           // Determine if we should re-throw the exception OR if we try to 
mark plan as reAnayzeAST to retry
           // planning as non-CBO.
-          if (e instanceof CalciteSubquerySemanticException || e instanceof 
CalciteViewSemanticException
-              || e instanceof CalciteSubqueryRuntimeException) {
-            // Non-CBO path for CalciteSubquerySemanticException fails with 
completely different error
-            // and masks the original failure.
-            // Non-CBO path for CalciteViewSemanticException would fail in a 
similar way as CBO path.
-            throw new SemanticException(e);
-          }
-
-          boolean isHiveTest = conf.getBoolVar(ConfVars.HIVE_IN_TEST);
-          // At this point we retry with CBO off:
-          // 1) If this is not test mode (common case)
-          // 2) If we are in test mode and we are missing stats
-          // 3) if we are in test mode and a CalciteSemanticException is 
generated
-          reAnalyzeAST = (!isHiveTest || isMissingStats ||  e instanceof 
CalciteSemanticException);

Review comment:
       Many thanks for keeping an eye @kgyrtkirk !
   
   In production, (now `CONSERVATIVE` mode) `isHiveTest` is `false` so 
`!isHiveTest` becomes `true` thus no matter what happens with `isMissingStats` 
and `CalciteSemanticException`, we fallback to legacy optimizer.
   
   In tests, before these changes we fallaback to legacy when the problem is 
due to missing stats or CSE. I see this mostly as a convenient way to not 
update a big amount of existing tests which otherwise will fail. From my 
perspective the fact that CBO fails is not something that should be considered 
normal. When writting a test if we know that we cannot handle it with CBO then 
I think it would be better to explicitly disable CBO instead of relying on this 
fallback mechanism. After these changes, future tests might (depends on the 
kind of exception that they come with) fail if they are missing stats.
   
   Now regarding the `RuntimeException`:
   
   - in `ALWAYS` mode we fallback to legacy;
   - in `CONSERVATIVE` mode we fallback to legacy (same as before);
   - in `NEVER` mode we stop and fail the query;
   - in `TEST` mode we stop and fail the query (slightly different than before 
where we could retry if `RuntimeException` was paired with a missing stats 
problem).




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 542829)
    Time Spent: 2.5h  (was: 2h 20m)

> Control CBO fallback behavior via property
> ------------------------------------------
>
>                 Key: HIVE-24601
>                 URL: https://issues.apache.org/jira/browse/HIVE-24601
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Stamatis Zampetakis
>            Assignee: Stamatis Zampetakis
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> When CBO optimizer fails there is a fallback mechanism(HIVE-7413) that will 
> retry to process the query using the legacy Hive optimizer. 
> There are use-cases where this behavior is not desirable notably for the 
> tests (HIVE-16058) but also for end users who would like to disable the 
> fall-back mechanism to avoid running problematic queries without realizing.
> The goal of this issue is to introduce a dedicated Hive property controlling 
> this behavior,{{hive.cbo.fallback.enable}}, for both tests and production. 
> The default value should be true and tests should run with this property set 
> to false. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to