[ https://issues.apache.org/jira/browse/HIVE-24601?focusedWorklogId=542829&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-542829 ]
ASF GitHub Bot logged work on HIVE-24601: ----------------------------------------- Author: ASF GitHub Bot Created on: 27/Jan/21 13:34 Start Date: 27/Jan/21 13:34 Worklog Time Spent: 10m Work Description: zabetak commented on a change in pull request #1875: URL: https://github.com/apache/hive/pull/1875#discussion_r565310416 ########## File path: ql/src/java/org/apache/hadoop/hive/ql/parse/CalcitePlanner.java ########## @@ -679,27 +680,15 @@ Operator genOPTree(ASTNode ast, PlannerContext plannerCtx) throws SemanticExcept // Determine if we should re-throw the exception OR if we try to mark plan as reAnayzeAST to retry // planning as non-CBO. - if (e instanceof CalciteSubquerySemanticException || e instanceof CalciteViewSemanticException - || e instanceof CalciteSubqueryRuntimeException) { - // Non-CBO path for CalciteSubquerySemanticException fails with completely different error - // and masks the original failure. - // Non-CBO path for CalciteViewSemanticException would fail in a similar way as CBO path. - throw new SemanticException(e); - } - - boolean isHiveTest = conf.getBoolVar(ConfVars.HIVE_IN_TEST); - // At this point we retry with CBO off: - // 1) If this is not test mode (common case) - // 2) If we are in test mode and we are missing stats - // 3) if we are in test mode and a CalciteSemanticException is generated - reAnalyzeAST = (!isHiveTest || isMissingStats || e instanceof CalciteSemanticException); Review comment: Many thanks for keeping an eye @kgyrtkirk ! In production, (now `CONSERVATIVE` mode) `isHiveTest` is `false` so `!isHiveTest` becomes `true` thus no matter what happens with `isMissingStats` and `CalciteSemanticException`, we fallback to legacy optimizer. In tests, before these changes we fallaback to legacy when the problem is due to missing stats or CSE. I see this mostly as a convenient way to not update a big amount of existing tests which otherwise will fail. From my perspective the fact that CBO fails is not something that should be considered normal. When writting a test if we know that we cannot handle it with CBO then I think it would be better to explicitly disable CBO instead of relying on this fallback mechanism. After these changes, future tests might (depends on the kind of exception that they come with) fail if they are missing stats. Now regarding the `RuntimeException`: - in `ALWAYS` mode we fallback to legacy; - in `CONSERVATIVE` mode we fallback to legacy (same as before); - in `NEVER` mode we stop and fail the query; - in `TEST` mode we stop and fail the query (slightly different than before where we could retry if `RuntimeException` was paired with a missing stats problem). ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 542829) Time Spent: 2.5h (was: 2h 20m) > Control CBO fallback behavior via property > ------------------------------------------ > > Key: HIVE-24601 > URL: https://issues.apache.org/jira/browse/HIVE-24601 > Project: Hive > Issue Type: Improvement > Reporter: Stamatis Zampetakis > Assignee: Stamatis Zampetakis > Priority: Major > Labels: pull-request-available > Time Spent: 2.5h > Remaining Estimate: 0h > > When CBO optimizer fails there is a fallback mechanism(HIVE-7413) that will > retry to process the query using the legacy Hive optimizer. > There are use-cases where this behavior is not desirable notably for the > tests (HIVE-16058) but also for end users who would like to disable the > fall-back mechanism to avoid running problematic queries without realizing. > The goal of this issue is to introduce a dedicated Hive property controlling > this behavior,{{hive.cbo.fallback.enable}}, for both tests and production. > The default value should be true and tests should run with this property set > to false. -- This message was sent by Atlassian Jira (v8.3.4#803005)