Yi Hu created CALCITE-7101:
------------------------------

             Summary: Query parsed to LogicalJoin changed to LogicalFilter 
causing ClassCastException then CannotPlanException in newer versions
                 Key: CALCITE-7101
                 URL: https://issues.apache.org/jira/browse/CALCITE-7101
             Project: Calcite
          Issue Type: Bug
          Components: core
    Affects Versions: 1.39.0, 1.34.0
         Environment: Java11
            Reporter: Yi Hu


We are trying to upgrade Apache Calcite version in our project (Apache Beam 
https://github.com/apache/beam/pull/35588). Found a potential breaking change.

 

Prior to Calcite <=1.33, the following query works for the test

 
{quote}"select * from CUSTOMER "
+ " where exists ( "
+ " select * from ORDERS "
+ " where o_custkey = c_custkey )";{quote}
 
However, since Calcite 1.34 (to 1.38), the following error is observed
 
{quote}java.lang.RuntimeException: Error while applying rule FilterToCalcRule, 
args [rel#55:LogicalFilter.NONE(input=RelSubset#54,condition=EXISTS({

LogicalFilter(condition=[=($1, $cor0.c_custkey)])

  BeamIOSourceRel(table=[[beam, ORDERS]])

}),variablesSet=[$cor0])]

at 
org.apache.calcite.plan.volcano.VolcanoRuleCall.onMatch(VolcanoRuleCall.java:250)

at 
org.apache.calcite.plan.volcano.IterativeRuleDriver.drive(IterativeRuleDriver.java:59)

at 
org.apache.calcite.plan.volcano.VolcanoPlanner.findBestExp(VolcanoPlanner.java:523)

at org.apache.calcite.tools.Programs$RuleSetProgram.run(Programs.java:317)

at org.apache.calcite.prepare.PlannerImpl.transform(PlannerImpl.java:385)

at 
org.apache.beam.sdk.extensions.sql.impl.CalciteQueryPlanner.convertToBeamRel(CalciteQueryPlanner.java:210)



Caused by: java.lang.ClassCastException: class 
org.apache.calcite.rex.RexSubQuery cannot be cast to class 
org.apache.calcite.rex.RexLocalRef (org.apache.calcite.rex.RexSubQuery and 
org.apache.calcite.rex.RexLocalRef are in unnamed module of loader 'app')

at 
org.apache.calcite.rex.RexProgramBuilder.registerInput(RexProgramBuilder.java:304)
{quote}
 

It is appeared due to CALCITE-6874, which is fixed in Calcite 1.39. However the 
fix does not fix our use case In 1.39/1.40, a different error is seen:

 
{quote}org.apache.beam.sdk.extensions.sql.impl.SqlConversionException: Unable 
to convert query select * from CUSTOMER  where exists (  select * from ORDERS  
where o_custkey = c_custkey )

at 
app//org.apache.beam.sdk.extensions.sql.impl.CalciteQueryPlanner.convertToBeamRel(CalciteQueryPlanner.java:214)

at 
app//org.apache.beam.sdk.extensions.sql.impl.BeamSqlEnv.parseQuery(BeamSqlEnv.java:116)

 

Caused by: org.apache.calcite.plan.RelOptPlanner$CannotPlanException: There are 
not enough rules to produce a node with desired properties: 
convention=BEAM_LOGICAL.

Missing conversion is LogicalFilter[convention: NONE -> BEAM_LOGICAL]

at 
app//org.apache.calcite.plan.volcano.RelSubset$CheapestPlanReplacer.visit(RelSubset.java:718)

at 
app//org.apache.calcite.plan.volcano.RelSubset.buildCheapestPlan(RelSubset.java:391)

at 
app//org.apache.calcite.plan.volcano.VolcanoPlanner.findBestExp(VolcanoPlanner.java:535)

at app//org.apache.calcite.tools.Programs$RuleSetProgram.run(Programs.java:353)

at app//org.apache.calcite.prepare.PlannerImpl.transform(PlannerImpl.java:385)

at 
app//org.apache.beam.sdk.extensions.sql.impl.CalciteQueryPlanner.convertToBeamRel(CalciteQueryPlanner.java:210)

... 48 more
{quote}
 

Taking a closer look, In Calcite<=1.33 the query is parsed (planner.parse)

as

 
{quote}LogicalProject(c_custkey=[$0], c_acctbal=[$1], c_city=[$2])

  LogicalProject(c_custkey=[$0], c_acctbal=[$1], c_city=[$2], 
o_custkey=[CAST($3):INTEGER], $f1=[CAST($4):BOOLEAN])

    LogicalJoin(condition=[=($0, $3)], joinType=[inner])

      BeamIOSourceRel(table=[[beam, CUSTOMER]])

      LogicalAggregate(group=[\{0}], agg#0=[MIN($1)])

        LogicalProject(o_custkey=[$1], $f0=[true])

          BeamIOSourceRel(table=[[beam, ORDERS]])
{quote}
 

Then send to planner.rel(). After Calcite 1.34, the query parsed differently

 
{quote}LogicalProject(c_custkey=[$0], c_acctbal=[$1], c_city=[$2]) 
LogicalFilter(condition=[EXISTS(\{ LogicalFilter(condition=[=($1, 
$cor0.c_custkey)]) BeamIOSourceRel(table=[[beam, ORDERS]]) })], 
variablesSet=[[$cor0]]) BeamIOSourceRel(table=[[beam, CUSTOMER]]){quote}
 

such that a LogicalFilter no longer converted to LogicalJoin. Then later on 
planner.rel() fails.

 

What might have caused this change (between Calcite 1.33 -> 1.34). This is 
currently blocking upgrade. Is it possible to mitigate this without 
implementing a custom "LogicalFilter[convention: NONE -> BEAM_LOGICAL]" rule 
such that Calcite still knows to apply FilterToJoinRule?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to