milenkovicm opened a new issue, #14679:
URL: https://github.com/apache/datafusion/issues/14679

   ### Describe the bug
   
   While testing ballista builds with latest main I've noticed tests failing 
with:
   
   ```
   Error: Internal("Could not create `ExprBoundaries`: in `try_from_column` 
`col_index` \n                has gone out of bounds with a value of 3, the 
schema has 3 columns.")
   ```
   
   This was not the case with df 45, nor there is a problem if remote context 
is replaced with datafusion context.
   
   ### To Reproduce
   
   Apparently difference between datafusion and ballista execution is logical 
and physical plan plans serde. After looking at the wrong place (logical plan) 
I've managed to reproduced it with:
   
   ```rust
           let ctx = SessionContext::new();
           ctx.register_parquet(
               "test",
               "alltypes_plain.parquet",
               Default::default(),
           )
           .await?;
   
           let plan = ctx
               .sql("select string_col, timestamp_col from test where id > 4")
               .await?
               .create_physical_plan()
               .await?;
   
           let node: PhysicalPlanNode = 
PhysicalPlanNode::try_from_physical_plan(
               plan,
               &DefaultPhysicalExtensionCodec {},
           )?;
           // fails here
           let plan = node.try_into_physical_plan(
               &ctx,
               &ctx.runtime_env(),
               &DefaultPhysicalExtensionCodec {},
           )?;
   
           let _ = plan.execute(0, ctx.task_ctx()).unwrap();
   ```
   where parquet file can be found at 
https://github.com/apache/datafusion-ballista/blob/46a67459e61467a2e86c23f0c1c2920dd49c877f/ballista/client/testdata/alltypes_plain.parquet
   
   datafusion commit used for testing 
[a104661](https://github.com/apache/datafusion/commit/a104661a020b895eb155af12575bafe693b8edaf)
   
   (for what its worth, this issue is there 15 - 16 commits in the past)
   
   note that queries, will execute without any problems: 
   - `select string_col, timestamp_col from test`
   - `select * from test where id > 4`
   
   query will execute without problems without plan serde 
   
   
   
   ### Expected behavior
   
   round trip to be successful 
   
   ### Additional context
   
   #14631


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to