milenkovicm opened a new issue, #14679: URL: https://github.com/apache/datafusion/issues/14679
### Describe the bug While testing ballista builds with latest main I've noticed tests failing with: ``` Error: Internal("Could not create `ExprBoundaries`: in `try_from_column` `col_index` \n has gone out of bounds with a value of 3, the schema has 3 columns.") ``` This was not the case with df 45, nor there is a problem if remote context is replaced with datafusion context. ### To Reproduce Apparently difference between datafusion and ballista execution is logical and physical plan plans serde. After looking at the wrong place (logical plan) I've managed to reproduced it with: ```rust let ctx = SessionContext::new(); ctx.register_parquet( "test", "alltypes_plain.parquet", Default::default(), ) .await?; let plan = ctx .sql("select string_col, timestamp_col from test where id > 4") .await? .create_physical_plan() .await?; let node: PhysicalPlanNode = PhysicalPlanNode::try_from_physical_plan( plan, &DefaultPhysicalExtensionCodec {}, )?; // fails here let plan = node.try_into_physical_plan( &ctx, &ctx.runtime_env(), &DefaultPhysicalExtensionCodec {}, )?; let _ = plan.execute(0, ctx.task_ctx()).unwrap(); ``` where parquet file can be found at https://github.com/apache/datafusion-ballista/blob/46a67459e61467a2e86c23f0c1c2920dd49c877f/ballista/client/testdata/alltypes_plain.parquet datafusion commit used for testing [a104661](https://github.com/apache/datafusion/commit/a104661a020b895eb155af12575bafe693b8edaf) (for what its worth, this issue is there 15 - 16 commits in the past) note that queries, will execute without any problems: - `select string_col, timestamp_col from test` - `select * from test where id > 4` query will execute without problems without plan serde ### Expected behavior round trip to be successful ### Additional context #14631 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org