alamb opened a new issue, #10926:
URL: https://github.com/apache/datafusion/issues/10926
### Describe the bug
I noticed this while working on
https://github.com/apache/datafusion/pull/10852 with @marvinlanhenke
Basially, when generating statistics for a non existent column, the
StatisticsExtractor will return a null array of the type of the column not a
UInt64Array
Specifically
https://github.com/apache/datafusion/blob/2f4347647172f6997448b2e24d322b50c856f3a0/datafusion/core/src/datasource/physical_plan/parquet/statistics.rs#L871-L886
The same problem exists for `data_page_null_counts` and
`data_page_row_counts` (not for `row_group_row_counts`
### To Reproduce
Try to call row_group_null_counts for a column that isn't in the parquet file
### Expected behavior
1. row_group_null_counts should always return an `UInt64Array` (not an
`ArrayRef`)
2. If there is not a column, the UInt64Array should be all nulls
### Additional context
_No response_
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]