wiedld commented on PR #15503: URL: https://github.com/apache/datafusion/pull/15503#issuecomment-2787732689
> > Can we review the Statistics::with_fetch code for how the column statistics are treated? Because in this case we know the num_rows=2 but the column stats (min/max values) are gone, when I think they could be set to inexact? > > Are you suggesting that we change with_fetch to be set to inexact in this case? Yes. Because we are removing the min/max values (instead of setting to inexact when the limit is applied), which means that we are not able to later assess which partitions have nonoverlapping ranges. A.k.a. we'll not be able to use the progressive eval exec. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org