adriangb commented on PR #16080:
URL: https://github.com/apache/datafusion/pull/16080#issuecomment-2992639239

   Thanks for linking Andrew.
   
   So it seems to me that previously:
   - Statistics were collected by `ListingTable` despite the config being set 
to not collect them
   - This PR made `ListingTable` obey the config, but in doing so changed the 
default for some use patterns of `ListingTable` from `true` to `false`
   - #16158 made the config default `true` - thus reverting the change for the 
users in the point above but introduce a change for other users, e.g. I think 
default usage of `datafusion-cli` will now collect stats wheres it did not 
before
   
   My read of this is that collecting stats can be beneficial or hurt depending 
on details of the query, etc. and before this PR there were 2 different 
defaults going around and after there is just one. So we just need to decide 
what the default is. We've decided it's going to be `true`. There's a 
possibility that other users complain about the behavior change by setting the 
default to `true` in which case we either have to stick to our guns with a 
choice (my vote) or revert this PR and go back to having 2 different defaults.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to