clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2646048358
Small update; I started looking at the initial join selection rule implemented by backtracking from the reference to `HashBuildProbeOrder` and did a review over the PR that introduced the `JoinSelection` rule in the physical optimizer (ref: https://github.com/apache/datafusion/pull/4219). I've also been looking at work done for the cost calculations and cost-based optimizations EPIC here https://github.com/apache/datafusion/issues/3929. I've also started revisiting the no-statistics approach of DuckDB to try and get a better intuition for how their cardinality estimator approach works, implementation wise I think I want to get a small set of queries with nested joins running to get a better view of the current way they are handled before I draft an early implementation. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org