Re: [I] Implement nested join optimization [datafusion]

2025-04-28 Thread via GitHub
alamb commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2836816875 - I am trying to organize a join task force for planning joins / subqueries: https://github.com/apache/datafusion/issues/15885 -- This is an automated message from the Apache Git

Re: [I] Implement nested join optimization [datafusion]

2025-04-28 Thread via GitHub
alamb commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2836536572 BTW I wrote some thoughts on join ordering in a blog post (part 2) - https://www.influxdata.com/blog/optimizing-sql-dataframes-part-one/ - -- This is an automated message fr

Re: [I] Implement nested join optimization [datafusion]

2025-02-18 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2667413769 Hey @alamb following up on #14688 I made a new pull request in #14735 to add an example that demonstrates how analysis works for `AND` conjunctions and a placeholder for `OR` c

Re: [I] Implement nested join optimization [datafusion]

2025-02-16 Thread via GitHub
alamb commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2661556159 > Hey [@alamb](https://github.com/alamb) I have a small change in [#14688](https://github.com/apache/datafusion/pull/14688) to demo boundary analysis (as I understand from the exist

Re: [I] Implement nested join optimization [datafusion]

2025-02-15 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2661208132 Hey @alamb I have a small change[1] to demo boundary analysis (as I understand from the existing code), If this looks like a suitable initial example I can add one that demonst

Re: [I] Implement nested join optimization [datafusion]

2025-02-12 Thread via GitHub
alamb commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2654991464 Thanks @clflushopt I don't have a great handle in my head on the current state of Boundary and Selectivity anaylsis. Maybe your first PRs could focus on adding some docs and

Re: [I] Implement nested join optimization [datafusion]

2025-02-10 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2649742371 Hey @alamb thanks for the clear answer, yes that sounds good ! It's seems that both ticket for interval boundary and selectivity analysis for `AND` & `OR` conjunctions seem ope

Re: [I] Implement nested join optimization [datafusion]

2025-02-09 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2646640247 @alamb quick question what's considered higher priority here between better join ordering approach (potentially like DuckDB's) vs picking up the couple tickets left in the EPIC

Re: [I] Implement nested join optimization [datafusion]

2025-02-08 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2646048358 Small update; I started looking at the initial join selection rule implemented by backtracking from the reference to `HashBuildProbeOrder` and did a review over the PR that int

Re: [I] Implement nested join optimization [datafusion]

2025-02-06 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2641934903 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

Re: [I] Implement nested join optimization [datafusion]

2025-02-06 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2641928732 Hi, I've been doing some reading on the side and I am interested into taking a stab at this if the issue is still open and no one is working on it. -- This is an automated me

Re: [I] Implement nested join optimization [datafusion]

2024-11-03 Thread via GitHub
maruschin commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2453801494 Hi, is there any progress? I can take the task for initial development. -- This is an automated message from the Apache Git Service. To respond to the message, please log on