Re: [PR] Set HashJoin seed [datafusion]

2025-04-29 Thread via GitHub
alamb commented on PR #15783: URL: https://github.com/apache/datafusion/pull/15783#issuecomment-2838662506 Thanks again @ctsk -- sorry for the delay in review / merge -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

Re: [PR] Set HashJoin seed [datafusion]

2025-04-29 Thread via GitHub
alamb merged PR #15783: URL: https://github.com/apache/datafusion/pull/15783 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusi

Re: [PR] Set HashJoin seed [datafusion]

2025-04-28 Thread via GitHub
alamb commented on PR #15783: URL: https://github.com/apache/datafusion/pull/15783#issuecomment-2837274656 > │ QQuery 4 │ 887.43ms │ 740.13ms │ +1.20x faster │ Given these queries don't have joins I am not sure that is reproduceable 😬 -- This is an automated message

Re: [PR] Set HashJoin seed [datafusion]

2025-04-28 Thread via GitHub
alamb commented on PR #15783: URL: https://github.com/apache/datafusion/pull/15783#issuecomment-2837217773 🤖: Benchmark completed Details ``` Comparing HEAD and fix_hash-join-seed Benchmark clickbench_extended.json

Re: [PR] Set HashJoin seed [datafusion]

2025-04-28 Thread via GitHub
alamb commented on code in PR #15783: URL: https://github.com/apache/datafusion/pull/15783#discussion_r2065171783 ## datafusion/physical-plan/src/joins/hash_join.rs: ## @@ -86,6 +86,10 @@ use datafusion_physical_expr_common::physical_expr::fmt_sql; use futures::{ready, Stream,

Re: [PR] Set HashJoin seed [datafusion]

2025-04-28 Thread via GitHub
alamb commented on PR #15783: URL: https://github.com/apache/datafusion/pull/15783#issuecomment-2837160708 🤖 `./gh_compare_branch.sh` [Benchmark Script](https://github.com/alamb/datafusion-benchmarking/blob/main/gh_compare_branch.sh) Running Linux aal-dev 6.11.0-1013-gcp #13~24.04.1-Ubun

Re: [PR] Set HashJoin seed [datafusion]

2025-04-21 Thread via GitHub
Weijun-H commented on code in PR #15783: URL: https://github.com/apache/datafusion/pull/15783#discussion_r2052164212 ## datafusion/physical-plan/src/joins/hash_join.rs: ## @@ -86,6 +86,9 @@ use datafusion_physical_expr_common::physical_expr::fmt_sql; use futures::{ready, Stream

[PR] Set HashJoin seed [datafusion]

2025-04-20 Thread via GitHub
ctsk opened a new pull request, #15783: URL: https://github.com/apache/datafusion/pull/15783 ## Which issue does this PR close? - Closes #15620. ## What changes are included in this PR? The hash join seed is hard-coded to a different value that the RepartitionExec seed.