Re: [D] how to run tpch benchmark datafusion [datafusion]

2025-06-27 Thread via GitHub
GitHub user zhuqi-lucas added a comment to the discussion: how to run tpch benchmark datafusion It isn’t a bug in DataFusion so much as in how the TPCH benchmark runner expects your data laid out. By default it will look under your --path for one directory per table (named exactly after the t

Re: [I] Improve performance of ClickBench Q21 by removing the cast [datafusion]

2025-06-27 Thread via GitHub
zhuqi-lucas commented on issue #16591: URL: https://github.com/apache/datafusion/issues/16591#issuecomment-3015037296 > Wasn't clickbench writing strings as binary columns (an option was added for this)? > > Though there is something to say that filtering should not have to validata

Re: [D] Optimizing dafafusion Build in CI [datafusion]

2025-06-27 Thread via GitHub
GitHub user rain2307 closed a discussion: Optimizing dafafusion Build in CI My project depends on dafafusion and is built on CI, with CPU usage exceeding 90% and memory around 24GB. Are there any ways to optimize it? For example, by reducing features or similar adjustments—I'm currently using

Re: [D] DISCUSSION: DataFusion Meetup in New York, NY, USA [datafusion]

2025-06-27 Thread via GitHub
GitHub user leoDYL added a comment to the discussion: DISCUSSION: DataFusion Meetup in New York, NY, USA I'll get in touch! GitHub link: https://github.com/apache/datafusion/discussions/16265#discussioncomment-13600313 This is an automatically sent email for github@datafusion.apache.org