Re: [PR] fix case_column_or_null with nullable when conditions [datafusion]

2024-12-23 Thread via GitHub
richox commented on code in PR #13886: URL: https://github.com/apache/datafusion/pull/13886#discussion_r1896512603 ## datafusion/sqllogictest/test_files/case.slt: ## @@ -50,7 +50,7 @@ NULL 6 NULL NULL -7 +NULL Review Comment: done -- This is an automated message from

[PR] Default to ZSTD compression when writing Parquet [datafusion-python]

2024-12-23 Thread via GitHub
kosiew opened a new pull request, #981: URL: https://github.com/apache/datafusion-python/pull/981 # Which issue does this PR close? Closes #978. # Rationale for this change Currently, the write_parquet method defaults to "uncompressed" Parquet files, whi

Re: [I] Functionality of `array_repeat` udf [datafusion]

2024-12-23 Thread via GitHub
jatin510 commented on issue #13872: URL: https://github.com/apache/datafusion/issues/13872#issuecomment-2560679722 > I think we can return null for this case This will be a breaking change. @jayzhan211 cc: @alamb -- This is an automated message from the Apache

Re: [I] Default to some compression when writing Parquet [datafusion-python]

2024-12-23 Thread via GitHub
kosiew commented on issue #978: URL: https://github.com/apache/datafusion-python/issues/978#issuecomment-2560675582 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] Introduce LogicalPlan invariants, begin automatically checking them [datafusion]

2024-12-23 Thread via GitHub
wiedld commented on code in PR #13651: URL: https://github.com/apache/datafusion/pull/13651#discussion_r1896088892 ## datafusion/optimizer/src/optimizer.rs: ## @@ -384,9 +394,16 @@ impl Optimizer { // rule handles recursion itself None =

Re: [PR] Introduce LogicalPlan invariants, begin automatically checking them [datafusion]

2024-12-23 Thread via GitHub
wiedld commented on PR #13651: URL: https://github.com/apache/datafusion/pull/13651#issuecomment-2560616923 Fixed the performance regression. It wasn't where we thought it was. The problems was a recursive check (down the LP) of the check_fields within the `assert_unique_field_names()

Re: [PR] Introduce LogicalPlan invariants, begin automatically checking them [datafusion]

2024-12-23 Thread via GitHub
wiedld commented on code in PR #13651: URL: https://github.com/apache/datafusion/pull/13651#discussion_r1896088892 ## datafusion/optimizer/src/optimizer.rs: ## @@ -384,9 +394,16 @@ impl Optimizer { // rule handles recursion itself None =

Re: [PR] Support (order by / sort) for DataFrameWriteOptions [datafusion]

2024-12-23 Thread via GitHub
zhuqi-lucas commented on PR #13874: URL: https://github.com/apache/datafusion/pull/13874#issuecomment-2560602968 Thank you @alamb @Dandandan for review, it makes sense we continue investigating the solution for communicating the ordering of a file back with the existing listing table imple

[I] Find a way to communicate the ordering of a file back with the existing listing table implementation [datafusion]

2024-12-23 Thread via GitHub
zhuqi-lucas opened a new issue, #13891: URL: https://github.com/apache/datafusion/issues/13891 ### Is your feature request related to a problem or challenge? We add support (order by / sort) for DataFrameWriteOptions, but when a user try to query the table which the file already order

Re: [I] sql result discrepency with sqlite, postgres and duckdb bug #3 [datafusion]

2024-12-23 Thread via GitHub
jonahgao commented on issue #13784: URL: https://github.com/apache/datafusion/issues/13784#issuecomment-2560545696 If [fail_on_overflow](https://github.com/apache/datafusion/blob/e718c1a5c5770c071c9c2e14a7681a7f1a2f3f23/datafusion/physical-expr/src/expressions/binary.rs#L88) is enabled, Dat

Re: [PR] Support 1 or 3 arg in generate_series() UDTF [datafusion]

2024-12-23 Thread via GitHub
goldmedal commented on PR #13856: URL: https://github.com/apache/datafusion/pull/13856#issuecomment-2560542467 Thanks @UBarney -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

Re: [I] Support 1 or 3 arg in `generate_series()` UDTF [datafusion]

2024-12-23 Thread via GitHub
goldmedal closed issue #13615: Support 1 or 3 arg in `generate_series()` UDTF URL: https://github.com/apache/datafusion/issues/13615 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

Re: [PR] Support 1 or 3 arg in generate_series() UDTF [datafusion]

2024-12-23 Thread via GitHub
goldmedal merged PR #13856: URL: https://github.com/apache/datafusion/pull/13856 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@data

Re: [PR] Branch 43 downgraded tonic [datafusion]

2024-12-23 Thread via GitHub
matthewmturner closed pull request #13890: Branch 43 downgraded tonic URL: https://github.com/apache/datafusion/pull/13890 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[PR] Branch 43 downgraded tonic [datafusion]

2024-12-23 Thread via GitHub
matthewmturner opened a new pull request, #13890: URL: https://github.com/apache/datafusion/pull/13890 ## Which issue does this PR close? Closes #. ## Rationale for this change ## What changes are included in this PR? ## Are these changes te

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on code in PR #13876: URL: https://github.com/apache/datafusion/pull/13876#discussion_r1896196677 ## .github/workflows/rust.yml: ## @@ -288,17 +318,20 @@ jobs: mv *.tbl ../datafusion/sqllogictest/test_files/tpch/data - name: Verify that bench

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on code in PR #13876: URL: https://github.com/apache/datafusion/pull/13876#discussion_r1896192661 ## .github/actions/setup-builder/action.yaml: ## @@ -42,6 +42,8 @@ runs: "${RETRY[@]}" rustup component add rustfmt - name: Configure rust runtime e

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on code in PR #13876: URL: https://github.com/apache/datafusion/pull/13876#discussion_r1896187659 ## .github/actions/setup-builder/action.yaml: ## @@ -42,6 +42,8 @@ runs: "${RETRY[@]}" rustup component add rustfmt - name: Configure rust runtime e

Re: [I] Improve efficiency of CI checks (so we can add MORE!) [datafusion]

2024-12-23 Thread via GitHub
alamb commented on issue #13845: URL: https://github.com/apache/datafusion/issues/13845#issuecomment-2560403938 Thanks for the ideas @edmondop -- FWIW the ASF doens't have its own hosted workers (individual projects could do that -- arrow did for a time -- but it requires dedicated time /

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on code in PR #13876: URL: https://github.com/apache/datafusion/pull/13876#discussion_r1896184558 ## .github/actions/setup-rust-runtime/action.yaml: ## @@ -34,5 +34,6 @@ runs: echo "RUSTC_WRAPPER=sccache" >> $GITHUB_ENV echo "SCCACHE_GHA_EN

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
alamb commented on code in PR #13876: URL: https://github.com/apache/datafusion/pull/13876#discussion_r1896174869 ## .github/actions/setup-rust-runtime/action.yaml: ## @@ -34,5 +34,6 @@ runs: echo "RUSTC_WRAPPER=sccache" >> $GITHUB_ENV echo "SCCACHE_GHA_ENABL

[PR] Alamb/ci without cache [datafusion]

2024-12-23 Thread via GitHub
alamb opened a new pull request, #13889: URL: https://github.com/apache/datafusion/pull/13889 This is an experiment on top of - https://github.com/apache/datafusion/pull/13876 To test how much a difference the rust cache crate actually makes -- This is an automated messa

Re: [PR] Support (order by / sort) for DataFrameWriteOptions [datafusion]

2024-12-23 Thread via GitHub
alamb commented on code in PR #13874: URL: https://github.com/apache/datafusion/pull/13874#discussion_r1896169061 ## datafusion/core/src/dataframe/parquet.rs: ## @@ -74,8 +74,16 @@ impl DataFrame { let file_type = format_as_file_type(format); +let plan = if

Re: [PR] doc-gen: migrate builtin scalar functions documentation to attribute based [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13878: URL: https://github.com/apache/datafusion/pull/13878#issuecomment-2560387220 Marking as draft as I think this PR is no longer waiting on feedback. Please ping us on the other PRs 🙏 Thanks aain @Chen-Yuan-Lai -- This is an automated message from the A

Re: [I] Panic when querying a hive-partitioned parquet dataset created with wrong column name [datafusion]

2024-12-23 Thread via GitHub
jwimberl closed issue #10020: Panic when querying a hive-partitioned parquet dataset created with wrong column name URL: https://github.com/apache/datafusion/issues/10020 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

Re: [I] Panic when querying a hive-partitioned parquet dataset created with wrong column name [datafusion]

2024-12-23 Thread via GitHub
jwimberl commented on issue #10020: URL: https://github.com/apache/datafusion/issues/10020#issuecomment-2560386636 Belatedly confirmed that this issue is now resolved and using the wrong partition column name yields an error `Exception: DataFusion error: Plan("Inferred partitions to be ..."

Re: [PR] minor: fix typos in comments / structure names [datafusion]

2024-12-23 Thread via GitHub
alamb merged PR #13879: URL: https://github.com/apache/datafusion/pull/13879 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusi

Re: [PR] minor: fix typos in comments / structure names [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13879: URL: https://github.com/apache/datafusion/pull/13879#issuecomment-2560378852 Thanks again @zhuliquan and @berkaysynnada 🚀 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] fix case_column_or_null with nullable when conditions [datafusion]

2024-12-23 Thread via GitHub
alamb commented on code in PR #13886: URL: https://github.com/apache/datafusion/pull/13886#discussion_r1896165275 ## datafusion/sqllogictest/test_files/case.slt: ## @@ -50,7 +50,7 @@ NULL 6 NULL NULL -7 +NULL Review Comment: Can you also please add a test to this file whe

Re: [I] Improve efficiency of CI checks (so we can add MORE!) [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on issue #13845: URL: https://github.com/apache/datafusion/issues/13845#issuecomment-2560352170 > > Maintaining an extended workflow shouldn't be too bad tbh. I think having a workflow that runs outside of PR's (iow runs nightly) could be useful as well for expensive test

Re: [I] Datafusion binary size has been getting bigger [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on issue #13816: URL: https://github.com/apache/datafusion/issues/13816#issuecomment-2560327632 ``` [profile.release] codegen-units = 1 strip = true panic = "abort" opt-level = "s" ``` Expanding on @comphead's idea adding opt-level = "s" reduced the

Re: [I] inner join involving hive-partitioned parquet dataset and filters on LHS and RHS causes panic [datafusion]

2024-12-23 Thread via GitHub
jwimberl commented on issue #9797: URL: https://github.com/apache/datafusion/issues/9797#issuecomment-2560326293 Apologies for the delay -- this issue no longer reproduces using the datafusion version 43! I did not attempt to find where between [37,43] the resolution happened but could if y

Re: [PR] [minor] make recursive package dependency optional [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13778: URL: https://github.com/apache/datafusion/pull/13778#issuecomment-2560323839 Here is a PR to try and fix this: https://github.com/apache/datafusion/pull/13887 -- This is an automated message from the Apache Git Service. To respond to the message, please log o

Re: [I] Improve efficiency of CI checks (so we can add MORE!) [datafusion]

2024-12-23 Thread via GitHub
edmondop commented on issue #13845: URL: https://github.com/apache/datafusion/issues/13845#issuecomment-2560320481 Other alternatives: # Split test executions in multiple jobs We can generate a docker image with the code and the tests compiled, and then having separate jobs that pu

Re: [PR] Fix `recursive-protection` feature flag [datafusion]

2024-12-23 Thread via GitHub
alamb commented on code in PR #13887: URL: https://github.com/apache/datafusion/pull/13887#discussion_r1895897853 ## datafusion/common/Cargo.toml: ## @@ -36,12 +36,11 @@ name = "datafusion_common" path = "src/lib.rs" [features] -default = ["recursive-protection"] avro = ["a

Re: [PR] chore: Consolidate Example: simplify_udwf_expression.rs into advanced_udwf.rs [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13883: URL: https://github.com/apache/datafusion/pull/13883#issuecomment-2560316188 Thanks again @xarus01 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

Re: [PR] chore: Consolidate Example: simplify_udwf_expression.rs into advanced_udwf.rs [datafusion]

2024-12-23 Thread via GitHub
alamb merged PR #13883: URL: https://github.com/apache/datafusion/pull/13883 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusi

Re: [I] Consolidate Example: simplify_udwf_expression.rs into advanced_udwf.rs #13842 [datafusion]

2024-12-23 Thread via GitHub
alamb closed issue #13843: Consolidate Example: simplify_udwf_expression.rs into advanced_udwf.rs #13842 URL: https://github.com/apache/datafusion/issues/13843 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [PR] feat: Add ConfigOptions to ScalarFunctionArgs [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on PR #13527: URL: https://github.com/apache/datafusion/pull/13527#issuecomment-2560262193 I think this may be ready for review again. For this round I refactored the code to use &ConfigOptions everywhere except for ScalarFunctionExpr so the cost for cloning ConfigOptions

Re: [PR] Introduce LogicalPlan invariants, begin automatically checking them [datafusion]

2024-12-23 Thread via GitHub
wiedld commented on PR #13651: URL: https://github.com/apache/datafusion/pull/13651#issuecomment-2560247024 We no longer have any per-optimizer-pass validation in release mode. Instead, the checks run before and after ALL optimizer passes are completed. When running in debug mode, the

Re: [PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
robtandy commented on PR #13888: URL: https://github.com/apache/datafusion/pull/13888#issuecomment-2560251488 Thank you @vbarua and @Blizzara for your review and comments! Yes, I think the existing tests do not go far enough, and I encountered bugs not covered by these tests already,

Re: [PR] Introduce LogicalPlan invariants, begin automatically checking them [datafusion]

2024-12-23 Thread via GitHub
wiedld commented on code in PR #13651: URL: https://github.com/apache/datafusion/pull/13651#discussion_r1896089589 ## datafusion/optimizer/src/optimizer.rs: ## @@ -445,35 +462,38 @@ impl Optimizer { } i += 1; } + +// verify LP is valid,

Re: [PR] Introduce LogicalPlan invariants, begin automatically checking them [datafusion]

2024-12-23 Thread via GitHub
wiedld commented on code in PR #13651: URL: https://github.com/apache/datafusion/pull/13651#discussion_r1896088892 ## datafusion/optimizer/src/optimizer.rs: ## @@ -384,9 +394,16 @@ impl Optimizer { // rule handles recursion itself None =

Re: [PR] feat: support inner iejoin [datafusion]

2024-12-23 Thread via GitHub
timsaucer commented on PR #12754: URL: https://github.com/apache/datafusion/pull/12754#issuecomment-2560224119 I'm planning on taking a look at this over the next week or so, but it will take a little time for me to get up to speed on the details of what you're trying to do here. Can you ad

Re: [PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
Blizzara commented on PR #13888: URL: https://github.com/apache/datafusion/pull/13888#issuecomment-2560223118 Haha, looks like @vbarua commented pretty pretty much the same thing while I was writing my own reply! 😄 -- This is an automated message from the Apache Git Service. To respond t

Re: [PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
Blizzara commented on PR #13888: URL: https://github.com/apache/datafusion/pull/13888#issuecomment-2560214683 I like the idea, more testing the better! We already have some Substrait TCP testing, but I think that's from "known Substrait" -> DF, so it only tests the consumer, while this woul

Re: [PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
vbarua commented on code in PR #13888: URL: https://github.com/apache/datafusion/pull/13888#discussion_r1896067593 ## datafusion/substrait/tests/testdata/tpch_queries/query_01.sql: ## @@ -0,0 +1,22 @@ +SELECT +l_returnflag, +l_linestatus, +sum(l_quantity) AS sum_qty,

Re: [PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
vbarua commented on PR #13888: URL: https://github.com/apache/datafusion/pull/13888#issuecomment-2560209539 There are already some tests for TPCH functionality in https://github.com/apache/datafusion/blob/main/datafusion/substrait/tests/cases/consumer_integration.rs, but IMO those are weake

Re: [PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
vbarua commented on code in PR #13888: URL: https://github.com/apache/datafusion/pull/13888#discussion_r1896058000 ## datafusion/substrait/tests/testdata/tpch_queries/query_01.sql: ## @@ -0,0 +1,22 @@ +SELECT +l_returnflag, +l_linestatus, +sum(l_quantity) AS sum_qty,

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 support [datafusion-comet]

2024-12-23 Thread via GitHub
Dandandan commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1896016551 ## native/core/benches/shuffle_writer.rs: ## @@ -35,23 +35,42 @@ fn criterion_benchmark(c: &mut Criterion) { group.bench_function("shuffle_writer: encod

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 support [datafusion-comet]

2024-12-23 Thread via GitHub
Dandandan commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1896016551 ## native/core/benches/shuffle_writer.rs: ## @@ -35,23 +35,42 @@ fn criterion_benchmark(c: &mut Criterion) { group.bench_function("shuffle_writer: encod

Re: [PR] Fix MySQL parsing of GRANT, REVOKE, and CREATE VIEW [datafusion-sqlparser-rs]

2024-12-23 Thread via GitHub
mvzink commented on code in PR #1538: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1538#discussion_r1896016157 ## src/parser/mod.rs: ## @@ -11375,7 +11466,11 @@ impl<'a> Parser<'a> { } else { let object_type = self.parse_one

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 support [datafusion-comet]

2024-12-23 Thread via GitHub
andygrove commented on PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#issuecomment-2560087256 @Dandandan you may be interested in the benchmark results -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [PR] feat: Add support for LZ4 compression [datafusion-comet]

2024-12-23 Thread via GitHub
andygrove commented on PR #1181: URL: https://github.com/apache/datafusion-comet/pull/1181#issuecomment-2560034413 LZ4 support is now part of https://github.com/apache/datafusion-comet/pull/1192 -- This is an automated message from the Apache Git Service. To respond to the message, pleas

Re: [PR] feat: Add support for LZ4 compression [datafusion-comet]

2024-12-23 Thread via GitHub
andygrove closed pull request #1181: feat: Add support for LZ4 compression URL: https://github.com/apache/datafusion-comet/pull/1181 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

Re: [I] Complete / integrate sqlite sqllogictest test scripts integrattion [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on issue #13812: URL: https://github.com/apache/datafusion/issues/13812#issuecomment-2559955748 Apparently there isn't a way to fork an empty repository in github, or at least I can't seem to find a way so I'm unable to push a PR there for the slt files. Can someone with

[PR] Add substrait tpch round trip tests from sql query [datafusion]

2024-12-23 Thread via GitHub
robtandy opened a new pull request, #13888: URL: https://github.com/apache/datafusion/pull/13888 ## Which issue does this PR close? I've been investigating and experimenting with federating tpch query plans and sending the federated portion downstream encoded as substrait protos (Side

Re: [I] Incorrect CASE WHEN + ELSE NULL behavior [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on issue #13885: URL: https://github.com/apache/datafusion/issues/13885#issuecomment-2559893186 possibly related: #13779 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [PR] Fix `recursive-protection` feature flag [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13887: URL: https://github.com/apache/datafusion/pull/13887#issuecomment-2559860749 Also before I merge this I want to rename the flag to `recursive_protection` (underscore) to be consistent with the rest -- This is an automated message from the Apache Git Service.

Re: [PR] Fix `recursive-protection` feature flag [datafusion]

2024-12-23 Thread via GitHub
alamb commented on code in PR #13887: URL: https://github.com/apache/datafusion/pull/13887#discussion_r1895840813 ## datafusion/core/Cargo.toml: ## @@ -69,6 +69,13 @@ pyarrow = ["datafusion-common/pyarrow", "parquet"] regex_expressions = [ "datafusion-functions/regex_expre

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
Omega359 commented on PR #13876: URL: https://github.com/apache/datafusion/pull/13876#issuecomment-2559855782 Thanks for the update - no rush on my side -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] Fix `recursive-protection` feature flag [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13887: URL: https://github.com/apache/datafusion/pull/13887#issuecomment-2559851448 Given this experience, we really need a test for the `recursive-protection` feature flag / compiling datafusion with various feature flags. Maybe someone can figure out how to do so, f

Re: [PR] ci improvements [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13876: URL: https://github.com/apache/datafusion/pull/13876#issuecomment-2559850885 I can not wait to test / review this but I think I will run out of time today -- hopefully either later today or tomorrow -- This is an automated message from the Apache Git Service.

[PR] Fix `recursive-protection` feature flag [datafusion]

2024-12-23 Thread via GitHub
alamb opened a new pull request, #13887: URL: https://github.com/apache/datafusion/pull/13887 ## Which issue does this PR close? - Closes https://github.com/apache/datafusion/issues/13766 ## Rationale for this change The recursive-protect flag was added in https://github

Re: [I] Making the `recursive` dependency an optional feature [datafusion]

2024-12-23 Thread via GitHub
alamb commented on issue #13766: URL: https://github.com/apache/datafusion/issues/13766#issuecomment-2559842765 I think this issue is not quite closed by https://github.com/apache/datafusion/pull/13778, reopening See https://github.com/apache/datafusion/issues/13334#issuecomment-25585769

Re: [PR] Improve error messages for incorrect zero argument signatures [datafusion]

2024-12-23 Thread via GitHub
alamb merged PR #13881: URL: https://github.com/apache/datafusion/pull/13881 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusi

Re: [I] Make migration to `Signature::nullary` in 44.0.0 easier / less confusing [datafusion]

2024-12-23 Thread via GitHub
alamb closed issue #13763: Make migration to `Signature::nullary` in 44.0.0 easier / less confusing URL: https://github.com/apache/datafusion/issues/13763 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

Re: [PR] Improve error messages for incorrect zero argument signatures [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13881: URL: https://github.com/apache/datafusion/pull/13881#issuecomment-2559813476 Thank you for the review @jayzhan211 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

Re: [PR] Feature scalar regexp match benchmark [datafusion]

2024-12-23 Thread via GitHub
zhuliquan commented on PR #13789: URL: https://github.com/apache/datafusion/pull/13789#issuecomment-2559812675 > I ran your benchmark and it looks good. I am unsure though the benefit of this benchmark over the existing 'regx' benchmark which targets the udf versions of the Postgresql symbo

Re: [PR] Improve error messages for incorrect zero argument signatures [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13881: URL: https://github.com/apache/datafusion/pull/13881#issuecomment-2559813030 here is a PR to just update the signatures: - https://github.com/apache/datafusion/pull/13881 -- This is an automated message from the Apache Git Service. To respond to the message

Re: [I] Release DataFusion `44.0.0` [datafusion]

2024-12-23 Thread via GitHub
alamb commented on issue #13334: URL: https://github.com/apache/datafusion/issues/13334#issuecomment-2559811428 > Since @buraksenn did the work to make this optional in [minor] make recursive package dependency optional #13778, perhaps they have some idea? Update I think I know what

Re: [PR] chore: Upgrade to DataFusion 44.0.0 [datafusion-comet]

2024-12-23 Thread via GitHub
alamb commented on PR #1154: URL: https://github.com/apache/datafusion-comet/pull/1154#issuecomment-2559808577 I found the issue: - https://github.com/apache/datafusion/pull/13778#issuecomment-2559808094 Working on a fix -- This is an automated message from the Apache Git Servi

Re: [PR] [minor] make recursive package dependency optional [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13778: URL: https://github.com/apache/datafusion/pull/13778#issuecomment-2559808094 While testing this in comet, I am pretty sure this PR didn't quite fix the problem - https://github.com/apache/datafusion-comet/pull/1154#issuecomment-2559777474 When someone

Re: [PR] chore: Upgrade to DataFusion 44.0.0 [datafusion-comet]

2024-12-23 Thread via GitHub
alamb commented on PR #1154: URL: https://github.com/apache/datafusion-comet/pull/1154#issuecomment-2559777474 > I have upgraded to latest DF and reinstated the miri checks. I'll keep this in draft until the RC is available later this week. I see the MIRI CI check fails: http

Re: [PR] Implement predicate pruning for `like` expressions (prefix matching) [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #12978: URL: https://github.com/apache/datafusion/pull/12978#issuecomment-2559758701 This is my top priority after DF 44 is released: - #13334 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

Re: [PR] Implement predicate pruning for `like` expressions (prefix matching) [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #12978: URL: https://github.com/apache/datafusion/pull/12978#issuecomment-2559758189 This is still on my list, hopefully other people can check it out too -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub a

Re: [PR] Add stacker and recursive [datafusion]

2024-12-23 Thread via GitHub
alamb commented on PR #13310: URL: https://github.com/apache/datafusion/pull/13310#issuecomment-2559754026 BTW we have been having problems downstream wiht the new recursive / stacker dependency. See for example: - https://github.com/apache/datafusion/issues/13513 - https://github.com/

Re: [PR] Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical Plan unparsing [datafusion]

2024-12-23 Thread via GitHub
phillipleblanc commented on code in PR #13880: URL: https://github.com/apache/datafusion/pull/13880#discussion_r1895780031 ## datafusion/sql/src/unparser/extension_unparser.rs: ## @@ -0,0 +1,66 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contrib

Re: [PR] Parse Postgres's LOCK TABLE statement [datafusion-sqlparser-rs]

2024-12-23 Thread via GitHub
freshtonic commented on code in PR #1614: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1614#discussion_r1895681055 ## src/parser/mod.rs: ## @@ -9604,7 +9604,13 @@ impl<'a> Parser<'a> { top = Some(self.parse_top()?); } -let projecti

Re: [PR] Parse Postgres's LOCK TABLE statement [datafusion-sqlparser-rs]

2024-12-23 Thread via GitHub
freshtonic commented on code in PR #1614: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1614#discussion_r1895681055 ## src/parser/mod.rs: ## @@ -9604,7 +9604,13 @@ impl<'a> Parser<'a> { top = Some(self.parse_top()?); } -let projecti

[PR] fix case_column_or_null with nullable when conditions [datafusion]

2024-12-23 Thread via GitHub
richox opened a new pull request, #13886: URL: https://github.com/apache/datafusion/pull/13886 ## Which issue does this PR close? Closes #13885 . ## Rationale for this change ## What changes are included in this PR? ## Are these changes test

Re: [PR] Fix MySQL parsing of GRANT, REVOKE, and CREATE VIEW [datafusion-sqlparser-rs]

2024-12-23 Thread via GitHub
iffyio commented on code in PR #1538: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1538#discussion_r1895651779 ## src/parser/mod.rs: ## @@ -3869,11 +3873,16 @@ impl<'a> Parser<'a> { .is_some(); let persistent = dialect_of!(self is DuckDbDial

Re: [PR] Support Snowflake Update-From-Select [datafusion-sqlparser-rs]

2024-12-23 Thread via GitHub
yuval-illumex commented on code in PR #1604: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1604#discussion_r1895655809 ## src/keywords.rs: ## @@ -941,6 +941,7 @@ pub const RESERVED_FOR_TABLE_ALIAS: &[Keyword] = &[ // Reserved for Snowflake table sample K

Re: [PR] Support Snowflake Update-From-Select [datafusion-sqlparser-rs]

2024-12-23 Thread via GitHub
iffyio commented on code in PR #1604: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1604#discussion_r1895636641 ## src/ast/query.rs: ## @@ -2790,3 +2790,14 @@ impl fmt::Display for ValueTableMode { } } } + +/// The update table from options Review

[I] Incorrect CASE WHEN + ELSE NULL behavior [datafusion]

2024-12-23 Thread via GitHub
richox opened a new issue, #13885: URL: https://github.com/apache/datafusion/issues/13885 ### Describe the bug some queries containing `case when .. ELSE NULL end` produces incorrect results. ### To Reproduce preparing test table: ```sql $ datafusion-cli DataFus

Re: [PR] minor: fix typos in comments / structure names [datafusion]

2024-12-23 Thread via GitHub
berkaysynnada commented on code in PR #13879: URL: https://github.com/apache/datafusion/pull/13879#discussion_r1895488724 ## datafusion/physical-plan/src/repartition/mod.rs: ## @@ -170,7 +170,7 @@ impl RepartitionExecState { /// which is commonly set to the number of CPU cores

[PR] chore(deps): update sqllogictest requirement from 0.23.0 to 0.24.0 [datafusion]

2024-12-23 Thread via GitHub
dependabot[bot] opened a new pull request, #13884: URL: https://github.com/apache/datafusion/pull/13884 Updates the requirements on [sqllogictest](https://github.com/risinglightdb/sqllogictest-rs) to permit the latest version. Release notes Sourced from https://github.com/risingli

Re: [PR] chore: Consolidate Example: simplify_udwf_expression.rs into advanced_udwf.rs [datafusion]

2024-12-23 Thread via GitHub
xarus01 commented on PR #13883: URL: https://github.com/apache/datafusion/pull/13883#issuecomment-2559110139 @alamb - mentioning per [contribution guide](https://datafusion.apache.org/contributor-guide/index.html#pull-request-overview). Please trigger the CI tasks. -- This is an automat