Re: [I] sql result discrepency with sqlite, postgres and duckdb bug #2 [datafusion]

2025-01-03 Thread via GitHub
Omega359 commented on issue #13782: URL: https://github.com/apache/datafusion/issues/13782#issuecomment-2569330925 It's not the nullif, it's the real datatype: ```sql > SELECT 1 - 213220800 * 1::REAL; ++ | Int64(1) - Int64(213220800) * Int64

Re: [PR] Update substrait requirement from 0.50 to 0.51 [datafusion]

2025-01-03 Thread via GitHub
crepererum merged PR #13978: URL: https://github.com/apache/datafusion/pull/13978 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dat

Re: [PR] doc-gen: migrate scalar functions (string) documentation 2/4 [datafusion]

2025-01-03 Thread via GitHub
crepererum merged PR #13925: URL: https://github.com/apache/datafusion/pull/13925 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dat

Re: [PR] Consolidate csv_opener.rs and json_opener.rs into a single example (#โ€ฆ [datafusion]

2025-01-03 Thread via GitHub
cj-zhukov commented on PR #13981: URL: https://github.com/apache/datafusion/pull/13981#issuecomment-2569196286 @alamb Andrew, I've noticed some checks have failed. Could you help me understand if this is related to my changes or something else? I'd be happy to address the issue. -- This

Re: [PR] feat: add `AsyncCatalogProvider` helpers for asynchronous catalogs [datafusion]

2025-01-03 Thread via GitHub
westonpace commented on PR #13800: URL: https://github.com/apache/datafusion/pull/13800#issuecomment-2569258311 Ok, I just needed a good night's sleep. I converted the `schema_name` / `catalog_name` methods into parameters on the `resolve` function. Now each of the new traits has only a s

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
comphead commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1901961326 ## docs/source/user-guide/sql/scalar_functions.md: ## @@ -1758,167 +1758,6 @@ encode(expression, format) - [decode](#decode) -## Regular Expression Functions

[PR] chore: Remove unreachable code [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove opened a new pull request, #1213: URL: https://github.com/apache/datafusion-comet/pull/1213 ## Which issue does this PR close? N/A ## Rationale for this change Fix this warning: ``` [INFO] compiling 6 Scala sources and 12 Java sources t

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
kazuyukitanimura commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902182027 ## spark/src/main/scala/org/apache/comet/CometSparkSessionExtensions.scala: ## @@ -919,14 +919,6 @@ class CometSparkSessionExtensions } ov

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
kazuyukitanimura commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902145561 ## common/src/main/scala/org/apache/comet/CometConf.scala: ## @@ -272,18 +272,19 @@ object CometConf extends ShimCometConf { .booleanConf

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902182688 ## native/core/src/execution/shuffle/shuffle_writer.rs: ## @@ -1567,17 +1585,41 @@ pub fn write_ipc_compressed( let mut timer = ipc_time.timer(); l

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902182125 ## native/core/src/execution/shuffle/shuffle_writer.rs: ## @@ -1567,17 +1585,41 @@ pub fn write_ipc_compressed( let mut timer = ipc_time.timer(); l

Re: [I] `CometBuffer` can potentially lead to concurrent modification of a held buffer (aka is "Unsound" in Rust terms) [datafusion-comet]

2025-01-03 Thread via GitHub
viirya closed issue #1035: `CometBuffer` can potentially lead to concurrent modification of a held buffer (aka is "Unsound" in Rust terms) URL: https://github.com/apache/datafusion-comet/issues/1035 -- This is an automated message from the Apache Git Service. To respond to the message, please

Re: [PR] chore: Add safety check to CometBuffer [datafusion-comet]

2025-01-03 Thread via GitHub
viirya merged PR #1050: URL: https://github.com/apache/datafusion-comet/pull/1050 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dat

[PR] feat: add test to check for `ctx.enable_url_table()` [datafusion-ballista]

2025-01-03 Thread via GitHub
milenkovicm opened a new pull request, #1155: URL: https://github.com/apache/datafusion-ballista/pull/1155 # Which issue does this PR close? Closes None. # Rationale for this change Test which covers (unsupported) `SessionContext` option `enable_url_table()` so we can a

Re: [PR] verify TPC-DS results [datafusion]

2025-01-03 Thread via GitHub
Lordworms commented on code in PR #13791: URL: https://github.com/apache/datafusion/pull/13791#discussion_r1902127243 ## benchmarks/transfer_dat_parquet.py: ## Review Comment: I'll refactor that, thanks a lot -- This is an automated message from the Apache Git Service.

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
comphead commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902232963 ## native/core/src/execution/shuffle/shuffle_writer.rs: ## @@ -1567,17 +1585,41 @@ pub fn write_ipc_compressed( let mut timer = ipc_time.timer(); le

Re: [PR] feat: add support for array_remove expression [datafusion-comet]

2025-01-03 Thread via GitHub
parthchandra commented on code in PR #1179: URL: https://github.com/apache/datafusion-comet/pull/1179#discussion_r1902256464 ## spark/src/test/scala/org/apache/comet/CometExpressionSuite.scala: ## @@ -2517,4 +2517,16 @@ class CometExpressionSuite extends CometTestBase with Adap

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902235813 ## native/core/src/execution/shuffle/shuffle_writer.rs: ## @@ -1567,17 +1585,41 @@ pub fn write_ipc_compressed( let mut timer = ipc_time.timer(); l

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902239523 ## spark/src/main/scala/org/apache/spark/sql/comet/execution/shuffle/NativeBatchDecoderIterator.scala: ## @@ -0,0 +1,182 @@ +/* + * Licensed to the Apache So

Re: [PR] chore: Remove unreachable code [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove merged PR #1213: URL: https://github.com/apache/datafusion-comet/pull/1213 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] feat: Move shuffle block decompression and decoding to native code and add LZ4 & Snappy support [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1192: URL: https://github.com/apache/datafusion-comet/pull/1192#discussion_r1902236276 ## common/src/main/scala/org/apache/comet/CometConf.scala: ## @@ -272,18 +272,19 @@ object CometConf extends ShimCometConf { .booleanConf .crea

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902234781 ## spark/src/main/scala/org/apache/comet/CometSparkSessionExtensions.scala: ## @@ -919,14 +919,6 @@ class CometSparkSessionExtensions } override

Re: [PR] test: Enable Comet by default except some tests in SparkSessionExtensionSuite [datafusion-comet]

2025-01-03 Thread via GitHub
kazuyukitanimura commented on PR #1201: URL: https://github.com/apache/datafusion-comet/pull/1201#issuecomment-2569950958 Thanks @parthchandra @andygrove @viirya can I get a committer approval? -- This is an automated message from the Apache Git Service. To respond to the message, ple

Re: [PR] test: Enable Comet by default except some tests in SparkSessionExtensionSuite [datafusion-comet]

2025-01-03 Thread via GitHub
kazuyukitanimura merged PR #1201: URL: https://github.com/apache/datafusion-comet/pull/1201 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsub

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
codecov-commenter commented on PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#issuecomment-2569962633 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/1210?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [PR] test: Enable Comet by default except some tests in SparkSessionExtensionSuite [datafusion-comet]

2025-01-03 Thread via GitHub
kazuyukitanimura commented on PR #1201: URL: https://github.com/apache/datafusion-comet/pull/1201#issuecomment-2569962967 Thanks, merged @viirya @parthchandra -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

[PR] fix: Preserve session id when using `ctx.enable_url_table()` [datafusion]

2025-01-03 Thread via GitHub
milenkovicm opened a new pull request, #14004: URL: https://github.com/apache/datafusion/pull/14004 ## Which issue does this PR close? Closes #14003. ## Rationale for this change ## What changes are included in this PR? ## Are these changes tested? Yes

[I] Preserve session context id when `ctx.enable_url_table()` [datafusion]

2025-01-03 Thread via GitHub
milenkovicm opened a new issue, #14003: URL: https://github.com/apache/datafusion/issues/14003 ### Is your feature request related to a problem or challenge? `SessionContext.enable_url_table()` will create new context with different session context id. In most cases this does not make

[PR] Update docs [datafusion]

2025-01-03 Thread via GitHub
matthewmturner opened a new pull request, #14002: URL: https://github.com/apache/datafusion/pull/14002 ## Which issue does this PR close? Closes #. ## Rationale for this change Fix outdated doc example ## What changes are included in this PR? Update

Re: [PR] Update release README for datafusion-cli publishing [datafusion]

2025-01-03 Thread via GitHub
comphead merged PR #13982: URL: https://github.com/apache/datafusion/pull/13982 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

[PR] chore: no need to run python test in rust [datafusion-ballista]

2025-01-03 Thread via GitHub
milenkovicm opened a new pull request, #1154: URL: https://github.com/apache/datafusion-ballista/pull/1154 as there is separate set of tests testing python functionality. # Which issue does this PR close? Closes none. # Rationale for this change we test python mod

Re: [PR] Minor: sort requirement check for `Last` function's `merge_batch` [datafusion]

2025-01-03 Thread via GitHub
jayzhan211 commented on PR #13980: URL: https://github.com/apache/datafusion/pull/13980#issuecomment-2569905955 Not pretty sure is the logic applied to `first` too. I will work on it later on -- This is an automated message from the Apache Git Service. To respond to the message, please lo

Re: [PR] Minor: sort requirement check for `Last` function's `merge_batch` [datafusion]

2025-01-03 Thread via GitHub
jayzhan211 merged PR #13980: URL: https://github.com/apache/datafusion/pull/13980 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dat

Re: [I] Functionality of `array_repeat` udf [datafusion]

2025-01-03 Thread via GitHub
jayzhan211 commented on issue #13872: URL: https://github.com/apache/datafusion/issues/13872#issuecomment-2569907599 Which one do you think should return null? > SELECT cardinality(array_remove([1], 1)); I think we should return 0 for this case ๐Ÿค” ? -- This is an automated

[PR] Add H2O.ai Database-like Ops benchmark to dfbench [datafusion]

2025-01-03 Thread via GitHub
zhuqi-lucas opened a new pull request, #13996: URL: https://github.com/apache/datafusion/pull/13996 ## Which issue does this PR close? Closes [#7209](https://github.com/apache/datafusion/issues/7209) ## Rationale for this change ## What changes are include

Re: [I] Implement SQLancer (a end-to-end SQL fuzz testing library) [datafusion]

2025-01-03 Thread via GitHub
rluvaton commented on issue #11030: URL: https://github.com/apache/datafusion/issues/11030#issuecomment-2568920755 I would like to help with this, I'm big into tests -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[PR] Add support for Snowflake LIST and REMOVE [datafusion-sqlparser-rs]

2025-01-03 Thread via GitHub
yoavcloud opened a new pull request, #1639: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1639 (no comment) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[PR] Add support for the SQL OVERLAPS predicate [datafusion-sqlparser-rs]

2025-01-03 Thread via GitHub
yoavcloud opened a new pull request, #1638: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1638 https://jakewheat.github.io/sql-overview/sql-2016-foundation-grammar.html#overlaps-predicate I did not gate this one by dialect, as it doesn't seem to conflict with anything a

Re: [PR] feat(substrait): introduce consume_rel and consume_expression [datafusion]

2025-01-03 Thread via GitHub
Blizzara commented on code in PR #13963: URL: https://github.com/apache/datafusion/pull/13963#discussion_r1901637948 ## datafusion/substrait/src/logical_plan/consumer.rs: ## @@ -744,10 +752,10 @@ pub async fn from_substrait_plan_with_consumer( match plan.relations[0

Re: [PR] feat: Reenable tests for filtered SMJ anti join [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on PR #1211: URL: https://github.com/apache/datafusion-comet/pull/1211#issuecomment-2569622997 > Thanks, so tests are passed, should we merge this PR? If we merge this PR we will probably no longer be able to run the TPC-DS benchmark. I will try running it now.

Re: [PR] feat: Reenable tests for filtered SMJ anti join [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on PR #1211: URL: https://github.com/apache/datafusion-comet/pull/1211#issuecomment-2569624567 > f we merge this PR we will probably no longer be able to run the TPC-DS benchmark. I will try running it now. Well, maybe if we don't enable SMJ with filter it is ok.

Re: [PR] feat: Reenable tests for filtered SMJ anti join [datafusion-comet]

2025-01-03 Thread via GitHub
viirya commented on PR #1211: URL: https://github.com/apache/datafusion-comet/pull/1211#issuecomment-2569624977 The two configs are still disabled by default now, isn't? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

Re: [PR] feat: Reenable tests for filtered SMJ anti join [datafusion-comet]

2025-01-03 Thread via GitHub
comphead commented on PR #1211: URL: https://github.com/apache/datafusion-comet/pull/1211#issuecomment-2569564363 > > Thanks @andygrove for running Q21, is it right the result row count is the same? > > Yes, the output from the final two SMJs is correct, as is the final output.

Re: [I] Functionality of `array_repeat` udf [datafusion]

2025-01-03 Thread via GitHub
jatin510 commented on issue #13872: URL: https://github.com/apache/datafusion/issues/13872#issuecomment-2569547645 ``` > SELECT array_remove([1,null], 1); +--+ | array_remove(make_array(Int64(1),NULL),Int64(1)) | +---

[PR] fix: unwrapping Err(DataFusionError::Plan) for use in plan_datafusion_err [datafusion]

2025-01-03 Thread via GitHub
avkirilishin opened a new pull request, #14000: URL: https://github.com/apache/datafusion/pull/14000 ## Which issue does this PR close? Closes #13979. ## Are these changes tested? No new tests were added. It was manually tes

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
Chen-Yuan-Lai commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1902001514 ## docs/source/user-guide/sql/scalar_functions.md: ## @@ -1758,167 +1758,6 @@ encode(expression, format) - [decode](#decode) -## Regular Expression Func

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
Chen-Yuan-Lai commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1902001514 ## docs/source/user-guide/sql/scalar_functions.md: ## @@ -1758,167 +1758,6 @@ encode(expression, format) - [decode](#decode) -## Regular Expression Func

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
Chen-Yuan-Lai commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1902001514 ## docs/source/user-guide/sql/scalar_functions.md: ## @@ -1758,167 +1758,6 @@ encode(expression, format) - [decode](#decode) -## Regular Expression Func

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
Chen-Yuan-Lai commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1902005630 ## datafusion/functions/src/encoding/inner.rs: ## @@ -126,10 +124,21 @@ impl ScalarUDFImpl for EncodeFunc { } fn documentation(&self) -> Option<&

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
comphead commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1902025654 ## docs/source/user-guide/sql/scalar_functions.md: ## @@ -1758,167 +1758,6 @@ encode(expression, format) - [decode](#decode) -## Regular Expression Functions

Re: [PR] doc-gen: migrate scalar functions (encoding & regex) documentation [datafusion]

2025-01-03 Thread via GitHub
comphead commented on code in PR #13919: URL: https://github.com/apache/datafusion/pull/13919#discussion_r1902030922 ## docs/source/user-guide/sql/scalar_functions.md: ## @@ -1758,167 +1758,6 @@ encode(expression, format) - [decode](#decode) -## Regular Expression Functions

[I] Doc attribution: make `user_doc` to work with predefined consts. [datafusion]

2025-01-03 Thread via GitHub
comphead opened a new issue, #14001: URL: https://github.com/apache/datafusion/issues/14001 I'll file a ticket to make `user_doc` to work with predefined consts. Currently the `doc_section` attribute must match fully the predefined DocSection consts, for example ```

Re: [I] Implement SQLancer (a end-to-end SQL fuzz testing library) [datafusion]

2025-01-03 Thread via GitHub
2010YOUY01 commented on issue #11030: URL: https://github.com/apache/datafusion/issues/11030#issuecomment-2569092518 @rluvaton Thank you! I'm still interested in this project (though I havenโ€™t been working on it for a few months ๐Ÿ˜…) and I'm happy to help with any contributions. I thin

Re: [PR] Include license and notice files in more crates [datafusion]

2025-01-03 Thread via GitHub
ankane commented on PR #13985: URL: https://github.com/apache/datafusion/pull/13985#issuecomment-2569369567 Thanks @alamb! Will defer to you on a CI check (but isn't a big deal to manually add them). -- This is an automated message from the Apache Git Service. To respond to the message, p

Re: [PR] feat: Reenable tests for filtered SMJ anti join [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on PR #1211: URL: https://github.com/apache/datafusion-comet/pull/1211#issuecomment-2569362549 > Thanks @andygrove for running Q21, is it right the result row count is the same? Yes, the output from the final two SMJs is correct, as is the final output. -- This

Re: [PR] extract expressions to folders based on spark grouping [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on PR #1206: URL: https://github.com/apache/datafusion-comet/pull/1206#issuecomment-2569642133 @rluvaton I think that this looks great but it is challenging to really review the PR with so many changes. ![2025-01-03_11-20](https://github.com/user-attachments/asse

Re: [I] java.lang.ClassNotFoundException: org.apache.spark.sql.comet.execution.shuffle.CometShuffleManager [datafusion-comet]

2025-01-03 Thread via GitHub
ramyadass commented on issue #864: URL: https://github.com/apache/datafusion-comet/issues/864#issuecomment-2569648366 Any thoughts on above failure ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] chore: Add safety check to CometBuffer [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on PR #1050: URL: https://github.com/apache/datafusion-comet/pull/1050#issuecomment-2569652201 LGTM. I am going to run benchmarks before approving (I do not expect to see any performance difference). -- This is an automated message from the Apache Git Service. To resp

[PR] Add support for MySQL's INSERT INTO ... SET syntax [datafusion-sqlparser-rs]

2025-01-03 Thread via GitHub
yoavcloud opened a new pull request, #1641: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1641 https://dev.mysql.com/doc/refman/8.4/en/insert.html -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL a

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#issuecomment-2569445901 @Kontinuation, could you review this, particularly the updated Tuning Guide, to make sure I understand it correctly? -- This is an automated message from the Apache Git Serv

Re: [PR] feat(substrait): introduce consume_rel and consume_expression [datafusion]

2025-01-03 Thread via GitHub
vbarua commented on code in PR #13963: URL: https://github.com/apache/datafusion/pull/13963#discussion_r1901924005 ## datafusion/substrait/src/logical_plan/consumer.rs: ## @@ -233,6 +233,10 @@ pub trait SubstraitConsumer: Send + Sync + Sized { // These methods have default

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1901922136 ## native/core/src/execution/jni_api.rs: ## @@ -281,14 +279,13 @@ fn parse_memory_pool_config( memory_pool_type: String, memory_limit: i64, me

Re: [PR] feat(substrait): introduce consume_rel and consume_expression [datafusion]

2025-01-03 Thread via GitHub
vbarua commented on code in PR #13963: URL: https://github.com/apache/datafusion/pull/13963#discussion_r1901926751 ## datafusion/substrait/src/logical_plan/consumer.rs: ## @@ -744,10 +752,10 @@ pub async fn from_substrait_plan_with_consumer( match plan.relations[0].

[I] Support fast group accumulator for `first` and `last` [datafusion]

2025-01-03 Thread via GitHub
jayzhan211 opened a new issue, #13998: URL: https://github.com/apache/datafusion/issues/13998 ### Is your feature request related to a problem or challenge? ```rust pub(crate) fn create_group_accumulator( agg_expr: &Arc, ) -> Result> { if agg_expr.groups_accumulator

[PR] Add support for various Snowflake grantees [datafusion-sqlparser-rs]

2025-01-03 Thread via GitHub
yoavcloud opened a new pull request, #1640: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1640 (no comment) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] Introduce ProjectionMask To Allow Nested Projection Pushdown [datafusion]

2025-01-03 Thread via GitHub
gatesn commented on issue #2581: URL: https://github.com/apache/datafusion/issues/2581#issuecomment-2569432044 I'm not sure I agree that these are two separate ideas, rather, a generalization of the existing notion of projection. Projection today is all about selecting some subset of

Re: [PR] Change trigger, rename `hash_collision.yml` to `extended.yml` and add comments [datafusion]

2025-01-03 Thread via GitHub
alamb commented on PR #13988: URL: https://github.com/apache/datafusion/pull/13988#issuecomment-2569052790 Thank you for the review @ozankabak ๐Ÿ™ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

Re: [PR] Improve deserialize_to_struct example [datafusion]

2025-01-03 Thread via GitHub
alamb commented on PR #13958: URL: https://github.com/apache/datafusion/pull/13958#issuecomment-2569054376 Thank you @jonahgao -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

Re: [PR] Change trigger, rename `hash_collision.yml` to `extended.yml` and add comments [datafusion]

2025-01-03 Thread via GitHub
alamb merged PR #13988: URL: https://github.com/apache/datafusion/pull/13988 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusi

Re: [PR] Deprecate ScalarUDFImpl::return_type [datafusion]

2025-01-03 Thread via GitHub
findepi commented on code in PR #13717: URL: https://github.com/apache/datafusion/pull/13717#discussion_r1901954669 ## datafusion/core/src/catalog_common/information_schema.rs: ## @@ -406,6 +406,7 @@ fn get_udf_args_and_return_types( .into_iter() .map(|

Re: [PR] chore: Add safety check to CometBuffer [datafusion-comet]

2025-01-03 Thread via GitHub
viirya commented on PR #1050: URL: https://github.com/apache/datafusion-comet/pull/1050#issuecomment-2569654791 Thank you @andygrove -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] feat: Reenable tests for filtered SMJ anti join [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove merged PR #1211: URL: https://github.com/apache/datafusion-comet/pull/1211 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [I] Support sort merge join with a join condition [datafusion-comet]

2025-01-03 Thread via GitHub
andygrove closed issue #398: Support sort merge join with a join condition URL: https://github.com/apache/datafusion-comet/issues/398 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
Kontinuation commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902289845 ## native/core/src/execution/jni_api.rs: ## @@ -281,14 +279,13 @@ fn parse_memory_pool_config( memory_pool_type: String, memory_limit: i64,

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
Kontinuation commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902290275 ## docs/source/user-guide/tuning.md: ## @@ -23,11 +23,52 @@ Comet provides some tuning options to help you get the best performance from you ## Memory

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
viirya commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902629149 ## docs/source/user-guide/tuning.md: ## @@ -23,11 +23,52 @@ Comet provides some tuning options to help you get the best performance from you ## Memory Tunin

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
viirya commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902636032 ## docs/source/user-guide/tuning.md: ## @@ -23,11 +23,52 @@ Comet provides some tuning options to help you get the best performance from you ## Memory Tunin

Re: [PR] chore: Follow-on PR to fully enable onheap memory usage [datafusion-comet]

2025-01-03 Thread via GitHub
viirya commented on code in PR #1210: URL: https://github.com/apache/datafusion-comet/pull/1210#discussion_r1902634368 ## docs/source/user-guide/tuning.md: ## @@ -23,11 +23,52 @@ Comet provides some tuning options to help you get the best performance from you ## Memory Tunin

Re: [I] Add H2O.ai Database-like Ops benchmark to `dfbench` [datafusion]

2025-01-03 Thread via GitHub
zhuqi-lucas commented on issue #7209: URL: https://github.com/apache/datafusion/issues/7209#issuecomment-2570474523 Hi @alamb , the draft PR works well and tested group by h2o benchmark, but the join seems have some problems, i also submitted the question: https://github.com/MrPowers/

[PR] Chore: update wasm-supported crates [datafusion]

2025-01-03 Thread via GitHub
Lordworms opened a new pull request, #14005: URL: https://github.com/apache/datafusion/pull/14005 ## Which issue does this PR close? part of #7652 Closes #. ## Rationale for this change ## What changes are included in this PR? ## Are these c