Re: [PR] Signature::Coercible with user defined implicit casting [datafusion]

2025-02-18 Thread via GitHub
shehabgamin commented on code in PR #14440: URL: https://github.com/apache/datafusion/pull/14440#discussion_r1960715986 ## datafusion/functions/src/datetime/date_part.rs: ## @@ -95,24 +96,29 @@ impl DatePartFunc { signature: Signature::one_of( vec![

Re: [PR] Signature::Coercible with user defined implicit casting [datafusion]

2025-02-18 Thread via GitHub
shehabgamin commented on code in PR #14440: URL: https://github.com/apache/datafusion/pull/14440#discussion_r1960718930 ## datafusion/functions/src/datetime/date_part.rs: ## @@ -95,24 +96,29 @@ impl DatePartFunc { signature: Signature::one_of( vec![

Re: [PR] Signature::Coercible with user defined implicit casting [datafusion]

2025-02-18 Thread via GitHub
shehabgamin commented on code in PR #14440: URL: https://github.com/apache/datafusion/pull/14440#discussion_r1960718930 ## datafusion/functions/src/datetime/date_part.rs: ## @@ -95,24 +96,29 @@ impl DatePartFunc { signature: Signature::one_of( vec![

Re: [PR] chore(deps): bump blake3 from 1.5.5 to 1.6.0 [datafusion]

2025-02-18 Thread via GitHub
comphead merged PR #14741: URL: https://github.com/apache/datafusion/pull/14741 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] Speed up `chr` UDF (~4x faster) [datafusion]

2025-02-18 Thread via GitHub
comphead merged PR #14700: URL: https://github.com/apache/datafusion/pull/14700 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] chore(deps): bump parquet from 54.1.0 to 54.2.0 [datafusion]

2025-02-18 Thread via GitHub
comphead merged PR #14744: URL: https://github.com/apache/datafusion/pull/14744 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [I] Implement nested join optimization [datafusion]

2025-02-18 Thread via GitHub
clflushopt commented on issue #3843: URL: https://github.com/apache/datafusion/issues/3843#issuecomment-2667413769 Hey @alamb following up on #14688 I made a new pull request in #14735 to add an example that demonstrates how analysis works for `AND` conjunctions and a placeholder for `OR` c

Re: [I] Replace `TypeSignature::String` with `TypeSignature::Coercible` [datafusion]

2025-02-18 Thread via GitHub
zjregee commented on issue #14759: URL: https://github.com/apache/datafusion/issues/14759#issuecomment-2667425857 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

Re: [I] [EPIC] Improved support for nested / structured types (`Struct` , `List`, `ListArray`, and other Composite types) [datafusion]

2025-02-18 Thread via GitHub
TheBuilderJR commented on issue #2326: URL: https://github.com/apache/datafusion/issues/2326#issuecomment-2667439161 Filed an issue here: https://github.com/apache/datafusion/issues/14757 -- This is an automated message from the Apache Git Service. To respond to the message, please log on

Re: [PR] Signature::Coercible with user defined implicit casting [datafusion]

2025-02-18 Thread via GitHub
shehabgamin commented on code in PR #14440: URL: https://github.com/apache/datafusion/pull/14440#discussion_r1960715986 ## datafusion/functions/src/datetime/date_part.rs: ## @@ -95,24 +96,29 @@ impl DatePartFunc { signature: Signature::one_of( vec![

Re: [I] Overflow happened on: -2147483648 % -1 [datafusion-comet]

2025-02-18 Thread via GitHub
kazuyukitanimura commented on issue #1412: URL: https://github.com/apache/datafusion-comet/issues/1412#issuecomment-2667077131 It looks addition is working, I wonder what would be the difference ``` checkSparkAnswerAndOperator("select c1 + c2 from t1") ``` -- This is an automated

[PR] Allow setting the recursion limit for sql parsing [datafusion]

2025-02-18 Thread via GitHub
cetra3 opened a new pull request, #14756: URL: https://github.com/apache/datafusion/pull/14756 ## Which issue does this PR close? No issue, just running into this in production. ## Rationale for this change At the moment there isn't a clean way to set the recursion limit

Re: [PR] feat: add spark_signed_integer_remainder native function for compatibility with spark [datafusion-comet]

2025-02-18 Thread via GitHub
wForget commented on PR #1416: URL: https://github.com/apache/datafusion-comet/pull/1416#issuecomment-2667358803 > wondering if we can fix in arrow/datafusion > It looks addition is working, I wonder what would be the difference between `+` and `%` It seems to be by design in rust

Re: [I] [DISCUSSION] Lowering the barrier to new users (Lessons from-799 CMU Optimizer Class) [datafusion]

2025-02-18 Thread via GitHub
niebayes commented on issue #14373: URL: https://github.com/apache/datafusion/issues/14373#issuecomment-2667370515 > # Substait > > [@lmwnshn](https://github.com/lmwnshn) reported that for their [first project(link)](https://15799.courses.cs.cmu.edu/spring2025/project1.html), they tr

Re: [PR] Fix CI fail for extended test (by freeing up more disk space in CI runner) [datafusion]

2025-02-18 Thread via GitHub
2010YOUY01 commented on code in PR #14745: URL: https://github.com/apache/datafusion/pull/14745#discussion_r1961058668 ## .github/workflows/extended.yml: ## @@ -39,43 +39,54 @@ jobs: linux-build-lib: name: linux build test runs-on: ubuntu-latest -container: -

Re: [PR] Set projection before configuring the source [datafusion]

2025-02-18 Thread via GitHub
mertak-synnada commented on PR #14685: URL: https://github.com/apache/datafusion/pull/14685#issuecomment-2667684649 Yes, I agree with the builder approach, it might be helpful for such cases. However, I'm a bit concerned about with the explicit approach, since it might be forgotten. It look

Re: [PR] StatisticsV2: initial statistics framework redesign [datafusion]

2025-02-18 Thread via GitHub
niebayes commented on code in PR #14699: URL: https://github.com/apache/datafusion/pull/14699#discussion_r1961065951 ## datafusion/physical-expr/src/intervals/cp_solver.rs: ## @@ -15,7 +15,130 @@ // specific language governing permissions and limitations // under the License.

[I] Nested Fields Access on StructArray field not working [datafusion]

2025-02-18 Thread via GitHub
erchirag opened a new issue, #14768: URL: https://github.com/apache/datafusion/issues/14768 ### Discussed in https://github.com/apache/datafusion/discussions/11730 Originally posted by **erchirag** July 30, 2024 Hi I have a parquet file with following example data

Re: [I] Apache DataFusion Google Summer of Code (GSoC) Application Guidelines [datafusion]

2025-02-18 Thread via GitHub
ozankabak closed issue #14577: Apache DataFusion Google Summer of Code (GSoC) Application Guidelines URL: https://github.com/apache/datafusion/issues/14577 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] [wip] attach diagnostic to duplicate table name error [datafusion]

2025-02-18 Thread via GitHub
zjregee commented on code in PR #14767: URL: https://github.com/apache/datafusion/pull/14767#discussion_r1961079113 ## datafusion/common/src/dfschema.rs: ## @@ -230,6 +230,13 @@ impl DFSchema { return _schema_err!(SchemaError::DuplicateQualifiedField {

Re: [I] Enable `used_underscore_binding` clippy lint [datafusion]

2025-02-18 Thread via GitHub
findepi commented on issue #14649: URL: https://github.com/apache/datafusion/issues/14649#issuecomment-2667781939 We could move logic of `PushDownLimit::rewrite` to make it inherent method (`impl PushDownLimit` block). This way we could recurse without threading useless `config` parameter

Re: [PR] chore: migrate crypto functions to invoke_with_args [datafusion]

2025-02-18 Thread via GitHub
jatin510 commented on PR #14764: URL: https://github.com/apache/datafusion/pull/14764#issuecomment-2667556139 lgtm -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

Re: [PR] Allow setting the recursion limit for sql parsing [datafusion]

2025-02-18 Thread via GitHub
jatin510 commented on PR #14756: URL: https://github.com/apache/datafusion/pull/14756#issuecomment-2667568452 I see that a recursion limit of 50 is introduced for SQL parsing. Is this based on specific performance benchmarks or potential stack overflow risks? Also, is there any refer

Re: [PR] feat: add spark_signed_integer_remainder native function for compatibility with spark [datafusion-comet]

2025-02-18 Thread via GitHub
codecov-commenter commented on PR #1416: URL: https://github.com/apache/datafusion-comet/pull/1416#issuecomment-2667576051 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/1416?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [I] Return `NativeType` instead of `DataType` for `get_example_types` [datafusion]

2025-02-18 Thread via GitHub
irenjj commented on issue #14761: URL: https://github.com/apache/datafusion/issues/14761#issuecomment-2667601156 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

Re: [PR] Fix CI fail for extended test (by freeing up more disk space in CI runner) [datafusion]

2025-02-18 Thread via GitHub
ozankabak commented on code in PR #14745: URL: https://github.com/apache/datafusion/pull/14745#discussion_r1961014910 ## .github/workflows/extended.yml: ## @@ -95,7 +106,7 @@ jobs: - name: Run tests run: | cd datafusion - cargo test --profile

Re: [PR] Fix CI fail for extended test (by freeing up more disk space in CI runner) [datafusion]

2025-02-18 Thread via GitHub
ozankabak commented on code in PR #14745: URL: https://github.com/apache/datafusion/pull/14745#discussion_r1961014910 ## .github/workflows/extended.yml: ## @@ -95,7 +106,7 @@ jobs: - name: Run tests run: | cd datafusion - cargo test --profile

[PR] feat: Improve datafusion-cli memory usage and considering reserve mem… [datafusion]

2025-02-18 Thread via GitHub
zhuqi-lucas opened a new pull request, #14766: URL: https://github.com/apache/datafusion/pull/14766 …ory for the result batches ## Which issue does this PR close? - Closes[ #14751](https://github.com/apache/datafusion/issues/14751) ## Rationale for this change This

Re: [PR] feat: Improve datafusion-cli memory usage and considering reserve mem… [datafusion]

2025-02-18 Thread via GitHub
zhuqi-lucas commented on PR #14766: URL: https://github.com/apache/datafusion/pull/14766#issuecomment-2667614609 cc @alamb @2010YOUY01 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

Re: [PR] feat: Improve datafusion-cli memory usage and considering reserve mem… [datafusion]

2025-02-18 Thread via GitHub
zhuqi-lucas commented on PR #14766: URL: https://github.com/apache/datafusion/pull/14766#issuecomment-2667624674 1. The memory usage now is accurate, it will not collect all result to memory. 2. We now register datafusion-cli result batch to memory pool also. The testing result for

Re: [PR] feat: Improve datafusion-cli memory usage and considering reserve mem… [datafusion]

2025-02-18 Thread via GitHub
zhuqi-lucas commented on PR #14766: URL: https://github.com/apache/datafusion/pull/14766#issuecomment-2667636669 Before this PR, the result is double: ```rust /usr/bin/time -l cargo run --release -- --mem-pool-type fair -m 5G --maxrows 10 -f '/Users/zhuqi/arrow-datafusion/benchmark

[PR] chore: migrate crypto functions to invoke_with_args [datafusion]

2025-02-18 Thread via GitHub
Chen-Yuan-Lai opened a new pull request, #14764: URL: https://github.com/apache/datafusion/pull/14764 ## Which issue does this PR close? - Closes #14707 . ## Rationale for this change ## What changes are included in this PR? ## Are these cha

[I] TypeSignature::Coercible for math functions [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 opened a new issue, #14763: URL: https://github.com/apache/datafusion/issues/14763 ### Is your feature request related to a problem or challenge? `Log` for example can be handled with `TypeSignature::Coercible` where the desired type is float and allow source types are inte

Re: [PR] chore(deps): bump uuid from 1.13.1 to 1.13.2 [datafusion]

2025-02-18 Thread via GitHub
comphead merged PR #14739: URL: https://github.com/apache/datafusion/pull/14739 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [I] TypeSignature::Coercible for math functions [datafusion]

2025-02-18 Thread via GitHub
alan910127 commented on issue #14763: URL: https://github.com/apache/datafusion/issues/14763#issuecomment-2667281094 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] chore: fixes for kube build [datafusion-comet]

2025-02-18 Thread via GitHub
codecov-commenter commented on PR #1421: URL: https://github.com/apache/datafusion-comet/pull/1421#issuecomment-2667378784 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/1421?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [I] Library Guide: Extending DataFusion's operators: custom LogicalPlan and `ExecutionPlans` [datafusion]

2025-02-18 Thread via GitHub
Tanger commented on issue #7308: URL: https://github.com/apache/datafusion/issues/7308#issuecomment-2667394052 Can I take this issue? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

Re: [I] Feature: support `to_char(date, timstamp format)` [datafusion]

2025-02-18 Thread via GitHub
friendlymatthew commented on issue #14536: URL: https://github.com/apache/datafusion/issues/14536#issuecomment-2667388287 > @Omega359 wrote: We submit a PR to arrow-rs to attempt the casting of dates to timestamps in the display code in arrow-cast upon receiving a format failure. I a

Re: [I] TypeSignature::Coercible for crypto functions [datafusion]

2025-02-18 Thread via GitHub
Chen-Yuan-Lai commented on issue #14762: URL: https://github.com/apache/datafusion/issues/14762#issuecomment-2667277631 take -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] fix: fix various unit test failures in native_datafusion and native_iceberg_compat readers [datafusion-comet]

2025-02-18 Thread via GitHub
parthchandra commented on PR #1415: URL: https://github.com/apache/datafusion-comet/pull/1415#issuecomment-2667311891 @kazuyukitanimura @andygrove @huaxingao @comphead review requested. -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[PR] Dev [datafusion-comet]

2025-02-18 Thread via GitHub
comphead opened a new pull request, #1421: URL: https://github.com/apache/datafusion-comet/pull/1421 ## Which issue does this PR close? Related #1417. ## Rationale for this change ## What changes are included in this PR? ## How are these cha

Re: [I] Simplify `EXPR LIKE 'constant'` to `expr = 'constant'` [datafusion]

2025-02-18 Thread via GitHub
ngli-me commented on issue #13192: URL: https://github.com/apache/datafusion/issues/13192#issuecomment-2667317716 Hi! I'm just going through older issues, it looks like this one can be closed as of #13260, since #13061 got merged into that one? -- This is an automated message from the Apa

Re: [PR] chore(deps): bump clap from 4.5.29 to 4.5.30 [datafusion]

2025-02-18 Thread via GitHub
comphead merged PR #14743: URL: https://github.com/apache/datafusion/pull/14743 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] chore(deps): bump tempfile from 3.17.0 to 3.17.1 [datafusion]

2025-02-18 Thread via GitHub
comphead merged PR #14742: URL: https://github.com/apache/datafusion/pull/14742 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] POC to show performance improvements of not copying token [datafusion-sqlparser-rs]

2025-02-18 Thread via GitHub
github-actions[bot] closed pull request #1561: POC to show performance improvements of not copying token URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1561 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] feat: add spark_signed_integer_remainder native function for compatibility with spark [datafusion-comet]

2025-02-18 Thread via GitHub
kazuyukitanimura commented on PR #1416: URL: https://github.com/apache/datafusion-comet/pull/1416#issuecomment-2667093535 wondering if we can fix in arrow/datafusion It looks addition is working, I wonder what would be the difference between `+` and `%` ``` checkSparkAnswerAnd

Re: [I] Datafusion can't seem to cast evolving structs [datafusion]

2025-02-18 Thread via GitHub
TheBuilderJR commented on issue #14757: URL: https://github.com/apache/datafusion/issues/14757#issuecomment-2667240380 cc @alamb many of my users can't query their data because of this evolution. any chance you can take a look to see if there's any workaround I can do for now? -- This is

Re: [PR] Fix Type Coercion for UDF Arguments [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 commented on PR #14268: URL: https://github.com/apache/datafusion/pull/14268#issuecomment-2667238494 Close by #14440 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

Re: [I] DataFusion Regression (Starting in v43): Type Coercion for UDF Arguments (X --> String) for Specified UDFs [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 closed issue #14230: DataFusion Regression (Starting in v43): Type Coercion for UDF Arguments (X --> String) for Specified UDFs URL: https://github.com/apache/datafusion/issues/14230 -- This is an automated message from the Apache Git Service. To respond to the message, please log

[I] Datafusion can't seem to cast evolving structs [datafusion]

2025-02-18 Thread via GitHub
TheBuilderJR opened a new issue, #14757: URL: https://github.com/apache/datafusion/issues/14757 ### Describe the bug I'd expect as I add fields to structs, I should be able to cast one into another. You can see in the repro below this doesn't seem to be allowed: ### To Reproduc

Re: [I] DataFusion Regression (Starting in v43): Type Coercion for UDF Arguments (X --> String) for Specified UDFs [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 commented on issue #14230: URL: https://github.com/apache/datafusion/issues/14230#issuecomment-2667238810 Close by #14440 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] Signature::Coercible with user defined implicit casting [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 commented on code in PR #14440: URL: https://github.com/apache/datafusion/pull/14440#discussion_r1960783349 ## datafusion/expr-common/src/signature.rs: ## @@ -466,6 +551,186 @@ fn get_data_types(native_type: &NativeType) -> Vec { } } +/// Represents type coer

Re: [PR] Fix Type Coercion for UDF Arguments [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 closed pull request #14268: Fix Type Coercion for UDF Arguments URL: https://github.com/apache/datafusion/pull/14268 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] Support aliases in ConstEvaluator [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 merged PR #14734: URL: https://github.com/apache/datafusion/pull/14734 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dat

Re: [PR] Support aliases in ConstEvaluator [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 commented on PR #14734: URL: https://github.com/apache/datafusion/pull/14734#issuecomment-2667239582 Thanks @joroKr21 @Dandandan -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[I] Support Self Join Eliminate [datafusion]

2025-02-18 Thread via GitHub
comphead opened a new issue, #14758: URL: https://github.com/apache/datafusion/issues/14758 ### Is your feature request related to a problem or challenge? PostgreSQL just landed elimination for the self joins for some queries https://www.phoronix.com/news/PostgreSQL-Self-Join-Elimi

Re: [PR] dependabot: group arrow/parquet minor/patch bumps, remove limit [datafusion]

2025-02-18 Thread via GitHub
comphead commented on code in PR #14730: URL: https://github.com/apache/datafusion/pull/14730#discussion_r1960781327 ## .github/dependabot.yml: ## @@ -21,19 +21,30 @@ updates: directory: "/" schedule: interval: daily -open-pull-requests-limit: 10 target

[I] Replace `TypeSignature::Numeric` with `TypeSignature::Coercible` [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 opened a new issue, #14760: URL: https://github.com/apache/datafusion/issues/14760 ### Is your feature request related to a problem or challenge? After #14440, we can use `Coercible` for existing `Numeric` signature ### Describe the solution you'd like _No resp

[I] Replace `TypeSignature::String` with `TypeSignature::Coercible` [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 opened a new issue, #14759: URL: https://github.com/apache/datafusion/issues/14759 ### Is your feature request related to a problem or challenge? After #14440, we can use Coercible signature for existing String ```rust TypeSignature::Coercible(v

[I] TypeSignature::Coercible for crypto functions [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 opened a new issue, #14762: URL: https://github.com/apache/datafusion/issues/14762 ### Is your feature request related to a problem or challenge? `DigestFunc`, `Md5Func`, `SHA` family allow string and binary types as input. We can use `TypeSignature::Coercible` for these fu

[I] Return `NativeType` instead of `DataType` for `get_example_types` [datafusion]

2025-02-18 Thread via GitHub
jayzhan211 opened a new issue, #14761: URL: https://github.com/apache/datafusion/issues/14761 ### Is your feature request related to a problem or challenge? `get_example_types` is used to list possible types for a function in information schema Instead of DataType, NativeType i

Re: [I] Document PREPARE statements [datafusion]

2025-02-18 Thread via GitHub
dhegberg commented on issue #13570: URL: https://github.com/apache/datafusion/issues/13570#issuecomment-2667446243 I was looking at how to support named parameters for PREPARE statements. I expected the supported syntax in SQL dialects to looks something like: ``` PREPARE fun

[I] Add DataFrame fill_nan/fill_null [datafusion]

2025-02-18 Thread via GitHub
kosiew opened a new issue, #14765: URL: https://github.com/apache/datafusion/issues/14765 ### Is your feature request related to a problem or challenge? There is a common operation in libraries such as pyspark to fill nulls in an entire DataFrame (or to limit by columns). It would be

Re: [I] Enable `used_underscore_binding` clippy lint [datafusion]

2025-02-18 Thread via GitHub
ding-young commented on issue #14649: URL: https://github.com/apache/datafusion/issues/14649#issuecomment-2667701042 Hi, @findepi I've set `used_underscore_binding=deny` and renamed each `_var` that was actually used or passed down as fn arg to `var`. However, this gives another clippy

[PR] [wip] attach diagnostic to duplicate table name error [datafusion]

2025-02-18 Thread via GitHub
zjregee opened a new pull request, #14767: URL: https://github.com/apache/datafusion/pull/14767 ## Which issue does this PR close? - Closes #14436. ## Rationale for this change ## What changes are included in this PR? ## Are these changes tested? ## Are there any user-facing ch

Re: [PR] feat: Improve datafusion-cli memory usage and considering reserve mem… [datafusion]

2025-02-18 Thread via GitHub
2010YOUY01 commented on PR #14766: URL: https://github.com/apache/datafusion/pull/14766#issuecomment-2667753322 Thank you for the help. This change will stop execution once `maxrow` is reached. I think this is the optimal behavior for application developers using `datafusion-cli` for qu

[I] Add scripts to automate release process [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove opened a new issue, #1419: URL: https://github.com/apache/datafusion-comet/issues/1419 ### What is the problem the feature request solves? The release process has some manual steps, such as updating snapshot and release version numbers in the documentation. I would like to a

Re: [PR] fix: Substrait serializer clippy error: not calling truncate [datafusion]

2025-02-18 Thread via GitHub
niebayes commented on code in PR #14723: URL: https://github.com/apache/datafusion/pull/14723#discussion_r1960068258 ## datafusion/substrait/src/serializer.rs: ## @@ -26,9 +26,30 @@ use substrait::proto::Plan; use std::fs::OpenOptions; use std::io::{Read, Write}; +use std::p

Re: [PR] fix: Substrait serializer clippy error: not calling truncate [datafusion]

2025-02-18 Thread via GitHub
niebayes commented on PR #14723: URL: https://github.com/apache/datafusion/pull/14723#issuecomment-2666178275 @alamb Applied suggestions from @mbrobbel. Please check it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

Re: [PR] Comet 0.6.0 [datafusion-site]

2025-02-18 Thread via GitHub
andygrove commented on PR #56: URL: https://github.com/apache/datafusion-site/pull/56#issuecomment-2666080238 Thanks for the reviews! I plan on publishing this once https://github.com/apache/datafusion-comet/pull/1418 is merged. -- This is an automated message from the Apache Git Service.

[PR] chore: Update released version in documentation [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove opened a new pull request, #1418: URL: https://github.com/apache/datafusion-comet/pull/1418 ## Which issue does this PR close? Closes #. ## Rationale for this change ## What changes are included in this PR? ## How are these changes

[I] Improve datafusion-cli memory usage and considering reserve memory for the result batches [datafusion]

2025-02-18 Thread via GitHub
zhuqi-lucas opened a new issue, #14751: URL: https://github.com/apache/datafusion/issues/14751 ### Is your feature request related to a problem or challenge? This is the follow-up for the discussion https://github.com/apache/datafusion/pull/14644#issuecomment-2665614584 **P

Re: [PR] bug: Fix memory reservation and allocation problems for SortExec [datafusion]

2025-02-18 Thread via GitHub
zhuqi-lucas commented on PR #14644: URL: https://github.com/apache/datafusion/pull/14644#issuecomment-2666113013 > > > Problem > > > I tried one query and this PR is not working as expected, I specified one query to run under 5GB memory (select * without order requires 7GB) but it's stil

Re: [I] ListingTable cannot handle partition evolution [datafusion]

2025-02-18 Thread via GitHub
TheBuilderJR commented on issue #13270: URL: https://github.com/apache/datafusion/issues/13270#issuecomment-2666309651 I just created an issue for this: https://github.com/apache/datafusion/issues/14753 -- This is an automated message from the Apache Git Service. To respond to the message

Re: [I] Docker image for 0.6.0 failed to build [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove commented on issue #1417: URL: https://github.com/apache/datafusion-comet/issues/1417#issuecomment-2666318484 Once this is resolved we should update the `kubernetes.md` doc to point to the new version, as discussed in https://github.com/apache/datafusion-comet/pull/1418 -- Thi

Re: [PR] chore: Update released version in documentation [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove merged PR #1418: URL: https://github.com/apache/datafusion-comet/pull/1418 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] Comet 0.6.0 [datafusion-site]

2025-02-18 Thread via GitHub
andygrove merged PR #56: URL: https://github.com/apache/datafusion-site/pull/56 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] Preallocate Ray Workers [datafusion-ray]

2025-02-18 Thread via GitHub
robtandy commented on PR #62: URL: https://github.com/apache/datafusion-ray/pull/62#issuecomment-2666307285 Updated to squash messy commit history from source branch -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] Preallocate Ray Workers [datafusion-ray]

2025-02-18 Thread via GitHub
andygrove commented on code in PR #62: URL: https://github.com/apache/datafusion-ray/pull/62#discussion_r1960168520 ## requirements-in.txt: ## @@ -7,6 +7,6 @@ numpy pyarrow>=18.0.0 pytest ray==2.40.0 -datafusion==43.1.0 +datafusion==43.0.0 Review Comment: There is no 43.0

Re: [I] Datafusion can't seem to handle schema evolution [datafusion]

2025-02-18 Thread via GitHub
TheBuilderJR commented on issue #14753: URL: https://github.com/apache/datafusion/issues/14753#issuecomment-2666342013 Actually this does seem to work with the latest version of datafusion! -- This is an automated message from the Apache Git Service. To respond to the message, please log o

Re: [I] Remove hard-coded Comet version numbers from GitHub actions [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove commented on issue #1406: URL: https://github.com/apache/datafusion-comet/issues/1406#issuecomment-2666355667 This is fixed now -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spec

Re: [I] Remove hard-coded Comet version numbers from GitHub actions [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove closed issue #1406: Remove hard-coded Comet version numbers from GitHub actions URL: https://github.com/apache/datafusion-comet/issues/1406 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] Add `#[recursive]` [datafusion-sqlparser-rs]

2025-02-18 Thread via GitHub
alamb commented on PR #1522: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1522#issuecomment-2666793579 > Hi @alamb , @iffyio ! Just wanted to report that I just received a crash report that seems to come from here: [sqlpage/SQLPage#814](https://github.com/sqlpage/SQLPage/iss

Re: [PR] fix: Substrait serializer clippy error: not calling truncate [datafusion]

2025-02-18 Thread via GitHub
mbrobbel commented on code in PR #14723: URL: https://github.com/apache/datafusion/pull/14723#discussion_r1960083499 ## datafusion/substrait/src/serializer.rs: ## @@ -26,28 +26,50 @@ use substrait::proto::Plan; use std::fs::OpenOptions; use std::io::{Read, Write}; +use std::

Re: [PR] chore: Update released version in documentation [datafusion-comet]

2025-02-18 Thread via GitHub
codecov-commenter commented on PR #1418: URL: https://github.com/apache/datafusion-comet/pull/1418#issuecomment-2666238969 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/1418?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

[I] Comet 0.7.0 (March 2025) [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove opened a new issue, #1420: URL: https://github.com/apache/datafusion-comet/issues/1420 ### What is the problem the feature request solves? Tracking the next release. Issues to resolve: - https://github.com/apache/datafusion-comet/issues/1414 - https://github.

Re: [I] Datafusion can't seem to handle schema evolution [datafusion]

2025-02-18 Thread via GitHub
TheBuilderJR closed issue #14753: Datafusion can't seem to handle schema evolution URL: https://github.com/apache/datafusion/issues/14753 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] Skip target in taplo checks [datafusion]

2025-02-18 Thread via GitHub
findepi merged PR #14747: URL: https://github.com/apache/datafusion/pull/14747 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafu

Re: [PR] fix: Substrait serializer clippy error: not calling truncate [datafusion]

2025-02-18 Thread via GitHub
niebayes commented on code in PR #14723: URL: https://github.com/apache/datafusion/pull/14723#discussion_r1960056475 ## datafusion/substrait/src/serializer.rs: ## @@ -26,9 +26,30 @@ use substrait::proto::Plan; use std::fs::OpenOptions; use std::io::{Read, Write}; +use std::p

Re: [PR] fix: EnforceSorting should not remove a needed coalesces [datafusion]

2025-02-18 Thread via GitHub
berkaysynnada commented on PR #14637: URL: https://github.com/apache/datafusion/pull/14637#issuecomment-2666086572 I'd like to take a look at this tomorrow, please hold if it's not urgent. -- This is an automated message from the Apache Git Service. To respond to the message, please log on

Re: [PR] chore: Update released version in documentation [datafusion-comet]

2025-02-18 Thread via GitHub
andygrove commented on code in PR #1418: URL: https://github.com/apache/datafusion-comet/pull/1418#discussion_r1959993933 ## docs/source/user-guide/kubernetes.md: ## @@ -65,13 +65,13 @@ metadata: spec: type: Scala mode: cluster - image: ghcr.io/apache/datafusion-comet:sp

Re: [PR] fix: fix various unit test failures in native_datafusion and native_iceberg_compat readers [datafusion-comet]

2025-02-18 Thread via GitHub
parthchandra commented on PR #1415: URL: https://github.com/apache/datafusion-comet/pull/1415#issuecomment-2666140365 > #1413 shouldn't really conflict with this (I mean it will from a merge standpoint, but not from a logic standpoint). This just adds more true cases to `cast_supported` an

Re: [PR] feat: add Win-amd64 profile [datafusion-comet]

2025-02-18 Thread via GitHub
wForget commented on PR #1410: URL: https://github.com/apache/datafusion-comet/pull/1410#issuecomment-2666254913 > > Thanks @wForget. I have no way to test this, but LGTM. > > @andygrove Thank you. I have verified this locally, I'll provide screenshots later. I have updated sc

Re: [PR] Skip target in taplo checks [datafusion]

2025-02-18 Thread via GitHub
findepi commented on PR #14747: URL: https://github.com/apache/datafusion/pull/14747#issuecomment-2666259330 > `trybuild` is an example of a tool that creates cargo files inside target directory. BTW i am using `trybuild` in https://github.com/apache/datafusion/pull/14668, so this wi

Re: [I] Docker image for 0.6.0 failed to build [datafusion-comet]

2025-02-18 Thread via GitHub
comphead commented on issue #1417: URL: https://github.com/apache/datafusion-comet/issues/1417#issuecomment-2666484290 it stucks for hours on ``` Downloading from central: https://repo.maven.apache.org/maven2/org/scala-lang/scala-reflect/2.12.17/scala-reflect-2.12.17.jar #33 40.95

Re: [PR] Preallocate Ray Workers [datafusion-ray]

2025-02-18 Thread via GitHub
robtandy commented on code in PR #62: URL: https://github.com/apache/datafusion-ray/pull/62#discussion_r1960252067 ## requirements-in.txt: ## @@ -7,6 +7,6 @@ numpy pyarrow>=18.0.0 pytest ray==2.40.0 -datafusion==43.1.0 +datafusion==43.0.0 Review Comment: Good eye. Correc

Re: [PR] Signature::Coercible with user defined implicit casting [datafusion]

2025-02-18 Thread via GitHub
findepi commented on code in PR #14440: URL: https://github.com/apache/datafusion/pull/14440#discussion_r1959483130 ## datafusion/expr-common/src/signature.rs: ## @@ -466,6 +551,186 @@ fn get_data_types(native_type: &NativeType) -> Vec { } } +/// Represents type coercio

[I] More accurate memory accounting in external sort [datafusion]

2025-02-18 Thread via GitHub
2010YOUY01 opened a new issue, #14748: URL: https://github.com/apache/datafusion/issues/14748 ### Is your feature request related to a problem or challenge? https://github.com/apache/datafusion/pull/14644 fixed an external sorting bug. Each batch's memory overhead is estimated as `2 *

Re: [I] Migrate Array Functions to `invoke_with_args` [datafusion]

2025-02-18 Thread via GitHub
goldmedal closed issue #14702: Migrate Array Functions to `invoke_with_args` URL: https://github.com/apache/datafusion/issues/14702 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

Re: [PR] chore: Migrate Array Functions to invoke_with_args [datafusion]

2025-02-18 Thread via GitHub
goldmedal merged PR #14726: URL: https://github.com/apache/datafusion/pull/14726 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@data

  1   2   3   >