Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
Weijun-H commented on PR #13683: URL: https://github.com/apache/datafusion/pull/13683#issuecomment-2525400134 > We follow DuckDB for array function mostly, the best I can think of is implementing spark function in [datafusion-contrib/datafusion-functions-extra](https://github.com/datafusion

Re: [I] Add `SessionConfig` reference to `ScalarFunctionArgs` [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 commented on issue #13519: URL: https://github.com/apache/datafusion/issues/13519#issuecomment-2525424413 I guess we can also add `nullable` info to `ScalarFunctionArgs` https://github.com/apache/datafusion/issues/11923 -- This is an automated message from the Apache Git Se

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
Kimahriman commented on PR #13683: URL: https://github.com/apache/datafusion/pull/13683#issuecomment-2525401068 Interesting thing is that DuckDB says they based it on the PrestoDB behavior: https://github.com/duckdb/duckdb/pull/3065 But a quick look at the PrestoDB implementation sugg

Re: [I] Create memory table with target partitions [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 commented on issue #12905: URL: https://github.com/apache/datafusion/issues/12905#issuecomment-2525406814 I'm thinking about make `ValuesExec` partitioned -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 commented on PR #13683: URL: https://github.com/apache/datafusion/pull/13683#issuecomment-2525402657 > Interesting thing is that DuckDB says they based it on the PrestoDB behavior: [duckdb/duckdb#3065](https://github.com/duckdb/duckdb/pull/3065) > > But a quick look at the

Re: [PR] Refactor regexplike signature [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 merged PR #13394: URL: https://github.com/apache/datafusion/pull/13394 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dat

Re: [I] Update ClickBench benchmarks with DataFusion `43.0.0` [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 commented on issue #13099: URL: https://github.com/apache/datafusion/issues/13099#issuecomment-2525483850 > > > @alamb I would really appreciate any advice you could give when you have a moment. > > > > > > I think we would have to get some detailed profiling to really

Re: [PR] Refactor regexplike signature [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 commented on PR #13394: URL: https://github.com/apache/datafusion/pull/13394#issuecomment-2525480746 Thanks @jiashenC @alamb @Omega359 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
Kimahriman commented on code in PR #13683: URL: https://github.com/apache/datafusion/pull/13683#discussion_r1874582555 ## datafusion/sqllogictest/test_files/array.slt: ## @@ -5260,6 +5270,13 @@ select array_has([], null), NULL NULL NULL +# If lhs is has any Nulls, we re

Re: [PR] chore: Remove unused StringView struct [datafusion-comet]

2024-12-07 Thread via GitHub
sunchao commented on code in PR #1143: URL: https://github.com/apache/datafusion-comet/pull/1143#discussion_r1874543948 ## native/core/src/data_type.rs: ## @@ -1,241 +0,0 @@ -// Licensed to the Apache Software Foundation (ASF) under one -// or more contributor license agreements

Re: [PR] fix: Enable scenarios accidentally commented out in CometExecBenchmark [datafusion-comet]

2024-12-07 Thread via GitHub
andygrove merged PR #1151: URL: https://github.com/apache/datafusion-comet/pull/1151 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

[PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
Kimahriman opened a new pull request, #13683: URL: https://github.com/apache/datafusion/pull/13683 ## Which issue does this PR close? Closes #13682 ## Rationale for this change Makes null handling for `array_has` consistent across scalars and arrays, and mak

Re: [I] Update ballista logo [datafusion-ballista]

2024-12-07 Thread via GitHub
andygrove commented on issue #1133: URL: https://github.com/apache/datafusion-ballista/issues/1133#issuecomment-2525226778 It looks like it is between 3 & 4. @pinarbayata would it be possible to see one more variation - option 3 but using the colors from option 4? -- This is an au

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
jayzhan211 commented on PR #13683: URL: https://github.com/apache/datafusion/pull/13683#issuecomment-2525371434 We follow DuckDB for array function, I think the better approach is having a easy way to customize the function -- This is an automated message from the Apache Git Service. To r

Re: [PR] Performance: enable array allocation reuse (`ScalarFunctionArgs` gets owned `ColumnReference`) [datafusion]

2024-12-07 Thread via GitHub
alamb commented on code in PR #13637: URL: https://github.com/apache/datafusion/pull/13637#discussion_r1874553258 ## datafusion-examples/examples/advanced_udf.rs: ## @@ -191,6 +199,51 @@ impl ScalarUDFImpl for PowUdf { } } +/// Evaluate `base ^ exp` *without* allocating

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
alamb commented on code in PR #13683: URL: https://github.com/apache/datafusion/pull/13683#discussion_r1874558578 ## datafusion/functions-nested/src/array_has.rs: ## @@ -215,7 +215,11 @@ fn array_has_dispatch_for_array( let needle_row = Scalar::new(needle.slice(i, 1));

Re: [PR] chore: Remove unused StringView struct [datafusion-comet]

2024-12-07 Thread via GitHub
viirya commented on code in PR #1143: URL: https://github.com/apache/datafusion-comet/pull/1143#discussion_r1874555050 ## native/core/src/data_type.rs: ## @@ -1,241 +0,0 @@ -// Licensed to the Apache Software Foundation (ASF) under one -// or more contributor license agreements.

[PR] build(deps): bump pyo3-build-config from 0.22.6 to 0.23.3 [datafusion-python]

2024-12-07 Thread via GitHub
dependabot[bot] opened a new pull request, #966: URL: https://github.com/apache/datafusion-python/pull/966 Bumps [pyo3-build-config](https://github.com/pyo3/pyo3) from 0.22.6 to 0.23.3. Release notes Sourced from https://github.com/pyo3/pyo3/releases";>pyo3-build-config's releases

[PR] build(deps): bump tokio from 1.41.1 to 1.42.0 [datafusion-python]

2024-12-07 Thread via GitHub
dependabot[bot] opened a new pull request, #968: URL: https://github.com/apache/datafusion-python/pull/968 Bumps [tokio](https://github.com/tokio-rs/tokio) from 1.41.1 to 1.42.0. Release notes Sourced from https://github.com/tokio-rs/tokio/releases";>tokio's releases. Tokio v

[PR] build(deps): bump prost from 0.13.3 to 0.13.4 [datafusion-python]

2024-12-07 Thread via GitHub
dependabot[bot] opened a new pull request, #967: URL: https://github.com/apache/datafusion-python/pull/967 Bumps [prost](https://github.com/tokio-rs/prost) from 0.13.3 to 0.13.4. Changelog Sourced from https://github.com/tokio-rs/prost/blob/master/CHANGELOG.md";>prost's changelog.

Re: [PR] chore: macros crate cleanup [datafusion]

2024-12-07 Thread via GitHub
findepi merged PR #13685: URL: https://github.com/apache/datafusion/pull/13685 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafu

[PR] feat: Expose Ballista Scheduler and Executor in Python [datafusion-ballista]

2024-12-07 Thread via GitHub
milenkovicm opened a new pull request, #1148: URL: https://github.com/apache/datafusion-ballista/pull/1148 # Which issue does this PR close? Closes #1107. # Rationale for this change # What changes are included in this PR? - added basic classes for scheduler & exe

Re: [PR] feat: default instance for executor configuration [datafusion-ballista]

2024-12-07 Thread via GitHub
andygrove merged PR #1147: URL: https://github.com/apache/datafusion-ballista/pull/1147 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr.

Re: [I] [EPIC] Improve sqlparser performance [datafusion-sqlparser-rs]

2024-12-07 Thread via GitHub
davisp commented on issue #1557: URL: https://github.com/apache/datafusion-sqlparser-rs/issues/1557#issuecomment-2525239854 @alamb How are you generating your flamegraphs? Locally, `cargo flamegraph --bench sqlparser_bench` does not appear to be doing the trick as the flamegraph appe

Re: [I] [DISCUSS] More extensive pre-release testing [datafusion]

2024-12-07 Thread via GitHub
findepi commented on issue #13661: URL: https://github.com/apache/datafusion/issues/13661#issuecomment-2525246516 > It is my understanding that the apache voting / approval process prevents automated builds That's my understanding too, but i hope this process isn't nonnegotiable.

Re: [PR] [comet-parquet-exec] Add Native Scan to CometReadBenchmark [datafusion-comet]

2024-12-07 Thread via GitHub
mbutrovich commented on PR #1150: URL: https://github.com/apache/datafusion-comet/pull/1150#issuecomment-2525245173 I am curious what's going on in this scenario in particular: ``` Sorted Lg Str Filter Scan (50.0% zeros): Best Time(ms) Avg Time(ms) Stdev(ms)Rate(M/s) Per Ro

Re: [PR] Add snapshot testing to CLI & set up AWS mock [datafusion]

2024-12-07 Thread via GitHub
findepi commented on code in PR #13672: URL: https://github.com/apache/datafusion/pull/13672#discussion_r1874533180 ## datafusion-cli/tests/integration_setup.bash: ## @@ -0,0 +1,16 @@ +# you should have localstack up, e.g by +#$ LOCALSTACK_VERSION=sha256:a0b79cb2430f1818de2c66c

Re: [PR] Fix S3 in CLI: Do not normalize options values [datafusion]

2024-12-07 Thread via GitHub
findepi commented on code in PR #13576: URL: https://github.com/apache/datafusion/pull/13576#discussion_r1874535271 ## datafusion/common/src/config.rs: ## @@ -973,16 +980,24 @@ impl ConfigField for Option { #[macro_export] macro_rules! config_field { -($t:ty) => { +(

Re: [PR] chore: Move more expressions from core crate to spark-expr crate [datafusion-comet]

2024-12-07 Thread via GitHub
andygrove commented on code in PR #1152: URL: https://github.com/apache/datafusion-comet/pull/1152#discussion_r1874535391 ## native/spark-expr/src/sum_decimal.rs: ## @@ -286,18 +284,16 @@ struct SumDecimalGroupsAccumulator { sum: Vec, result_type: DataType, precis

[PR] chore: Move more expressions from core crate to spark-expr crate [datafusion-comet]

2024-12-07 Thread via GitHub
andygrove opened a new pull request, #1152: URL: https://github.com/apache/datafusion-comet/pull/1152 ## Which issue does this PR close? Part of https://github.com/apache/datafusion-comet/issues/659 ## Rationale for this change We do not publish the core `

[PR] macros crate cleanup [datafusion]

2024-12-07 Thread via GitHub
findepi opened a new pull request, #13685: URL: https://github.com/apache/datafusion/pull/13685 ## Which issue does this PR close? none ## Rationale for this change - remove unused dependencies - move user_doc macro from top-level lib.rs to module (in prep for more ma

[PR] Reorganize the Parser module [datafusion-sqlparser-rs]

2024-12-07 Thread via GitHub
davisp opened a new pull request, #1581: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1581 I mostly did this as an exercise to get a general feel of how the Parser implementation is organized. The basics here are that for every top level keyword in Parser::parse_statement I c

Re: [PR] Reorganize the Parser module [datafusion-sqlparser-rs]

2024-12-07 Thread via GitHub
davisp commented on PR #1581: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1581#issuecomment-2525327041 I originally had intentions on spending today trying to see if I couldn't figure out how to help move #1561 forward, but my usual attempt at commenting out the non-clone A

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
comphead commented on code in PR #13683: URL: https://github.com/apache/datafusion/pull/13683#discussion_r1874573459 ## datafusion/sqllogictest/test_files/array.slt: ## @@ -5260,6 +5270,13 @@ select array_has([], null), NULL NULL NULL +# If lhs is has any Nulls, we retu

Re: [PR] Implement GroupsAccumulator for corr(x,y) aggregate function [datafusion]

2024-12-07 Thread via GitHub
2010YOUY01 commented on PR #13581: URL: https://github.com/apache/datafusion/pull/13581#issuecomment-2525258213 > Hi, I think this one is pretty close, do you have time to look at the review comments @2010YOUY01 ? Yes, I will be back and finish this PR in next 2 days, I'm traveling an

Re: [PR] Make scalar and array handling for array_has consistent [datafusion]

2024-12-07 Thread via GitHub
Kimahriman commented on code in PR #13683: URL: https://github.com/apache/datafusion/pull/13683#discussion_r1874569927 ## datafusion/functions-nested/src/array_has.rs: ## @@ -215,7 +215,11 @@ fn array_has_dispatch_for_array( let needle_row = Scalar::new(needle.slice(i,

[PR] refactor: use `LazyLock` in the `user_doc` macro [datafusion]

2024-12-07 Thread via GitHub
jonahgao opened a new pull request, #13684: URL: https://github.com/apache/datafusion/pull/13684 ## Which issue does this PR close? Part of https://github.com/apache/datafusion/issues/11687 ## Rationale for this change Replace `OnceLock` with `LazyLock`. ##

Re: [PR] WIP: example solution for part of Epic 13525; invariant checking for implicit LP changes [datafusion]

2024-12-07 Thread via GitHub
findepi commented on code in PR #13651: URL: https://github.com/apache/datafusion/pull/13651#discussion_r1874536239 ## datafusion/optimizer/src/optimizer.rs: ## @@ -451,6 +468,33 @@ impl Optimizer { } } +/// These are invariants to hold true for each logical plan. +/// D

Re: [I] Retry logic in ParquetSink [datafusion]

2024-12-07 Thread via GitHub
findepi commented on issue #13679: URL: https://github.com/apache/datafusion/issues/13679#issuecomment-2525252358 > ParquetSink (used for `COPY TO`) encodes bytes to parquet and writes to the sink (e.g. object store). It currently does not include retry logic for failed multipart PUTs to ob

Re: [PR] chore: Create devcontainer.json [datafusion]

2024-12-07 Thread via GitHub
rluvaton commented on PR #13520: URL: https://github.com/apache/datafusion/pull/13520#issuecomment-2525342446 @Omega359 do you wanna take this as you have better configuration? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub a

Re: [I] Statistic: data_size should be in ColumnStatistics. [datafusion]

2024-12-07 Thread via GitHub
findepi commented on issue #7548: URL: https://github.com/apache/datafusion/issues/7548#issuecomment-2525243892 @AdamGS +1 from me. The average data size sounds most logical from optimizer's perspective (i was involved in the introduction of ColumnStatistics.dataSize of Presto/Trino,

Re: [PR] Implement GroupsAccumulator for corr(x,y) aggregate function [datafusion]

2024-12-07 Thread via GitHub
alamb commented on PR #13581: URL: https://github.com/apache/datafusion/pull/13581#issuecomment-2525285569 I also harbor hopes of contributing a benchmark for `corr`, hopefully -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub a

Re: [PR] refactor: use `LazyLock` in the `user_doc` macro [datafusion]

2024-12-07 Thread via GitHub
comphead commented on code in PR #13684: URL: https://github.com/apache/datafusion/pull/13684#discussion_r1874552141 ## datafusion/macros/src/lib.rs: ## @@ -235,27 +248,23 @@ pub fn user_doc(args: TokenStream, input: TokenStream) -> TokenStream { } }); -let

Re: [PR] refactor: use `LazyLock` in the `user_doc` macro [datafusion]

2024-12-07 Thread via GitHub
comphead commented on code in PR #13684: URL: https://github.com/apache/datafusion/pull/13684#discussion_r1874552306 ## datafusion/macros/src/lib.rs: ## @@ -26,16 +26,19 @@ use syn::{parse_macro_input, DeriveInput, LitStr}; /// declared on `AggregateUDF`, `WindowUDFImpl`, `Scal

Re: [PR] Performance: enable array allocation reuse (`ScalarFunctionArgs` gets owned `ColumnReference`) [datafusion]

2024-12-07 Thread via GitHub
alamb commented on code in PR #13637: URL: https://github.com/apache/datafusion/pull/13637#discussion_r1868550990 ## datafusion/physical-expr/src/scalar_function.rs: ## @@ -134,20 +134,20 @@ impl PhysicalExpr for ScalarFunctionExpr { } fn evaluate(&self, batch: &Reco

[PR] build(deps): bump prost-types from 0.13.3 to 0.13.4 [datafusion-python]

2024-12-07 Thread via GitHub
dependabot[bot] opened a new pull request, #969: URL: https://github.com/apache/datafusion-python/pull/969 Bumps [prost-types](https://github.com/tokio-rs/prost) from 0.13.3 to 0.13.4. Changelog Sourced from https://github.com/tokio-rs/prost/blob/master/CHANGELOG.md";>prost-types's

Re: [PR] build(deps): bump pyo3-build-config from 0.22.6 to 0.23.2 [datafusion-python]

2024-12-07 Thread via GitHub
dependabot[bot] closed pull request #961: build(deps): bump pyo3-build-config from 0.22.6 to 0.23.2 URL: https://github.com/apache/datafusion-python/pull/961 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

Re: [PR] build(deps): bump pyo3-build-config from 0.22.6 to 0.23.2 [datafusion-python]

2024-12-07 Thread via GitHub
dependabot[bot] commented on PR #961: URL: https://github.com/apache/datafusion-python/pull/961#issuecomment-2525294073 Superseded by #966. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[I] The syntax of mysql RENAME TABLE tb1 TO tb2 is not supported. [datafusion-sqlparser-rs]

2024-12-07 Thread via GitHub
charmfocus opened a new issue, #1582: URL: https://github.com/apache/datafusion-sqlparser-rs/issues/1582 ```rust let sql = "RENAME TABLE `test`.`test` TO `test`.`test2`"; // parse to a Statement let dialect = GenericDialect {}; let ast = Parser::parse_sql(&dial

Re: [PR] RSS url needed the relative path removed to stage it correctly [datafusion-site]

2024-12-07 Thread via GitHub
timsaucer commented on PR #48: URL: https://github.com/apache/datafusion-site/pull/48#issuecomment-2525385983 @alamb I believe this is ready to go. I resolved the issue with the infra team. We had to make an adjustment to the .asf.yaml file within the `asf-site` branch. I had to make some u

Re: [PR] Redshift: Fix parsing for quoted numbered columns [datafusion-sqlparser-rs]

2024-12-07 Thread via GitHub
7phs commented on code in PR #1576: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1576#discussion_r1874399142 ## tests/sqlparser_redshift.rs: ## @@ -353,3 +380,23 @@ fn test_parse_json_path_from() { _ => panic!(), } } + +#[test] +fn test_parse_selec

Re: [I] Release DataFusion `44.0.0` [datafusion]

2024-12-07 Thread via GitHub
alamb commented on issue #13334: URL: https://github.com/apache/datafusion/issues/13334#issuecomment-2525081316 I would personally love to see DataFusion 44.0.0 be lauded as "super stable" and have few upgrade issues (we would largely achieve this by testing upgrades with other projects pri

[PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb opened a new pull request, #46: URL: https://github.com/apache/datafusion-site/pull/46 I am testing the instructions on https://github.com/apache/datafusion-site/pull/44 Also I aspire to write a post about 44.0.0's release -- This is an automated message from the Apache Git S

[PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb opened a new pull request, #45: URL: https://github.com/apache/datafusion-site/pull/45 This is an aspirational PR with a DataFusion 44.0.0 release announcement It is mostly to test out https://github.com/apache/datafusion-site/pull/44 but I figured I would at least pretend I wou

Re: [PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb commented on PR #45: URL: https://github.com/apache/datafusion-site/pull/45#issuecomment-2525083301 I didn't name the branch correctly, closing -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb closed pull request #45: Add DataFusion 44.0.0 blog post URL: https://github.com/apache/datafusion-site/pull/45 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

[PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb opened a new pull request, #47: URL: https://github.com/apache/datafusion-site/pull/47 I am testing the instructions on https://github.com/apache/datafusion-site/pull/44 Also I aspire to write a post about 44.0.0's release -- This is an automated message from the Apache Git S

Re: [PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb closed pull request #46: Add DataFusion 44.0.0 blog post URL: https://github.com/apache/datafusion-site/pull/46 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

Re: [PR] Add DataFusion 44.0.0 blog post [datafusion-site]

2024-12-07 Thread via GitHub
alamb commented on PR #46: URL: https://github.com/apache/datafusion-site/pull/46#issuecomment-2525084254 Staging push failed: - https://github.com/apache/datafusion-site/actions/runs/12212614636/job/34071269677?pr=46 -- This is an automated message from the Apache Git Service. To res

Re: [PR] Fix join with sort push down [datafusion]

2024-12-07 Thread via GitHub
haohuaijin commented on code in PR #13560: URL: https://github.com/apache/datafusion/pull/13560#discussion_r1874368912 ## datafusion/sqllogictest/test_files/joins.slt: ## @@ -4313,3 +4313,86 @@ physical_plan 04)--HashJoinExec: mode=CollectLeft, join_type=Inner, on=[(binary_

Re: [PR] Redshift: Fix parsing for quoted numbered columns [datafusion-sqlparser-rs]

2024-12-07 Thread via GitHub
iffyio commented on code in PR #1576: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/1576#discussion_r1874372151 ## tests/sqlparser_redshift.rs: ## @@ -353,3 +380,23 @@ fn test_parse_json_path_from() { _ => panic!(), } } + +#[test] +fn test_parse_sel

[PR] feat: support `RightAnti` for `SortMergeJoin` [datafusion]

2024-12-07 Thread via GitHub
irenjj opened a new pull request, #13680: URL: https://github.com/apache/datafusion/pull/13680 ## Which issue does this PR close? Closes #13472 ## Rationale for this change ## What changes are included in this PR? ## Are these changes tested

Re: [PR] Adjust site URL to be relative to /blog [datafusion-site]

2024-12-07 Thread via GitHub
alamb commented on code in PR #44: URL: https://github.com/apache/datafusion-site/pull/44#discussion_r1874404718 ## pelicanconf.py: ## @@ -37,9 +37,10 @@ ARCHIVES_SAVE_AS = '' # Disable articles by pointing to a (should-be-absent) subdir ARTICLE_PATHS = [ 'blog' ] -# needed t

Re: [PR] WIP Upgrade to arrow-rs/parquet `54.0.0` [datafusion]

2024-12-07 Thread via GitHub
alamb commented on code in PR #13663: URL: https://github.com/apache/datafusion/pull/13663#discussion_r1874406962 ## datafusion/common/src/pyarrow.rs: ## @@ -17,6 +17,10 @@ //! Conversions between PyArrow and DataFusion types +// TODO update to pyo3 new APIs Review Comment

[PR] Support specific `GroupsAccumulator` for `median` [datafusion]

2024-12-07 Thread via GitHub
Rachelint opened a new pull request, #13681: URL: https://github.com/apache/datafusion/pull/13681 ## Which issue does this PR close? Closes #. ## Rationale for this change ## What changes are included in this PR? ## Are these changes tested?

Re: [PR] Adjust site URL to be relative to /blog [datafusion-site]

2024-12-07 Thread via GitHub
timsaucer merged PR #44: URL: https://github.com/apache/datafusion-site/pull/44 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [I] Automate deploying the website [datafusion-site]

2024-12-07 Thread via GitHub
timsaucer closed issue #13: Automate deploying the website URL: https://github.com/apache/datafusion-site/issues/13 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscri

Re: [I] Update ballista logo [datafusion-ballista]

2024-12-07 Thread via GitHub
alamb commented on issue #1133: URL: https://github.com/apache/datafusion-ballista/issues/1133#issuecomment-2525104848 I also like option 4: ![image](https://github.com/user-attachments/assets/b27df4c0-8b33-4f2e-9658-d2eeccb18ea2) -- This is an automated message from the Apache G

Re: [PR] RSS url needed the relative path removed to stage it correctly [datafusion-site]

2024-12-07 Thread via GitHub
timsaucer commented on PR #48: URL: https://github.com/apache/datafusion-site/pull/48#issuecomment-2525128812 There is still a problem with the new site serving up when I attempt to remove the old site content. I have opened a ticket with the infra team: https://issues.apache.org/jira/brows

Re: [PR] Implement GroupsAccumulator for corr(x,y) aggregate function [datafusion]

2024-12-07 Thread via GitHub
Dandandan commented on PR #13581: URL: https://github.com/apache/datafusion/pull/13581#issuecomment-2525138937 Hi, I think this one is pretty close, do you have time to look at the review comments @2010YOUY01 ? -- This is an automated message from the Apache Git Service. To respond to the

[I] `array_has` has inconsistent null handling for scalars and arrays [datafusion]

2024-12-07 Thread via GitHub
Kimahriman opened a new issue, #13682: URL: https://github.com/apache/datafusion/issues/13682 ### Describe the bug `array_has` handles nulls differently for scalars and arrays. Scalars return null if all the left hand side values are null, while arrays don't care what values in the l

Re: [I] The file with non-standard newline character can't be read when sqllogictests testing [datafusion]

2024-12-07 Thread via GitHub
korowa closed issue #12328: The file with non-standard newline character can't be read when sqllogictests testing URL: https://github.com/apache/datafusion/issues/12328 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] fix: repartitioned reads of CSV with custom line terminator [datafusion]

2024-12-07 Thread via GitHub
korowa merged PR #13677: URL: https://github.com/apache/datafusion/pull/13677 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafus