Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-05-02 Thread via GitHub
alamb commented on PR #15168: URL: https://github.com/apache/datafusion/pull/15168#issuecomment-2846971785 > Fyi, the main CI has failed since the PR @blaginin has fixed it -- it appears to have been a logical conflict -- This is an automated message from the Apache Git Service. To

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-05-01 Thread via GitHub
xudong963 commented on PR #15168: URL: https://github.com/apache/datafusion/pull/15168#issuecomment-2846060139 Fyi, the main CI has failed since the PR -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-05-01 Thread via GitHub
alamb merged PR #15168: URL: https://github.com/apache/datafusion/pull/15168 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusi

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-05-01 Thread via GitHub
alamb commented on PR #15168: URL: https://github.com/apache/datafusion/pull/15168#issuecomment-2846025123 I have filed an epic to track filling out the datafusion-spark crate: - https://github.com/apache/datafusion/issues/15914 I will file some subtickets for follow on work as we

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-05-01 Thread via GitHub
alamb commented on PR #15168: URL: https://github.com/apache/datafusion/pull/15168#issuecomment-2846025618 Onward! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubs

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-29 Thread via GitHub
alamb commented on PR #15168: URL: https://github.com/apache/datafusion/pull/15168#issuecomment-2840589691 This looks great to me -- I plan to merge it tomorrow and start collecting next steps in a new `EPIC` ticket unless someone beats me to it -- This is an automated message from the Ap

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-28 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2065542062 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,168 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor lic

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-28 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2065539727 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,210 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-28 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2065539353 ## datafusion/sqllogictest/src/engines/conversion.rs: ## @@ -77,7 +77,21 @@ pub(crate) fn f64_to_str(value: f64) -> String { } else if value == f64::NEG_I

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-28 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2065538751 ## datafusion/spark/src/lib.rs: ## @@ -0,0 +1,154 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license agreements.

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-28 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2065538632 ## Cargo.lock: ## @@ -2558,6 +2558,27 @@ dependencies = [ "tokio", ] +[[package]] +name = "datafusion-spark" +version = "47.0.0" +dependencies = [ + "arro

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-28 Thread via GitHub
alamb commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2065131934 ## datafusion/sqllogictest/test_files/spark/README.md: ## @@ -0,0 +1,57 @@ + + +# Spark Test Files + +This directory contains test files for the `spark` test suite.

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-26 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2061232126 ## datafusion/sqllogictest/src/engines/datafusion_engine/normalize.rs: ## @@ -193,7 +192,7 @@ macro_rules! get_row_value { /// /// Floating numbers are round

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-18 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2051099886 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor licen

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-04-10 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2038609229 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor lic

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-22 Thread via GitHub
alamb commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2008804385 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license a

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-21 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2008444729 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor lic

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-20 Thread via GitHub
alamb commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2004518126 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license a

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-18 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r2002067067 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor lic

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-16 Thread via GitHub
alamb commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1997557845 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contributor

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-16 Thread via GitHub
alamb commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1997557995 ## datafusion/sqllogictest/test_files/spark/string/ascii.slt: ## @@ -0,0 +1,43 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contributo

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-15 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1995461595 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-15 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994892781 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-14 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994917169 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994946916 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994929734 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994884151 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994863716 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994823297 ## datafusion/core/src/execution/session_state_defaults.rs: ## @@ -104,28 +104,55 @@ impl SessionStateDefaults { /// returns the list of default [`Scala

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994466480 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994471938 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994452292 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994474461 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994472992 ## datafusion/core/src/execution/session_state_defaults.rs: ## @@ -104,28 +104,55 @@ impl SessionStateDefaults { /// returns the list of default [`ScalarU

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994457478 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994457586 ## datafusion/sqllogictest/test_files/spark/string/ascii.slt: ## @@ -0,0 +1,43 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994469174 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994457586 ## datafusion/sqllogictest/test_files/spark/string/ascii.slt: ## @@ -0,0 +1,43 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994461973 ## datafusion/sqllogictest/test_files/spark/string/ascii.slt: ## @@ -0,0 +1,43 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more cont

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994461775 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994461511 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994455591 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
andygrove commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1994448064 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contribu

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-13 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992573548 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,208 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992589651 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,208 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992589651 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,208 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992592514 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor lic

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992574619 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,208 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992575249 ## datafusion/spark/src/function/math/expm1.rs: ## @@ -0,0 +1,169 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor lic

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992574619 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,208 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992573548 ## datafusion/spark/src/function/string/ascii.rs: ## @@ -0,0 +1,208 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992569405 ## datafusion/sqllogictest/test_files/spark/README.md: ## @@ -0,0 +1,57 @@ + + Review Comment: Testing instructions here -- This is an automated messag

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992569901 ## datafusion/sqllogictest/test_files/spark/math/expm1.slt: ## @@ -0,0 +1,32 @@ +# Licensed to the Apache Software Foundation (ASF) under one +# or more contri

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-12 Thread via GitHub
shehabgamin commented on code in PR #15168: URL: https://github.com/apache/datafusion/pull/15168#discussion_r1992569624 ## datafusion/spark/README.md: ## @@ -0,0 +1,38 @@ + + +# datafusion-spark: Spark-compatible Expressions + +This crate provides Apache Spark-compatible express

[PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-11 Thread via GitHub
shehabgamin opened a new pull request, #15168: URL: https://github.com/apache/datafusion/pull/15168 ## Which issue does this PR close? Closes https://github.com/apache/datafusion/issues/5600 ## Rationale for this change See discussion in https://github.com/apache/datafusi

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-10 Thread via GitHub
andygrove closed pull request #14392: feat: Add `datafusion-spark` crate URL: https://github.com/apache/datafusion/pull/14392 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-09 Thread via GitHub
shehabgamin commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2708820627 > > > Would you have bandwidth to help with adding some initial tests in the Comet repo? I think if we have some examples then it will be easier for others to contribute. > >

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-06 Thread via GitHub
alamb commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2703660394 > > Would you have bandwidth to help with adding some initial tests in the Comet repo? I think if we have some examples then it will be easier for others to contribute. > > Yeah

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-05 Thread via GitHub
shehabgamin commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2703064824 > Would you have bandwidth to help with adding some initial tests in the Comet repo? I think if we have some examples then it will be easier for others to contribute. Yeah

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-03-05 Thread via GitHub
andygrove commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2701788755 > @andygrove Just checking in. I've been adding tons of new functions to the Sail repo but will port them over after this PR is merged! Sounds good! The consensus was

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-26 Thread via GitHub
shehabgamin commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2686793279 @andygrove Just checking in. I've been adding tons of new functions to the Sail repo but will port them over after this PR is merged! -- This is an automated message from the A

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-05 Thread via GitHub
andygrove commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2637613868 I filed an issue in Comet to do the necessary work (testing and examples) to prepare to move the crate into DataFusion repo. https://github.com/apache/datafusion-comet/issues

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-05 Thread via GitHub
alamb commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2637338434 > All functions that Sail ports over can be tested without the JVM or Spark Client. Because there is no JVM involvement when running a Sail server, it would be a relatively straightfor

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-05 Thread via GitHub
alamb commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2637335904 > @andygrove @alamb maybe we can do a conditional compilation on function level instead of separate crate? Sort of introduce a feature spark and have different implementations >

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-03 Thread via GitHub
Blizzara commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2630634253 I'd be very happy to see this go in :) Having the functions here in main datafusion repo makes dealing with any breaks and downstream upgrades easier (we'll only need to track/match

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
andygrove commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629666721 I'm going to move this to draft for now. Let's keep the conversation going, though. I plan on working with the community to start adding tests to the `datafusion-comet-spark

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
comphead commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629508048 That is exactly what I was mentioning https://github.com/apache/datafusion/pull/14392#issuecomment-2628334890 not sure how to separate out functions with the same names. I think thi

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
andygrove commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629456979 > For Comet, we need the function names to match Spark. Actually, this isn't true. In Comet, we just need the Scala wrapper classes to have the same function names as Spark.

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
andygrove commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629435886 > @comphead @andygrove @alamb It may be a good idea to prefix all function names with `spark_` to avoid confusion, conflicts, or unknown behavior between functions that share the s

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
shehabgamin commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629317132 > If the behavior is the same we don't have any reason to copy one to spark crate, adding alias to the function is enough. @jayzhan211 I’ll need to verify this, but I reca

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
jayzhan211 commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629310335 > > In optimizer, we rely on the name to do such optimization so if we rename it to name like 'spark_count' we might need to add the spark name to those optimize rules as well, wh

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
shehabgamin commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629306233 > In optimizer, we rely on the name to do such optimization so if we rename it to name like 'spark_count' we might need to add the spark name to those optimize rules as well, whi

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-02 Thread via GitHub
jayzhan211 commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629290634 > It may be a good idea to prefix all function names with spark_ to avoid confusion, conflicts, or unknown behavior between functions that share the same name. In optimizer

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-01 Thread via GitHub
shehabgamin commented on code in PR #14392: URL: https://github.com/apache/datafusion/pull/14392#discussion_r1938420167 ## datafusion/spark/src/agg_funcs/avg.rs: ## @@ -0,0 +1,344 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license a

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-01 Thread via GitHub
shehabgamin commented on PR #14392: URL: https://github.com/apache/datafusion/pull/14392#issuecomment-2629161924 > cc @Blizzara and @shehabgamin as you have mentioned interest in helping here > Will catch up on this thread tonight. So much happening so fast, exciting! -- Th

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-01 Thread via GitHub
andygrove commented on code in PR #14392: URL: https://github.com/apache/datafusion/pull/14392#discussion_r1938313126 ## datafusion/spark/src/agg_funcs/avg.rs: ## @@ -0,0 +1,344 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license agr

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-01 Thread via GitHub
andygrove commented on code in PR #14392: URL: https://github.com/apache/datafusion/pull/14392#discussion_r1938313058 ## datafusion/spark/src/agg_funcs/avg.rs: ## @@ -0,0 +1,344 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license agr

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-01 Thread via GitHub
alamb commented on code in PR #14392: URL: https://github.com/apache/datafusion/pull/14392#discussion_r1938312603 ## datafusion/spark/src/comet_scalar_funcs.rs: ## @@ -0,0 +1,192 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license ag

Re: [PR] feat: Add `datafusion-spark` crate [datafusion]

2025-02-01 Thread via GitHub
alamb commented on code in PR #14392: URL: https://github.com/apache/datafusion/pull/14392#discussion_r1938312400 ## datafusion/spark/src/agg_funcs/avg.rs: ## @@ -0,0 +1,344 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license agreeme