Re: [PR] [SPARK-48353][SQL] Introduction of Error Handling mechanism in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49427: URL: https://github.com/apache/spark/pull/49427#discussion_r1925131311 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingExecutionContext.scala: ## @@ -50,12 +51,24 @@ class SqlScriptingExecutionContext { throw

Re: [PR] [SPARK-48353][SQL] Introduction of Error Handling mechanism in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49427: URL: https://github.com/apache/spark/pull/49427#discussion_r1925132255 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingExecutionContext.scala: ## @@ -111,15 +124,32 @@ class SqlScriptingExecutionFrame( } }

Re: [PR] [SPARK-48353][SQL] Introduction of Error Handling mechanism in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49427: URL: https://github.com/apache/spark/pull/49427#discussion_r1925131311 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingExecutionContext.scala: ## @@ -50,12 +51,24 @@ class SqlScriptingExecutionContext { throw

Re: [PR] [SPARK-50947][PYTHON][SQL][CONNECT] Assign appropriate error class and SparkException for duplicated artifacts [spark]

2025-01-22 Thread via GitHub
MaxGekk commented on code in PR #49598: URL: https://github.com/apache/spark/pull/49598#discussion_r1925003554 ## common/utils/src/main/resources/error/error-conditions.json: ## @@ -1229,6 +1229,12 @@ ], "sqlState" : "42710" }, + "DUPLICATED_ARTIFACT" : { Review C

Re: [PR] [SPARK-48353][SQL] Introduction of Error Handling mechanism in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49427: URL: https://github.com/apache/spark/pull/49427#discussion_r1925099480 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingExecutionContext.scala: ## @@ -27,7 +27,8 @@ import org.apache.spark.sql.scripting.SqlScripting

Re: [PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
yaooqinn commented on code in PR #49602: URL: https://github.com/apache/spark/pull/49602#discussion_r1925101343 ## sql/api/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala: ## @@ -16,7 +16,7 @@ */ package org.apache.spark.sql -import java.util +import java.util.{Lo

Re: [PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
HyukjinKwon commented on PR #49602: URL: https://github.com/apache/spark/pull/49602#issuecomment-2606966479 I used a little bit diff approach :-). Thanks for comments. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use th

Re: [PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
HyukjinKwon commented on code in PR #49602: URL: https://github.com/apache/spark/pull/49602#discussion_r1925146776 ## sql/api/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala: ## @@ -139,7 +139,7 @@ abstract class DataFrameWriter[T] { * * @since 1.4.0 */ -

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on PR #49599: URL: https://github.com/apache/spark/pull/49599#issuecomment-2607633085 Thanks @MaxGekk -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] [SPARK-50909][PYTHON] Setup faulthandler in PythonPlannerRunners [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on PR #49592: URL: https://github.com/apache/spark/pull/49592#issuecomment-2607641305 > ![image](https://private-user-images.githubusercontent.com/1475305/405489523-21d33f5c-d29a-4def-ba84-41475b231a63.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29

Re: [PR] [SPARK-50906][SS] Add nullability check for if inputs of to_avro align with schema [spark]

2025-01-22 Thread via GitHub
MaxGekk commented on code in PR #49590: URL: https://github.com/apache/spark/pull/49590#discussion_r1925572716 ## connector/avro/src/test/scala/org/apache/spark/sql/avro/AvroCatalystDataConversionSuite.scala: ## @@ -241,6 +241,25 @@ class AvroCatalystDataConversionSuite extends

Re: [PR] [SPARK-50918][ML][PYTHON][CONNECT] Support Pipeline on connect [spark]

2025-01-22 Thread via GitHub
wbo4958 commented on code in PR #49596: URL: https://github.com/apache/spark/pull/49596#discussion_r1924953969 ## python/pyspark/ml/connect/readwrite.py: ## @@ -37,52 +38,99 @@ def sc(self) -> "SparkContext": raise RuntimeError("Accessing SparkContext is not supported o

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925348974 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CTESubstitution.scala: ## @@ -151,7 +157,7 @@ object CTESubstitution extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925339756 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925351425 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925352139 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925363913 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CTESubstitution.scala: ## @@ -151,7 +157,7 @@ object CTESubstitution extends Rule[LogicalPlan]

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925606130 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925610123 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925616520 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala: ## @@ -1042,6 +1043,75 @@ trait CheckAnalysis extends PredicateHelper with

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49599: URL: https://github.com/apache/spark/pull/49599#issuecomment-2607713012 Thank you, @LuciferYang and @MaxGekk . -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on code in PR #49599: URL: https://github.com/apache/spark/pull/49599#discussion_r1925139317 ## core/src/test/scala/org/apache/spark/util/UtilsSuite.scala: ## @@ -526,10 +526,18 @@ class UtilsSuite extends SparkFunSuite with ResetSystemProperties {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925274283 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925274874 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CTESubstitution.scala: ## @@ -423,4 +429,20 @@ object CTESubstitution extends Rule[LogicalPlan]

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925279434 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925280827 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveWithCTE.scala: ## @@ -183,4 +184,52 @@ object ResolveWithCTE extends Rule[LogicalPlan] {

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925291147 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/cteOperators.scala: ## @@ -100,12 +100,14 @@ case class CTERelationDef( override def

Re: [PR] [WIP][SPARK-50838][SQL]Performs additional checks inside recursive CTEs to throw an error if forbidden case is encountered [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49518: URL: https://github.com/apache/spark/pull/49518#discussion_r1925294587 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CTESubstitution.scala: ## @@ -151,7 +157,7 @@ object CTESubstitution extends Rule[LogicalPlan]

Re: [PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on code in PR #49602: URL: https://github.com/apache/spark/pull/49602#discussion_r1925193660 ## sql/api/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala: ## @@ -139,7 +139,7 @@ abstract class DataFrameWriter[T] { * * @since 1.4.0 */ -

Re: [PR] [SPARK-50948][ML][PYTHON][CONNECT] Add support for StringIndexer/PCA on Connect [spark]

2025-01-22 Thread via GitHub
zhengruifeng commented on PR #49600: URL: https://github.com/apache/spark/pull/49600#issuecomment-2607079366 merged to master/4.0 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

Re: [PR] [SPARK-50948][ML][PYTHON][CONNECT] Add support for StringIndexer/PCA on Connect [spark]

2025-01-22 Thread via GitHub
zhengruifeng closed pull request #49600: [SPARK-50948][ML][PYTHON][CONNECT] Add support for StringIndexer/PCA on Connect URL: https://github.com/apache/spark/pull/49600 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
MaxGekk commented on PR #49599: URL: https://github.com/apache/spark/pull/49599#issuecomment-2607548264 +1, LGTM. Merging to master/4.0. Thank you, @LuciferYang. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the U

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
MaxGekk closed pull request #49599: [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass URL: https://github.com/apache/spark/pull/49599 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [PR] [SPARK-48530][SQL] Support for local variables in SQL Scripting [spark]

2025-01-22 Thread via GitHub
dusantism-db commented on code in PR #49445: URL: https://github.com/apache/spark/pull/49445#discussion_r1925532381 ## sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogManager.scala: ## @@ -49,6 +49,18 @@ class CatalogManager( // TODO: create a real S

Re: [PR] [SPARK-50826][ML][CONNECT] Refactor the reflection checker [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on code in PR #49601: URL: https://github.com/apache/spark/pull/49601#discussion_r1925237252 ## sql/connect/server/src/main/scala/org/apache/spark/sql/connect/ml/MLUtils.scala: ## @@ -448,98 +454,149 @@ private[ml] object MLUtils { // Since we're using r

Re: [PR] [SPARK-50826][ML][CONNECT] Refactor the reflection checker [spark]

2025-01-22 Thread via GitHub
zhengruifeng commented on code in PR #49601: URL: https://github.com/apache/spark/pull/49601#discussion_r1925254904 ## sql/connect/server/src/main/scala/org/apache/spark/sql/connect/ml/MLUtils.scala: ## @@ -448,98 +454,149 @@ private[ml] object MLUtils { // Since we're using

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
MaxGekk commented on code in PR #49599: URL: https://github.com/apache/spark/pull/49599#discussion_r1925067453 ## core/src/test/scala/org/apache/spark/util/UtilsSuite.scala: ## @@ -526,10 +526,18 @@ class UtilsSuite extends SparkFunSuite with ResetSystemProperties { // T

Re: [PR] [SPARK-49700][CONNECT][SQL] Unified Scala Interface for Connect and Classic [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on code in PR #48818: URL: https://github.com/apache/spark/pull/48818#discussion_r1925413987 ## sql/api/src/main/scala/org/apache/spark/sql/SparkSession.scala: ## @@ -776,40 +779,161 @@ abstract class SparkSession extends Serializable with Closeable { *

Re: [PR] [SPARK-49700][CONNECT][SQL] Unified Scala Interface for Connect and Classic [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on code in PR #48818: URL: https://github.com/apache/spark/pull/48818#discussion_r1925409046 ## project/MimaExcludes.scala: ## @@ -205,10 +205,30 @@ object MimaExcludes { // SPARK-50112: Moving avro files from connector to sql/core ProblemFilters.ex

Re: [PR] [SPARK-49646][SQL] add spark config for fixing subquery decorrelation for union/set operations when parentOuterReferences has references not covered in collectedChildOuterReferences [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on PR #49536: URL: https://github.com/apache/spark/pull/49536#issuecomment-2607406371 thanks, merging to master! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

Re: [PR] [SPARK-49646][SQL] add spark config for fixing subquery decorrelation for union/set operations when parentOuterReferences has references not covered in collectedChildOuterReferences [spark]

2025-01-22 Thread via GitHub
cloud-fan closed pull request #49536: [SPARK-49646][SQL] add spark config for fixing subquery decorrelation for union/set operations when parentOuterReferences has references not covered in collectedChildOuterReferences URL: https://github.com/apache/spark/pull/49536 -- This is an automated

Re: [PR] [SPARK-50082][CORE] Remove some unnecessary Jersey-related warning logs [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on PR #48611: URL: https://github.com/apache/spark/pull/48611#issuecomment-2606699186 > javax If time permits, I suggest we try to migrate them to jakarta before Spark 4.0 release -- This is an automated message from the Apache Git Service. To respond

[PR] SPARK-50895][SQL] Create common interface for expressions which produce default string type [spark]

2025-01-22 Thread via GitHub
stefankandic opened a new pull request, #49603: URL: https://github.com/apache/spark/pull/49603 Creating a new PR for #49576 into a newly cut 4.0 branch. ### What changes were proposed in this pull request? Introducing a new interface `DefaultStringProducingExpression` which should

[PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
HyukjinKwon opened a new pull request, #49602: URL: https://github.com/apache/spark/pull/49602 ### What changes were proposed in this pull request? This PR avoids importing java.util at DataFrameWriter. ### Why are the changes needed? Using `util` in the codebase is confu

Re: [PR] [SPARK-50826][ML][CONNECT] Refactor the reflection checker [spark]

2025-01-22 Thread via GitHub
zhengruifeng commented on code in PR #49601: URL: https://github.com/apache/spark/pull/49601#discussion_r1925372177 ## sql/connect/server/src/test/scala/org/apache/spark/sql/connect/ml/MLHelper.scala: ## @@ -298,7 +299,7 @@ class MyLogisticRegressionModel( override val uid:

Re: [PR] [SPARK-50826][ML][CONNECT] Refactor the reflection checker [spark]

2025-01-22 Thread via GitHub
zhengruifeng commented on code in PR #49601: URL: https://github.com/apache/spark/pull/49601#discussion_r1925370678 ## sql/connect/server/src/main/scala/org/apache/spark/sql/connect/ml/MLUtils.scala: ## @@ -448,98 +454,149 @@ private[ml] object MLUtils { // Since we're using

[PR] [SPARK-50849][Connect] Add example project to demonstrate Spark Connect Server Libraries [spark]

2025-01-22 Thread via GitHub
vicennial opened a new pull request, #49604: URL: https://github.com/apache/spark/pull/49604 ### What changes were proposed in this pull request? This PR adds a sample project, `server-library-example` (under a new directory `connect-examples`) to demonstrate the workings of u

Re: [PR] [SPARK-48530][SQL] Support for local variables in SQL Scripting [spark]

2025-01-22 Thread via GitHub
cloud-fan commented on code in PR #49445: URL: https://github.com/apache/spark/pull/49445#discussion_r1925386521 ## sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogManager.scala: ## @@ -49,6 +49,18 @@ class CatalogManager( // TODO: create a real SYST

Re: [PR] [SPARK-48530][SQL] Support for local variables in SQL Scripting [spark]

2025-01-22 Thread via GitHub
vladimirg-db commented on code in PR #49445: URL: https://github.com/apache/spark/pull/49445#discussion_r1925392103 ## sql/catalyst/src/main/scala/org/apache/spark/sql/connector/catalog/CatalogManager.scala: ## @@ -49,6 +49,18 @@ class CatalogManager( // TODO: create a real S

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49599: URL: https://github.com/apache/spark/pull/49599#issuecomment-2607766964 I backported this to `branch-3.5` too. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

Re: [PR] [WIP][SPARK-50892][SQL]Add UnionLoopExec, physical operator for recursion, to perform execution of recursive queries [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49571: URL: https://github.com/apache/spark/pull/49571#discussion_r1925763851 ## sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala: ## @@ -714,6 +718,147 @@ case class UnionExec(children: Seq[SparkPlan]) exten

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on code in PR #49606: URL: https://github.com/apache/spark/pull/49606#discussion_r1925762757 ## core/src/test/scala/org/apache/spark/util/UtilsSuite.scala: ## @@ -525,10 +525,14 @@ class UtilsSuite extends SparkFunSuite with ResetSystemProperties {

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on code in PR #49606: URL: https://github.com/apache/spark/pull/49606#discussion_r1925760510 ## core/src/test/scala/org/apache/spark/util/UtilsSuite.scala: ## @@ -525,10 +525,14 @@ class UtilsSuite extends SparkFunSuite with ResetSystemProperties {

Re: [PR] [SPARK-50578][PYTHON][SS] Add support for new version of state metadata for TransformWithStateInPandas [spark]

2025-01-22 Thread via GitHub
jingz-db commented on PR #49156: URL: https://github.com/apache/spark/pull/49156#issuecomment-2607941901 Sorry @HyukjinKwon, what is old dependencies you are referring to here? I think this suite is flaky because of timing issue. I'll take the ticket and fix the flakiness soon. -- This i

Re: [PR] [SPARK-50815][FOLLOW-UP] Handle Variant-related edge-case for createDataFrame [spark]

2025-01-22 Thread via GitHub
harshmotw-db commented on code in PR #49591: URL: https://github.com/apache/spark/pull/49591#discussion_r1925867531 ## python/pyspark/sql/types.py: ## @@ -1478,6 +1478,9 @@ def toInternal(self, obj: Tuple) -> Tuple: if obj is None: return +if isin

Re: [PR] [SPARK-48530][SQL] Support for local variables in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49445: URL: https://github.com/apache/spark/pull/49445#discussion_r1925883386 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingExecution.scala: ## @@ -55,7 +55,15 @@ class SqlScriptingExecution( } private val variab

Re: [PR] [SPARK-48530][SQL] Support for local variables in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49445: URL: https://github.com/apache/spark/pull/49445#discussion_r1925889523 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingExecution.scala: ## @@ -55,7 +55,15 @@ class SqlScriptingExecution( } private val variab

Re: [PR] [SPARK-50392][PYTHON][FOLLOWUP] Move `import`s into methods to fix `connect-only` builds [spark]

2025-01-22 Thread via GitHub
xinrong-meng commented on PR #49472: URL: https://github.com/apache/spark/pull/49472#issuecomment-2608154024 Thank you @ueshin for fixing that! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] [SPARK-50855][SS][CONNECT] Spark Connect Support for TransformWithState [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49488: URL: https://github.com/apache/spark/pull/49488#issuecomment-2608157455 > @jingz-db - lets update the PR description to mention that this only covers support in Scala. Thx Just a side question. Do you have some Spark committers in your mind to get

Re: [PR] [SPARK-48530][SQL] Support for local variables in SQL Scripting [spark]

2025-01-22 Thread via GitHub
davidm-db commented on code in PR #49445: URL: https://github.com/apache/spark/pull/49445#discussion_r1925894712 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/SqlScriptingVariableManager.scala: ## @@ -0,0 +1,25 @@ +/* + * Licensed to the Apache Software Foundation

Re: [PR] [SPARK-50855][SS][CONNECT] Spark Connect Support for TransformWithState [spark]

2025-01-22 Thread via GitHub
anishshri-db commented on PR #49488: URL: https://github.com/apache/spark/pull/49488#issuecomment-2608160079 > Do you have some Spark committers in your mind to get reviews for this PR? Yes - cc - @HeartSaVioR - PTAL also, thx ! -- This is an automated message from the Apache Git Se

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49606: URL: https://github.com/apache/spark/pull/49606#issuecomment-2608165800 Merged to branch-3.5. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49606: URL: https://github.com/apache/spark/pull/49606#issuecomment-2607967237 Thank you, @LuciferYang ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun closed pull request #49606: [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass URL: https://github.com/apache/spark/pull/49606 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[PR] [SPARK-50952][BUILD] Include `jjwt`-related libraries and provide `jjwt-provided` profile [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun opened a new pull request, #49608: URL: https://github.com/apache/spark/pull/49608 … ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change?

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on code in PR #49606: URL: https://github.com/apache/spark/pull/49606#discussion_r1925760510 ## core/src/test/scala/org/apache/spark/util/UtilsSuite.scala: ## @@ -525,10 +525,14 @@ class UtilsSuite extends SparkFunSuite with ResetSystemProperties {

Re: [PR] [SPARK-48516][PYTHON][CONNECT] Turn on Arrow optimization for Python UDFs by default [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49482: URL: https://github.com/apache/spark/pull/49482#issuecomment-2608097024 Gentle ping, @xinrong-meng . If this is targeting Apache Spark 4.0, we had better have this before February 1st. - https://spark.apache.org/versioning-policy.html -- Thi

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49606: URL: https://github.com/apache/spark/pull/49606#issuecomment-2608087924 Thank you, @MaxGekk -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

Re: [PR] [SPARK-50855][SS][CONNECT] Spark Connect Support for TransformWithState [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49488: URL: https://github.com/apache/spark/pull/49488#issuecomment-2608106885 cc @hvanhovell -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] [SPARK-50855][SS][CONNECT] Spark Connect Support for TransformWithState [spark]

2025-01-22 Thread via GitHub
anishshri-db commented on PR #49488: URL: https://github.com/apache/spark/pull/49488#issuecomment-2608109177 @jingz-db - lets update the PR description to mention that this only covers support in Scala. Thx -- This is an automated message from the Apache Git Service. To respond to the mes

Re: [PR] [SPARK-50951][BUILD][TESTS] Update Oracle free version from 23.5 to 23.6 [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun closed pull request #49605: [SPARK-50951][BUILD][TESTS] Update Oracle free version from 23.5 to 23.6 URL: https://github.com/apache/spark/pull/49605 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

Re: [PR] [WIP][SPARK-50892][SQL]Add UnionLoopExec, physical operator for recursion, to perform execution of recursive queries [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49571: URL: https://github.com/apache/spark/pull/49571#discussion_r1925831054 ## sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala: ## @@ -714,6 +718,147 @@ case class UnionExec(children: Seq[SparkPlan]) exten

Re: [PR] [SPARK-50952][BUILD] Include `jjwt`-related libraries and provide `jjwt-provided` profile [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun closed pull request #49608: [SPARK-50952][BUILD] Include `jjwt`-related libraries and provide `jjwt-provided` profile URL: https://github.com/apache/spark/pull/49608 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub a

Re: [PR] [SPARK-50952][BUILD] Include `jjwt`-related libraries and provide `jjwt-provided` profile [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49608: URL: https://github.com/apache/spark/pull/49608#issuecomment-2608358968 Merged to master/4.0. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [PR] [SPARK-48353][SQL] Introduction of Error Handling mechanism in SQL Scripting [spark]

2025-01-22 Thread via GitHub
miland-db commented on code in PR #49427: URL: https://github.com/apache/spark/pull/49427#discussion_r1926044733 ## sql/core/src/main/scala/org/apache/spark/sql/scripting/SqlScriptingInterpreter.scala: ## @@ -63,6 +67,79 @@ case class SqlScriptingInterpreter(session: SparkSessio

Re: [PR] [SPARK-48353][SQL] Introduction of Error Handling mechanism in SQL Scripting [spark]

2025-01-22 Thread via GitHub
miland-db commented on code in PR #49427: URL: https://github.com/apache/spark/pull/49427#discussion_r1926048316 ## sql/api/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBaseParser.g4: ## @@ -79,6 +81,29 @@ setStatementWithOptionalVarKeyword LEFT_PAREN query R

[PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun opened a new pull request, #49606: URL: https://github.com/apache/spark/pull/49606 ### What changes were proposed in this pull request? This pr adds Java version checks and logic adaptations to `scenario6` in the test case named "SPARK-35907: createDirectory" within the `Uti

Re: [PR] [SPARK-50946][CORE][TESTS] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49599: URL: https://github.com/apache/spark/pull/49599#issuecomment-2607889075 I made a backporting PR to branch-3.5 by replacing `Runtime.Version` (Java 9+) - https://github.com/apache/spark/pull/49606 -- This is an automated message from the Apache Git

Re: [PR] [WIP][SPARK-50892][SQL]Add UnionLoopExec, physical operator for recursion, to perform execution of recursive queries [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49571: URL: https://github.com/apache/spark/pull/49571#discussion_r1925698021 ## sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala: ## @@ -714,6 +718,147 @@ case class UnionExec(children: Seq[SparkPlan]) exten

Re: [PR] [WIP][SPARK-50892][SQL]Add UnionLoopExec, physical operator for recursion, to perform execution of recursive queries [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49571: URL: https://github.com/apache/spark/pull/49571#discussion_r1925709323 ## sql/core/src/main/scala/org/apache/spark/sql/execution/basicPhysicalOperators.scala: ## @@ -714,6 +718,147 @@ case class UnionExec(children: Seq[SparkPlan]) exten

Re: [PR] [WIP][SPARK-50892][SQL]Add UnionLoopExec, physical operator for recursion, to perform execution of recursive queries [spark]

2025-01-22 Thread via GitHub
milanisvet commented on code in PR #49571: URL: https://github.com/apache/spark/pull/49571#discussion_r1925710464 ## sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala: ## @@ -4520,6 +4520,31 @@ object SQLConf { .checkValues(LegacyBehaviorPolicy.values.

[PR] [DRAFT] Resolve default string producing expressions in analyzer [spark]

2025-01-22 Thread via GitHub
stefankandic opened a new pull request, #49607: URL: https://github.com/apache/spark/pull/49607 ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### Ho

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49606: URL: https://github.com/apache/spark/pull/49606#issuecomment-2607887517 cc @LuciferYang and @MaxGekk -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [PR] [SPARK-50858][PYTHON] Add configuration to hide Python UDF stack trace [spark]

2025-01-22 Thread via GitHub
wengh commented on code in PR #49535: URL: https://github.com/apache/spark/pull/49535#discussion_r1925772564 ## sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala: ## @@ -3459,6 +3459,15 @@ object SQLConf { .checkValues(Set("legacy", "row", "dict"))

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49606: URL: https://github.com/apache/spark/pull/49606#issuecomment-2607958939 Here is the manual test. ``` $ java -version openjdk version "1.8.0_312" OpenJDK Runtime Environment AppleJDK-8.0.312.7.1 (build 1.8.0_312-b07) OpenJDK 64-Bit Server VM

Re: [PR] [SPARK-50951][BUILD][TESTS] Update Oracle free version from 23.5 to 23.6 [spark]

2025-01-22 Thread via GitHub
LucaCanali commented on PR #49605: URL: https://github.com/apache/spark/pull/49605#issuecomment-2608217614 Thank you @dongjoon-hyun -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

Re: [PR] [SPARK-50952][BUILD] Include `jjwt`-related libraries and provide `jjwt-provided` profile [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49608: URL: https://github.com/apache/spark/pull/49608#issuecomment-2608241619 Could you review this when you have some time, @viirya ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] [SPARK-50118][CONNET] Reset isolated state cache when tasks are running [spark]

2025-01-22 Thread via GitHub
HyukjinKwon commented on PR #48665: URL: https://github.com/apache/spark/pull/48665#issuecomment-2608525172 I am debugging the flakiness, and seems like this causes the tests flaky for some reasons. I will revert this first. -- This is an automated message from the Apache Git Service. To

Re: [PR] [SPARK-50855][SS][CONNECT] Spark Connect Support for TransformWithState In Scala [spark]

2025-01-22 Thread via GitHub
anishshri-db commented on PR #49488: URL: https://github.com/apache/spark/pull/49488#issuecomment-2608542409 LGTM pending green CI -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

Re: [PR] [SPARK-50826][ML][CONNECT] Refactor the reflection checker [spark]

2025-01-22 Thread via GitHub
wbo4958 commented on code in PR #49601: URL: https://github.com/apache/spark/pull/49601#discussion_r1926192056 ## sql/connect/server/src/main/scala/org/apache/spark/sql/connect/ml/MLUtils.scala: ## @@ -448,98 +454,149 @@ private[ml] object MLUtils { // Since we're using refle

Re: [PR] [SPARK-50858][PYTHON] Add configuration to hide Python UDF stack trace [spark]

2025-01-22 Thread via GitHub
wengh commented on code in PR #49535: URL: https://github.com/apache/spark/pull/49535#discussion_r1925772564 ## sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala: ## @@ -3459,6 +3459,15 @@ object SQLConf { .checkValues(Set("legacy", "row", "dict"))

Re: [PR] [SPARK-50858][PYTHON] Add configuration to hide Python UDF stack trace [spark]

2025-01-22 Thread via GitHub
wengh commented on code in PR #49535: URL: https://github.com/apache/spark/pull/49535#discussion_r1926225302 ## python/pyspark/util.py: ## @@ -468,16 +468,19 @@ def handle_worker_exception(e: BaseException, outfile: IO) -> None: and exception traceback info to outfile. JVM

Re: [PR] [SPARK-50815][FOLLOW-UP] Handle Variant-related edge-case for createDataFrame [spark]

2025-01-22 Thread via GitHub
HyukjinKwon commented on PR #49591: URL: https://github.com/apache/spark/pull/49591#issuecomment-2608654196 Merged to master and branch-4.0. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

Re: [PR] [SPARK-50815][FOLLOW-UP] Handle Variant-related edge-case for createDataFrame [spark]

2025-01-22 Thread via GitHub
HyukjinKwon closed pull request #49591: [SPARK-50815][FOLLOW-UP] Handle Variant-related edge-case for createDataFrame URL: https://github.com/apache/spark/pull/49591 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

Re: [PR] [SPARK-50946][CORE][TESTS][3.5] Add version check for Java 17.0.14 to make `UtilsSuite` test pass [spark]

2025-01-22 Thread via GitHub
LuciferYang commented on code in PR #49606: URL: https://github.com/apache/spark/pull/49606#discussion_r1925746802 ## core/src/test/scala/org/apache/spark/util/UtilsSuite.scala: ## @@ -525,10 +525,14 @@ class UtilsSuite extends SparkFunSuite with ResetSystemProperties {

Re: [PR] [SPARK-50909][PYTHON] Setup faulthandler in PythonPlannerRunners [spark]

2025-01-22 Thread via GitHub
ueshin commented on PR #49592: URL: https://github.com/apache/spark/pull/49592#issuecomment-2608006451 @LuciferYang Thanks for the fix! Let me merge it and rerun tests. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

Re: [PR] [SPARK-50806][SQL] Support InputRDDCodegen interruption on task cancellation [spark]

2025-01-22 Thread via GitHub
dongjoon-hyun commented on PR #49501: URL: https://github.com/apache/spark/pull/49501#issuecomment-2608007898 Gentle ping, @Ngone51 ~ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
HyukjinKwon commented on PR #49602: URL: https://github.com/apache/spark/pull/49602#issuecomment-2608421968 ah it's done in a different way. I will fix the linter and merge. Thanks guys! -- This is an automated message from the Apache Git Service. To respond to the message, please log on

Re: [PR] [SPARK-50883][SQL] Support altering multiple columns in the same command [spark]

2025-01-22 Thread via GitHub
ctring commented on code in PR #49559: URL: https://github.com/apache/spark/pull/49559#discussion_r1926074852 ## sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2AlterTableCommands.scala: ## @@ -201,45 +201,52 @@ case class RenameColumn( copy(table

Re: [PR] [MINOR][SQL] Avoid importing java.util at DataFrameWriter [spark]

2025-01-22 Thread via GitHub
HyukjinKwon closed pull request #49602: [MINOR][SQL] Avoid importing java.util at DataFrameWriter URL: https://github.com/apache/spark/pull/49602 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

  1   2   >