Re: [PR] docs: adopt issue #4419 terminology in Scala/Java UDF guide [datafusion-comet]

via GitHub Sat, 13 Jun 2026 15:40:45 -0700


comphead commented on code in PR #4651:
URL: https://github.com/apache/datafusion-comet/pull/4651#discussion_r3408707939



##########
docs/source/user-guide/latest/scala_java_udfs.md:
##########
@@ -19,24 +19,24 @@
 
 # Scala UDF and Java UDF Support
 
-Comet executes Spark's Scala and Java [scalar user-defined functions 
(UDFs)](https://spark.apache.org/docs/latest/sql-ref-functions-udf-scalar.html) 
on the native Comet path. The presence of a UDF does not force the enclosing 
operator off the native path; surrounding native operators stay native.
+Comet executes Spark's Scala and Java [scalar user-defined functions 
(UDFs)](https://spark.apache.org/docs/latest/sql-ref-functions-udf-scalar.html) 
within the Comet pipeline. The presence of a UDF does not force the enclosing 
operator out of the Comet pipeline; surrounding Rust-implemented operators stay 
in the pipeline.
 
 This page covers Spark's `ScalaUDF` (Scala `udf(...)`, 
`spark.udf.register(...)` over Scala or Java functional interfaces, and SQL 
`CREATE FUNCTION ... AS 'com.example.MyUDF'`). Other UDF kinds (Python / 
Pandas, Hive, aggregate) are out of scope and continue to fall back to Spark.
 
 This feature is enabled by default. Set 
`spark.comet.exec.scalaUDF.codegen.enabled` to `false` to route plans 
containing a `ScalaUDF` back to Spark for the enclosing operator.
 
 ## Configuration
 
-| Key                                         | Default | Description          
                                                                                
              |
-| ------------------------------------------- | ------- | 
------------------------------------------------------------------------------------------------------------------
 |
-| `spark.comet.exec.scalaUDF.codegen.enabled` | `true`  | When `true`, 
eligible `ScalaUDF`s run on the Comet path. When `false`, the enclosing 
operator falls back to Spark. |
+| Key                                         | Default | Description          
                                                                                
                  |
+| ------------------------------------------- | ------- | 
----------------------------------------------------------------------------------------------------------------------
 |
+| `spark.comet.exec.scalaUDF.codegen.enabled` | `true`  | When `true`, 
eligible `ScalaUDF`s run in the Comet pipeline. When `false`, the enclosing 
operator falls back to Spark. |

Review Comment:
   perhaps we need to highlight this param contols a fallback for user defined 
functions codegen, having another config for builtin Spark functions codegen?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] docs: adopt issue #4419 terminology in Scala/Java UDF guide [datafusion-comet]

Reply via email to