Szehon Ho created SPARK-51273:
---------------------------------

             Summary: Spark Connect Call Procedure runs the procedure twice
                 Key: SPARK-51273
                 URL: https://issues.apache.org/jira/browse/SPARK-51273
             Project: Spark
          Issue Type: Bug
          Components: Connect, SQL
    Affects Versions: 4.0.0
            Reporter: Szehon Ho


Running 'call procedure' via Spark connect results in the procedure getting 
called twice.

 

This is because the 
org.apache.spark.sql.connect.SparkSession.sql sends the plan over to be 
evaluated, and that invokes it once.
 
This returns a org.apache.spark.sql.connect.DataSet, and then running 
df.collect() sends the plan to be evaluated, invoking it a second time.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to