[ https://issues.apache.org/jira/browse/SPARK-51273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dongjoon Hyun updated SPARK-51273: ---------------------------------- Parent: SPARK-44111 Issue Type: Sub-task (was: Bug) > Spark Connect Call Procedure runs the procedure twice > ----------------------------------------------------- > > Key: SPARK-51273 > URL: https://issues.apache.org/jira/browse/SPARK-51273 > Project: Spark > Issue Type: Sub-task > Components: Connect, SQL > Affects Versions: 4.0.0 > Reporter: Szehon Ho > Priority: Blocker > Labels: pull-request-available > > Running 'call procedure' via Spark connect results in the procedure getting > called twice. > > This is because the > org.apache.spark.sql.connect.SparkSession.sql sends the plan over to be > evaluated, and that invokes it once. > > This returns a org.apache.spark.sql.connect.DataSet, and then running > df.collect() sends the plan to be evaluated, invoking it a second time. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org