[ https://issues.apache.org/jira/browse/SPARK-51273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated SPARK-51273: ----------------------------------- Labels: pull-request-available (was: ) > Spark Connect Call Procedure runs the procedure twice > ----------------------------------------------------- > > Key: SPARK-51273 > URL: https://issues.apache.org/jira/browse/SPARK-51273 > Project: Spark > Issue Type: Bug > Components: Connect, SQL > Affects Versions: 4.0.0 > Reporter: Szehon Ho > Priority: Major > Labels: pull-request-available > > Running 'call procedure' via Spark connect results in the procedure getting > called twice. > > This is because the > org.apache.spark.sql.connect.SparkSession.sql sends the plan over to be > evaluated, and that invokes it once. > > This returns a org.apache.spark.sql.connect.DataSet, and then running > df.collect() sends the plan to be evaluated, invoking it a second time. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org