Dongjoon,

Thanks for fixing this!
https://github.com/apache/spark/pull/55355

Enrico

Am 15.04.26 um 18:22 schrieb Dongjoon Hyun:
`spark-app-selector` works fine in our CIs and my environment like the 
following. Could you double-check your environment, Enrico?


$ bin/docker-image-tool.sh -r docker.io/apache -t 4.2.0-preview4 build


$ bin/spark-submit \
--master k8s://$K8S_MASTER \
--deploy-mode cluster \
-c spark.kubernetes.container.image=apache/spark:4.2.0-preview4 \
-c spark.kubernetes.authenticate.driver.serviceAccountName=spark \
--class org.apache.spark.examples.SparkPi \
local:///opt/spark/examples/jars/spark-examples.jar 300000


$ kubectl get pods --show-labels
NAME                                                        READY   STATUS    
RESTARTS   AGE   LABELS
org-apache-spark-examples-sparkpi-8fea449d91f10411-driver   1/1     Running   0 
         6s    
spark-app-name=org-apache-spark-examples-sparkpi,spark-app-selector=spark-e88adf6c93e64834a26bd9162034f34b,spark-role=driver,spark-version=4.2.0-preview4
spark-pi-e33c809d91f10f25-exec-1                            1/1     Running   0 
         3s    
spark-app-name=spark-pi,spark-app-selector=spark-e88adf6c93e64834a26bd9162034f34b,spark-exec-id=1,spark-exec-resourceprofile-id=0,spark-role=executor,spark-version=4.2.0-preview4
spark-pi-e33c809d91f10f25-exec-2                            1/1     Running   0 
         3s    
spark-app-name=spark-pi,spark-app-selector=spark-e88adf6c93e64834a26bd9162034f34b,spark-exec-id=2,spark-exec-resourceprofile-id=0,spark-role=executor,spark-version=4.2.0-preview4


Dongjoon.

On 2026/04/14 09:04:44 Enrico Minack wrote:
Hi Spark-devs,

Looks like 4.2.0-preview4 is affected by regression SPARK-56238.

Building a docker image from 4.2.0-preview4 and running a Spark job on
Kubernetes fails due to a Spark app id mismatch in the selector.

Setting an app id explicitly fixes this mismatch: --conf spark.app.id=abc

Build Spark binaries and Docker image, load image into minikube:

cd spark-4.2.0-preview4-bin-hadoop3
./bin/docker-image-tool.sh -t "latest" build
docker save spark:latest -o spark.tar
minikube image load spark.tar

Run Spark Shell:

echo"spark.range(10).mapPartitions { it => Thread.sleep(60000); it }.collect" | 
./bin/spark-shell --master k8s://https://192.168.49.2:8443--conf
spark.kubernetes.container.image=spark:latest --conf
spark.kubernetes.authenticate.driver.serviceAccountName=spark-sa --conf
spark.kubernetes.executor.missingPodDetectDelta=1000

The driver says:

Spark context available as'sc' (master = k8s://https://192.168.49.2:8443, app 
id =
spark-c0f802279d8146c29b1ef3467b467590).

The executor pods say:

kubectl describe"pod/$(kubectl get pods | grep exec | grep Running | head -n 1 | cut -d " 
" -f 1)" | grep selec
spark-app-selector=spark-6c8d69be4f00410b884fd6e6417b872a

this mismatch causes:

26/03/26 11:40:29 ERROR dispatcher-CoarseGrainedScheduler 
org.apache.spark.scheduler.TaskSchedulerImpl: Lost executor 1 on 10.244.0.55: 
The executor with ID 1 (registered at 1774521602640 ms) was not found in the 
cluster at the polling time (1774521629051 ms) which is after the accepted 
detect delta time (1000 ms) configured by 
`spark.kubernetes.executor.missingPodDetectDelta`. The executor may have been 
deleted but the driver missed the deletion event. Markingthis executor as 
failed.


Either the app id mismatch should be fixed, or the app id should be
mandatory in this setup.

Cheers,
Enrico


Am 10.04.26 um 03:06 schrieb Hyukjin Kwon:
Hi all,

To enable wide-scale community testing of the upcoming Spark 4.2.0
release,
the Apache Spark community has posted a Spark 4.2.0-preview4 release
<https://dist.apache.org/repos/dist/release/spark/spark-4.2.0-preview4>.
This preview is not a stable release in terms of either API or
functionality,
but it is meant to give the community early access to try the code
that will
become Spark 4.2.0. If you would like to test the release, please
download
<https://dist.apache.org/repos/dist/release/spark/spark-4.2.0-preview3> it,
and send feedback using either mailing lists
<https://spark.apache.org/community.html> or JIRA
<https://issues.apache.org/jira/browse/SPARK/?selectedTab=com.atlassian.jira.jira-projects-plugin:summary-panel>.
The documentation is available at the link
<https://spark.apache.org/docs/4.2.0-preview3/>.

We’d like to thank our contributors and users for their contributions
and early feedback to this release. This release would not have been
possible without you.

To download Spark 4.2.0-preview4, head over to the download page:
https://dist.apache.org/repos/dist/release/spark/spark-4.2.0-preview4/
<https://dist.apache.org/repos/dist/release/spark/spark-4.2.0-preview4/>

To view the documentation:
https://spark.apache.org/docs/preview/

---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]

Reply via email to