Re: Spark Shuffle - in kubeflow spark operator installation on k8s

2025-04-06 Thread karan alang
One issue I've seen is that after about 24 hours, the sparkapplication job pods seem to be getting evicted .. i've installed spark history server, and am verifying the case. It could be due to resource constraints, checking this. Pls note : kubeflow spark operator is installed in

Re: Spark Shuffle - in kubeflow spark operator installation on k8s

2025-04-06 Thread karan alang
t; >>> hello all - checking to see if anyone has any input on this >>> >>> thanks! >>> >>> >>> On Tue, Mar 25, 2025 at 12:22 PM karan alang >>> wrote: >>> >>>> hello All, >>>> >>>> I have kubef

Re: Spark Shuffle - in kubeflow spark operator installation on k8s

2025-04-06 Thread megh vidani
t > me know. > > thanks! > > > On Mon, Mar 31, 2025 at 1:58 PM karan alang wrote: > >> hello all - checking to see if anyone has any input on this >> >> thanks! >> >> >> On Tue, Mar 25, 2025 at 12:22 PM karan alang >> wrote: >

Kubeflow Spark-Operator

2025-04-04 Thread Hamish Whittal
Hello folks, My colleague has posted this issue on Github: https://github.com/kubeflow/spark-operator/issues/2491 I'm wondering whether anyone here is using the kubeflow, Spark-Operator and could provide any insight into what's happening here. I know he's been stumped for a

Re: Spark Shuffle - in kubeflow spark operator installation on k8s

2025-03-31 Thread Mich Talebzadeh
gt; thanks! > > > On Mon, Mar 31, 2025 at 1:58 PM karan alang wrote: > >> hello all - checking to see if anyone has any input on this >> >> thanks! >> >> >> On Tue, Mar 25, 2025 at 12:22 PM karan alang >> wrote: >> >>> hell

Re: Spark Shuffle - in kubeflow spark operator installation on k8s

2025-03-31 Thread karan alang
wrote: > >> hello All, >> >> I have kubeflow Spark Operator installed on k8s and from what i >> understand - Spark Shuffle is not officially supported on kubernetes. >> >> Looking for feedback from the community on what approach is being taken >> t

Re: Spark Shuffle - in kubeflow spark operator installation on k8s

2025-03-31 Thread karan alang
hello all - checking to see if anyone has any input on this thanks! On Tue, Mar 25, 2025 at 12:22 PM karan alang wrote: > hello All, > > I have kubeflow Spark Operator installed on k8s and from what i understand > - Spark Shuffle is not officially supported on kubernetes. >

Spark Shuffle - in kubeflow spark operator installation on k8s

2025-03-25 Thread karan alang
hello All, I have kubeflow Spark Operator installed on k8s and from what i understand - Spark Shuffle is not officially supported on kubernetes. Looking for feedback from the community on what approach is being taken to handle this issue - especially since dynamicAllocation cannot be enabled

Re: kubeflow spark operator & SparkHistoryService on k8s - spark driver/executor logs not showing up Spark History Server

2024-10-23 Thread Mat Schaffer
alang wrote: > Hello All, > I have kubeflow spark operator installed on GKE (in namespace - so350), as > well as Spark History Server installed on GKE in namespace shs-350. > The spark job is launched in a separate namespaces - spark-apps. > > When I launch the spark job, it runs fine

kubeflow spark operator & SparkHistoryService on k8s - spark driver/executor logs not showing up Spark History Server

2024-10-23 Thread karan alang
Hello All, I have kubeflow spark operator installed on GKE (in namespace - so350), as well as Spark History Server installed on GKE in namespace shs-350. The spark job is launched in a separate namespaces - spark-apps. When I launch the spark job, it runs fine and I'm able to see the job de

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-11 Thread karan alang
:///opt/spark/other-jars/mongo-spark-connector_2.12-3.0.2.jar,file:///opt/spark/other-jars/bson-4.0.5.jar,file:///opt/spark/other-jars/mongodb-driver-sync-4.0.5.jar,file:///opt/spark/other-jars/mongodb-driver-core-4.0.5.jar,file:///opt/spark/other-jars/org.apache.spark_spark-sql-kafka-0-10_2.

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-07 Thread Nimrod Ofek
ar,file:///opt/spark/other-jars/org.mongodb_mongodb-driver-sync-4.0.5.jar,file:///opt/spark/other-jars/org.mongodb_bson-4.0.5.jar,file:///opt/spark/other-jars/org.mongodb_mongodb-driver-core-4.0.5.jar" >>> "spark.executor.extraClassPath": >>> "file:///opt/spark/othe

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-06 Thread karan alang
0.jar,file:///opt/spark/other-jars/org.apache.commons_commons-pool2-2.6.2.jar,file:///opt/spark/other-jars/com.github.luben_zstd-jni-1.4.8-1.jar,file:///opt/spark/other-jars/org.lz4_lz4-java-1.7.1.jar,file:///opt/spark/other-jars/org.xerial.snappy_snappy-java-1.1.8.2.jar,file:///opt/spark/other-jars

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-06 Thread Nimrod Ofek
/spark/zips/streams.zip,file:///opt/spark/zips/utils.zip" > hadoopConf: > "fs.gs.impl": "com.google.cloud.hadoop.fs.gcs.GoogleHadoopFileSystem" > "fs.AbstractFileSystem.gs.impl": > "com.google.cloud.hadoop.fs.gcs.GoogleHadoopFS" > "google.cloud.a

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-05 Thread karan alang
quot;--isdebug={{ .Values.isdebug }}" - "--istest={{ .Values.isdebug }}" here is snapshot of the secret : ``` (base) Karans-MacBook-Pro:spark-k8s-operator karanalang$ kc get secret spark-gcs-creds -n so350 -o yaml apiVersion: v1 data: spark-gcs-key.json: <--- KEY ---> kind:

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-03 Thread Nimrod Ofek
Where is the checkpoint location? Not in GCS? Probably the location of the checkpoint is there- and you don't have permissions for that... בתאריך יום ה׳, 3 באוק׳ 2024, 02:43, מאת karan alang ‏: > This seems to be the cause of this -> > github.com/kubeflow/spark-operator/issues/1619

Re: kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-02 Thread karan alang
This seems to be the cause of this -> github.com/kubeflow/spark-operator/issues/1619 .. the secret is not getting mounted die to this error -> MountVolume.SetUp failed for volume “spark-conf-volume-driver I'm getting same error in event logs, and the secret mounted is not getting read

kubeflow spark-operator - error in querying strimzi kafka using structured streaming

2024-10-01 Thread karan alang
I've kubeflow spark-operator installed on K8s (GKE), and i'm running a structured streaming job which reads data from kafka .. the job is run every 10 mins. It is giving an error shown below: ``` Traceback (most recent call last): File "/opt/spark/custom-dir/main.py