Re: Running Spark Connect Server in Cluster Mode on Kubernetes

Nagatomi Yasukazu Sun, 03 Sep 2023 18:31:21 -0700

Hi Cley,

Thank you for taking the time to respond to my query. Your insights on
Spark cluster deployment are much appreciated.


However, I'd like to clarify that my specific challenge is related to
running the Spark Connect Server on Kubernetes in Cluster Mode. While I
understand the general deployment strategies for Spark on Kubernetes, I am
seeking guidance particularly on the Spark Connect Server aspect.

cf. Spark Connect Overview - Spark 3.4.1 Documentation
    https://spark.apache.org/docs/latest/spark-connect-overview.html

To reiterate, when I connect from an external Python client and execute
scripts, the server operates in Local Mode instead of the expected
Kubernetes Cluster Mode (with master as k8s://... and deploy-mode set to
cluster).

If I've misunderstood your initial response and it was indeed related to
Spark Connect, I sincerely apologize for the oversight. In that case, could
you please expand a bit on the Spark Connect-specific aspects?

Do you, or anyone else in the community, have experience with this specific
setup or encountered a similar issue with Spark Connect Server on
Kubernetes? Any targeted advice or guidance would be invaluable.

Thank you again for your time and help.

Best regards,
Yasukazu

2023年9月4日(月) 0:23 Cleyson Barros <euroc...@gmail.com>:

> Hi Nagatomi,
> Use Apache imagers, then run your master node, then start your many
> slavers. You can add a command line in the docker files to call for the
> master using the docker container names in your service composition if you
> wish to run 2 masters active and standby follow the instructions in the
> Apache docs to do this configuration, the recipe is the same except when
> you start the masters and how you expect the behaviour of your cluster.
> I hope it helps.
> Have a nice day :)
> Cley
>
> Nagatomi Yasukazu <yassan0...@gmail.com> escreveu no dia sábado,
> 2/09/2023 à(s) 15:37:
>
>> Hello Apache Spark community,
>>
>> I'm currently trying to run Spark Connect Server on Kubernetes in Cluster
>> Mode and facing some challenges. Any guidance or hints would be greatly
>> appreciated.
>>
>> ## Environment:
>> Apache Spark version: 3.4.1
>> Kubernetes version:  1.23
>> Command executed:
>>  /opt/spark/sbin/start-connect-server.sh \
>>    --packages
>> org.apache.spark:spark-connect_2.13:3.4.1,org.apache.iceberg:iceberg-spark-runtime-3.4_2.13:1.3.1...
>> Note that I'm running it with the environment variable
>> SPARK_NO_DAEMONIZE=1.
>>
>> ## Issue:
>> When I connect from an external Python client and run scripts, it
>> operates in Local Mode instead of the expected Cluster Mode.
>>
>> ## Expected Behavior:
>> When connecting from a Python client to the Spark Connect Server, I
>> expect it to run in Cluster Mode.
>>
>> If anyone has any insights, advice, or has faced a similar issue, I'd be
>> grateful for your feedback.
>> Thank you in advance.
>>
>>
>>

Re: Running Spark Connect Server in Cluster Mode on Kubernetes

Reply via email to