Messages by Thread
-
why advisoryPartitionSize <= maxShuffledHashJoinLocalMapThreshold
??????
-
Spark Vulnerabilities
Sankavi Nagalingam
-
Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Stephen Coy
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Patrick Tucci
-
Re: Spark-SQL - Query Hanging, How To Troubleshoot
Mich Talebzadeh
-
[PySpark][UDF][PickleException]
Sanket Sharma
-
Spark Connect, Master, and Workers
Kezhi Xiong
-
dockerhub does not contain apache/spark-py 3.4.1
Mark Elliot
-
[PySpark] Failed to add file [file:///tmp/app-submodules.zip] specified in 'spark.submit.pyFiles' to Python path:
lnxpgn
-
Use of ML in certain aspects of Spark to improve the performance
Mich Talebzadeh
-
Spark 3.41 with Java 11 performance on k8s serverless/autopilot
Mich Talebzadeh
-
Custom Session Windowing in Spark using Scala/Python
Ravi Teja
-
Extracting Logical Plan
Vibhatha Abeykoon
-
Spark-SQL - Concurrent Inserts Into Same Table Throws Exception
Patrick Tucci
-
The performance difference when running Apache Spark on K8s and traditional server
Trường Trần Phan An
-
Dynamic allocation does not deallocate executors
Sergei Zhgirovski
-
[ANNOUNCE] Apache Celeborn(incubating) 0.3.0 available
zhongqiang chen
-
conver panda image column to spark dataframe
second_co...@yahoo.com.INVALID
-
spark context list_packages()
second_co...@yahoo.com.INVALID
-
Map Partition is called Multiple Times
Deepak Patankar
-
Fwd: Interested in contributing to SPARK-24815
Pavan Kotikalapudi
-
Re: Spark 3.3 + parquet 1.10
Mich Talebzadeh
-
Spark3.3 with parquet 1.10.x
Pralabh Kumar
-
Unable to launch Spark connect on Docker image
Edmondo Porcu
-
Argo for general purpose k8s scheduling
Mich Talebzadeh
-
Spark Scala SBT Local build fails
Varun Shah
-
Spark File Output Committer algorithm for GCS
Dipayan Dev
-
Contributing to Spark MLLib
Dipayan Dev
-
[no subject]
Varun Shah
-
[Spark RPC]: Yarn - Application Master / executors to Driver communication issue
Sunayan Saikia
-
Spark Not Connecting
timi ayoade
-
Loading in custom Hive jars for spark
Yeachan Park
-
Jobs that have join & have .rdd calls get executed 2x when AQE is enabled.
Priyanka Raju
-
Spark UI - Bug Executors tab when using proxy port
Bruno Pistone
-
Performance Issue with Column Addition in Spark 3.4.x: Time Doubling with Increased Columns
KO Dukhyun
-
Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gengliang Wang
-
Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
CFP for the 2nd Performance Engineering track at Community over Code NA 2023
Brebner, Paul
-
PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
-
[Spark SQL] Data objects from query history
Ruben Mennes
-
checkpoint file deletion
Lingzhe Sun
-
subscribe
mojianan2015
-
[PySpark] Intermittent Spark session initialization error on M1 Mac
BeoumSuk Kim
-
[k8s] Fail to expose custom port on executor container specified in my executor pod template
James Yu
-
[Spark-SQL] Dataframe write saveAsTable failed
Anil Dasari
-
Unable to populate spark metrics using custom metrics API
Surya Soma
-
Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
-
[Spark streaming]: Microbatch id in logs
Anil Dasari
-
Apache Spark with watermark - processing data different LogTypes in same kafka topic
karan alang
-
[ANNOUNCE] Apache Spark 3.4.1 released
Dongjoon Hyun
-
Rename columns without manually setting them all
John Paul Jayme
-
Shuffle data on pods which get decomissioned
Nikhil Goyal
-
How to read excel file in PySpark
John Paul Jayme
-
implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Pengfei Li
-
TAC Applications for Community Over Code North America and Asia now open
Gavin McDonald
-
Fwd: iceberg queries
Gaurav Agarwal
-
Spark using iceberg
Gaurav Agarwal
-
Announcing the Community Over Code 2023 Streaming Track
James Hughes
-
Apache Spark not reading UTC timestamp from MongoDB correctly
karan alang
-
Getting SparkRuntimeException: Unexpected value for length in function slice: length must be greater than or equal to 0
Bariudin, Daniel
-
[Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
-
Comparison of Trino, Spark, and Hive-MR3
Sungwoo Park
-
Viewing UI for spark jobs running on K8s
Nikhil Goyal