user
Thread
Date
Earlier messages
Later messages
Messages by Thread
dockerhub does not contain apache/spark-py 3.4.1
Mark Elliot
Re: dockerhub does not contain apache/spark-py 3.4.1
Mich Talebzadeh
Re: dockerhub does not contain apache/spark-py 3.4.1
Mich Talebzadeh
[PySpark] Failed to add file [file:///tmp/app-submodules.zip] specified in 'spark.submit.pyFiles' to Python path:
lnxpgn
Re: [PySpark] Failed to add file [file:///tmp/app-submodules.zip] specified in 'spark.submit.pyFiles' to Python path:
Mich Talebzadeh
Re: [PySpark] Failed to add file [file:///tmp/app-submodules.zip] specified in 'spark.submit.pyFiles' to Python path:
lnxpgn
Use of ML in certain aspects of Spark to improve the performance
Mich Talebzadeh
Re: [EXTERNAL] Use of ML in certain aspects of Spark to improve the performance
Daniel Tavares de Santana
unsubscribe
Daniel Tavares de Santana
Spark 3.41 with Java 11 performance on k8s serverless/autopilot
Mich Talebzadeh
Custom Session Windowing in Spark using Scala/Python
Ravi Teja
Extracting Logical Plan
Vibhatha Abeykoon
Re: Extracting Logical Plan
Winston Lai
Re: Extracting Logical Plan
Vibhatha Abeykoon
Re: Extracting Logical Plan
Winston Lai
Re: Extracting Logical Plan
Vibhatha Abeykoon
Re: Extracting Logical Plan
Ruifeng Zheng
Re: Extracting Logical Plan
Vibhatha Abeykoon
Re: Extracting Logical Plan
Ruifeng Zheng
Re: Extracting Logical Plan
Vibhatha Abeykoon
Re: Extracting Logical Plan
Winston Lai
Re: Extracting Logical Plan
Vibhatha Abeykoon
Re: Extracting Logical Plan
Vibhatha Abeykoon
Spark-SQL - Concurrent Inserts Into Same Table Throws Exception
Patrick Tucci
Re: Spark-SQL - Concurrent Inserts Into Same Table Throws Exception
Mich Talebzadeh
Re: Spark-SQL - Concurrent Inserts Into Same Table Throws Exception
Pol Santamaria
Re: Spark-SQL - Concurrent Inserts Into Same Table Throws Exception
Patrick Tucci
Re: Spark-SQL - Concurrent Inserts Into Same Table Throws Exception
Mich Talebzadeh
The performance difference when running Apache Spark on K8s and traditional server
Trường Trần Phan An
Re: The performance difference when running Apache Spark on K8s and traditional server
Mich Talebzadeh
Dynamic allocation does not deallocate executors
Sergei Zhgirovski
Re: Dynamic allocation does not deallocate executors
Mich Talebzadeh
Re: Dynamic allocation does not deallocate executors
Holden Karau
Re: Dynamic allocation does not deallocate executors
Mich Talebzadeh
Re: Dynamic allocation does not deallocate executors
Holden Karau
[ANNOUNCE] Apache Celeborn(incubating) 0.3.0 available
zhongqiang chen
conver panda image column to spark dataframe
second_co...@yahoo.com.INVALID
Re: conver panda image column to spark dataframe
Adrian Pop-Tifrea
Re: conver panda image column to spark dataframe
second_co...@yahoo.com.INVALID
Re: conver panda image column to spark dataframe
Adrian Pop-Tifrea
Re: conver panda image column to spark dataframe
second_co...@yahoo.com.INVALID
Re: conver panda image column to spark dataframe
Sean Owen
spark context list_packages()
second_co...@yahoo.com.INVALID
Re: spark context list_packages()
Sean Owen
Map Partition is called Multiple Times
Deepak Patankar
Fwd: Interested in contributing to SPARK-24815
Pavan Kotikalapudi
Re: Interested in contributing to SPARK-24815
Sean Owen
Re: Interested in contributing to SPARK-24815
Kent Yao
Re: Interested in contributing to SPARK-24815
Pavan Kotikalapudi
Re: Interested in contributing to SPARK-24815
Rinat Shangeeta
Re: Interested in contributing to SPARK-24815
Sean Owen
Re: Spark 3.3 + parquet 1.10
Mich Talebzadeh
Spark3.3 with parquet 1.10.x
Pralabh Kumar
Unable to launch Spark connect on Docker image
Edmondo Porcu
Re: Unable to launch Spark connect on Docker image
Mich Talebzadeh
Argo for general purpose k8s scheduling
Mich Talebzadeh
Spark Scala SBT Local build fails
Varun Shah
Re: Spark Scala SBT Local build fails
Varun Shah
Re: Spark Scala SBT Local build fails
Varun Shah
Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Jay
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Yeachan Park
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Jay
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Contributing to Spark MLLib
Dipayan Dev
Re: Contributing to Spark MLLib
Brian Huynh
Re: Contributing to Spark MLLib
Gourav Sengupta
[no subject]
Varun Shah
[Spark RPC]: Yarn - Application Master / executors to Driver communication issue
Sunayan Saikia
Spark Not Connecting
timi ayoade
Re: [EXTERNAL] Spark Not Connecting
Daniel Tavares de Santana
Re: Spark Not Connecting
Artemis User
Loading in custom Hive jars for spark
Yeachan Park
Re: Loading in custom Hive jars for spark
Mich Talebzadeh
Jobs that have join & have .rdd calls get executed 2x when AQE is enabled.
Priyanka Raju
Spark UI - Bug Executors tab when using proxy port
Bruno Pistone
Performance Issue with Column Addition in Spark 3.4.x: Time Doubling with Increased Columns
KO Dukhyun
Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gengliang Wang
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Farshid Ashouri
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Hyukjin Kwon
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gavin Ray
Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
Re: Filtering JSON records when there isn't an exact schema match in Spark
Vikas Kumar
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
Re: Filtering JSON records when there isn't an exact schema match in Spark
Hill Liu
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
CFP for the 2nd Performance Engineering track at Community over Code NA 2023
Brebner, Paul
PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
Re: PySpark error java.lang.IllegalArgumentException
Khalid Mammadov
Re: PySpark error java.lang.IllegalArgumentException
Brian Huynh
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
[Spark SQL] Data objects from query history
Ruben Mennes
Re: [Spark SQL] Data objects from query history
Jack Wells
checkpoint file deletion
Lingzhe Sun
subscribe
mojianan2015
Re:subscribe
mojianan2015
subscribe
Sahib Aulakh
subscribe
Sahib Aulakh
[PySpark] Intermittent Spark session initialization error on M1 Mac
BeoumSuk Kim
[k8s] Fail to expose custom port on executor container specified in my executor pod template
James Yu
[Spark-SQL] Dataframe write saveAsTable failed
Anil Dasari
Unable to populate spark metrics using custom metrics API
Surya Soma
Unable to populate spark metrics using custom metrics API
Surya Soma
Re: Unable to populate spark metrics using custom metrics API
Surya Soma
Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
[Spark streaming]: Microbatch id in logs
Anil Dasari
Re: [Spark streaming]: Microbatch id in logs
Mich Talebzadeh
Apache Spark with watermark - processing data different LogTypes in same kafka topic
karan alang
[ANNOUNCE] Apache Spark 3.4.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Mridul Muralidharan
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Hyukjin Kwon
Re: [ANNOUNCE] Apache Spark 3.4.1 released
L. C. Hsieh
Re: [ANNOUNCE] Apache Spark 3.4.1 released
yangjie01
Re:[ANNOUNCE] Apache Spark 3.4.1 released
beliefer
Rename columns without manually setting them all
John Paul Jayme
Re: Rename columns without manually setting them all
Farshid Ashouri
Re: Rename columns without manually setting them all
Bjørn Jørgensen
Shuffle data on pods which get decomissioned
Nikhil Goyal
Re: Shuffle data on pods which get decomissioned
Mich Talebzadeh
How to read excel file in PySpark
John Paul Jayme
Re: How to read excel file in PySpark
Sean Owen
Re: How to read excel file in PySpark
Bjørn Jørgensen
Re: How to read excel file in PySpark
Mich Talebzadeh
Re: How to read excel file in PySpark
Bjørn Jørgensen
Re: How to read excel file in PySpark
Mich Talebzadeh
Re: How to read excel file in PySpark
Sean Owen
Re: How to read excel file in PySpark
Mich Talebzadeh
Re: How to read excel file in PySpark
Bjørn Jørgensen
Re: How to read excel file in PySpark
Mich Talebzadeh
implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Pengfei Li
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
TAC Applications for Community Over Code North America and Asia now open
Gavin McDonald
Fwd: iceberg queries
Gaurav Agarwal
Spark using iceberg
Gaurav Agarwal
Re: Spark using iceberg
Gaurav Agarwal
Announcing the Community Over Code 2023 Streaming Track
James Hughes
Apache Spark not reading UTC timestamp from MongoDB correctly
karan alang
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Sean Owen
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Enrico Minack
Getting SparkRuntimeException: Unexpected value for length in function slice: length must be greater than or equal to 0
Bariudin, Daniel
[Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Mich Talebzadeh
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Wenchen Fan
Comparison of Trino, Spark, and Hive-MR3
Sungwoo Park
Viewing UI for spark jobs running on K8s
Nikhil Goyal
Re: Viewing UI for spark jobs running on K8s
Qian Sun
ChatGPT and prediction of Spark future
Mich Talebzadeh
Re: ChatGPT and prediction of Spark future
Winston Lai
Re: ChatGPT and prediction of Spark future
Mich Talebzadeh
Structured streaming append mode picture question
Hill Liu
JDK version support information
Poorna Murali
Re: JDK version support information
Aironman DirtDiver
Re: JDK version support information
Poorna Murali
Re: JDK version support information
Sean Owen
Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
Re: Re: maven with Spark 3.4.0 fails compilation
Lingzhe Sun
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
Re: Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
[Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Mich Talebzadeh
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
Re: [MLlib] how-to find implementation of Decision Tree Regressor fit function
Sean Owen
Dynamic value as the offset of lag() function
Nipuna Shantha
Incremental Value dependents on another column of Data frame Spark
Nipuna Shantha
Re: Incremental Value dependents on another column of Data frame Spark
Raghavendra Ganesh
Re: Incremental Value dependents on another column of Data frame Spark
Enrico Minack
cannot load model using pyspark
second_co...@yahoo.com.INVALID
Data Stream Processing applications testing
Alexandre Strapacao Guedes Vianna
Understanding Spark S3 Read Performance
Shashank Rao
RE: Understanding Spark S3 Read Performance
info
Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
Re: Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
[spark-core] Can executors recover/reuse shuffle files upon failure?
Faiz Halde
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
vaquar khan
RE: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Maksym M
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
Pyspark cluster mode on standalone deployment
خالد القحطاني
Earlier messages
Later messages