Messages by Date
-
2023/07/19
Argo for general purpose k8s scheduling
Mich Talebzadeh
-
2023/07/19
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/19
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
-
2023/07/18
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/18
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
-
2023/07/18
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/17
Re: Spark Scala SBT Local build fails
Varun Shah
-
2023/07/17
Re: Spark Scala SBT Local build fails
Varun Shah
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/17
Re: Contributing to Spark MLLib
Gourav Sengupta
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Jay
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
-
2023/07/17
Unsubscribe
mojianan2015
-
2023/07/17
Unsubscribe
Zoran Jeremic
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Yeachan Park
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Jay
-
2023/07/17
Unsubscribe
Bode, Meikel
-
2023/07/17
Spark Scala SBT Local build fails
Varun Shah
-
2023/07/17
Re: Unsubscribe
srini subramanian
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/17
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
-
2023/07/17
Spark File Output Committer algorithm for GCS
Dipayan Dev
-
2023/07/16
Unsubscribe
Bode, Meikel
-
2023/07/16
Re: Contributing to Spark MLLib
Brian Huynh
-
2023/07/16
Contributing to Spark MLLib
Dipayan Dev
-
2023/07/16
[no subject]
Varun Shah
-
2023/07/14
[Spark RPC]: Yarn - Application Master / executors to Driver communication issue
Sunayan Saikia
-
2023/07/13
Re: Unable to populate spark metrics using custom metrics API
Surya Soma
-
2023/07/12
Re: Spark Not Connecting
Artemis User
-
2023/07/12
Re: [EXTERNAL] Spark Not Connecting
Daniel Tavares de Santana
-
2023/07/12
Spark Not Connecting
timi ayoade
-
2023/07/11
Re: Loading in custom Hive jars for spark
Mich Talebzadeh
-
2023/07/11
Loading in custom Hive jars for spark
Yeachan Park
-
2023/07/11
Jobs that have join & have .rdd calls get executed 2x when AQE is enabled.
Priyanka Raju
-
2023/07/10
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
-
2023/07/09
Unsubscribe
chen...@birdiexx.com
-
2023/07/08
Unable to populate spark metrics using custom metrics API
Surya Soma
-
2023/07/08
Unsubscribe
yixu2...@163.com
-
2023/07/07
Re: PySpark error java.lang.IllegalArgumentException
Brian Huynh
-
2023/07/07
Re: PySpark error java.lang.IllegalArgumentException
Khalid Mammadov
-
2023/07/07
Re: Unsubscribe
Atheeth SH
-
2023/07/06
Unsubscribe
Mihai Musat
-
2023/07/06
Spark UI - Bug Executors tab when using proxy port
Bruno Pistone
-
2023/07/04
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
-
2023/07/04
Performance Issue with Column Addition in Spark 3.4.x: Time Doubling with Increased Columns
KO Dukhyun
-
2023/07/04
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/04
Re: Filtering JSON records when there isn't an exact schema match in Spark
Hill Liu
-
2023/07/04
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/03
Re: Filtering JSON records when there isn't an exact schema match in Spark
Vikas Kumar
-
2023/07/03
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gavin Ray
-
2023/07/03
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Hyukjin Kwon
-
2023/07/03
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Farshid Ashouri
-
2023/07/03
Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gengliang Wang
-
2023/07/03
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/03
Re: [Spark SQL] Data objects from query history
Jack Wells
-
2023/07/03
Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/03
CFP for the 2nd Performance Engineering track at Community over Code NA 2023
Brebner, Paul
-
2023/07/03
PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
-
2023/06/30
[Spark SQL] Data objects from query history
Ruben Mennes
-
2023/06/29
checkpoint file deletion
Lingzhe Sun
-
2023/06/29
Unsubscribe
lee
-
2023/06/28
Unsubscribe
Ghazi Naceur
-
2023/06/28
Re:subscribe
mojianan2015
-
2023/06/28
subscribe
mojianan2015
-
2023/06/27
[PySpark] Intermittent Spark session initialization error on M1 Mac
BeoumSuk Kim
-
2023/06/26
[k8s] Fail to expose custom port on executor container specified in my executor pod template
James Yu
-
2023/06/26
[Spark-SQL] Dataframe write saveAsTable failed
Anil Dasari
-
2023/06/26
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
-
2023/06/26
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
-
2023/06/26
Unable to populate spark metrics using custom metrics API
Surya Soma
-
2023/06/26
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
-
2023/06/26
Unsubscribe
Ghazi Naceur
-
2023/06/26
Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
-
2023/06/26
Re: [Spark streaming]: Microbatch id in logs
Mich Talebzadeh
-
2023/06/25
[Spark streaming]: Microbatch id in logs
Anil Dasari
-
2023/06/24
Re: [ANNOUNCE] Apache Spark 3.4.1 released
yangjie01
-
2023/06/24
Re:[ANNOUNCE] Apache Spark 3.4.1 released
beliefer
-
2023/06/24
Apache Spark with watermark - processing data different LogTypes in same kafka topic
karan alang
-
2023/06/23
Re: [ANNOUNCE] Apache Spark 3.4.1 released
L. C. Hsieh
-
2023/06/23
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Hyukjin Kwon
-
2023/06/23
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Mridul Muralidharan
-
2023/06/23
[ANNOUNCE] Apache Spark 3.4.1 released
Dongjoon Hyun
-
2023/06/21
Re: Rename columns without manually setting them all
Bjørn Jørgensen
-
2023/06/21
Re: Rename columns without manually setting them all
Farshid Ashouri
-
2023/06/21
Rename columns without manually setting them all
John Paul Jayme
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Unsubscribe
Bhargava Sukkala
-
2023/06/20
Re: How to read excel file in PySpark
Bjørn Jørgensen
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Re: Shuffle data on pods which get decomissioned
Mich Talebzadeh
-
2023/06/20
Re: How to read excel file in PySpark
Sean Owen
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Re: How to read excel file in PySpark
Bjørn Jørgensen
-
2023/06/20
Shuffle data on pods which get decomissioned
Nikhil Goyal
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Re: How to read excel file in PySpark
Bjørn Jørgensen
-
2023/06/20
Re: How to read excel file in PySpark
Sean Owen
-
2023/06/20
How to read excel file in PySpark
John Paul Jayme
-
2023/06/18
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
-
2023/06/18
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
-
2023/06/18
implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Pengfei Li
-
2023/06/16
TAC Applications for Community Over Code North America and Asia now open
Gavin McDonald
-
2023/06/15
Fwd: iceberg queries
Gaurav Agarwal
-
2023/06/15
Re: Spark using iceberg
Gaurav Agarwal
-
2023/06/15
Spark using iceberg
Gaurav Agarwal
-
2023/06/11
Unsubscribe
Yu voidy
-
2023/06/09
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Wenchen Fan
-
2023/06/09
Announcing the Community Over Code 2023 Streaming Track
James Hughes
-
2023/06/08
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Enrico Minack
-
2023/06/08
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Sean Owen
-
2023/06/08
Apache Spark not reading UTC timestamp from MongoDB correctly
karan alang
-
2023/06/06
Getting SparkRuntimeException: Unexpected value for length in function slice: length must be greater than or equal to 0
Bariudin, Daniel
-
2023/06/04
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Mich Talebzadeh
-
2023/06/04
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
-
2023/06/01
[Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
-
2023/06/01
Re: ChatGPT and prediction of Spark future
Mich Talebzadeh
-
2023/05/31
Comparison of Trino, Spark, and Hive-MR3
Sungwoo Park
-
2023/05/31
Re: Viewing UI for spark jobs running on K8s
Qian Sun
-
2023/05/31
Re: ChatGPT and prediction of Spark future
Winston Lai
-
2023/05/31
Viewing UI for spark jobs running on K8s
Nikhil Goyal
-
2023/05/31
ChatGPT and prediction of Spark future
Mich Talebzadeh
-
2023/05/31
Structured streaming append mode picture question
Hill Liu
-
2023/05/29
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
-
2023/05/29
Re: JDK version support information
Sean Owen
-
2023/05/29
Re: JDK version support information
Poorna Murali
-
2023/05/29
Re: JDK version support information
Aironman DirtDiver
-
2023/05/29
JDK version support information
Poorna Murali
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Lingzhe Sun
-
2023/05/29
Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
-
2023/05/28
Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
-
2023/05/25
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Mich Talebzadeh
-
2023/05/25
[Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
-
2023/05/25
Re: [MLlib] how-to find implementation of Decision Tree Regressor fit function
Sean Owen
-
2023/05/24
Re: Incremental Value dependents on another column of Data frame Spark
Enrico Minack
-
2023/05/23
Dynamic value as the offset of lag() function
Nipuna Shantha
-
2023/05/23
Re: Incremental Value dependents on another column of Data frame Spark
Raghavendra Ganesh
-
2023/05/23
Incremental Value dependents on another column of Data frame Spark
Nipuna Shantha
-
2023/05/23
Re: Shuffle with Window().partitionBy(<column>)
ashok34...@yahoo.com.INVALID
-
2023/05/23
Re: Shuffle with Window().partitionBy(<column>)
Rauf Khan
-
2023/05/22
cannot load model using pyspark
second_co...@yahoo.com.INVALID
-
2023/05/22
Data Stream Processing applications testing
Alexandre Strapacao Guedes Vianna
-
2023/05/22
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
-
2023/05/22
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
-
2023/05/22
RE: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Maksym M
-
2023/05/17
Re: Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
-
2023/05/17
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
vaquar khan
-
2023/05/16
RE: Understanding Spark S3 Read Performance
info
-
2023/05/16
Understanding Spark S3 Read Performance
Shashank Rao
-
2023/05/16
Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
-
2023/05/15
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
-
2023/05/15
[spark-core] Can executors recover/reuse shuffle files upon failure?
Faiz Halde
-
2023/05/14
Pyspark cluster mode on standalone deployment
خالد القحطاني
-
2023/05/12
Re: Error while merge in delta table
Farhan Misarwala
-
2023/05/12
Shuffle with Window().partitionBy(<column>)
ashok34...@yahoo.com.INVALID
-
2023/05/12
Re: Error while merge in delta table
Karthick Nk
-
2023/05/11
Does spark read the same file twice, if two stages are using the same DataFrame?
Vijay B
-
2023/05/11
Re: Error while merge in delta table
Farhan Misarwala
-
2023/05/11
Re: Error while merge in delta table
Jacek Laskowski
-
2023/05/10
Error while merge in delta table
Karthick Nk
-
2023/05/10
RE: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Vijay B
-
2023/05/09
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/09
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
-
2023/05/09
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/09
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
-
2023/05/09
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/09
unsubscribe
Balakumar iyer S
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/07
unsubscribe
Utkarsh Jain
-
2023/05/06
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Winston Lai
-
2023/05/06
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Mich Talebzadeh
-
2023/05/06
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
-
2023/05/05
Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
-
2023/05/05
Re: Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
-
2023/05/04
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
-
2023/05/04
Re: Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
-
2023/05/04
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
-
2023/05/04
Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
-
2023/05/04
Re: Write custom JSON from DataFrame in PySpark
Marco Costantini
-
2023/05/04
Re: Write custom JSON from DataFrame in PySpark
Enrico Minack
-
2023/05/03
Write custom JSON from DataFrame in PySpark
Marco Costantini
-
2023/05/03
How to create spark udf use functioncatalog?
tzxxh
-
2023/05/03
unsubscribe
Kang
-
2023/05/02
Re: How to determine the function of tasks on each stage in an Apache Spark application?
Trường Trần Phan An
-
2023/05/02
CVE-2023-32007: Apache Spark: Shell command injection via Spark UI
Arnout Engelen
-
2023/05/01
Unsubscribe
rau-jannik
-
2023/05/01
Unsubscribe
peng
-
2023/05/01
Unsubscribe
sandeep vura
-
2023/05/01
Re: Change column values using several when conditions
Bjørn Jørgensen
-
2023/05/01
Change column values using several when conditions
marc nicole
-
2023/04/30
How to change column values using several when conditions ?
marc nicole
-
2023/04/30
Any experience with K8s Remote Shuffling Service at scale?
Andrey Gourine