user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: Interested in contributing to SPARK-24815
Kent Yao
Re: Interested in contributing to SPARK-24815
Pavan Kotikalapudi
Re: Interested in contributing to SPARK-24815
Rinat Shangeeta
Re: Interested in contributing to SPARK-24815
Sean Owen
Re: Spark 3.3 + parquet 1.10
Mich Talebzadeh
Spark3.3 with parquet 1.10.x
Pralabh Kumar
Unable to launch Spark connect on Docker image
Edmondo Porcu
Re: Unable to launch Spark connect on Docker image
Mich Talebzadeh
Argo for general purpose k8s scheduling
Mich Talebzadeh
Spark Scala SBT Local build fails
Varun Shah
Re: Spark Scala SBT Local build fails
Varun Shah
Re: Spark Scala SBT Local build fails
Varun Shah
Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Jay
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Yeachan Park
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Jay
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Dipayan Dev
Re: Spark File Output Committer algorithm for GCS
Mich Talebzadeh
Contributing to Spark MLLib
Dipayan Dev
Re: Contributing to Spark MLLib
Brian Huynh
Re: Contributing to Spark MLLib
Gourav Sengupta
[no subject]
Varun Shah
[Spark RPC]: Yarn - Application Master / executors to Driver communication issue
Sunayan Saikia
Spark Not Connecting
timi ayoade
Re: [EXTERNAL] Spark Not Connecting
Daniel Tavares de Santana
Re: Spark Not Connecting
Artemis User
Loading in custom Hive jars for spark
Yeachan Park
Re: Loading in custom Hive jars for spark
Mich Talebzadeh
Jobs that have join & have .rdd calls get executed 2x when AQE is enabled.
Priyanka Raju
Spark UI - Bug Executors tab when using proxy port
Bruno Pistone
Performance Issue with Column Addition in Spark 3.4.x: Time Doubling with Increased Columns
KO Dukhyun
Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gengliang Wang
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Farshid Ashouri
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Hyukjin Kwon
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gavin Ray
Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
Re: Filtering JSON records when there isn't an exact schema match in Spark
Vikas Kumar
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
Re: Filtering JSON records when there isn't an exact schema match in Spark
Hill Liu
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
CFP for the 2nd Performance Engineering track at Community over Code NA 2023
Brebner, Paul
PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
Re: PySpark error java.lang.IllegalArgumentException
Khalid Mammadov
Re: PySpark error java.lang.IllegalArgumentException
Brian Huynh
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
[Spark SQL] Data objects from query history
Ruben Mennes
Re: [Spark SQL] Data objects from query history
Jack Wells
checkpoint file deletion
Lingzhe Sun
subscribe
mojianan2015
Re:subscribe
mojianan2015
subscribe
Sahib Aulakh
subscribe
Sahib Aulakh
[PySpark] Intermittent Spark session initialization error on M1 Mac
BeoumSuk Kim
[k8s] Fail to expose custom port on executor container specified in my executor pod template
James Yu
[Spark-SQL] Dataframe write saveAsTable failed
Anil Dasari
Unable to populate spark metrics using custom metrics API
Surya Soma
Unable to populate spark metrics using custom metrics API
Surya Soma
Re: Unable to populate spark metrics using custom metrics API
Surya Soma
Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
[Spark streaming]: Microbatch id in logs
Anil Dasari
Re: [Spark streaming]: Microbatch id in logs
Mich Talebzadeh
Apache Spark with watermark - processing data different LogTypes in same kafka topic
karan alang
[ANNOUNCE] Apache Spark 3.4.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Mridul Muralidharan
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Hyukjin Kwon
Re: [ANNOUNCE] Apache Spark 3.4.1 released
L. C. Hsieh
Re: [ANNOUNCE] Apache Spark 3.4.1 released
yangjie01
Re:[ANNOUNCE] Apache Spark 3.4.1 released
beliefer
Rename columns without manually setting them all
John Paul Jayme
Re: Rename columns without manually setting them all
Farshid Ashouri
Re: Rename columns without manually setting them all
Bjørn Jørgensen
Shuffle data on pods which get decomissioned
Nikhil Goyal
Re: Shuffle data on pods which get decomissioned
Mich Talebzadeh
How to read excel file in PySpark
John Paul Jayme
Re: How to read excel file in PySpark
Sean Owen
Re: How to read excel file in PySpark
Bjørn Jørgensen
Re: How to read excel file in PySpark
Mich Talebzadeh
Re: How to read excel file in PySpark
Bjørn Jørgensen
Re: How to read excel file in PySpark
Mich Talebzadeh
Re: How to read excel file in PySpark
Sean Owen
Re: How to read excel file in PySpark
Mich Talebzadeh
Re: How to read excel file in PySpark
Bjørn Jørgensen
Re: How to read excel file in PySpark
Mich Talebzadeh
implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Pengfei Li
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
TAC Applications for Community Over Code North America and Asia now open
Gavin McDonald
Fwd: iceberg queries
Gaurav Agarwal
Spark using iceberg
Gaurav Agarwal
Re: Spark using iceberg
Gaurav Agarwal
Announcing the Community Over Code 2023 Streaming Track
James Hughes
Apache Spark not reading UTC timestamp from MongoDB correctly
karan alang
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Sean Owen
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Enrico Minack
Getting SparkRuntimeException: Unexpected value for length in function slice: length must be greater than or equal to 0
Bariudin, Daniel
[Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Mich Talebzadeh
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Wenchen Fan
Comparison of Trino, Spark, and Hive-MR3
Sungwoo Park
Viewing UI for spark jobs running on K8s
Nikhil Goyal
Re: Viewing UI for spark jobs running on K8s
Qian Sun
ChatGPT and prediction of Spark future
Mich Talebzadeh
Re: ChatGPT and prediction of Spark future
Winston Lai
Re: ChatGPT and prediction of Spark future
Mich Talebzadeh
Structured streaming append mode picture question
Hill Liu
JDK version support information
Poorna Murali
Re: JDK version support information
Aironman DirtDiver
Re: JDK version support information
Poorna Murali
Re: JDK version support information
Sean Owen
Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
Re: Re: maven with Spark 3.4.0 fails compilation
Lingzhe Sun
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
Re: Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
[Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Mich Talebzadeh
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
Re: [MLlib] how-to find implementation of Decision Tree Regressor fit function
Sean Owen
Dynamic value as the offset of lag() function
Nipuna Shantha
Incremental Value dependents on another column of Data frame Spark
Nipuna Shantha
Re: Incremental Value dependents on another column of Data frame Spark
Raghavendra Ganesh
Re: Incremental Value dependents on another column of Data frame Spark
Enrico Minack
cannot load model using pyspark
[email protected]
Data Stream Processing applications testing
Alexandre Strapacao Guedes Vianna
Understanding Spark S3 Read Performance
Shashank Rao
RE: Understanding Spark S3 Read Performance
info
Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
Re: Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
[spark-core] Can executors recover/reuse shuffle files upon failure?
Faiz Halde
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
vaquar khan
RE: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Maksym M
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
Pyspark cluster mode on standalone deployment
خالد القحطاني
Shuffle with Window().partitionBy(<column>)
[email protected]
Re: Shuffle with Window().partitionBy(<column>)
Rauf Khan
Re: Shuffle with Window().partitionBy(<column>)
[email protected]
Error while merge in delta table
Karthick Nk
Re: Error while merge in delta table
Jacek Laskowski
Re: Error while merge in delta table
Farhan Misarwala
Re: Error while merge in delta table
Karthick Nk
Re: Error while merge in delta table
Farhan Misarwala
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Winston Lai
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
Does spark read the same file twice, if two stages are using the same DataFrame?
Vijay B
Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Mich Talebzadeh
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
RE: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Vijay B
Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
Re: Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
Re: Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
Write custom JSON from DataFrame in PySpark
Marco Costantini
Re: Write custom JSON from DataFrame in PySpark
Enrico Minack
Re: Write custom JSON from DataFrame in PySpark
Marco Costantini
CVE-2023-32007: Apache Spark: Shell command injection via Spark UI
Arnout Engelen
Change column values using several when conditions
marc nicole
Re: Change column values using several when conditions
Bjørn Jørgensen
How to change column values using several when conditions ?
marc nicole
Any experience with K8s Remote Shuffling Service at scale?
Andrey Gourine
How to read text files with GBK encoding in the spark core
[email protected]
Tensorflow on Spark CPU
[email protected]
Re: Tensorflow on Spark CPU
Sean Owen
Re: Tensorflow on Spark CPU
[email protected]
Re: Tensorflow on Spark CPU
Sean Owen
driver and executors shared same Kubernetes PVC
[email protected]
***pyspark.sql.functions.monotonically_increasing_id()***
Karthick Nk
Re: ***pyspark.sql.functions.monotonically_increasing_id()***
Winston Lai
config: minOffsetsPerTrigger not working
Abhishek Singla
Re: config: minOffsetsPerTrigger not working
Mich Talebzadeh
Earlier messages
Later messages