user

Messages by Thread

- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community [email protected]
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Parsian, Mahmoud
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Hyukjin Kwon
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Code Tutelage
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Deepak Sharma
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Bjørn Jørgensen
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Reynold Xin
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Joris Billen
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Mich Talebzadeh
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Varun Shah
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Farshid Ashouri
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Kiran Kumar Dusi
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Jay Han
- Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community Winston Lai
[GraphX]: Prevent recomputation of DAG Marek Berith
- Re: [GraphX]: Prevent recomputation of DAG Mich Talebzadeh
Python library that generates fake data using Faker Mich Talebzadeh
Requesting further assistance with Spark Scala code coverage 里昂
pyspark - Where are Dataframes created from Python objects stored? Sreyan Chakravarty
- Re: pyspark - Where are Dataframes created from Python objects stored? Mich Talebzadeh
- Re: pyspark - Where are Dataframes created from Python objects stored? Sreyan Chakravarty
- Re: pyspark - Where are Dataframes created from Python objects stored? Mich Talebzadeh
- Re: pyspark - Where are Dataframes created from Python objects stored? Sreyan Chakravarty
- Re: pyspark - Where are Dataframes created from Python objects stored? Varun Shah
Data ingestion into elastic failing using pyspark Karthick Nk
Bug in How to Monitor Streaming Queries in PySpark Mich Talebzadeh
- Re: Bug in How to Monitor Streaming Queries in PySpark 刘唯
- Re: Bug in How to Monitor Streaming Queries in PySpark 刘唯
- Re: Bug in How to Monitor Streaming Queries in PySpark Mich Talebzadeh
- Re: Bug in How to Monitor Streaming Queries in PySpark 刘唯
- Re: Bug in How to Monitor Streaming Queries in PySpark Mich Talebzadeh
Spark on Kubenets, execute dataset.show raise exceptions BODY NO
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled sharad mishra
- Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled sharad mishra
Creating remote tables using PySpark Tom Barber
- Re: Creating remote tables using PySpark Tom Barber
- Re: Creating remote tables using PySpark Tom Barber
- Re: Creating remote tables using PySpark Mich Talebzadeh
Dark mode logo Mike Drob
S3 committer for dynamic partitioning Nikhil Goyal
It seems --py-files only takes the first two arguments. Can someone please confirm? Pedro, Chuck
- Re: It seems --py-files only takes the first two arguments. Can someone please confirm? Mich Talebzadeh
- Re: It seems --py-files only takes the first two arguments. Can someone please confirm? Mich Talebzadeh
Working with a text file that is both compressed by bz2 followed by zip in PySpark Mich Talebzadeh
pyspark dataframe join with two different data type Karthick Nk
- Re: pyspark dataframe join with two different data type Mich Talebzadeh
- Re: pyspark dataframe join with two different data type Karthick Nk
- Re: pyspark dataframe join with two different data type Damien Hawes
- Re: pyspark dataframe join with two different data type Karthick Nk
- Re: pyspark dataframe join with two different data type Mich Talebzadeh
- Re: pyspark dataframe join with two different data type Karthick Nk
- Re: pyspark dataframe join with two different data type Karthick Nk
[ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re:[ANNOUNCE] Apache Spark 3.5.1 released beliefer
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Xinrong Meng
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Prem Sahoo
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Peter Toth
- Re: [ANNOUNCE] Apache Spark 3.5.1 released John Zhuge
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Hyukjin Kwon
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- Re: [ANNOUNCE] Apache Spark 3.5.1 released yangjie01
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
- Re: [ANNOUNCE] Apache Spark 3.5.1 released Jungtaek Lim
- 答复: [ANNOUNCE] Apache Spark 3.5.1 released Pan,Bingkun
[Spark Core] Potential bug in JavaRDD#countByValue Stuart Fehr
- Re: [Spark Core] Potential bug in JavaRDD#countByValue Mich Talebzadeh
Bugs with joins and SQL in Structured Streaming Andrzej Zera
- Re: Bugs with joins and SQL in Structured Streaming Mich Talebzadeh
- Re: Bugs with joins and SQL in Structured Streaming Andrzej Zera
- Re: Bugs with joins and SQL in Structured Streaming Andrzej Zera
- Re: Bugs with joins and SQL in Structured Streaming Jungtaek Lim
- Re: Bugs with joins and SQL in Structured Streaming Jungtaek Lim
- Re: Bugs with joins and SQL in Structured Streaming Jungtaek Lim
- Re: Bugs with joins and SQL in Structured Streaming Andrzej Zera
Re: Bintray replacement for spark-packages.org Richard Eggert
Issue of spark with antlr version Chawla, Parul
- RE: Issue of spark with antlr version Sahni, Ashima
- Re: Issue of spark with antlr version Mich Talebzadeh
- Re: Issue of spark with antlr version Bjørn Jørgensen
- Re: [External] Re: Issue of spark with antlr version Chawla, Parul
- Re: [External] Re: Issue of spark with antlr version Bjørn Jørgensen
- Re: [External] Re: Issue of spark with antlr version Chawla, Parul
- Re: [External] Re: Issue of spark with antlr version Bjørn Jørgensen
Re: AQE coalesce 60G shuffle data into a single partition Enrico Minack
[Beginner Debug]: Executor OutOfMemoryError Shawn Ligocki
- Re: [Beginner Debug]: Executor OutOfMemoryError Mich Talebzadeh
Kafka-based Spark Streaming and Vertex AI for Sentiment Analysis Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi 1.8.1 is available Cheng Pan
Re: Spark 3.3 Query Analyzer Bug Report Sharma, Anup
Spark 4.0 Query Analyzer Bug Report Sharma, Anup
- Re: Spark 4.0 Query Analyzer Bug Report Holden Karau
- Re: Spark 4.0 Query Analyzer Bug Report Mich Talebzadeh
Community Over Code Asia 2024 Travel Assistance Applications now open! Gavin McDonald
[Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Sri Potluri
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Mich Talebzadeh
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Mich Talebzadeh
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Sri Potluri
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Mich Talebzadeh
- Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures Cheng Pan
Regarding Spark on Kubernetes(EKS) Jagannath Majhi
- Re: Regarding Spark on Kubernetes(EKS) Richard Smith
- Re: Regarding Spark on Kubernetes(EKS) Mich Talebzadeh
- Re: Regarding Spark on Kubernetes(EKS) Jagannath Majhi
- Re: Regarding Spark on Kubernetes(EKS) Mich Talebzadeh
- Re: Regarding Spark on Kubernetes(EKS) Mich Talebzadeh
- Re: Regarding Spark on Kubernetes(EKS) Jagannath Majhi
Re: Re-create SparkContext of SparkSession inside long-lived Spark app Adam Binford
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Jörn Franke
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Mich Talebzadeh
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Saha, Daniel
- Re: Re-create SparkContext of SparkSession inside long-lived Spark app Mich Talebzadeh
Re: job uuid not unique Mich Talebzadeh
- Re: job uuid not unique Xin Zhang
Effectively append the dataset to avro directory Rushikesh Kavar
Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Chao Sun
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Holden Karau
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Yufei Gu
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow John Zhuge
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Chao Sun
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow praveen sinha
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Chao Sun
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Mich Talebzadeh
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Mich Talebzadeh
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Chao Sun
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Mich Talebzadeh
- Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow Manoj Kumar
Null pointer exception while replying WAL nayan sharma
- Re: Null pointer exception while replying WAL Mich Talebzadeh
- Re: Null pointer exception while replying WAL nayan sharma
- Re: Null pointer exception while replying WAL Mich Talebzadeh
Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration Mich Talebzadeh
- Re: Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration Mich Talebzadeh
performance of union vs insert into Manish Mehra
[ANNOUNCE] Apache Celeborn(incubating) 0.4.0 available Fu Chen
Community over Code EU 2024 Travel Assistance Applications now open! Gavin McDonald
[no subject] Gavin McDonald
deploy spark as cluster ali sharifi
Create Custom Logs PRASHANT L
randomsplit has issue? [email protected]
Issue in Creating Temp_view in databricks and using spark.sql(). Karthick Nk
- Re: Issue in Creating Temp_view in databricks and using spark.sql(). Jungtaek Lim
- Re: Issue in Creating Temp_view in databricks and using spark.sql(). Mich Talebzadeh
- Re: Issue in Creating Temp_view in databricks and using spark.sql(). Mich Talebzadeh
[Spark SQL]: Crash when attempting to select PostgreSQL bpchar without length specifier in Spark 3.5.0 Lily Hahn
startTimestamp doesn't work when using rate-micro-batch format Perfect Stranger
- Re: startTimestamp doesn't work when using rate-micro-batch format Mich Talebzadeh
- Re: startTimestamp doesn't work when using rate-micro-batch format Perfect Stranger
- Re: startTimestamp doesn't work when using rate-micro-batch format Mich Talebzadeh
Some optimization questions about our beloved engine Spark Aissam Chia
Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001- Abhishek Singla
- Re: Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001- Abhishek Singla
- Re: Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001- Bjørn Jørgensen
- Re: Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001- Mich Talebzadeh
[spark.local.dir] comma separated list does not work Andrew Petersen
- Re: [spark.local.dir] comma separated list does not work Koert Kuipers
- Re: [spark.local.dir] comma separated list does not work Andrew Petersen
- Re: [spark.local.dir] comma separated list does not work Andrew Petersen
- Unsubscribe Andrew Redd
[GraphFrames Spark Package]: Why is there not a distribution for Spark 3.3? Boileau, Brad
- Re: [GraphFrames Spark Package]: Why is there not a distribution for Spark 3.3? Russell Jurney
- Re: [External] Re: [GraphFrames Spark Package]: Why is there not a distribution for Spark 3.3? Ofir Manor
Best option to process single kafka stream in parallel: PySpark Vs Dask lab22
Structured Streaming Process Each Records Individually PRASHANT L
- Re: Structured Streaming Process Each Records Individually Khalid Mammadov
- Re: Structured Streaming Process Each Records Individually Ant Kutschera
- Re: Structured Streaming Process Each Records Individually Mich Talebzadeh
- Re: Structured Streaming Process Each Records Individually Mich Talebzadeh
[Structured Streaming] Avoid one microbatch delay with multiple stateful operations Andrzej Zera
- Re: [Structured Streaming] Avoid one microbatch delay with multiple stateful operations Ant Kutschera
- Re: [Structured Streaming] Avoid one microbatch delay with multiple stateful operations Jungtaek Lim
- Re: [Structured Streaming] Avoid one microbatch delay with multiple stateful operations Andrzej Zera
[apache-spark] documentation on File Metadata _metadata struct Jason Horner
Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics. Mich Talebzadeh
- Re: Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics. Mich Talebzadeh
- Re: Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics. [email protected]
- Re: Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics. Mich Talebzadeh
[ANNOUNCE] Apache Celeborn(incubating) 0.3.2 available Nicholas Jiang
[Structured Streaming] Keeping checkpointing cost under control Andrzej Zera
- Re: [Structured Streaming] Keeping checkpointing cost under control Mich Talebzadeh
- Re: [Structured Streaming] Keeping checkpointing cost under control Andrzej Zera
- Re: [Structured Streaming] Keeping checkpointing cost under control Mich Talebzadeh
- Re: [Structured Streaming] Keeping checkpointing cost under control Andrzej Zera
- Re: [Structured Streaming] Keeping checkpointing cost under control Mich Talebzadeh
- Re: [Structured Streaming] Keeping checkpointing cost under control Andrzej Zera
- Re: [Structured Streaming] Keeping checkpointing cost under control Mich Talebzadeh
- Re: [Structured Streaming] Keeping checkpointing cost under control Andrzej Zera
- Re: [Structured Streaming] Keeping checkpointing cost under control Mich Talebzadeh
- Re: [Structured Streaming] Keeping checkpointing cost under control Jungtaek Lim
Issue with Spark Session Initialization in Kubernetes Deployment Atul Patil