user
Thread
Date
Earlier messages
Later messages
Messages by Thread
[Spark SQL][How-To] Remove builtin function support from Spark
Matthew McMillian
should OutputCommitCoordinator fail stages for authorized committer failures when using s3a optimized committers?
Dylan McClelland
[Spark SQL] xxhash64 default seed of 42 confusion
Igor Calabria
auto create event log directory if not exist
second_co...@yahoo.com.INVALID
Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Mich Talebzadeh
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Mich Talebzadeh
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Spark column headings, camelCase or snake case?
Mich Talebzadeh
[Spark SQL]: Source code for PartitionedFile
Ashley McManamon
Re: [Spark SQL]: Source code for PartitionedFile
Mich Talebzadeh
Re: [Spark SQL]: Source code for PartitionedFile
Ashley McManamon
How to get db related metrics when use spark jdbc to read db table?
casel.chen
Re: How to get db related metrics when use spark jdbc to read db table?
Mich Talebzadeh
Re: How to get db related metrics when use spark jdbc to read db table?
Femi Anthony
Spark UDAF in examples fail with not serializable error
Owen Bell
Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError?
Baran, Mert
Re: Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError?
Mich Talebzadeh
Example UDAF fails with "not serializable" exception
Owen Bell
External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Bjørn Jørgensen
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Vakaris Baškirov
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
roryqi
Re: External Spark shuffle service for k8s
Vakaris Baškirov
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Arun Ravi
Re: External Spark shuffle service for k8s
Bjørn Jørgensen
Re: External Spark shuffle service for k8s
Bjørn Jørgensen
Re: External Spark shuffle service for k8s
Cheng Pan
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Enrico Minack
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning
Tahj Anderson
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning
Tahj Anderson
Participate in the ASF 25th Anniversary Campaign
Brian Proffitt
[Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Aaron Grubb
Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
[Spark SQL] How can I use .sql() in conjunction with watermarks?
Chloe He
Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Mich Talebzadeh
RE: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Chloe He
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Mich Talebzadeh
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
刘唯
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
刘唯
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Mich Talebzadeh
Apache Spark integration with Spring Boot 3.0.0+
Szymon Kasperkiewicz
Community Over Code NA 2024 Travel Assistance Applications now open!
Gavin McDonald
[DISCUSS] MySQL version support policy
Cheng Pan
Re: [DISCUSS] MySQL version support policy
Dongjoon Hyun
Is one Spark partition mapped to one and only Spark Task ?
Sreyan Chakravarty
Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering
Mich Talebzadeh
Re: Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering
Mich Talebzadeh
Bug in org.apache.spark.util.sketch.BloomFilter
Nathan Conroy
[no subject]
Рамик И
Re:
Mich Talebzadeh
Announcing the Community Over Code 2024 Streaming Track
James Hughes
[ANNOUNCE] Apache Kyuubi released 1.9.0
Binjie Yang
pyspark - Use Spark to generate a large dataset on the fly
Sreyan Chakravarty
pyspark - Use Spark to generate a large dataset on the fly
Sreyan Chakravarty
A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
ashok34...@yahoo.com.INVALID
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Parsian, Mahmoud
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Hyukjin Kwon
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Code Tutelage
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Deepak Sharma
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Bjørn Jørgensen
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Reynold Xin
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Joris Billen
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Varun Shah
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Farshid Ashouri
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Kiran Kumar Dusi
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Jay Han
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Winston Lai
[GraphX]: Prevent recomputation of DAG
Marek Berith
Re: [GraphX]: Prevent recomputation of DAG
Mich Talebzadeh
Python library that generates fake data using Faker
Mich Talebzadeh
Requesting further assistance with Spark Scala code coverage
里昂
pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
Re: pyspark - Where are Dataframes created from Python objects stored?
Mich Talebzadeh
Re: pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
Re: pyspark - Where are Dataframes created from Python objects stored?
Mich Talebzadeh
Re: pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
Re: pyspark - Where are Dataframes created from Python objects stored?
Varun Shah
Data ingestion into elastic failing using pyspark
Karthick Nk
Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
Re: Bug in How to Monitor Streaming Queries in PySpark
刘唯
Re: Bug in How to Monitor Streaming Queries in PySpark
刘唯
Re: Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
Re: Bug in How to Monitor Streaming Queries in PySpark
刘唯
Re: Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
Spark on Kubenets, execute dataset.show raise exceptions
BODY NO
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled
sharad mishra
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled
sharad mishra
Creating remote tables using PySpark
Tom Barber
Re: Creating remote tables using PySpark
Tom Barber
Re: Creating remote tables using PySpark
Tom Barber
Re: Creating remote tables using PySpark
Mich Talebzadeh
Dark mode logo
Mike Drob
S3 committer for dynamic partitioning
Nikhil Goyal
It seems --py-files only takes the first two arguments. Can someone please confirm?
Pedro, Chuck
Re: It seems --py-files only takes the first two arguments. Can someone please confirm?
Mich Talebzadeh
Re: It seems --py-files only takes the first two arguments. Can someone please confirm?
Mich Talebzadeh
Working with a text file that is both compressed by bz2 followed by zip in PySpark
Mich Talebzadeh
pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Mich Talebzadeh
Re: pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Damien Hawes
Re: pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Mich Talebzadeh
Re: pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Karthick Nk
[ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re:[ANNOUNCE] Apache Spark 3.5.1 released
beliefer
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Xinrong Meng
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Prem Sahoo
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Peter Toth
Re: [ANNOUNCE] Apache Spark 3.5.1 released
John Zhuge
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Hyukjin Kwon
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re: [ANNOUNCE] Apache Spark 3.5.1 released
yangjie01
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
[Spark Core] Potential bug in JavaRDD#countByValue
Stuart Fehr
Re: [Spark Core] Potential bug in JavaRDD#countByValue
Mich Talebzadeh
Bugs with joins and SQL in Structured Streaming
Andrzej Zera
Re: Bugs with joins and SQL in Structured Streaming
Mich Talebzadeh
Re: Bugs with joins and SQL in Structured Streaming
Andrzej Zera
Re: Bugs with joins and SQL in Structured Streaming
Andrzej Zera
Re: Bugs with joins and SQL in Structured Streaming
Jungtaek Lim
Re: Bugs with joins and SQL in Structured Streaming
Jungtaek Lim
Re: Bugs with joins and SQL in Structured Streaming
Jungtaek Lim
Re: Bugs with joins and SQL in Structured Streaming
Andrzej Zera
Re: Bintray replacement for spark-packages.org
Richard Eggert
Issue of spark with antlr version
Chawla, Parul
RE: Issue of spark with antlr version
Sahni, Ashima
Re: Issue of spark with antlr version
Mich Talebzadeh
Re: Issue of spark with antlr version
Bjørn Jørgensen
Re: [External] Re: Issue of spark with antlr version
Chawla, Parul
Re: [External] Re: Issue of spark with antlr version
Bjørn Jørgensen
Re: [External] Re: Issue of spark with antlr version
Chawla, Parul
Re: [External] Re: Issue of spark with antlr version
Bjørn Jørgensen
Re: AQE coalesce 60G shuffle data into a single partition
Enrico Minack
[Beginner Debug]: Executor OutOfMemoryError
Shawn Ligocki
Re: [Beginner Debug]: Executor OutOfMemoryError
Mich Talebzadeh
Kafka-based Spark Streaming and Vertex AI for Sentiment Analysis
Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi 1.8.1 is available
Cheng Pan
Re: Spark 3.3 Query Analyzer Bug Report
Sharma, Anup
Spark 4.0 Query Analyzer Bug Report
Sharma, Anup
Re: Spark 4.0 Query Analyzer Bug Report
Holden Karau
Re: Spark 4.0 Query Analyzer Bug Report
Mich Talebzadeh
Community Over Code Asia 2024 Travel Assistance Applications now open!
Gavin McDonald
[Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Sri Potluri
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Mich Talebzadeh
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Mich Talebzadeh
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Sri Potluri
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Mich Talebzadeh
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Cheng Pan
Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
Re: Regarding Spark on Kubernetes(EKS)
Richard Smith
Re: Regarding Spark on Kubernetes(EKS)
Mich Talebzadeh
Re: Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
Re: Regarding Spark on Kubernetes(EKS)
Mich Talebzadeh
Re: Regarding Spark on Kubernetes(EKS)
Mich Talebzadeh
Re: Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Adam Binford
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Jörn Franke
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Mich Talebzadeh
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Saha, Daniel
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Mich Talebzadeh
Re: job uuid not unique
Mich Talebzadeh
Re: job uuid not unique
Xin Zhang
Effectively append the dataset to avro directory
Rushikesh Kavar
Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Holden Karau
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Yufei Gu
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
John Zhuge
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
praveen sinha
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
Earlier messages
Later messages