Messages by Thread
-
-
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning
Tahj Anderson
-
Participate in the ASF 25th Anniversary Campaign
Brian Proffitt
-
[Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
-
[Spark SQL] How can I use .sql() in conjunction with watermarks?
Chloe He
-
Apache Spark integration with Spring Boot 3.0.0+
Szymon Kasperkiewicz
-
Community Over Code NA 2024 Travel Assistance Applications now open!
Gavin McDonald
-
[DISCUSS] MySQL version support policy
Cheng Pan
-
Is one Spark partition mapped to one and only Spark Task ?
Sreyan Chakravarty
-
Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering
Mich Talebzadeh
-
Bug in org.apache.spark.util.sketch.BloomFilter
Nathan Conroy
-
[no subject]
Рамик И
-
Announcing the Community Over Code 2024 Streaming Track
James Hughes
-
[ANNOUNCE] Apache Kyuubi released 1.9.0
Binjie Yang
-
pyspark - Use Spark to generate a large dataset on the fly
Sreyan Chakravarty
-
A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
-
[GraphX]: Prevent recomputation of DAG
Marek Berith
-
Python library that generates fake data using Faker
Mich Talebzadeh
-
Requesting further assistance with Spark Scala code coverage
里昂
-
pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
-
Data ingestion into elastic failing using pyspark
Karthick Nk
-
Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
-
Spark on Kubenets, execute dataset.show raise exceptions
BODY NO
-
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled
sharad mishra
-
Creating remote tables using PySpark
Tom Barber
-
Dark mode logo
Mike Drob
-
S3 committer for dynamic partitioning
Nikhil Goyal
-
It seems --py-files only takes the first two arguments. Can someone please confirm?
Pedro, Chuck
-
Working with a text file that is both compressed by bz2 followed by zip in PySpark
Mich Talebzadeh
-
pyspark dataframe join with two different data type
Karthick Nk
-
[ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
[Spark Core] Potential bug in JavaRDD#countByValue
Stuart Fehr
-
Bugs with joins and SQL in Structured Streaming
Andrzej Zera
-
Re: Bintray replacement for spark-packages.org
Richard Eggert
-
Issue of spark with antlr version
Chawla, Parul
-
Re: AQE coalesce 60G shuffle data into a single partition
Enrico Minack
-
[Beginner Debug]: Executor OutOfMemoryError
Shawn Ligocki
-
Kafka-based Spark Streaming and Vertex AI for Sentiment Analysis
Mich Talebzadeh
-
[ANNOUNCE] Apache Kyuubi 1.8.1 is available
Cheng Pan
-
Re: Spark 3.3 Query Analyzer Bug Report
Sharma, Anup
-
Spark 4.0 Query Analyzer Bug Report
Sharma, Anup
-
Community Over Code Asia 2024 Travel Assistance Applications now open!
Gavin McDonald
-
[Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Sri Potluri
-
Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
-
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Adam Binford
-
Re: job uuid not unique
Mich Talebzadeh
-
Effectively append the dataset to avro directory
Rushikesh Kavar
-
Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Holden Karau
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Yufei Gu
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
John Zhuge
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
praveen sinha
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Manoj Kumar
-
Null pointer exception while replying WAL
nayan sharma
-
Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration
Mich Talebzadeh
-
performance of union vs insert into
Manish Mehra
-
[ANNOUNCE] Apache Celeborn(incubating) 0.4.0 available
Fu Chen
-
Community over Code EU 2024 Travel Assistance Applications now open!
Gavin McDonald
-
[no subject]
Gavin McDonald
-
deploy spark as cluster
ali sharifi
-
Create Custom Logs
PRASHANT L
-
randomsplit has issue?
second_co...@yahoo.com.INVALID
-
Issue in Creating Temp_view in databricks and using spark.sql().
Karthick Nk
-
[Spark SQL]: Crash when attempting to select PostgreSQL bpchar without length specifier in Spark 3.5.0
Lily Hahn
-
startTimestamp doesn't work when using rate-micro-batch format
Perfect Stranger