user
Thread
Date
Earlier messages
Messages by Thread
The use of Python ParamSpec in PySpark
Rafał Wojdyła
Spark Streaming Dataset with Multiple S3 Sources is too Slow
Jevon Cowell
Is "SORTED BY (col DESC)" Supported for Bucketed Table?
Joe Lee
kubernetes spark connect iceberg SparkWrite$WriterFactory not found
Razvan Mihai
High count of Active Jobs
nayan sharma
Re: High count of Active Jobs
nayan sharma
Announcing the Community Over Code 2025 Streaming Track
James Hughes
Kubeflow Spark-Operator
Hamish Whittal
Correctness Issue: UNIX_SECONDS() mismatch with TO_UTC_TIMESTAMP() result in Spark 3.5.1
Miguel Leite
Executors not getting released dynamically once task is over
Shivang Modi
Re: Executors not getting released dynamically once task is over
Soumasish
Java coding with spark API
tim wade
Re: Java coding with spark API
Jevon Cowell
Re: Java coding with spark API
tim wade
Re: Java coding with spark API
Ángel Álvarez Pascua
Re: Java coding with spark API
Sonal Goyal
Re: Java coding with spark API
Ángel Álvarez Pascua
Re: Java coding with spark API
Stephen Coy
Re: Java coding with spark API
Jules Damji
Spark 3.3 job jar assembly with JDK 17 and JRE 11 runtime (java target/source = 8)
Kristopher Kane
Request for Support and Resources for Apache Spark User Groups in Bogotá and Mexico
Juan Diaz
Inquiry in regards to a New onQuery Method for StreamingQueryListener
Jevon Cowell
Re: Inquiry in regards to a New onQuery Method for StreamingQueryListener
Jungtaek Lim
Re: Inquiry in regards to a New onQuery Method for StreamingQueryListener
Jevon Cowell
Re: Inquiry in regards to a New onQuery Method for StreamingQueryListener
Jevon Cowell
performance issue Spark 3.5.2 on kubernetes
Prem Sahoo
Spark Shuffle - in kubeflow spark operator installation on k8s
karan alang
Re: Spark Shuffle - in kubeflow spark operator installation on k8s
karan alang
Re: Spark Shuffle - in kubeflow spark operator installation on k8s
karan alang
Re: Spark Shuffle - in kubeflow spark operator installation on k8s
Mich Talebzadeh
Re: Spark Shuffle - in kubeflow spark operator installation on k8s
megh vidani
Re: Spark Shuffle - in kubeflow spark operator installation on k8s
karan alang
Re: Spark Shuffle - in kubeflow spark operator installation on k8s
karan alang
Motif finding tutorial
Russell Jurney
Re: Spark 3.5.2 and Hadoop 3.4.1 slow performance
Prem Gmail
Re: Spark 3.5.2 and Hadoop 3.4.1 slow performance
Ángel Álvarez Pascua
Re: Spark 3.5.2 and Hadoop 3.4.1 slow performance
Prem Sahoo
Re: Spark 3.5.2 and Hadoop 3.4.1 slow performance
Prem Gmail
Re: Spark 3.5.2 and Hadoop 3.4.1 slow performance
Prem Sahoo
High/Critical CVEs in jackson-mapper-asl (spark 3.5.5)
Mohammad, Ejas Ali
Re: High/Critical CVEs in jackson-mapper-asl (spark 3.5.5)
Ángel Álvarez Pascua
Spark Kubernetes Operator | Release Date
Dheeraj Panangat
[ANNOUNCE] Apache Sedona 1.7.1 released
Jia Yu
Multiple CVE issues in apache/spark-py:3.4.0 + Pyspark 3.4.0
Mohammad, Ejas Ali
Re: Multiple CVE issues in apache/spark-py:3.4.0 + Pyspark 3.4.0
Soumasish
[ANNOUNCE] Apache Celeborn 0.5.4 available
Nicholas
4.1.0 release timeline
Martin Bielik
[ANNOUNCE] Version 2.0.0-beta1 of hnswlib spark released
jelmer
[CONNECT] Question on Spark Connect in Cluster Deply Mode
Yasukazu Nagatomi
Apply pivot only on some columns in pyspark
Dhruv Singla
Re: Apply pivot only on some columns in pyspark
Mich Talebzadeh
Re: Apply pivot only on some columns in pyspark
Dhruv Singla
Re: Apply pivot only on some columns in pyspark
Mich Talebzadeh
Re: Apply pivot only on some columns in pyspark
Dhruv Singla
Re: Apply pivot only on some columns in pyspark
Mich Talebzadeh
Re: Apply pivot only on some columns in pyspark
Bjørn Jørgensen
[ANNOUNCE] Apache Spark 3.5.5 released
Dongjoon Hyun
Optimizing file size of an iceberg table
Pathum Wijethunge
Re: Apache - GSOC'25 projects / Contributions
Mich Talebzadeh
Kafka Connector: producer throttling
Abhishek Singla
Re: Kafka Connector: producer throttling
daniel williams
Re: Kafka Connector: producer throttling
Abhishek Singla
Re: Kafka Connector: producer throttling
Rommel Yuan
Re: Kafka Connector: producer throttling
Jungtaek Lim
Re: Kafka Connector: producer throttling
daniel williams
GraphFrames Hackathon - NOW :)
Russell Jurney
Using storage decommissioning on K8S cluster
Enrico Minack
Spark connect: Table caching for global use?
Tim Robertson
Re: Spark connect: Table caching for global use?
Tim Robertson
Re: Spark connect: Table caching for global use?
Mich Talebzadeh
Re: Spark connect: Table caching for global use?
Tim Robertson
Re: Spark connect: Table caching for global use?
Mich Talebzadeh
Re: Spark connect: Table caching for global use?
Subhasis Mukherjee
Re: Spark connect: Table caching for global use?
Ángel
END OF LIFE DETERMINATION
Izhar Mohammed
Doubt regarding year formatting
Dhruv Singla
Spark Website Styling Issues Partially Resolved
Gengliang Wang
Re: Spark Website Styling Issues Partially Resolved
Reynold Xin
Website Down
Will Dumas
Re: Website Down
walt
Is SSL configuration being used for RPC communication?
Pablo Fernández
Re: Is SSL configuration being used for RPC communication?
Aironman DirtDiver
Re: Is SSL configuration being used for RPC communication?
Pablo Fernández
GraphFrames Hackathon on Friday, February 21
Russell Jurney
Re: GraphFrames Hackathon on Friday, February 21
Russell Jurney
Re: GraphFrames Hackathon on Friday, February 21
Holden Karau
Re: GraphFrames Hackathon on Friday, February 21
Russell Jurney
Drop Python 2 support from GraphFrames?
Russell Jurney
Re: Drop Python 2 support from GraphFrames?
Holden Karau
Re: Drop Python 2 support from GraphFrames?
Jules Damji
Re: Drop Python 2 support from GraphFrames?
Russell Jurney
Re: Drop Python 2 support from GraphFrames?
Mich Talebzadeh
Re: Drop Python 2 support from GraphFrames?
Ángel
Re: Drop Python 2 support from GraphFrames?
Russell Jurney
[Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Frank Bertsch
Re: [Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Mich Talebzadeh
Re: [Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Frank Bertsch
Re: [Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Soumasish
Re: [Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Reynold Xin
Re: [Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Allison Wang
Re: [Spark SQL]: Are SQL User-Defined Functions on the Roadmap?
Frank Bertsch
Re: [Spark Stream]: Batch processing time reduce over time causing Kafka Lag
Mich Talebzadeh
Re: [Spark Stream]: Batch processing time reduce over time causing Kafka Lag
Saurabh Agrawal
Re: [Spark Stream]: Batch processing time reduce over time causing Kafka Lag
Mich Talebzadeh
Re: Feature store in bigquery
Mich Talebzadeh
Re: Feature store in bigquery
Gunjan Kumar
Need a solution
aishwarya talluri
[start-connect-server.sh] connecting with org.apache.spark.deploy.worker.Worker
Andrew Petersen
Re: [start-connect-server.sh] connecting with org.apache.spark.deploy.worker.Worker
Mich Talebzadeh
Re: [start-connect-server.sh] connecting with org.apache.spark.deploy.worker.Worker
Mich Talebzadeh
Re: Re: Increasing Shading & Relocating for 4.0
Mich Talebzadeh
Re: Re: Increasing Shading & Relocating for 4.0
Ángel
[Spark Core][BlockManager] Spark job fails if blockmgr dirs are cleaned up
Olga Averianova
Re: [Spark Core][BlockManager] Spark job fails if blockmgr dirs are cleaned up
Mich Talebzadeh
Help choose a GraphFrames logo
Russell Jurney
Re: Help choose a GraphFrames logo
Denny Lee
Re: Help choose a GraphFrames logo
Matei Zaharia
Re: Help choose a GraphFrames logo
Mich Talebzadeh
GraphFrames' ConnectedComponentSuite test 'two components and two dangling vertices' fails with OutOfMemoryError: Java heap space
Russell Jurney
Re: GraphFrames' ConnectedComponentSuite test 'two components and two dangling vertices' fails with OutOfMemoryError: Java heap space
Ángel
Re: GraphFrames' ConnectedComponentSuite test 'two components and two dangling vertices' fails with OutOfMemoryError: Java heap space
Bjørn Jørgensen
Re: GraphFrames' ConnectedComponentSuite test 'two components and two dangling vertices' fails with OutOfMemoryError: Java heap space
Russell Jurney
Storing a JDBC-based table in a catalog for direct use in Spark SQL
Aaron Grubb
Re: Storing a JDBC-based table in a catalog for direct use in Spark SQL
Aaron Grubb
Incorrect Results and SIGSEGV on Read with Iceberg + PySpark + Nessie
Aaron Grubb
Re: Incorrect Results and SIGSEGV on Read with Iceberg + PySpark + Nessie
Aaron Grubb
Re: GraphFrames' ConnectedComponentSuite test 'two components and two dangling vertices' fails with OutOfMemoryError: Java heap space
Ángel
Re: GraphFrames' ConnectedComponentSuite test 'two components and two dangling vertices' fails with OutOfMemoryError: Java heap space
Russell Jurney
Spark 3.5.4 gpg validation
Jack Buggins
Re: Spark 3.5.4 gpg validation
Rozov, Vlad
Re: Spark 3.5.4 gpg validation
Rozov, Vlad
Spark catalog api bug when working with non-hms based catalog
Sunny Malik
Re: Spark catalog api bug when working with non-hms based catalog
Sunny Malik
[SPARK spark-submit] Is there a way to use master URL with multiple masters ips in spark-submit ??
Darshan Shah
LLM based data pre-processing
Mayur Dattatray Bhosale
Re: LLM based data pre-processing
Russell Jurney
Re: LLM based data pre-processing
Gurunandan
Re: LLM based data pre-processing
Russell Jurney
Re: LLM based data pre-processing
Holden Karau
Re: LLM based data pre-processing
Russell Jurney
Re: LLM based data pre-processing
Holden Karau
Re: LLM based data pre-processing
Mich Talebzadeh
Re: LLM based data pre-processing
Mich Talebzadeh
Re: LLM based data pre-processing
Mich Talebzadeh
AWS Glue PySpark Job
Perez
Re: AWS Glue PySpark Job
Perez
spark k8s submit
jilani shaik
Re: spark k8s submit
Mat Schaffer
CVE-2024-23945: Apache Hive and Spark: CookieSigner exposes the correct signature when message verification fails
Stamatis Zampetakis
S3 Metrics when reading/writing using Spark
Asaf Mesika
Re: S3 Metrics when reading/writing using Spark
Soumasish
[ANNOUNCE] Apache Spark 3.5.4 released
杨杰
[QUESTION] Issue with "column -1 out of bounds" exception using sqlite JDBC
Sil
[Spark Structured Streaming] Is it possible to enable AQE in some case?
bluzy
Spark 2.4 to Spark 3.5 migration - waiting for HMS
Matteo Moci
Re: Spark 2.4 to Spark 3.5 migration - waiting for HMS
Mich Talebzadeh
Re: Spark 2.4 to Spark 3.5 migration - waiting for HMS
Matteo Moci
Re: Spark 2.4 to Spark 3.5 migration - waiting for HMS
Matteo Moci
Re: Spark 2.4 to Spark 3.5 migration - waiting for HMS
Matteo Moci
Re: Spark 2.4 to Spark 3.5 migration - waiting for HMS
Mich Talebzadeh
Re: Spark 2.4 to Spark 3.5 migration - waiting for HMS
Matteo Moci
[Spark Structured Streaming] How to delete old data that was created by Spark Structured Streaming?
Дубинкин Егор
Re: [Spark Structured Streaming] How to delete old data that was created by Spark Structured Streaming?
Yuri Oleynikov (יורי אולייניקוב)
Re: [Spark Structured Streaming] How to delete old data that was created by Spark Structured Streaming?
Mich Talebzadeh
Re: [Spark Structured Streaming] How to delete old data that was created by Spark Structured Streaming?
Andrei L
RE: [Spark Structured Streaming] How to delete old data that was created by Spark Structured Streaming?
Дубинкин Егор
[ANNOUNCE] Apache Sedona 1.7.0 released
Jia Yu
repartition before writing to table with bucketed partitioning
Henryk Česnolovič
Re: repartition before writing to table with bucketed partitioning
Soumasish
Re: repartition before writing to table with bucketed partitioning
Henryk Česnolovič
Re: repartition before writing to table with bucketed partitioning
Soumasish
Why is spark not doing filter just after the scan to reduce shuffle at later stage.
Khemchand Nagar
[ANNOUNCE] Apache Celeborn 0.5.2 available
Nicholas Jiang
[Spark]: Is spark goal to remove spark native function extensions?
Nathan DEBUCQUOIS
Can't import confluent_kafka package
Tzahi File
Getting "Cannot broadcast the table that is larger than 8GB error" - Clarification
Lakshminarayana Chari
Which shuffle operations trigger AQE and which don't?
Perfect Stranger
Re: Which shuffle operations trigger AQE and which don't?
Mich Talebzadeh
Re: Which shuffle operations trigger AQE and which don't?
Gurunandan
Job Opportunities in India,UK,Australia,UAE,Singapore or USA
sri hari kali charan Tummala
Re: [Spark SQL] [DISK_ONLY Persistence] getting "this.inMemSorter" is null exception
Gurunandan
Re: [Spark SQL] [DISK_ONLY Persistence] getting "this.inMemSorter" is null exception
Gurunandan
Re: [Spark SQL] [DISK_ONLY Persistence] getting "this.inMemSorter" is null exception
Ashwani Pundir
Observation not working on jdbc write
Smokeriu
Re: [BRAND] I have a question
Mark Thomas
[SPARK]: standalone : cluster mode : spark-submit driverId issue
meivenkatkumar lakshminarayanan
Access binary representation of Dataframe
vincent gromakowski
Re: Access binary representation of Dataframe
Sem
StreamingQueryListener in PySpark lagging behind
Andrzej Zera
ClassCastException in Spark3.5 when submit job to an old yarn cluster using netty 4.1.17
Smokeriu
Re: ClassCastException in Spark3.5 when submit job to an old yarn cluster using netty 4.1.17
Gurunandan
[SPARK CORE] Incompatible configuration used between Spark and HBaseTestingUtility
Evelina Dumitrescu
Re: [SPARK CORE] Incompatible configuration used between Spark and HBaseTestingUtility
Gurunandan
Re: [SPARK CORE] Incompatible configuration used between Spark and HBaseTestingUtility
Evelina Dumitrescu
Re: [SPARK CORE] Incompatible configuration used between Spark and HBaseTestingUtility
Gurunandan
Re:
Gurunandan
[ANNOUNCE] Apache Spark 3.4.4 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.4.4 released
Mich Talebzadeh
Code change requirement
Avi Minsky
[ANNOUNCE] Apache Kyuubi v1.10.0 is available
Bowen Liang
Earlier messages