user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: Autoscaling in Spark
Mich Talebzadeh
Log file location in Spark on K8s
Agrawal, Sanket
Re: Log file location in Spark on K8s
Prashant Sharma
Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
Danilo Sousa
Spark Compatibility with Spring Boot 3.x
Ahmed Albalawi
Re: Spark Compatibility with Spring Boot 3.x
Sean Owen
Re: Spark Compatibility with Spring Boot 3.x
Angshuman Bhattacharya
RE: Re: Spark Compatibility with Spring Boot 3.x
Guru Panda
Connection pool shut down in Spark Iceberg Streaming Connector
Agrawal, Sanket
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Prashant Sharma
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Igor Calabria
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Raghavendra Ganesh
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Perez
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Mich Talebzadeh
[Spark Core]: Recomputation cost of a job due to executor failures
Faiz Halde
Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Mich Talebzadeh
Re: Updating delta file column data
Mich Talebzadeh
using facebook Prophet + pyspark for forecasting - Dataframe has less than 2 non-NaN rows
karan alang
Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jayabindu Singh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Mich Talebzadeh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Thread dump only shows 10 shuffle clients
Nebi Aydin
Files io threads vs shuffle io threads
Nebi Aydin
Inquiry about Processing Speed
Haseeb Khalid
Re: Inquiry about Processing Speed
Deepak Goel
Re: Inquiry about Processing Speed
Jack Goodson
Reading Glue Catalog Views through Spark.
Agrawal, Sanket
[PySpark][Spark logs] Is it possible to dynamically customize Spark logs?
Ayman Rekik
[ANNOUNCE] Apache Kyuubi released 1.7.3
Zhen Wang
Spark Connect Multi-tenant Support
Kezhi Xiong
Parallel write to different partitions
Shrikant Prasad
Re: Parallel write to different partitions
Shrikant Prasad
Need to split incoming data into PM on time column and find the top 5 by volume of data
ashok34...@yahoo.com.INVALID
Re: Need to split incoming data into PM on time column and find the top 5 by volume of data
Mich Talebzadeh
PySpark 3.5.0 on PyPI
Kezhi Xiong
Re: PySpark 3.5.0 on PyPI
Sean Owen
Re: PySpark 3.5.0 on PyPI
Kezhi Xiong
[Spark 3.5.0] Is the protobuf-java JAR no longer shipped with Spark?
Gijs Hendriksen
Create an external table with DataFrameWriterV2
Christophe Préaud
Spark streaming sourceArchiveDir does not move file to archive directory
Yunus Emre G?rses
Discriptency sample standard deviation pyspark and Excel
Helene Bøe
Re: Discriptency sample standard deviation pyspark and Excel
Sean Owen
Re: Discriptency sample standard deviation pyspark and Excel
Mich Talebzadeh
Re: Discriptency sample standard deviation pyspark and Excel
Sean Owen
Re: Discriptency sample standard deviation pyspark and Excel
Bjørn Jørgensen
Re: Discriptency sample standard deviation pyspark and Excel
Mich Talebzadeh
Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Karthick
Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Gowtham S
Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem
Karthick
getting emails in different order!
Mich Talebzadeh
Re: getting emails in different order!
Sean Owen
Re: getting emails in different order!
Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi released 1.7.2
Zhen Wang
About Peak Jvm Memory Onheap
Nebi Aydin
Fwd: First Time contribution.
ram manickam
Re: First Time contribution.
Denny Lee
Re: First Time contribution.
Haejoon Lee
[Spark Core]: How does rpc threads influence shuffle?
Nebi Aydin
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Mich Talebzadeh
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
Bjørn Jørgensen
Re: Filter out 20% of rows
ashok34...@yahoo.com.INVALID
Spark stand-alone mode
Ilango
Re: Spark stand-alone mode
Patrick Tucci
Re: Spark stand-alone mode
Sean Owen
Re: Spark stand-alone mode
Mich Talebzadeh
Re: Spark stand-alone mode
Bjørn Jørgensen
Re: Spark stand-alone mode
Ilango
Re: Spark stand-alone mode
Patrick Tucci
Re: Spark stand-alone mode
Ilango
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Craig Alfieri
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Jerry Peng
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
russell . spitzer
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Craig Alfieri
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2
Jerry Peng
APACHE Spark adoption/growth chart
Andrew Petersen
Write Spark Connection client application in Go
bo yang
Re: Write Spark Connection client application in Go
Holden Karau
Re: Write Spark Connection client application in Go
Martin Grund
Re: Write Spark Connection client application in Go
bo yang
Feedback on Testing Guidelines for Data Stream Processing Applications
Alexandre Strapacao Guedes Vianna
Re: IDEA compile fail but sbt test succeed
Pasha Finkelshteyn
About /mnt/hdfs/current/BP directories
Nebi Aydin
Re: About /mnt/hdfs/current/BP directories
Jack Wells
Re: [External Email] Re: About /mnt/hdfs/current/BP directories
Nebi Aydin
Re: [External Email] Re: About /mnt/hdfs/current/BP directories
Jack Wells
Re: [External Email] Re: About /mnt/hdfs/current/BP directories
Nebi Aydin
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
Re: Spark 3.4.1 and Hive 3.1.3
Yeachan Park
Re: Spark 3.4.1 and Hive 3.1.3
Chao Sun
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
Re: Spark 3.4.1 and Hive 3.1.3
Nagatomi Yasukazu
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
RE: Spark 3.4.1 and Hive 3.1.3
Agrawal, Sanket
how can i use spark with yarn cluster in java
BCMS
Re: how can i use spark with yarn cluster in java
Mich Talebzadeh
Change default timestamp offset on data load
Jack Goodson
Re: Change default timestamp offset on data load
Mich Talebzadeh
Re: Change default timestamp offset on data load
Jack Goodson
Re: Change default timestamp offset on data load
Mich Talebzadeh
Re: Change default timestamp offset on data load
Jack Goodson
Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
Varun Shah
Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
Mich Talebzadeh
Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
ashok34...@yahoo.com.INVALID
Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community
Mich Talebzadeh
pyspark.ml.recommendation is using the wrong python version
Harry Jamison
Re: pyspark.ml.recommendation is using the wrong python version
Harry Jamison
Re: pyspark.ml.recommendation is using the wrong python version
Mich Talebzadeh
Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Cleyson Barros
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Mich Talebzadeh
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes
eab...@163.com
Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes
eab...@163.com
Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
[Spark Connect]Running Spark Connect Server in Cluster Mode on Kubernetes
Nagatomi Yasukazu
Reg read json inference schema
Manoj Babu
Okio Vulnerability in Spark 3.4.1
Agrawal, Sanket
Re: Okio Vulnerability in Spark 3.4.1
Sean Owen
RE: Okio Vulnerability in Spark 3.4.1
Agrawal, Sanket
Re: Okio Vulnerability in Spark 3.4.1
Sean Owen
Re: Okio Vulnerability in Spark 3.4.1
Bjørn Jørgensen
Re: Okio Vulnerability in Spark 3.4.1
Bjørn Jørgensen
Re: Okio Vulnerability in Spark 3.4.1
Bjørn Jørgensen
CommunityOverCode(CoC) 2023
Uma Maheswara Rao Gangumalla
Registration open for Community Over Code North America
Rich Bowen
Two new tickets for Spark on K8s
Mich Talebzadeh
Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Holden Karau
Re: Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Koert Kuipers
Re: Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Sean Owen
Re: Elasticsearch support for Spark 3.x
Dipayan Dev
Re: Elasticsearch support for Spark 3.x
Sean Owen
Spark 2.4.7
Harry Jamison
Re: Spark 2.4.7
Varun Shah
Re: Spark 2.4.7
Harry Jamison
Re: Spark 2.4.7
Mich Talebzadeh
Re: Spark 2.4.7
Mich Talebzadeh
mysterious spark.sql.utils.AnalysisException Union in spark 3.3.2, but not seen in 3.4.0+
Srivatsan vn
Re: mysterious spark.sql.utils.AnalysisException Union in spark 3.3.2, but not seen in 3.4.0+
Mich Talebzadeh
Spark Connect: API mismatch in SparkSesession#execute
Stefan Hagedorn
Fwd: 📅 Wednesday: Join 6 Members at "Ofir Press | Complementing Scale: Novel Guidance Methods for Improving LMs"
Mich Talebzadeh
[no subject]
ayan guha
Re:
leibnitz
$SPARK_HOME/sbin/start-worker.sh spark://{main_host}:{cluster_port} failing
Jeremy Brent
Re: $SPARK_HOME/sbin/start-worker.sh spark://{main_host}:{cluster_port} failing
Mich Talebzadeh
Fwd: Recap on current status of "SPIP: Support Customized Kubernetes Schedulers"
Mich Talebzadeh
[ANNOUNCE] Apache Spark 3.3.3 released
Yuming Wang
error trying to save to database (Phoenix)
Kal Stevens
Re: error trying to save to database (Phoenix)
Sean Owen
Re: error trying to save to database (Phoenix)
Kal Stevens
Re: error trying to save to database (Phoenix)
Sean Owen
Re: error trying to save to database (Phoenix)
Kal Stevens
Re: error trying to save to database (Phoenix)
Gera Shegalov
DataFrame cache keeps growing
Varun .N
Spark doesn’t create SUCCESS file when external path is passed
Dipayan Dev
k8s+ YARN Spark
Крюков Виталий Семенович
Re: k8s+ YARN Spark
Mich Talebzadeh
Problem with spark 3.4.1 not finding spark java classes
Kal Stevens
Re: Problem with spark 3.4.1 not finding spark java classes
Kal Stevens
Re: Problem with spark 3.4.1 not finding spark java classes
Mich Talebzadeh
Problem with spark 3.4.1 not finding spark java classes
Kal Stevens
Re: Problem with spark 3.4.1 not finding spark java classes
Bjørn Jørgensen
Probable Spark Bug while inserting into flat GCS bucket?
Dipayan Dev
Re: Probable Spark Bug while inserting into flat GCS bucket?
Mich Talebzadeh
Re: Probable Spark Bug while inserting into flat GCS bucket?
Dipayan Dev
[Spark Core]: What's difference among spark.shuffle.io.threads
Nebi Aydin
Re: [Spark Core]: What's difference among spark.shuffle.io.threads
Mich Talebzadeh
Re: [External Email] Re: [Spark Core]: What's difference among spark.shuffle.io.threads
Nebi Aydin
Re: [External Email] Re: [Spark Core]: What's difference among spark.shuffle.io.threads
Mich Talebzadeh
Re: [External Email] Re: [Spark Core]: What's difference among spark.shuffle.io.threads
Nebi Aydin
[Spark Core]: What's difference among spark.shuffle.io.threads
Nebi Aydin
[no subject]
Dipayan Dev
read dataset from only one node in YARN cluster
marc nicole
Re: read dataset from only one node in YARN cluster
Mich Talebzadeh
Managing python modules in docker for PySpark?
Mich Talebzadeh
Earlier messages
Later messages