user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: What could be the cause of an execution freeze on Hadoop for small datasets?
Mich Talebzadeh
Re: What could be the cause of an execution freeze on Hadoop for small datasets?
sam smith
Re: What could be the cause of an execution freeze on Hadoop for small datasets?
Mich Talebzadeh
Re: What could be the cause of an execution freeze on Hadoop for small datasets?
sam smith
How to allocate vcores to driver (client mode)
sam smith
org.apache.spark.shuffle.FetchFailedException in dataproc
Gary Liu
Re: org.apache.spark.shuffle.FetchFailedException in dataproc
Mich Talebzadeh
Re: org.apache.spark.shuffle.FetchFailedException in dataproc
Gary Liu
Re: org.apache.spark.shuffle.FetchFailedException in dataproc
Mich Talebzadeh
Re: org.apache.spark.shuffle.FetchFailedException in dataproc
Gary Liu
Spark StructuredStreaming - watermark not working as expected
karan alang
Re: Spark StructuredStreaming - watermark not working as expected
Mich Talebzadeh
Re: Spark StructuredStreaming - watermark not working as expected
karan alang
Re: Spark StructuredStreaming - watermark not working as expected
Mich Talebzadeh
Re: Spark StructuredStreaming - watermark not working as expected
karan alang
Re: Spark StructuredStreaming - watermark not working as expected
karan alang
Re: Spark StructuredStreaming - watermark not working as expected
Mich Talebzadeh
Re: Spark StructuredStreaming - watermark not working as expected
karan alang
How to share a dataset file across nodes
sam smith
Re: How to share a dataset file across nodes
Sean Owen
Re: How to share a dataset file across nodes
Mich Talebzadeh
eqNullSafe breaks Sorted Merge Bucket Join?
Thomas Wang
read a binary file and save in another location
[email protected]
Re: read a binary file and save in another location
Russell Jurney
Re: read a binary file and save in another location
Mich Talebzadeh
Re: read a binary file and save in another location
Russell Jurney
Spark Thrift Server - Autoscaling on K8
Jayabindu Singh
Re: [EXTERNAL] Spark Thrift Server - Autoscaling on K8
Saurabh Gulati
How to use Fair Scheduler Pools
李杰
How to use Fair Scheduler Pools
李杰
[Spark] How to find which type of key is illegal during from_json() function
hueiyuan su
spark-submit: No "driver-" id printed in standalone mode
Travis Athougies
[ANNOUNCE] Apache Kyuubi released 1.7.0
Cheng Pan
Online classes for spark topics
[email protected]
Re: Online classes for spark topics
Mich Talebzadeh
Re: Online classes for spark topics
[email protected]
Re: Online classes for spark topics
Mich Talebzadeh
Re: Online classes for spark topics
karan alang
Re: Online classes for spark topics
asma zgolli
Re: Online classes for spark topics
Winston Lai
Re: Online classes for spark topics
Sofia’s World
Re: Online classes for spark topics
Denny Lee
Re: Online classes for spark topics
Deepak Sharma
Re: Online classes for spark topics
Mich Talebzadeh
Re: [EXTERNAL] Re: Online classes for spark topics
Saurabh Gulati
Re: [EXTERNAL] Re: Online classes for spark topics
Winston Lai
Re: [EXTERNAL] Re: Online classes for spark topics
asma zgolli
Re: Online classes for spark topics
neeraj bhadani
Re: Online classes for spark topics
Denny Lee
Re: Online classes for spark topics
Mich Talebzadeh
Re: Online classes for spark topics
vaquar khan
回复:Re: Build SPARK from source with SBT failed
ckgppl_yan
Re: 回复:Re: Build SPARK from source with SBT failed
Artemis User
Re: 回复:Re: Build SPARK from source with SBT failed
Sean Owen
Re: 回复:Re: Build SPARK from source with SBT failed
Tufan Rakshit
Re: Build SPARK from source with SBT failed
Sean Owen
Pandas UDFs vs Inbuilt pyspark functions
neha garde
Re: Pandas UDFs vs Inbuilt pyspark functions
Sean Owen
[Spark Structured Streaming] Do spark structured streaming is support sink to AWS Kinesis currently and how to handle if achieve quotas of kinesis?
hueiyuan su
Re: [Spark Structured Streaming] Do spark structured streaming is support sink to AWS Kinesis currently and how to handle if achieve quotas of kinesis?
Mich Talebzadeh
Data duplication and loss occur after executing 'insert overwrite...' in Spark 3.1.1
周锋
How to pass variables across functions in spark structured streaming (PySpark)
Mich Talebzadeh
Re: How to pass variables across functions in spark structured streaming (PySpark)
Sean Owen
Re: How to pass variables across functions in spark structured streaming (PySpark)
Mich Talebzadeh
Re: How to pass variables across functions in spark structured streaming (PySpark)
Sean Owen
Re: How to pass variables across functions in spark structured streaming (PySpark)
Mich Talebzadeh
Re: How to pass variables across functions in spark structured streaming (PySpark)
Mich Talebzadeh
Re: How to pass variables across functions in spark structured streaming (PySpark)
Mich Talebzadeh
[ANNOUNCE] Apache Celeborn(incubating) 0.2.0 available
Ethan Feng
Fwd: [New Project] sparksql-ml : Distributed Machine Learning using SparkSQL.
Chitral Verma
Re: [New Project] sparksql-ml : Distributed Machine Learning using SparkSQL.
Russell Jurney
Fwd: 自动回复: Re: [DISCUSS] Show Python code examples first in Spark documentation
Mich Talebzadeh
[JDBC] [PySpark] Possible bug when comparing incoming data frame from mssql and empty delta table
lennart
Late arriving updates to fact tables
rajat kumar
Re: SPIP architecture diagrams
Mich Talebzadeh
Re: SPIP architecture diagrams
Mich Talebzadeh
Unable to handle bignumeric datatype in spark/pyspark
nidhi kher
Re: Unable to handle bignumeric datatype in spark/pyspark
Mich Talebzadeh
Re: Unable to handle bignumeric datatype in spark/pyspark
Rajnil Guha
Re: Unable to handle bignumeric datatype in spark/pyspark
Mich Talebzadeh
Re: Unable to handle bignumeric datatype in spark/pyspark
Atheeth SH
Re: Unable to handle bignumeric datatype in spark/pyspark
Atheeth SH
[PySpark SQL] New column with the maximum of multiple terms?
Oliver Ruebenacker
Re: [PySpark SQL] New column with the maximum of multiple terms?
Sean Owen
Re: [PySpark SQL] New column with the maximum of multiple terms?
Oliver Ruebenacker
Re: [PySpark SQL] New column with the maximum of multiple terms?
Russell Jurney
Re: [PySpark SQL] New column with the maximum of multiple terms?
Bjørn Jørgensen
Re: [PySpark SQL] New column with the maximum of multiple terms?
Sean Owen
Re: [PySpark SQL] New column with the maximum of multiple terms?
Oliver Ruebenacker
Re: [PySpark SQL] New column with the maximum of multiple terms?
Russell Jurney
Re: [PySpark SQL] New column with the maximum of multiple terms?
Oliver Ruebenacker
Spark with bigquery : Data type issue
nidhi kher
Re: Spark with bigquery : Data type issue
nidhi kher
Re: Spark with bigquery : Data type issue
Mich Talebzadeh
SPIP: Adding work load identity to Spark on Kubernetes documents (supersedes Secret Management)
Mich Talebzadeh
SPIP: Shutting down spark structured streaming when the streaming process completed current process
Mich Talebzadeh
Re: SPIP: Shutting down spark structured streaming when the streaming process completed current process
Dongjoon Hyun
Re: SPIP: Shutting down spark structured streaming when the streaming process completed current process
Holden Karau
Re: SPIP: Shutting down spark structured streaming when the streaming process completed current process
Mich Talebzadeh
Vote SPIP
Faisal Waris
Update nested struct with null fields
Vikas Kumar
[Spark Structured Streaming] Do spark structured streaming is support sink to AWS Kinesis currently?
hueiyuan su
Re: [Spark Structured Streaming] Do spark structured streaming is support sink to AWS Kinesis currently?
Vikas Kumar
How can I set a value of Location with CustomDataSource ?
Zhuolin Ji
Upgrading from Spark SQL 3.2 to 3.3 faild
lk_spark
Re:Upgrading from Spark SQL 3.2 to 3.3 faild
lk_spark
[Spark Structured Streaming] Could we apply new options of readStream/writeStream without stopping spark application (zero downtime)?
hueiyuan su
Re: [Spark Structured Streaming] Could we apply new options of readStream/writeStream without stopping spark application (zero downtime)?
Jack Goodson
Re: [Spark Structured Streaming] Could we apply new options of readStream/writeStream without stopping spark application (zero downtime)?
Mich Talebzadeh
Re: [Spark Structured Streaming] Could we apply new options of readStream/writeStream without stopping spark application (zero downtime)?
Mich Talebzadeh
Re: [Spark Structured Streaming] Could we apply new options of readStream/writeStream without stopping spark application (zero downtime)?
hueiyuan su
ADLS Gen2 adfs sample yaml configuration
Kondala Ponnaboina (US)
Re: ADLS Gen2 adfs sample yaml configuration
Jayabindu Singh
How to explode array columns of a dataframe having the same length
sam smith
Re: How to explode array columns of a dataframe having the same length
Enrico Minack
Re: How to explode array columns of a dataframe having the same length
Navneet
Re: How to explode array columns of a dataframe having the same length
Bjørn Jørgensen
Re: How to explode array columns of a dataframe having the same length
sam smith
Re: How to explode array columns of a dataframe having the same length
Vikas Kumar
Re: How to explode array columns of a dataframe having the same length
404
Adding OpenSearch as a secondary index provider to SparkSQL
Anirudha Jadhav
Re: Adding OpenSearch as a secondary index provider to SparkSQL
Mich Talebzadeh
Executor tab missing information
Prem Sahoo
Running Spark on Kubernetes (GKE) - failing on spark-submit
karan alang
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
Khalid Mammadov
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
Ye Xianjin
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
karan alang
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
Mich Talebzadeh
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
Mich Talebzadeh
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
karan alang
Re: Running Spark on Kubernetes (GKE) - failing on spark-submit
Mich Talebzadeh
[Spark Core] Spark data loss/data duplication when executors die
Erik Eklund
How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
Sean Owen
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
Mich Talebzadeh
Re: How to improve efficiency of this piece of code (returning distinct column values)
Sean Owen
Re: How to improve efficiency of this piece of code (returning distinct column values)
Apostolos N. Papadopoulos
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
Enrico Minack
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
Sean Owen
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
Mich Talebzadeh
Re: How to improve efficiency of this piece of code (returning distinct column values)
Enrico Minack
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re: How to improve efficiency of this piece of code (returning distinct column values)
Sean Owen
Re: How to improve efficiency of this piece of code (returning distinct column values)
Enrico Minack
Re: How to improve efficiency of this piece of code (returning distinct column values)
sam smith
Re:
Sunil Prabhakara
Executor metrics are missing on prometheus sink
Qian Sun
Re: Executor metrics are missing on Prometheus sink
Qian Sun
Jira Account for Contributions
Jack Goodson
[Spark SQL]: Spark 3.2 generates different results to query when columns name have mixed casing vs when they have same casing
Amit Singh Rathore
Is sparkSession.sql now an action in Spark 3 and later?
Sayeh Roshan
Fwd: Graceful shutdown SPARK Structured Streaming
Mich Talebzadeh
Re: Graceful shutdown SPARK Structured Streaming
Brian Wylie
Re: Graceful shutdown SPARK Structured Streaming
Bjørn Jørgensen
Re: Graceful shutdown SPARK Structured Streaming
Mich Talebzadeh
[Spark SQL] : Delete is only supported on V2 tables.
Jeevan Chhajed
Fwd: [Spark SQL] : Delete is only supported on V2 tables.
Jeevan Chhajed
How to upgrade a spark structure streaming application
Yoel Benharrous
Re: How to upgrade a spark structure streaming application
Mich Talebzadeh
big data products
LinuxGuy
Create table before inserting in SQL
Harut Martirosyan
Re: Create table before inserting in SQL
Mich Talebzadeh
Re: Create table before inserting in SQL
Harut Martirosyan
Re: Create table before inserting in SQL
Harut Martirosyan
Re: Create table before inserting in SQL
Mich Talebzadeh
Re: Create table before inserting in SQL
Harut Martirosyan
Spark Thrift Server issue with external HDFS table
Kalhara Gurugamage
What is DataFilters and while joining why is the filter isnotnull[joinKey] applied twice
Nitin Siwach
[Spark/deeplyR] how come spark is caching tables read through jdbc connection from oracle, even when memory=false is chosen
Joris Billen
Help needed regarding error with 5 node Spark cluster (shuffle error)- Comcast
Jain, Sanchi
Re: Help needed regarding error with 5 node Spark cluster (shuffle error)- Comcast
Mich Talebzadeh
Re: Help needed regarding error with 5 node Spark cluster (shuffle error)- Comcast
Artemis User
Fwd: Spark-submit doesn't load all app classes in the classpath
Soheil Pourbafrani
spark+kafka+dynamic resource allocation
Lingzhe Sun
Re: spark+kafka+dynamic resource allocation
[email protected]
Re: Re: spark+kafka+dynamic resource allocation
Lingzhe Sun
Re: Re: spark+kafka+dynamic resource allocation
Mich Talebzadeh
Re: Re: spark+kafka+dynamic resource allocation
Lingzhe Sun
Re: Re: spark+kafka+dynamic resource allocation
Mich Talebzadeh
Spark SQL question
Kohki Nishio
Re: Spark SQL question
Mich Talebzadeh
Re: Spark SQL question
Bjørn Jørgensen
SQL GROUP BY alias with dots, was: Spark SQL question
Enrico Minack
Question regarding Spark 3.X performance
Athanasios Kordelas
Re: Question regarding Spark 3.X performance
Mich Talebzadeh
Re: Question regarding Spark 3.X performance
Mich Talebzadeh
Re: Question regarding Spark 3.X performance
Mich Talebzadeh
Re: Question regarding Spark 3.X performance
Athanasios Kordelas
Duplicates in Collaborative Filtering Output
Kartik Ohri
Re: Duplicates in Collaborative Filtering Output
Kartik Ohri
Any advantages of using sql.adaptive.autoBroadcastJoinThreshold over sql.autoBroadcastJoinThreshold?
Soumyadeep Mukhopadhyay
Re: Any advantages of using sql.adaptive.autoBroadcastJoinThreshold over sql.autoBroadcastJoinThreshold?
Balakrishnan Ayyappan
Table created with saveAsTable behaves differently than a table created with spark.sql("CREATE TABLE....)
krexos
Re: Table created with saveAsTable behaves differently than a table created with spark.sql("CREATE TABLE....)
Peyman Mohajerian
Re: Table created with saveAsTable behaves differently than a table created with spark.sql("CREATE TABLE....)
krexos
Earlier messages
Later messages