user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: Efficiently updating running sums only on new data
Artemis User
Re: Efficiently updating running sums only on new data
Igor Calabria
Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql?
Chartist
Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql?
Sadha Chilukoori
Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql?
Chartist
As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Oliver Plohmann
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Никита Романов
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Sean Owen
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Henrik Park
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Sean Owen
[Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release?
phoebe chen
Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release?
Sean Owen
Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release?
Bjørn Jørgensen
Converting None/Null into json in pyspark
Karthick Nk
Re: Converting None/Null into json in pyspark
Yeachan Park
Re: Converting None/Null into json in pyspark
Karthick Nk
Re: Converting None/Null into json in pyspark
Yeachan Park
Reading too many files
Sachit Murarka
Re: Reading too many files
Sid
Re: Reading too many files
Henrik Pang
Re: Reading too many files
Enrico Minack
Re: Reading too many files
Artemis User
Spike on number of tasks - dynamic allocation
murat migdisoglu
Re: Spike on number of tasks - dynamic allocation
Mich Talebzadeh
Re: Spike on number of tasks - dynamic allocation
murat migdisoglu
Re: Spike on number of tasks - dynamic allocation
Mich Talebzadeh
WARN ProcfsMetricsGetter: Exception
Surya Gopisetty
Re: WARN ProcfsMetricsGetter: Exception
Henrik Pang
Spark ML VarianceThresholdSelector Unexpected Results
姜鑫
Re: Spark ML VarianceThresholdSelector Unexpected Results
Sean Owen
Re: Spark ML VarianceThresholdSelector Unexpected Results
姜鑫
Help with Shuffle Read performance
Igor Calabria
Re: Help with Shuffle Read performance
Gourav Sengupta
Re: Help with Shuffle Read performance
Tufan Rakshit
Re: Help with Shuffle Read performance
Vladimir Prus
Re: Help with Shuffle Read performance
Igor Calabria
Re: Help with Shuffle Read performance
Gourav Sengupta
Re: Help with Shuffle Read performance
Leszek Reimus
Re: Help with Shuffle Read performance
Gourav Sengupta
Re: Help with Shuffle Read performance
Sungwoo Park
Re: Help with Shuffle Read performance
Leszek Reimus
Re: Help with Shuffle Read performance
Artemis User
Re: Help with Shuffle Read performance
Igor Calabria
depolying stage-level scheduling for Spark SQL and how to expose RDD code from Spark SQL?
Chenghao Lyu
Does 'Stage cancelled because SparkContext was shut down' is a error
lk_spark
[Spark Kubernetes] Question about Configurability of Labeling Driver Service
Shiqi Sun
Re: [Spark Kubernetes] Question about Configurability of Labeling Driver Service
Shiqi Sun
Kyro Serializer not getting set : Spark3
rajat kumar
Re: Kyro Serializer not getting set : Spark3
Qian SUN
Re: Kyro Serializer not getting set : Spark3
rajat kumar
HELP, Populating an empty pyspark dataframe with auto-generated dates
Jamie Arodi
Query regarding Proleptic Gregorian Calendar Spark3
Sachit Murarka
Re: Query regarding Proleptic Gregorian Calendar Spark3
Sachit Murarka
Error - Spark STREAMING
Akash Vellukai
Re: Error - Spark STREAMING
Anupam Singh
Re: Issue with SparkContext
Bjørn Jørgensen
Re: Issue with SparkContext
javacaoyu
NoClassDefError and SparkSession should only be created and accessed on the driver.
rajat kumar
答复: NoClassDefError and SparkSession should only be created and accessed on the driver.
Xiao, Alton
Re: NoClassDefError and SparkSession should only be created and accessed on the driver.
rajat kumar
Re: NoClassDefError and SparkSession should only be created and accessed on the driver.
Paul Rogalinski
Spark Structured Streaming - stderr getting filled up
karan alang
Re: Spark Structured Streaming - stderr getting filled up
karan alang
Re: Spark Structured Streaming - stderr getting filled up
karan alang
[how to]RDD using JDBC data source in PySpark
javaca...@163.com
答复: [how to]RDD using JDBC data source in PySpark
Xiao, Alton
回复: 答复: [how to]RDD using JDBC data source in PySpark
javaca...@163.com
Re: 答复: [how to]RDD using JDBC data source in PySpark
Bjørn Jørgensen
Re: Re: [how to]RDD using JDBC data source in PySpark
javaca...@163.com
Re: Re: [how to]RDD using JDBC data source in PySpark
Bjørn Jørgensen
Re: 答复: [how to]RDD using JDBC data source in PySpark
Sean Owen
Driver throws exception every few hours
Kiran Biswal
[Spark Core] Joining Same DataFrame Multiple Times Results in Column not getting dropped
Shahban Riaz
[Spark Internals]: Is sort order preserved after partitioned write?
Swetha Baskaran
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Enrico Minack
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Swetha Baskaran
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Enrico Minack
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Swetha Baskaran
Big Data Contract Roles ?
sri hari kali charan Tummala
Splittable or not?
Sid
Re: Splittable or not?
Amit Joshi
Re: Splittable or not?
Sid
Re: Splittable or not?
Enrico Minack
Re: Splittable or not?
Sid
Re: Splittable or not?
Jack Goodson
Network time out property is not getting set in Spark
Sachit Murarka
Re: EXT: Network time out property is not getting set in Spark
Vibhor Gupta
Re: EXT: Network time out property is not getting set in Spark
Sachit Murarka
Long running task in spark
rajat kumar
Re: Long running task in spark
Sid
[SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage
akshit marwah
Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage
Artemis User
Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage
Adam Binford
Dynamic shuffle partitions in a single job
Vibhor Gupta
Re: Dynamic shuffle partitions in a single job
Anupam Singh
RE: [EXTERNAL] Re: Dynamic shuffle partitions in a single job
Kapil Kumar Singh
Spark SQL
Mayur Benodekar
Re: Spark SQL
Gourav Sengupta
Re: Spark SQL
Mayur Benodekar
Re: Spark SQL
Gourav Sengupta
Re: EXT: Re: Spark SQL
Vibhor Gupta
Pipelined execution in Spark (???)
Sungwoo Park
Re: Pipelined execution in Spark (???)
Russell Jurney
Re: Pipelined execution in Spark (???)
Sungwoo Park
Re: Pipelined execution in Spark (???)
Sean Owen
Re: Pipelined execution in Spark (???)
Sungwoo Park
Re: Pipelined execution in Spark (???)
Russell Jurney
Re: Pipelined execution in Spark (???)
Gourav Sengupta
Re: Pipelined execution in Spark (???)
Russell Jurney
Re: Pipelined execution in Spark (???)
Russell Jurney
Spark equivalent to hdfs groups
phiroc
Re: Spark equivalent to hdfs groups
Sean Owen
Re: Spark equivalent to hdfs groups
phiroc
Re: Spark equivalent to hdfs groups
Sean Owen
Re: Spark equivalent to hdfs groups
phiroc
Spark Structured Streaming - unable to change max.poll.records (showing as 1)
karan alang
[ANNOUNCE] Apache Kyuubi (Incubating) released 1.6.0-incubating
Nicholas Jiang
Error in Spark in Jupyter Notebook
Mamata Shee
Re: Error in Spark in Jupyter Notebook
Sean Owen
Apache Spark - How to concert DataFrame json string to structured element and using schema_of_json
M Singh
Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Holden Karau
Re: Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Holden Karau
Re: Jupyter notebook on Dataproc versus GKE
Bjørn Jørgensen
Re: Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Bjørn Jørgensen
Re: Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Holden Karau
Re: Jupyter notebook on Dataproc versus GKE
Bjørn Jørgensen
Spark Issue with Istio in Distributed Mode
Deepak Sharma
Re: Spark Issue with Istio in Distributed Mode
Deepak Sharma
Re: Spark Issue with Istio in Distributed Mode
Deepak Sharma
Data Type Issue while upgrading to Spark3
rajat kumar
Creating Custom Broadcast Join
Murali S
ERROR MicroBatchExecution
Ravi Chandran
running pyspark on kubernetes - no space left on device
Manoj GEORGE
Re: running pyspark on kubernetes - no space left on device
Matt Proetsch
Re: running pyspark on kubernetes - no space left on device
Qian SUN
Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15
FengYu Cao
Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15
Chao Sun
Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15
FengYu Cao
Moving to Spark 3x from Spark2
rajat kumar
Re: Moving to Spark 3x from Spark2
Khalid Mammadov
Re: Moving to Spark 3x from Spark2
Martin Andersson
deciding Spark tasks & optimization resource
rajat kumar
Re: deciding Spark tasks & optimization resource
Gibson
Spark 3.3.0 with Structure Streaming from Kafka Issue on commons-pools2
Raymond Tang
Spark SQL Predict Pushdown for Hive Bucketed Table
Raymond Tang
Structured Streaming - data not being read (offsets not getting committed ?)
karan alang
回复:Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2
ckgppl_yan
Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2
ckgppl_yan
Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2
Sean Owen
Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2
pengyh
Profiling PySpark Pandas UDF
Subash Prabanantham
Re: Profiling PySpark Pandas UDF
Gourav Sengupta
Re: Profiling PySpark Pandas UDF
Andrew Melo
Re: Profiling PySpark Pandas UDF
Sean Owen
Re: Profiling PySpark Pandas UDF
Russell Jurney
Re: Profiling PySpark Pandas UDF
Takuya UESHIN
Re: Profiling PySpark Pandas UDF
Sean Owen
Re: Profiling PySpark Pandas UDF
Russell Jurney
Re: Profiling PySpark Pandas UDF
Subash Prabanantham
Re: Profiling PySpark Pandas UDF
Abdeali Kothari
RE: Profiling PySpark Pandas UDF
Luca Canali
Re: Profiling PySpark Pandas UDF
Abdeali Kothari
RE: Profiling PySpark Pandas UDF
Luca Canali
Re: Profiling PySpark Pandas UDF
Gourav Sengupta
spark-3.2.2-bin-without-hadoop : NoClassDefFoundError: org/apache/log4j/spi/Filter when starting the master
FLORANCE Grégory
Re: spark-3.2.2-bin-without-hadoop : NoClassDefFoundError: org/apache/log4j/spi/Filter when starting the master
Sean Owen
Question regarding checkpointing with kafka structured streaming
Martin Andersson
[Spark SQL]: Does Spark preserve the order in a nested ORDER BY?
Vinay Londhe
Filtering by job group in the Spark UI / API
Yeachan Park
Spark streaming
Prajith Vellukkai
Re: Spark streaming
ミユナ (alice)
Spark streaming
sandra sukumaran
Re: Spark streaming
Ajit Kumar Amit
Re: [EXTERNAL] Re: Spark streaming
Saurabh Gulati
Re: [EXTERNAL] Re: Spark streaming
sandra sukumaran
Re: Spark streaming
Gourav Sengupta
Data ingestion
Akash Vellukai
Re: Data ingestion
Pasha Finkelshtein
Re: Data ingestion
Yuri Oleynikov (יורי אולייניקוב)
Re: Data ingestion
pengyh
Re: Data ingestion
Pasha Finkelshtein
Spark streaming - Data Ingestion
Akash Vellukai
Re: Spark streaming - Data Ingestion
Gibson
Re: [EXTERNAL] Re: Spark streaming - Data Ingestion
Saurabh Gulati
Re: [EXTERNAL] Re: Spark streaming - Data Ingestion
Akash Vellukai
Re: [EXTERNAL] Re: Spark streaming - Data Ingestion
Gibson
Supported Hadoop versions for Spark 3.3
Håkan Nordgren
Re: Supported Hadoop versions for Spark 3.3
pengyh
PySpark schema sanitization
Shay Elbaz
Unsubscribe
Peter Kovgan
Spark with GPU
rajat kumar
Re: Spark with GPU
Sean Owen
Re: Spark with GPU
rajat kumar
Re: Spark with GPU
Sean Owen
Re: Spark with GPU
Alessandro Bellina
Re: Spark with GPU
Gourav Sengupta
Earlier messages
Later messages