user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: Spark with GPU
Alessandro Bellina
[no subject]
GAURAV GUPTA
pyspark not starting
Kelum Perera
Joins internally
Sid
Memory leak while caching in foreachBatch block
kineret M
[Spark SQL] Omit Create Table Statement in Spark Sql
阿强
Re: [Spark SQL] Omit Create Table Statement in Spark Sql
pengyh
Spark program not receiving messages from Cloud Pubsub
Pramod Biligiri
Re: Spark program not receiving messages from Cloud Pubsub
Pramod Biligiri
High number of tasks when ran on a hybrid cluster
murat migdisoglu
Spark Scala API still not updated for 2.13 or it's a mistake?
Roman I
Re: Spark Scala API still not updated for 2.13 or it's a mistake?
Sean Owen
Re: Spark Scala API still not updated for 2.13 or it's a mistake?
Roman I
Re: Spark Scala API still not updated for 2.13 or it's a mistake?
Sean Owen
Re: Spark Scala API still not updated for 2.13 or it's a mistake?
pengyh
log transfering into hadoop/spark
pengyh
Re: log transfering into hadoop/spark
ayan guha
Re: log transfering into hadoop/spark
Gourav Sengupta
[pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
Kumba Janga
Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
Sean Owen
Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
Kumba Janga
Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
ayan guha
Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
Sean Owen
Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
Stelios Philippou
Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty
Sean Owen
WARN: netlib.BLAS
陈刚
Re: WARN: netlib.BLAS
Sean Owen
Use case idea
Gioele Sal. Perri
Re: Use case idea
pengyh
Re: Use case idea
Gourav Sengupta
Re: Use case idea
pengyh
Re: Use case idea
Gourav Sengupta
Re: Use case idea
pengyh
Salting technique doubt
Sid
Re: Salting technique doubt
Amit Joshi
Re: Salting technique doubt
Sid
Re: Salting technique doubt
Amit Joshi
Re: Salting technique doubt
Jacob Lynn
Re: Salting technique doubt
ayan guha
Re: Salting technique doubt
Vinod KC
Re: Salting technique doubt
Sid
[no subject]
Milin Korath
PySpark cores
Andrew Melo
Re: PySpark cores
Jacob Lynn
Re: PySpark cores
Gourav Sengupta
spark can't connect to kafka via sasl_ssl
wilson
Re: spark can't connect to kafka via sasl_ssl
wilson
RE: Spark Avro Java 17 Compatibility
Shivaraj Sivasankaran
Re: Spark Avro Java 17 Compatibility
Sean Owen
[Spark thread pool configurations]: I would like to configure all ThreadPoolExecutor parameters for each thread pool started in Spark
Alex Peelman
Spark SQL Query filter behavior with special characters
prashanth reddy
Partial data with ADLS Gen2
kineret M
Re: [EXTERNAL] Partial data with ADLS Gen2
Shay Elbaz
Re: [EXTERNAL] Partial data with ADLS Gen2
Tufan Rakshit
Re: [EXTERNAL] Partial data with ADLS Gen2
hwl17801341688
Updating Broadcast Variable in Spark Streaming 2.4.4
Dipl.-Inf. Rico Bergmann
Re: Updating Broadcast Variable in Spark Streaming 2.4.4
Sean Owen
Updating Broadcast Variable in Spark Streaming 2.4.4
Dipl.-Inf. Rico Bergmann
Re: Updating Broadcast Variable in Spark Streaming 2.4.4
Sean Owen
Spark Structured Streaming -- Cannot consume next messages
KhajaAsmath Mohammed
Re: Spark Structured Streaming -- Cannot consume next messages
Artemis User
Re: Spark Structured Streaming -- Cannot consume next messages
KhajaAsmath Mohammed
external table with parquet files: problem querying in sparksql since data is stored as integer while hive schema expects a timestamp
Joris Billen
Re: external table with parquet files: problem querying in sparksql since data is stored as integer while hive schema expects a timestamp
Gourav Sengupta
Pyspark and multiprocessing
Bjørn Jørgensen
Fwd: Pyspark and multiprocessing
Bjørn Jørgensen
Re: Pyspark and multiprocessing
Khalid Mammadov
Re: Pyspark and multiprocessing
Bjørn Jørgensen
Re: Pyspark and multiprocessing
Khalid Mammadov
[MLlib] Differences after version upgrade
Roger Wechsler
Re: [MLlib] Differences after version upgrade
Sean Owen
Dependencies issue in spark
rajat kumar
Re: Dependencies issue in spark
rajat kumar
Building a ML pipeline with no training
Edgar H
Re: Building a ML pipeline with no training
Sean Owen
spark.executor.pyspark.memory not added to the executor resource request on Kubernetes
Shay Elbaz
Re: spark.executor.pyspark.memory not added to the executor resource request on Kubernetes
Shay Elbaz
very simple UI on webpage to display x/y plots+histogram of data stored in hive
Joris Billen
Re: very simple UI on webpage to display x/y plots+histogram of data stored in hive
Sean Owen
Re: very simple UI on webpage to display x/y plots+histogram of data stored in hive
Joris Billen
Re: very simple UI on webpage to display x/y plots+histogram of data stored in hive
ayan guha
Issue while building spark project
rajat kumar
Re: Issue while building spark project
Sean Owen
Re: Issue while building spark project
rajat kumar
CVE-2022-33891: Apache Spark shell command injection vulnerability via Spark UI
Sean Owen
[ANNOUNCE] Apache Spark 3.2.2 released
Dongjoon Hyun
Question regarding how to make spar Scala to evenly divide the spark job between executors
Orkhan Dadashov
Re: Question regarding how to make spar Scala to evenly divide the spark job between executors
Tufan Rakshit
spark re-use shuffle files not happening
Koert Kuipers
Re: [EXTERNAL] spark re-use shuffle files not happening
Shay Elbaz
Re: [EXTERNAL] spark re-use shuffle files not happening
Koert Kuipers
Spark Convert Column to String
Gibson
[Building] Building with JDK11
Szymon Kuryło
Re: [Building] Building with JDK11
Sean Owen
Re: [Building] Building with JDK11
Tufan Rakshit
Re: [Building] Building with JDK11
Stephen Coy
Re: [Building] Building with JDK11
Sergey B.
Re: [Building] Building with JDK11
Stephen Coy
Re: [Building] Building with JDK11
Szymon Kuryło
Re: [Building] Building with JDK11
Gera Shegalov
Re: [Building] Building with JDK11
Sean Owen
Spark (K8S) IPv6 support
Valer
Re: Spark (K8S) IPv6 support
Sean Owen
[Spark Structured Continous Processing] Plans for future left join support.
Mikołaj Błaszczyk
How use pattern matching in spark
Sid
Re: How use pattern matching in spark
Bjørn Jørgensen
Spark streaming pending mircobatches queue max length
Anil Dasari
Re: Spark streaming pending mircobatches queue max length
Anil Dasari
[Spark][Core] Resource Allocation
Amin Borjian
Re: [Spark][Core] Resource Allocation
Sungwoo Park
about cpu cores
Yong Walt
Re: about cpu cores
Sean Owen
Re: about cpu cores
Tufan Rakshit
Re: about cpu cores
Yong Walt
Re: about cpu cores
Tufan Rakshit
Re: about cpu cores
Gourav Sengupta
reading each JSON file from dataframe...
Muthu Jayakumar
Re: reading each JSON file from dataframe...
Enrico Minack
Re: reading each JSON file from dataframe...
Muthu Jayakumar
Re: reading each JSON file from dataframe...
Enrico Minack
Re: reading each JSON file from dataframe...
ayan guha
Re: reading each JSON file from dataframe...
Muthu Jayakumar
Re: reading each JSON file from dataframe...
Gourav Sengupta
RDD.pipe() for binary data
Yuhao Zhang
Re: [EXTERNAL] RDD.pipe() for binary data
Shay Elbaz
Re: [EXTERNAL] RDD.pipe() for binary data
Yuhao Zhang
Re: [EXTERNAL] RDD.pipe() for binary data
Sean Owen
Re: [EXTERNAL] RDD.pipe() for binary data
Sebastian Piu
Re: [EXTERNAL] RDD.pipe() for binary data
Andrew Melo
Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
Tufan Rakshit
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
Reading parquet strips non-nullability from schema
Greg Kopff
Reading snappy/lz4 compressed csv/json files
Yeachan Park
Spark with Hive (Standalone) Metastore
Ankur Khanna
Re: Spark with Hive (Standalone) Metastore
Qian SUN
How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Gourav Sengupta
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Apostolos N. Papadopoulos
Spark Group How to Ask
Zehra Günindi
Re: Spark Group How to Ask
Sean Owen
DatasourceV2 with Custom JDBC Source
Arsh Bhardwaj
Sources/V2 DatasourceV2 in Spark 3.*
Bigg Ben
Understanding about joins in spark
Sid
[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022
Gavin McDonald
Glue is serverless? how?
Sid
Re: Glue is serverless? how?
Bjørn Jørgensen
Re: Glue is serverless? how?
finkel
Re: Glue is serverless? how?
Sid
Follow up on Jira Issue 39549
Chenyang Zhang
Re: Follow up on Jira Issue 39549
Sean Owen
Re: Follow up on Jira Issue 39549
Chenyang Zhang
Re: Follow up on Jira Issue 39549
Sean Owen
Need help with the configuration for AWS glue jobs
Sid
Re: Need help with the configuration for AWS glue jobs
Gourav Sengupta
Re: Need help with the configuration for AWS glue jobs
Sid
[Java 17] --add-exports required?
Greg Kopff
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
Re: [Java 17] --add-exports required?
Greg Kopff
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
Re: [Java 17] --add-exports required?
Greg Kopff
StructuredStreaming - read from Kafka, writing data into Mongo every 10 minutes
karan alang
repartition(n) should be deprecated/alerted
Igor Berman
Re: repartition(n) should be deprecated/alerted
Sean Owen
Re: repartition(n) should be deprecated/alerted
Igor Berman
[Spark Dataframe] How to load compressed file? (lz4, snappy)
HelloWorld
Will it lead to OOM error?
Sid
Re: Will it lead to OOM error?
Deepak Sharma
Re: Will it lead to OOM error?
Enrico Minack
Re: Will it lead to OOM error?
Sid
Re: Will it lead to OOM error?
Enrico Minack
Re: Will it lead to OOM error?
Yong Walt
Re: Will it lead to OOM error?
Sid
Spark Doubts
Sid
Re: Spark Doubts
Apostolos N. Papadopoulos
Re: Spark Doubts
Yong Walt
Re: Spark Doubts
Sid
Spark Doubts
Sid
Re: Spark Doubts
Tufan Rakshit
Re: Spark Doubts
Sid
Re: Spark Doubts
russell . spitzer
spark-submit on kubernetes
Michaela Bogiages
Spark Summit Europe
Gowran, Declan
Re: Spark Summit Europe
Sean Owen
How to guarantee dataset is split over unique partitions (partitioned by a column value)
DESCOTTE Loic - externe
Re: How to guarantee dataset is split over unique partitions (partitioned by a column value)
Sean Owen
How reading works?
Sid
Re: How reading works?
Sid
Earlier messages
Later messages