user
Thread
Date
Earlier messages
Later messages
Messages by Date
2022/07/30
Salting technique doubt
Sid
2022/07/29
Re: PySpark cores
Gourav Sengupta
2022/07/29
[no subject]
Milin Korath
2022/07/29
Re: PySpark cores
Jacob Lynn
2022/07/28
PySpark cores
Andrew Melo
2022/07/28
Unsubscribe
Ashish
2022/07/28
Unsubscribe
Karthik Jayaraman
2022/07/28
Re: spark can't connect to kafka via sasl_ssl
wilson
2022/07/27
spark can't connect to kafka via sasl_ssl
wilson
2022/07/27
Re: Spark Avro Java 17 Compatibility
Sean Owen
2022/07/27
RE: Spark Avro Java 17 Compatibility
Shivaraj Sivasankaran
2022/07/27
[Spark thread pool configurations]: I would like to configure all ThreadPoolExecutor parameters for each thread pool started in Spark
Alex Peelman
2022/07/26
Re: [EXTERNAL] Partial data with ADLS Gen2
hwl17801341688
2022/07/25
Spark SQL Query filter behavior with special characters
prashanth reddy
2022/07/24
Re: [EXTERNAL] Partial data with ADLS Gen2
Tufan Rakshit
2022/07/24
Re: [EXTERNAL] Partial data with ADLS Gen2
Shay Elbaz
2022/07/24
Partial data with ADLS Gen2
kineret M
2022/07/24
Re: external table with parquet files: problem querying in sparksql since data is stored as integer while hive schema expects a timestamp
Gourav Sengupta
2022/07/22
Re: Updating Broadcast Variable in Spark Streaming 2.4.4
Sean Owen
2022/07/22
Updating Broadcast Variable in Spark Streaming 2.4.4
Dipl.-Inf. Rico Bergmann
2022/07/21
Re: Spark Structured Streaming -- Cannot consume next messages
KhajaAsmath Mohammed
2022/07/21
Re: Pyspark and multiprocessing
Khalid Mammadov
2022/07/21
Re: Spark Structured Streaming -- Cannot consume next messages
Artemis User
2022/07/21
Re: Pyspark and multiprocessing
Bjørn Jørgensen
2022/07/21
Spark Structured Streaming -- Cannot consume next messages
KhajaAsmath Mohammed
2022/07/21
Re: Pyspark and multiprocessing
Khalid Mammadov
2022/07/20
external table with parquet files: problem querying in sparksql since data is stored as integer while hive schema expects a timestamp
Joris Billen
2022/07/20
Fwd: Pyspark and multiprocessing
Bjørn Jørgensen
2022/07/20
Pyspark and multiprocessing
Bjørn Jørgensen
2022/07/20
Re: Dependencies issue in spark
rajat kumar
2022/07/20
Re: [MLlib] Differences after version upgrade
Sean Owen
2022/07/20
[MLlib] Differences after version upgrade
Roger Wechsler
2022/07/20
Re: Building a ML pipeline with no training
Sean Owen
2022/07/20
Dependencies issue in spark
rajat kumar
2022/07/20
Building a ML pipeline with no training
Edgar H
2022/07/19
Re: Issue while building spark project
rajat kumar
2022/07/19
Re: spark.executor.pyspark.memory not added to the executor resource request on Kubernetes
Shay Elbaz
2022/07/19
spark.executor.pyspark.memory not added to the executor resource request on Kubernetes
Shay Elbaz
2022/07/18
Re: [Building] Building with JDK11
Sean Owen
2022/07/18
Re: [Building] Building with JDK11
Gera Shegalov
2022/07/18
Re: [Building] Building with JDK11
Szymon Kuryło
2022/07/18
Re: very simple UI on webpage to display x/y plots+histogram of data stored in hive
ayan guha
2022/07/18
Re: very simple UI on webpage to display x/y plots+histogram of data stored in hive
Joris Billen
2022/07/18
Re: very simple UI on webpage to display x/y plots+histogram of data stored in hive
Sean Owen
2022/07/18
very simple UI on webpage to display x/y plots+histogram of data stored in hive
Joris Billen
2022/07/18
Re: Issue while building spark project
Sean Owen
2022/07/18
Issue while building spark project
rajat kumar
2022/07/18
Re: [Building] Building with JDK11
Stephen Coy
2022/07/17
Re: [Building] Building with JDK11
Sergey B.
2022/07/17
Re: [Building] Building with JDK11
Stephen Coy
2022/07/17
CVE-2022-33891: Apache Spark shell command injection vulnerability via Spark UI
Sean Owen
2022/07/17
Re: Question regarding how to make spar Scala to evenly divide the spark job between executors
Tufan Rakshit
2022/07/17
[ANNOUNCE] Apache Spark 3.2.2 released
Dongjoon Hyun
2022/07/16
Question regarding how to make spar Scala to evenly divide the spark job between executors
Orkhan Dadashov
2022/07/16
Re: [EXTERNAL] RDD.pipe() for binary data
Andrew Melo
2022/07/16
Re: [EXTERNAL] RDD.pipe() for binary data
Sebastian Piu
2022/07/16
Re: [EXTERNAL] RDD.pipe() for binary data
Sean Owen
2022/07/16
Re: [EXTERNAL] RDD.pipe() for binary data
Yuhao Zhang
2022/07/16
Re: [EXTERNAL] spark re-use shuffle files not happening
Koert Kuipers
2022/07/16
Re: [EXTERNAL] spark re-use shuffle files not happening
Shay Elbaz
2022/07/16
spark re-use shuffle files not happening
Koert Kuipers
2022/07/16
Spark Convert Column to String
Gibson
2022/07/15
Re: [Building] Building with JDK11
Tufan Rakshit
2022/07/15
Re: [Building] Building with JDK11
Sean Owen
2022/07/15
[Building] Building with JDK11
Szymon Kuryło
2022/07/15
Re: [Spark][Core] Resource Allocation
Sungwoo Park
2022/07/14
unsubscribe
randy clinton
2022/07/14
Re: How use pattern matching in spark
Bjørn Jørgensen
2022/07/14
Re: Spark (K8S) IPv6 support
Sean Owen
2022/07/14
Spark (K8S) IPv6 support
Valer
2022/07/13
Re: Spark streaming pending mircobatches queue max length
Anil Dasari
2022/07/13
[Spark Structured Continous Processing] Plans for future left join support.
Mikołaj Błaszczyk
2022/07/13
Re: reading each JSON file from dataframe...
Gourav Sengupta
2022/07/12
How use pattern matching in spark
Sid
2022/07/12
Re: How reading works?
Sid
2022/07/12
Re: reading each JSON file from dataframe...
Muthu Jayakumar
2022/07/12
Spark streaming pending mircobatches queue max length
Anil Dasari
2022/07/12
Re: reading each JSON file from dataframe...
ayan guha
2022/07/12
Re: reading each JSON file from dataframe...
Enrico Minack
2022/07/12
Re: reading each JSON file from dataframe...
Muthu Jayakumar
2022/07/12
[Spark][Core] Resource Allocation
Amin Borjian
2022/07/11
Re: reading each JSON file from dataframe...
Enrico Minack
2022/07/11
Re: about cpu cores
Gourav Sengupta
2022/07/11
Re: about cpu cores
Tufan Rakshit
2022/07/11
Re: about cpu cores
Yong Walt
2022/07/10
Re: about cpu cores
Tufan Rakshit
2022/07/10
Re: about cpu cores
Sean Owen
2022/07/10
Re: [EXTERNAL] RDD.pipe() for binary data
Shay Elbaz
2022/07/10
about cpu cores
Yong Walt
2022/07/10
reading each JSON file from dataframe...
Muthu Jayakumar
2022/07/08
RDD.pipe() for binary data
Yuhao Zhang
2022/07/06
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
2022/07/06
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
Tufan Rakshit
2022/07/06
Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
2022/07/05
Reading parquet strips non-nullability from schema
Greg Kopff
2022/07/05
Re: How reading works?
Bjørn Jørgensen
2022/07/05
Re: How reading works?
Bjørn Jørgensen
2022/07/05
Re: How reading works?
Sid
2022/07/05
Reading snappy/lz4 compressed csv/json files
Yeachan Park
2022/07/05
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Apostolos N. Papadopoulos
2022/07/05
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Gourav Sengupta
2022/07/04
Re: Spark with Hive (Standalone) Metastore
Qian SUN
2022/07/04
Spark with Hive (Standalone) Metastore
Ankur Khanna
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
2022/07/02
How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/01
Re: Spark Group How to Ask
Sean Owen
2022/07/01
Spark Group How to Ask
Zehra Günindi
2022/06/30
DatasourceV2 with Custom JDBC Source
Arsh Bhardwaj
2022/06/28
Sources/V2 DatasourceV2 in Spark 3.*
Bigg Ben
2022/06/28
Re: Glue is serverless? how?
Sid
2022/06/28
Re: Glue is serverless? how?
finkel
2022/06/27
Understanding about joins in spark
Sid
2022/06/27
[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022
Gavin McDonald
2022/06/26
Re: Glue is serverless? how?
Bjørn Jørgensen
2022/06/26
Glue is serverless? how?
Sid
2022/06/25
Re: Spark Doubts
russell . spitzer
2022/06/25
Re: Spark Doubts
Sid
2022/06/25
Re: Spark Doubts
Tufan Rakshit
2022/06/24
Spark Doubts
Sid
2022/06/24
Re: Follow up on Jira Issue 39549
Sean Owen
2022/06/24
Re: Follow up on Jira Issue 39549
Chenyang Zhang
2022/06/24
Re: Follow up on Jira Issue 39549
Sean Owen
2022/06/24
Follow up on Jira Issue 39549
Chenyang Zhang
2022/06/23
Re: Need help with the configuration for AWS glue jobs
Sid
2022/06/23
Re: Need help with the configuration for AWS glue jobs
Gourav Sengupta
2022/06/23
Re: [Java 17] --add-exports required?
Greg Kopff
2022/06/23
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
2022/06/22
Re: [Java 17] --add-exports required?
Greg Kopff
2022/06/22
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
2022/06/22
Need help with the configuration for AWS glue jobs
Sid
2022/06/22
Re: Will it lead to OOM error?
Sid
2022/06/22
[Java 17] --add-exports required?
Greg Kopff
2022/06/22
Re: Will it lead to OOM error?
Yong Walt
2022/06/22
StructuredStreaming - read from Kafka, writing data into Mongo every 10 minutes
karan alang
2022/06/22
Re: repartition(n) should be deprecated/alerted
Igor Berman
2022/06/22
Re: Will it lead to OOM error?
Enrico Minack
2022/06/22
Re: Will it lead to OOM error?
Sid
2022/06/22
Re: repartition(n) should be deprecated/alerted
Sean Owen
2022/06/22
repartition(n) should be deprecated/alerted
Igor Berman
2022/06/22
[Spark Dataframe] How to load compressed file? (lz4, snappy)
HelloWorld
2022/06/22
Re: Will it lead to OOM error?
Enrico Minack
2022/06/22
Re: Will it lead to OOM error?
Deepak Sharma
2022/06/22
Will it lead to OOM error?
Sid
2022/06/21
Re: Spark Doubts
Sid
2022/06/21
Re: Spark Doubts
Yong Walt
2022/06/21
Re: Spark Doubts
Apostolos N. Papadopoulos
2022/06/21
Spark Doubts
Sid
2022/06/21
Re: Spark Summit Europe
Sean Owen
2022/06/21
spark-submit on kubernetes
Michaela Bogiages
2022/06/21
Spark Summit Europe
Gowran, Declan
2022/06/20
Re: How to guarantee dataset is split over unique partitions (partitioned by a column value)
Sean Owen
2022/06/20
How to guarantee dataset is split over unique partitions (partitioned by a column value)
DESCOTTE Loic - externe
2022/06/20
Re: How reading works?
Sid
2022/06/19
Re: input file size
Markus Breuer
2022/06/19
How reading works?
Sid
2022/06/19
Re: input file size
marc nicole
2022/06/19
Re: input file size
Enrico Minack
2022/06/19
Re: input file size
Gourav Sengupta
2022/06/18
Re: input file size
Enrico Minack
2022/06/18
Re: input file size
Yong Walt
2022/06/18
Re: input file size
marc nicole
2022/06/18
input file size
mbreuer
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Stelios Philippou
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Stelios Philippou
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Sean Owen
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Sean Owen
2022/06/17
how to properly filter a dataset by dates ?
marc nicole
2022/06/16
How to update TaskMetrics from Python?
Shay Elbaz
2022/06/15
Spark Structured streaming(batch mode) - running dependent jobs concurrently
karan alang
2022/06/15
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: Stickers and Swag
Qian Sun
2022/06/14
Re: Stickers and Swag
Reynold Xin
2022/06/14
Re: Stickers and Swag
Gengliang Wang
2022/06/14
Re: Stickers and Swag
Hyukjin Kwon
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
2022/06/14
How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/13
Stickers and Swag
Xiao Li
2022/06/13
Re: API Problem
Enrico Minack
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
Earlier messages
Later messages