user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: Issue while building spark project
Sean Owen
Re: Issue while building spark project
rajat kumar
CVE-2022-33891: Apache Spark shell command injection vulnerability via Spark UI
Sean Owen
[ANNOUNCE] Apache Spark 3.2.2 released
Dongjoon Hyun
Question regarding how to make spar Scala to evenly divide the spark job between executors
Orkhan Dadashov
Re: Question regarding how to make spar Scala to evenly divide the spark job between executors
Tufan Rakshit
spark re-use shuffle files not happening
Koert Kuipers
Re: [EXTERNAL] spark re-use shuffle files not happening
Shay Elbaz
Re: [EXTERNAL] spark re-use shuffle files not happening
Koert Kuipers
Spark Convert Column to String
Gibson
[Building] Building with JDK11
Szymon Kuryło
Re: [Building] Building with JDK11
Sean Owen
Re: [Building] Building with JDK11
Tufan Rakshit
Re: [Building] Building with JDK11
Stephen Coy
Re: [Building] Building with JDK11
Sergey B.
Re: [Building] Building with JDK11
Stephen Coy
Re: [Building] Building with JDK11
Szymon Kuryło
Re: [Building] Building with JDK11
Gera Shegalov
Re: [Building] Building with JDK11
Sean Owen
Spark (K8S) IPv6 support
Valer
Re: Spark (K8S) IPv6 support
Sean Owen
[Spark Structured Continous Processing] Plans for future left join support.
Mikołaj Błaszczyk
How use pattern matching in spark
Sid
Re: How use pattern matching in spark
Bjørn Jørgensen
Spark streaming pending mircobatches queue max length
Anil Dasari
Re: Spark streaming pending mircobatches queue max length
Anil Dasari
[Spark][Core] Resource Allocation
Amin Borjian
Re: [Spark][Core] Resource Allocation
Sungwoo Park
about cpu cores
Yong Walt
Re: about cpu cores
Sean Owen
Re: about cpu cores
Tufan Rakshit
Re: about cpu cores
Yong Walt
Re: about cpu cores
Tufan Rakshit
Re: about cpu cores
Gourav Sengupta
reading each JSON file from dataframe...
Muthu Jayakumar
Re: reading each JSON file from dataframe...
Enrico Minack
Re: reading each JSON file from dataframe...
Muthu Jayakumar
Re: reading each JSON file from dataframe...
Enrico Minack
Re: reading each JSON file from dataframe...
ayan guha
Re: reading each JSON file from dataframe...
Muthu Jayakumar
Re: reading each JSON file from dataframe...
Gourav Sengupta
RDD.pipe() for binary data
Yuhao Zhang
Re: [EXTERNAL] RDD.pipe() for binary data
Shay Elbaz
Re: [EXTERNAL] RDD.pipe() for binary data
Yuhao Zhang
Re: [EXTERNAL] RDD.pipe() for binary data
Sean Owen
Re: [EXTERNAL] RDD.pipe() for binary data
Sebastian Piu
Re: [EXTERNAL] RDD.pipe() for binary data
Andrew Melo
Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
Tufan Rakshit
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
Reading parquet strips non-nullability from schema
Greg Kopff
Reading snappy/lz4 compressed csv/json files
Yeachan Park
Spark with Hive (Standalone) Metastore
Ankur Khanna
Re: Spark with Hive (Standalone) Metastore
Qian SUN
How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Gourav Sengupta
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Apostolos N. Papadopoulos
Spark Group How to Ask
Zehra Günindi
Re: Spark Group How to Ask
Sean Owen
DatasourceV2 with Custom JDBC Source
Arsh Bhardwaj
Sources/V2 DatasourceV2 in Spark 3.*
Bigg Ben
Understanding about joins in spark
Sid
[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022
Gavin McDonald
Glue is serverless? how?
Sid
Re: Glue is serverless? how?
Bjørn Jørgensen
Re: Glue is serverless? how?
finkel
Re: Glue is serverless? how?
Sid
Follow up on Jira Issue 39549
Chenyang Zhang
Re: Follow up on Jira Issue 39549
Sean Owen
Re: Follow up on Jira Issue 39549
Chenyang Zhang
Re: Follow up on Jira Issue 39549
Sean Owen
Need help with the configuration for AWS glue jobs
Sid
Re: Need help with the configuration for AWS glue jobs
Gourav Sengupta
Re: Need help with the configuration for AWS glue jobs
Sid
[Java 17] --add-exports required?
Greg Kopff
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
Re: [Java 17] --add-exports required?
Greg Kopff
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
Re: [Java 17] --add-exports required?
Greg Kopff
StructuredStreaming - read from Kafka, writing data into Mongo every 10 minutes
karan alang
repartition(n) should be deprecated/alerted
Igor Berman
Re: repartition(n) should be deprecated/alerted
Sean Owen
Re: repartition(n) should be deprecated/alerted
Igor Berman
[Spark Dataframe] How to load compressed file? (lz4, snappy)
HelloWorld
Will it lead to OOM error?
Sid
Re: Will it lead to OOM error?
Deepak Sharma
Re: Will it lead to OOM error?
Enrico Minack
Re: Will it lead to OOM error?
Sid
Re: Will it lead to OOM error?
Enrico Minack
Re: Will it lead to OOM error?
Yong Walt
Re: Will it lead to OOM error?
Sid
Spark Doubts
Sid
Re: Spark Doubts
Apostolos N. Papadopoulos
Re: Spark Doubts
Yong Walt
Re: Spark Doubts
Sid
Spark Doubts
Sid
Re: Spark Doubts
Tufan Rakshit
Re: Spark Doubts
Sid
Re: Spark Doubts
russell . spitzer
spark-submit on kubernetes
Michaela Bogiages
Spark Summit Europe
Gowran, Declan
Re: Spark Summit Europe
Sean Owen
How to guarantee dataset is split over unique partitions (partitioned by a column value)
DESCOTTE Loic - externe
Re: How to guarantee dataset is split over unique partitions (partitioned by a column value)
Sean Owen
How reading works?
Sid
Re: How reading works?
Sid
Re: How reading works?
Sid
Re: How reading works?
Bjørn Jørgensen
Re: How reading works?
Bjørn Jørgensen
Re: How reading works?
Sid
input file size
mbreuer
Re: input file size
marc nicole
Re: input file size
Yong Walt
Re: input file size
Enrico Minack
Re: input file size
Gourav Sengupta
Re: input file size
Enrico Minack
Re: input file size
marc nicole
Re: input file size
Markus Breuer
how to properly filter a dataset by dates ?
marc nicole
Re: how to properly filter a dataset by dates ?
Sean Owen
Re: how to properly filter a dataset by dates ?
marc nicole
Re: how to properly filter a dataset by dates ?
Sean Owen
Re: how to properly filter a dataset by dates ?
marc nicole
Re: how to properly filter a dataset by dates ?
Stelios Philippou
Re: how to properly filter a dataset by dates ?
marc nicole
Re: how to properly filter a dataset by dates ?
Stelios Philippou
Re: how to properly filter a dataset by dates ?
marc nicole
Re: how to properly filter a dataset by dates ?
marc nicole
Re: how to properly filter a dataset by dates ?
marc nicole
How to update TaskMetrics from Python?
Shay Elbaz
Spark Structured streaming(batch mode) - running dependent jobs concurrently
karan alang
How to recognize and get the min of a date/string column in Java?
marc nicole
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
Stickers and Swag
Xiao Li
Re: Stickers and Swag
Hyukjin Kwon
Re: Stickers and Swag
Gengliang Wang
Re: Stickers and Swag
Reynold Xin
Re: Stickers and Swag
Qian Sun
Redesign approach for hitting the APIs using PySpark
Sid
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
Re: Redesign approach for hitting the APIs using PySpark
Sid
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
Re: Redesign approach for hitting the APIs using PySpark
Sid
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
Re: Redesign approach for hitting the APIs using PySpark
Sid
[no subject]
Rodrigo
Re:
Aironman DirtDiver
Spark streaming / confluent Kafka- messages are empty
KhajaAsmath Mohammed
API Problem
Sid
Re: API Problem
Stelios Philippou
Re: API Problem
Sean Owen
Re: API Problem
Sid
Re: API Problem
Stelios Philippou
Re: API Problem
Sid
Re: API Problem
Enrico Minack
Re: API Problem
Enrico Minack
Re: API Problem
Sid
Re: API Problem
Enrico Minack
Re: API Problem
Sid
Re: API Problem
Enrico Minack
Retrieve the count of spark nodes
Poorna Murali
Re: Retrieve the count of spark nodes
Stephen Coy
Re: Retrieve the count of spark nodes
Poorna Murali
to find Difference of locations in Spark Dataframe rows
Chetan Khatri
Re: to find Difference of locations in Spark Dataframe rows
Bjørn Jørgensen
How the data is distributed
Sid
Re: How the data is distributed
Peyman Mohajerian
Re: How the data is distributed
Sean Owen
Re: How the data is distributed
Sid
Structured streaming with protobuf proto3 schema registry
Kiran Biswal
partitionBy creating lot of small files
Nikhil Goyal
Re: partitionBy creating lot of small files
Enrico Minack
How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
Re: How to convert a Dataset<Row> to a Dataset<String>?
Sean Owen
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
Re: How to convert a Dataset<Row> to a Dataset<String>?
Sean Owen
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
Re: How to convert a Dataset<Row> to a Dataset<String>?
Enrico Minack
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
Re: How to convert a Dataset<Row> to a Dataset<String>?
Enrico Minack
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
Re: How to convert a Dataset<Row> to a Dataset<String>?
Christophe Préaud
Re: How to convert a Dataset<Row> to a Dataset<String>?
Stelios Philippou
Earlier messages
Later messages