user
Thread
Date
Later messages
Messages by Date
2022/07/12
How use pattern matching in spark
Sid
2022/07/12
Re: How reading works?
Sid
2022/07/12
Re: reading each JSON file from dataframe...
Muthu Jayakumar
2022/07/12
Spark streaming pending mircobatches queue max length
Anil Dasari
2022/07/12
Re: reading each JSON file from dataframe...
ayan guha
2022/07/12
Re: reading each JSON file from dataframe...
Enrico Minack
2022/07/12
Re: reading each JSON file from dataframe...
Muthu Jayakumar
2022/07/12
[Spark][Core] Resource Allocation
Amin Borjian
2022/07/11
Re: reading each JSON file from dataframe...
Enrico Minack
2022/07/11
Re: about cpu cores
Gourav Sengupta
2022/07/11
Re: about cpu cores
Tufan Rakshit
2022/07/11
Re: about cpu cores
Yong Walt
2022/07/10
Re: about cpu cores
Tufan Rakshit
2022/07/10
Re: about cpu cores
Sean Owen
2022/07/10
Re: [EXTERNAL] RDD.pipe() for binary data
Shay Elbaz
2022/07/10
about cpu cores
Yong Walt
2022/07/10
reading each JSON file from dataframe...
Muthu Jayakumar
2022/07/08
RDD.pipe() for binary data
Yuhao Zhang
2022/07/06
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
2022/07/06
Re: Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
Tufan Rakshit
2022/07/06
Migration from Spark 2.4.0 to Spark 3.1.1 caused SortMergeJoin to change to BroadcastHashJoin
igor cabral uchoa
2022/07/05
Reading parquet strips non-nullability from schema
Greg Kopff
2022/07/05
Re: How reading works?
Bjørn Jørgensen
2022/07/05
Re: How reading works?
Bjørn Jørgensen
2022/07/05
Re: How reading works?
Sid
2022/07/05
Reading snappy/lz4 compressed csv/json files
Yeachan Park
2022/07/05
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Apostolos N. Papadopoulos
2022/07/05
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Gourav Sengupta
2022/07/04
Re: Spark with Hive (Standalone) Metastore
Qian SUN
2022/07/04
Spark with Hive (Standalone) Metastore
Ankur Khanna
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sean Owen
2022/07/02
Re: How is Spark a memory based solution if it writes data to disk before shuffles?
Sid
2022/07/02
How is Spark a memory based solution if it writes data to disk before shuffles?
krexos
2022/07/01
Re: Spark Group How to Ask
Sean Owen
2022/07/01
Spark Group How to Ask
Zehra Günindi
2022/06/30
DatasourceV2 with Custom JDBC Source
Arsh Bhardwaj
2022/06/28
Sources/V2 DatasourceV2 in Spark 3.*
Bigg Ben
2022/06/28
Re: Glue is serverless? how?
Sid
2022/06/28
Re: Glue is serverless? how?
finkel
2022/06/27
Understanding about joins in spark
Sid
2022/06/27
[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022
Gavin McDonald
2022/06/26
Re: Glue is serverless? how?
Bjørn Jørgensen
2022/06/26
Glue is serverless? how?
Sid
2022/06/25
Re: Spark Doubts
russell . spitzer
2022/06/25
Re: Spark Doubts
Sid
2022/06/25
Re: Spark Doubts
Tufan Rakshit
2022/06/24
Spark Doubts
Sid
2022/06/24
Re: Follow up on Jira Issue 39549
Sean Owen
2022/06/24
Re: Follow up on Jira Issue 39549
Chenyang Zhang
2022/06/24
Re: Follow up on Jira Issue 39549
Sean Owen
2022/06/24
Follow up on Jira Issue 39549
Chenyang Zhang
2022/06/23
Re: Need help with the configuration for AWS glue jobs
Sid
2022/06/23
Re: Need help with the configuration for AWS glue jobs
Gourav Sengupta
2022/06/23
Re: [Java 17] --add-exports required?
Greg Kopff
2022/06/23
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
2022/06/22
Re: [Java 17] --add-exports required?
Greg Kopff
2022/06/22
Re: [Java 17] --add-exports required?
Yang,Jie(INF)
2022/06/22
Need help with the configuration for AWS glue jobs
Sid
2022/06/22
Re: Will it lead to OOM error?
Sid
2022/06/22
[Java 17] --add-exports required?
Greg Kopff
2022/06/22
Re: Will it lead to OOM error?
Yong Walt
2022/06/22
StructuredStreaming - read from Kafka, writing data into Mongo every 10 minutes
karan alang
2022/06/22
Re: repartition(n) should be deprecated/alerted
Igor Berman
2022/06/22
Re: Will it lead to OOM error?
Enrico Minack
2022/06/22
Re: Will it lead to OOM error?
Sid
2022/06/22
Re: repartition(n) should be deprecated/alerted
Sean Owen
2022/06/22
repartition(n) should be deprecated/alerted
Igor Berman
2022/06/22
[Spark Dataframe] How to load compressed file? (lz4, snappy)
HelloWorld
2022/06/22
Re: Will it lead to OOM error?
Enrico Minack
2022/06/22
Re: Will it lead to OOM error?
Deepak Sharma
2022/06/22
Will it lead to OOM error?
Sid
2022/06/21
Re: Spark Doubts
Sid
2022/06/21
Re: Spark Doubts
Yong Walt
2022/06/21
Re: Spark Doubts
Apostolos N. Papadopoulos
2022/06/21
Spark Doubts
Sid
2022/06/21
Re: Spark Summit Europe
Sean Owen
2022/06/21
spark-submit on kubernetes
Michaela Bogiages
2022/06/21
Spark Summit Europe
Gowran, Declan
2022/06/20
Re: How to guarantee dataset is split over unique partitions (partitioned by a column value)
Sean Owen
2022/06/20
How to guarantee dataset is split over unique partitions (partitioned by a column value)
DESCOTTE Loic - externe
2022/06/20
Re: How reading works?
Sid
2022/06/19
Re: input file size
Markus Breuer
2022/06/19
How reading works?
Sid
2022/06/19
Re: input file size
marc nicole
2022/06/19
Re: input file size
Enrico Minack
2022/06/19
Re: input file size
Gourav Sengupta
2022/06/18
Re: input file size
Enrico Minack
2022/06/18
Re: input file size
Yong Walt
2022/06/18
Re: input file size
marc nicole
2022/06/18
input file size
mbreuer
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Stelios Philippou
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Stelios Philippou
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Sean Owen
2022/06/17
Re: how to properly filter a dataset by dates ?
marc nicole
2022/06/17
Re: how to properly filter a dataset by dates ?
Sean Owen
2022/06/17
how to properly filter a dataset by dates ?
marc nicole
2022/06/16
How to update TaskMetrics from Python?
Shay Elbaz
2022/06/15
Spark Structured streaming(batch mode) - running dependent jobs concurrently
karan alang
2022/06/15
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: Stickers and Swag
Qian Sun
2022/06/14
Re: Stickers and Swag
Reynold Xin
2022/06/14
Re: Stickers and Swag
Gengliang Wang
2022/06/14
Re: Stickers and Swag
Hyukjin Kwon
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/14
Re: How to recognize and get the min of a date/string column in Java?
Sean Owen
2022/06/14
How to recognize and get the min of a date/string column in Java?
marc nicole
2022/06/13
Stickers and Swag
Xiao Li
2022/06/13
Re: API Problem
Enrico Minack
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Sid
2022/06/13
Re: Redesign approach for hitting the APIs using PySpark
Gourav Sengupta
2022/06/13
Redesign approach for hitting the APIs using PySpark
Sid
2022/06/11
Re: API Problem
Sid
2022/06/10
Re: API Problem
Enrico Minack
2022/06/10
Re: API Problem
Sid
2022/06/10
Re: API Problem
Enrico Minack
2022/06/10
Re:
Aironman DirtDiver
2022/06/10
Re: API Problem
Enrico Minack
2022/06/10
Re: API Problem
Sid
2022/06/10
[no subject]
Rodrigo
2022/06/10
Re: API Problem
Stelios Philippou
2022/06/10
Re: API Problem
Sid
2022/06/09
Re: API Problem
Sean Owen
2022/06/09
Spark streaming / confluent Kafka- messages are empty
KhajaAsmath Mohammed
2022/06/09
Re: API Problem
Stelios Philippou
2022/06/09
API Problem
Sid
2022/06/09
Re: to find Difference of locations in Spark Dataframe rows
Bjørn Jørgensen
2022/06/09
Re: Retrieve the count of spark nodes
Poorna Murali
2022/06/08
Re: Retrieve the count of spark nodes
Stephen Coy
2022/06/08
Retrieve the count of spark nodes
Poorna Murali
2022/06/07
to find Difference of locations in Spark Dataframe rows
Chetan Khatri
2022/06/07
Re: How the data is distributed
Sid
2022/06/06
Re: How the data is distributed
Sean Owen
2022/06/06
Re: How the data is distributed
Peyman Mohajerian
2022/06/06
How the data is distributed
Sid
2022/06/06
Structured streaming with protobuf proto3 schema registry
Kiran Biswal
2022/06/06
Re: How to convert a Dataset<Row> to a Dataset<String>?
Stelios Philippou
2022/06/06
Re: How to convert a Dataset<Row> to a Dataset<String>?
Christophe Préaud
2022/06/04
Re: partitionBy creating lot of small files
Enrico Minack
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Enrico Minack
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
2022/06/04
partitionBy creating lot of small files
Nikhil Goyal
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Enrico Minack
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Sean Owen
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
2022/06/04
Re: How to convert a Dataset<Row> to a Dataset<String>?
Sean Owen
2022/06/04
How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
2022/06/03
Re: PartitionBy and SortWithinPartitions
Nikhil Goyal
2022/06/03
Re: PartitionBy and SortWithinPartitions
Enrico Minack
2022/06/03
PartitionBy and SortWithinPartitions
Nikhil Goyal
2022/06/02
approx_count_distinct in spark always return 1
marc nicole
2022/06/02
Does adaptive auto broadcast respect spark.sql.autoBroadcastJoinThreshold
Henry Quan
2022/06/01
What's the expected Spark 3.1.4 release date ?
Sandeep Vinayak
2022/05/31
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Gourav Sengupta
2022/05/31
Kotlin API for Apache Spark feedback
finkel
2022/05/31
Unsubscribe
Daan Stroep
2022/05/30
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ranadip Chatterjee
2022/05/30
Re: protobuf data as input to spark streaming
Kiran Biswal
2022/05/30
Re: Unable to format timestamp values in pyspark
Sid
2022/05/30
Re: Unable to format timestamp values in pyspark
Stelios Philippou
2022/05/30
Unable to format timestamp values in pyspark
Sid
2022/05/30
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ori Popowski
2022/05/29
Re: Unable to convert double values
marc nicole
2022/05/29
Re: Unable to convert double values
marc nicole
2022/05/29
Re: Unable to convert double values
Stelios Philippou
2022/05/29
Unable to convert double values
Sid
2022/05/28
k-anonymity with Spark in Java
marc nicole
2022/05/28
Re: Spark Push-Based Shuffle causing multiple stage failures
Ye Zhou
2022/05/27
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Aniket Mokashi
2022/05/26
Re: Issues getting Apache Spark
Apostolos N. Papadopoulos
2022/05/26
Issues getting Apache Spark
Martin, Michael
2022/05/26
Re: Complexity with the data
Sid
2022/05/26
Re: Complexity with the data
Gourav Sengupta
2022/05/26
java.lang.NoSuchMethodError: org.apache.hadoop.hive.common.FileUtils.mkdir --> Spark to Hive
Prasanth M Sasidharan
2022/05/26
Re: Complexity with the data
Bjørn Jørgensen
2022/05/26
Re: Complexity with the data
Sid
2022/05/26
Fwd: java.lang.NoSuchMethodError: org.apache.hadoop.hive.common.FileUtils.mkdir --> Spark to Hive
Prasanth M Sasidharan
Later messages