user

Messages by Date

2022/09/19 Re: Splittable or not? Sid
2022/09/19 Driver throws exception every few hours Kiran Biswal
2022/09/17 Re: [Spark Internals]: Is sort order preserved after partitioned write? Enrico Minack
2022/09/17 Re: Splittable or not? Enrico Minack
2022/09/16 Re: [Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
2022/09/16 [Spark Core] Joining Same DataFrame Multiple Times Results in Column not getting dropped Shahban Riaz
2022/09/15 Re: [Spark Internals]: Is sort order preserved after partitioned write? Enrico Minack
2022/09/15 Re: EXT: Re: Spark SQL Vibhor Gupta
2022/09/15 [Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
2022/09/15 Re: Spark SQL Gourav Sengupta
2022/09/15 Re: Spark SQL Mayur Benodekar
2022/09/14 Re: Spark SQL Gourav Sengupta
2022/09/14 Big Data Contract Roles ? sri hari kali charan Tummala
2022/09/14 Re: Splittable or not? Sid
2022/09/14 Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
2022/09/14 Re: Splittable or not? Amit Joshi
2022/09/14 Re: Long running task in spark Sid
2022/09/14 Splittable or not? Sid
2022/09/13 Unsubscribe Raghunadh Madamanchi
2022/09/13 Unsubscribe Hari Kunapareddy
2022/09/13 Re: EXT: Network time out property is not getting set in Spark Sachit Murarka
2022/09/13 Re: EXT: Network time out property is not getting set in Spark Vibhor Gupta
2022/09/13 Network time out property is not getting set in Spark Sachit Murarka
2022/09/12 Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage Artemis User
2022/09/11 Re: Spark Issue with Istio in Distributed Mode Deepak Sharma
2022/09/11 RE: [EXTERNAL] Re: Dynamic shuffle partitions in a single job Kapil Kumar Singh
2022/09/11 Long running task in spark rajat kumar
2022/09/11 [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage akshit marwah
2022/09/11 Re: Pipelined execution in Spark (???) Gourav Sengupta
2022/09/10 Re: Dynamic shuffle partitions in a single job Anupam Singh
2022/09/08 Dynamic shuffle partitions in a single job Vibhor Gupta
2022/09/07 Re: Pipelined execution in Spark (???) Russell Jurney
2022/09/07 Re: Pipelined execution in Spark (???) Sungwoo Park
2022/09/07 Re: Pipelined execution in Spark (???) Russell Jurney
2022/09/07 Re: Pipelined execution in Spark (???) Russell Jurney
2022/09/07 Re: Pipelined execution in Spark (???) Sean Owen
2022/09/07 Re: Pipelined execution in Spark (???) Sungwoo Park
2022/09/07 Spark SQL Mayur Benodekar
2022/09/07 Re: Pipelined execution in Spark (???) Russell Jurney
2022/09/07 Re: Spark equivalent to hdfs groups phiroc
2022/09/07 Re: Spark equivalent to hdfs groups Sean Owen
2022/09/07 Re: Spark equivalent to hdfs groups phiroc
2022/09/07 Pipelined execution in Spark (???) Sungwoo Park
2022/09/07 Re: Spark equivalent to hdfs groups Sean Owen
2022/09/07 Spark equivalent to hdfs groups phiroc
2022/09/06 Spark Structured Streaming - unable to change max.poll.records (showing as 1) karan alang
2022/09/06 Re: Jupyter notebook on Dataproc versus GKE Holden Karau
2022/09/06 Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
2022/09/06 [ANNOUNCE] Apache Kyuubi (Incubating) released 1.6.0-incubating Nicholas Jiang
2022/09/06 Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
2022/09/06 Re: Error in Spark in Jupyter Notebook Sean Owen
2022/09/06 Error in Spark in Jupyter Notebook Mamata Shee
2022/09/05 Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
2022/09/05 Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
2022/09/05 Re: Jupyter notebook on Dataproc versus GKE Holden Karau
2022/09/05 Apache Spark - How to concert DataFrame json string to structured element and using schema_of_json M Singh
2022/09/05 Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
2022/09/05 Re: Jupyter notebook on Dataproc versus GKE Holden Karau
2022/09/05 Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
2022/09/03 Re: Spark Issue with Istio in Distributed Mode Deepak Sharma
2022/09/02 Spark Issue with Istio in Distributed Mode Deepak Sharma
2022/09/02 Data Type Issue while upgrading to Spark3 rajat kumar
2022/09/01 Creating Custom Broadcast Join Murali S
2022/09/01 Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15 FengYu Cao
2022/09/01 Re: running pyspark on kubernetes - no space left on device Qian SUN
2022/09/01 Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15 Chao Sun
2022/09/01 Re: running pyspark on kubernetes - no space left on device Matt Proetsch
2022/09/01 running pyspark on kubernetes - no space left on device Manoj GEORGE
2022/09/01 Re: Moving to Spark 3x from Spark2 Martin Andersson
2022/09/01 Re: Moving to Spark 3x from Spark2 Khalid Mammadov
2022/09/01 Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15 FengYu Cao
2022/09/01 Moving to Spark 3x from Spark2 rajat kumar
2022/08/29 Re: deciding Spark tasks & optimization resource Gibson
2022/08/29 Re: Profiling PySpark Pandas UDF Gourav Sengupta
2022/08/29 deciding Spark tasks & optimization resource rajat kumar
2022/08/29 RE: Profiling PySpark Pandas UDF Luca Canali
2022/08/26 Spark 3.3.0 with Structure Streaming from Kafka Issue on commons-pools2 Raymond Tang
2022/08/26 Spark SQL Predict Pushdown for Hive Bucketed Table Raymond Tang
2022/08/26 Structured Streaming - data not being read (offsets not getting committed ?) karan alang
2022/08/26 Re: Profiling PySpark Pandas UDF Abdeali Kothari
2022/08/26 回复：Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 ckgppl_yan
2022/08/26 Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 pengyh
2022/08/26 Re: Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 Sean Owen
2022/08/26 Spark got incorrect scala version while using spark 3.2.1 and spark 3.2.2 ckgppl_yan
2022/08/26 RE: Profiling PySpark Pandas UDF Luca Canali
2022/08/25 Re: Profiling PySpark Pandas UDF Abdeali Kothari
2022/08/25 Re: Profiling PySpark Pandas UDF Subash Prabanantham
2022/08/25 Re: Profiling PySpark Pandas UDF Russell Jurney
2022/08/25 Re: Profiling PySpark Pandas UDF Sean Owen
2022/08/25 Re: Profiling PySpark Pandas UDF Takuya UESHIN
2022/08/25 Re: Profiling PySpark Pandas UDF Russell Jurney
2022/08/25 Re: Profiling PySpark Pandas UDF Sean Owen
2022/08/25 Re: Profiling PySpark Pandas UDF Andrew Melo
2022/08/25 Re: Profiling PySpark Pandas UDF Gourav Sengupta
2022/08/25 Profiling PySpark Pandas UDF Subash Prabanantham
2022/08/24 Re: spark-3.2.2-bin-without-hadoop : NoClassDefFoundError: org/apache/log4j/spi/Filter when starting the master Sean Owen
2022/08/24 spark-3.2.2-bin-without-hadoop : NoClassDefFoundError: org/apache/log4j/spi/Filter when starting the master FLORANCE Grégory
2022/08/22 Question regarding checkpointing with kafka structured streaming Martin Andersson
2022/08/20 Re: Spark streaming Gourav Sengupta
2022/08/20 Re: [EXTERNAL] Re: Spark streaming sandra sukumaran
2022/08/19 [Spark SQL]: Does Spark preserve the order in a nested ORDER BY? Vinay Londhe
2022/08/19 Re: [EXTERNAL] Re: Spark streaming Saurabh Gulati
2022/08/19 Re: Spark streaming Ajit Kumar Amit
2022/08/19 Spark streaming sandra sukumaran
2022/08/18 Filtering by job group in the Spark UI / API Yeachan Park
2022/08/17 Re: Spark streaming ミユナ (alice)
2022/08/17 Re: Data ingestion Pasha Finkelshtein
2022/08/17 Spark streaming Prajith Vellukkai
2022/08/17 Re: Data ingestion pengyh
2022/08/17 Re: Data ingestion Yuri Oleynikov (‫יורי אולייניקוב‬‎)
2022/08/17 Re: Data ingestion Pasha Finkelshtein
2022/08/17 Data ingestion Akash Vellukai
2022/08/17 Re: [EXTERNAL] Re: Spark streaming - Data Ingestion Akash Vellukai
2022/08/17 Re: [EXTERNAL] Re: Spark streaming - Data Ingestion Gibson
2022/08/17 Re: [EXTERNAL] Re: Spark streaming - Data Ingestion Saurabh Gulati
2022/08/17 Re: Spark streaming - Data Ingestion Gibson
2022/08/17 Spark streaming - Data Ingestion Akash Vellukai
2022/08/15 Unsubscribe Mohd Shukri Hasan
2022/08/15 Unsubscribe Nadeem Lalani
2022/08/15 Re: Supported Hadoop versions for Spark 3.3 pengyh
2022/08/15 Supported Hadoop versions for Spark 3.3 Håkan Nordgren
2022/08/13 PySpark schema sanitization Shay Elbaz
2022/08/13 Re: Spark with GPU Gourav Sengupta
2022/08/13 Re: Spark with GPU Alessandro Bellina
2022/08/13 Re: Spark with GPU Sean Owen
2022/08/13 Re: Spark with GPU rajat kumar
2022/08/13 Re: Spark with GPU Sean Owen
2022/08/13 Spark with GPU rajat kumar
2022/08/12 Unsubscribe Pascal Taddei
2022/08/12 [no subject] GAURAV GUPTA
2022/08/12 unsubscribe Sivakumar Ganesan
2022/08/12 unsubscribe Alexey Milogradov
2022/08/11 pyspark not starting Kelum Perera
2022/08/11 Joins internally Sid
2022/08/10 Re: Unsubscribe pengyh
2022/08/10 Unsubscribe Shrikar archak
2022/08/10 Memory leak while caching in foreachBatch block kineret M
2022/08/09 High number of tasks when ran on a hybrid cluster murat migdisoglu
2022/08/09 Re: Spark program not receiving messages from Cloud Pubsub Pramod Biligiri
2022/08/08 Re: [Spark SQL] Omit Create Table Statement in Spark Sql pengyh
2022/08/08 [Spark SQL] Omit Create Table Statement in Spark Sql 阿强
2022/08/06 Spark program not receiving messages from Cloud Pubsub Pramod Biligiri
2022/08/03 Re: Salting technique doubt Sid
2022/08/02 Re: Spark Scala API still not updated for 2.13 or it's a mistake? pengyh
2022/08/02 Re: Spark Scala API still not updated for 2.13 or it's a mistake? Sean Owen
2022/08/02 Re: Spark Scala API still not updated for 2.13 or it's a mistake? Roman I
2022/08/02 Re: Spark Scala API still not updated for 2.13 or it's a mistake? Sean Owen
2022/08/02 Spark Scala API still not updated for 2.13 or it's a mistake? Roman I
2022/08/02 Re: log transfering into hadoop/spark Gourav Sengupta
2022/08/02 Re: log transfering into hadoop/spark ayan guha
2022/08/02 Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Sean Owen
2022/08/02 Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Stelios Philippou
2022/08/02 Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Sean Owen
2022/08/02 Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty ayan guha
2022/08/01 log transfering into hadoop/spark pengyh
2022/08/01 Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Kumba Janga
2022/08/01 Re: [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Sean Owen
2022/08/01 [pyspark delta] [delta][Spark SQL]: Getting an Analysis Exception. The associated location (path) is not empty Kumba Janga
2022/08/01 Re: WARN: netlib.BLAS Sean Owen
2022/08/01 WARN: netlib.BLAS 陈刚
2022/08/01 Re: Use case idea pengyh
2022/08/01 Re: Use case idea Gourav Sengupta
2022/08/01 Re: unsubscribe pengyh
2022/08/01 unsubscribe Martin Soch
2022/07/31 Re: Use case idea pengyh
2022/07/31 Re: Use case idea Gourav Sengupta
2022/07/31 Re: Use case idea pengyh
2022/07/31 Use case idea Gioele Sal. Perri
2022/07/31 Re: Salting technique doubt Vinod KC
2022/07/31 Re: Salting technique doubt ayan guha
2022/07/31 Re: Salting technique doubt Jacob Lynn
2022/07/31 Re: Salting technique doubt Amit Joshi
2022/07/31 Re: Salting technique doubt Sid
2022/07/30 Re: Salting technique doubt Amit Joshi
2022/07/30 Salting technique doubt Sid
2022/07/29 Re: PySpark cores Gourav Sengupta
2022/07/29 [no subject] Milin Korath
2022/07/29 Re: PySpark cores Jacob Lynn
2022/07/28 PySpark cores Andrew Melo
2022/07/28 Unsubscribe Ashish
2022/07/28 Unsubscribe Karthik Jayaraman
2022/07/28 Re: spark can't connect to kafka via sasl_ssl wilson
2022/07/27 spark can't connect to kafka via sasl_ssl wilson
2022/07/27 Re: Spark Avro Java 17 Compatibility Sean Owen
2022/07/27 RE: Spark Avro Java 17 Compatibility Shivaraj Sivasankaran
2022/07/27 [Spark thread pool configurations]: I would like to configure all ThreadPoolExecutor parameters for each thread pool started in Spark Alex Peelman
2022/07/26 Re: [EXTERNAL] Partial data with ADLS Gen2 hwl17801341688
2022/07/25 Spark SQL Query filter behavior with special characters prashanth reddy
2022/07/24 Re: [EXTERNAL] Partial data with ADLS Gen2 Tufan Rakshit
2022/07/24 Re: [EXTERNAL] Partial data with ADLS Gen2 Shay Elbaz
2022/07/24 Partial data with ADLS Gen2 kineret M
2022/07/24 Re: external table with parquet files: problem querying in sparksql since data is stored as integer while hive schema expects a timestamp Gourav Sengupta
2022/07/22 Re: Updating Broadcast Variable in Spark Streaming 2.4.4 Sean Owen
2022/07/22 Updating Broadcast Variable in Spark Streaming 2.4.4 Dipl.-Inf. Rico Bergmann
2022/07/21 Re: Spark Structured Streaming -- Cannot consume next messages KhajaAsmath Mohammed
2022/07/21 Re: Pyspark and multiprocessing Khalid Mammadov
2022/07/21 Re: Spark Structured Streaming -- Cannot consume next messages Artemis User
2022/07/21 Re: Pyspark and multiprocessing Bjørn Jørgensen
2022/07/21 Spark Structured Streaming -- Cannot consume next messages KhajaAsmath Mohammed
2022/07/21 Re: Pyspark and multiprocessing Khalid Mammadov