Messages by Date
-
2023/07/08
Unable to populate spark metrics using custom metrics API
Surya Soma
-
2023/07/08
Unsubscribe
yixu2...@163.com
-
2023/07/07
Re: PySpark error java.lang.IllegalArgumentException
Brian Huynh
-
2023/07/07
Re: PySpark error java.lang.IllegalArgumentException
Khalid Mammadov
-
2023/07/07
Re: Unsubscribe
Atheeth SH
-
2023/07/06
Unsubscribe
Mihai Musat
-
2023/07/06
Spark UI - Bug Executors tab when using proxy port
Bruno Pistone
-
2023/07/04
Re: PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
-
2023/07/04
Performance Issue with Column Addition in Spark 3.4.x: Time Doubling with Increased Columns
KO Dukhyun
-
2023/07/04
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/04
Re: Filtering JSON records when there isn't an exact schema match in Spark
Hill Liu
-
2023/07/04
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/03
Re: Filtering JSON records when there isn't an exact schema match in Spark
Vikas Kumar
-
2023/07/03
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gavin Ray
-
2023/07/03
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Hyukjin Kwon
-
2023/07/03
Re: Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Farshid Ashouri
-
2023/07/03
Introducing English SDK for Apache Spark - Seeking Your Feedback and Contributions
Gengliang Wang
-
2023/07/03
Re: Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/03
Re: [Spark SQL] Data objects from query history
Jack Wells
-
2023/07/03
Filtering JSON records when there isn't an exact schema match in Spark
Shashank Rao
-
2023/07/03
CFP for the 2nd Performance Engineering track at Community over Code NA 2023
Brebner, Paul
-
2023/07/03
PySpark error java.lang.IllegalArgumentException
elango vaidyanathan
-
2023/06/30
[Spark SQL] Data objects from query history
Ruben Mennes
-
2023/06/29
checkpoint file deletion
Lingzhe Sun
-
2023/06/29
Unsubscribe
lee
-
2023/06/28
Unsubscribe
Ghazi Naceur
-
2023/06/28
Re:subscribe
mojianan2015
-
2023/06/28
subscribe
mojianan2015
-
2023/06/27
[PySpark] Intermittent Spark session initialization error on M1 Mac
BeoumSuk Kim
-
2023/06/26
[k8s] Fail to expose custom port on executor container specified in my executor pod template
James Yu
-
2023/06/26
[Spark-SQL] Dataframe write saveAsTable failed
Anil Dasari
-
2023/06/26
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
-
2023/06/26
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
-
2023/06/26
Unable to populate spark metrics using custom metrics API
Surya Soma
-
2023/06/26
Re: Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Mich Talebzadeh
-
2023/06/26
Unsubscribe
Ghazi Naceur
-
2023/06/26
Spark-Sql - Slow Performance With CTAS and Large Gzipped File
Patrick Tucci
-
2023/06/26
Re: [Spark streaming]: Microbatch id in logs
Mich Talebzadeh
-
2023/06/25
[Spark streaming]: Microbatch id in logs
Anil Dasari
-
2023/06/24
Re: [ANNOUNCE] Apache Spark 3.4.1 released
yangjie01
-
2023/06/24
Re:[ANNOUNCE] Apache Spark 3.4.1 released
beliefer
-
2023/06/24
Apache Spark with watermark - processing data different LogTypes in same kafka topic
karan alang
-
2023/06/23
Re: [ANNOUNCE] Apache Spark 3.4.1 released
L. C. Hsieh
-
2023/06/23
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Hyukjin Kwon
-
2023/06/23
Re: [ANNOUNCE] Apache Spark 3.4.1 released
Mridul Muralidharan
-
2023/06/23
[ANNOUNCE] Apache Spark 3.4.1 released
Dongjoon Hyun
-
2023/06/21
Re: Rename columns without manually setting them all
Bjørn Jørgensen
-
2023/06/21
Re: Rename columns without manually setting them all
Farshid Ashouri
-
2023/06/21
Rename columns without manually setting them all
John Paul Jayme
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Unsubscribe
Bhargava Sukkala
-
2023/06/20
Re: How to read excel file in PySpark
Bjørn Jørgensen
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Re: Shuffle data on pods which get decomissioned
Mich Talebzadeh
-
2023/06/20
Re: How to read excel file in PySpark
Sean Owen
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Re: How to read excel file in PySpark
Bjørn Jørgensen
-
2023/06/20
Shuffle data on pods which get decomissioned
Nikhil Goyal
-
2023/06/20
Re: How to read excel file in PySpark
Mich Talebzadeh
-
2023/06/20
Re: How to read excel file in PySpark
Bjørn Jørgensen
-
2023/06/20
Re: How to read excel file in PySpark
Sean Owen
-
2023/06/20
How to read excel file in PySpark
John Paul Jayme
-
2023/06/18
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
-
2023/06/18
Re: implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Mich Talebzadeh
-
2023/06/18
implement a distribution without shuffle like RDD.coalesce for DataSource V2 write
Pengfei Li
-
2023/06/16
TAC Applications for Community Over Code North America and Asia now open
Gavin McDonald
-
2023/06/15
Fwd: iceberg queries
Gaurav Agarwal
-
2023/06/15
Re: Spark using iceberg
Gaurav Agarwal
-
2023/06/15
Spark using iceberg
Gaurav Agarwal
-
2023/06/11
Unsubscribe
Yu voidy
-
2023/06/09
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Wenchen Fan
-
2023/06/09
Announcing the Community Over Code 2023 Streaming Track
James Hughes
-
2023/06/08
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Enrico Minack
-
2023/06/08
Re: Apache Spark not reading UTC timestamp from MongoDB correctly
Sean Owen
-
2023/06/08
Apache Spark not reading UTC timestamp from MongoDB correctly
karan alang
-
2023/06/06
Getting SparkRuntimeException: Unexpected value for length in function slice: length must be greater than or equal to 0
Bariudin, Daniel
-
2023/06/04
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
Mich Talebzadeh
-
2023/06/04
Re: [Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
-
2023/06/01
[Feature Request] create *permanent* Spark View from DataFrame via PySpark
keen
-
2023/06/01
Re: ChatGPT and prediction of Spark future
Mich Talebzadeh
-
2023/05/31
Comparison of Trino, Spark, and Hive-MR3
Sungwoo Park
-
2023/05/31
Re: Viewing UI for spark jobs running on K8s
Qian Sun
-
2023/05/31
Re: ChatGPT and prediction of Spark future
Winston Lai
-
2023/05/31
Viewing UI for spark jobs running on K8s
Nikhil Goyal
-
2023/05/31
ChatGPT and prediction of Spark future
Mich Talebzadeh
-
2023/05/31
Structured streaming append mode picture question
Hill Liu
-
2023/05/29
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
-
2023/05/29
Re: JDK version support information
Sean Owen
-
2023/05/29
Re: JDK version support information
Poorna Murali
-
2023/05/29
Re: JDK version support information
Aironman DirtDiver
-
2023/05/29
JDK version support information
Poorna Murali
-
2023/05/29
Re: Re: maven with Spark 3.4.0 fails compilation
Lingzhe Sun
-
2023/05/29
Re: maven with Spark 3.4.0 fails compilation
Mich Talebzadeh
-
2023/05/28
Re: maven with Spark 3.4.0 fails compilation
Bjørn Jørgensen
-
2023/05/25
Re: [Spark Structured Streaming]: Dynamic Scaling of Executors
Mich Talebzadeh
-
2023/05/25
[Spark Structured Streaming]: Dynamic Scaling of Executors
Aishwarya Panicker
-
2023/05/25
Re: [MLlib] how-to find implementation of Decision Tree Regressor fit function
Sean Owen
-
2023/05/24
Re: Incremental Value dependents on another column of Data frame Spark
Enrico Minack
-
2023/05/23
Dynamic value as the offset of lag() function
Nipuna Shantha
-
2023/05/23
Re: Incremental Value dependents on another column of Data frame Spark
Raghavendra Ganesh
-
2023/05/23
Incremental Value dependents on another column of Data frame Spark
Nipuna Shantha
-
2023/05/23
Re: Shuffle with Window().partitionBy(<column>)
ashok34...@yahoo.com.INVALID
-
2023/05/23
Re: Shuffle with Window().partitionBy(<column>)
Rauf Khan
-
2023/05/22
cannot load model using pyspark
second_co...@yahoo.com.INVALID
-
2023/05/22
Data Stream Processing applications testing
Alexandre Strapacao Guedes Vianna
-
2023/05/22
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
-
2023/05/22
Re: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
-
2023/05/22
RE: Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Maksym M
-
2023/05/17
Re: Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
-
2023/05/17
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
vaquar khan
-
2023/05/16
RE: Understanding Spark S3 Read Performance
info
-
2023/05/16
Understanding Spark S3 Read Performance
Shashank Rao
-
2023/05/16
Spark shuffle and inevitability of writing to Disk
Mich Talebzadeh
-
2023/05/15
Re: [spark-core] Can executors recover/reuse shuffle files upon failure?
Mich Talebzadeh
-
2023/05/15
[spark-core] Can executors recover/reuse shuffle files upon failure?
Faiz Halde
-
2023/05/14
Pyspark cluster mode on standalone deployment
خالد القحطاني
-
2023/05/12
Re: Error while merge in delta table
Farhan Misarwala
-
2023/05/12
Shuffle with Window().partitionBy(<column>)
ashok34...@yahoo.com.INVALID
-
2023/05/12
Re: Error while merge in delta table
Karthick Nk
-
2023/05/11
Does spark read the same file twice, if two stages are using the same DataFrame?
Vijay B
-
2023/05/11
Re: Error while merge in delta table
Farhan Misarwala
-
2023/05/11
Re: Error while merge in delta table
Jacek Laskowski
-
2023/05/10
Error while merge in delta table
Karthick Nk
-
2023/05/10
RE: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Vijay B
-
2023/05/09
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/09
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
-
2023/05/09
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/09
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
-
2023/05/09
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/09
unsubscribe
Balakumar iyer S
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Mich Talebzadeh
-
2023/05/07
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Nitin Siwach
-
2023/05/07
unsubscribe
Utkarsh Jain
-
2023/05/06
Re: Does spark read the same file twice, if two stages are using the same DataFrame?
Winston Lai
-
2023/05/06
Re: Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Mich Talebzadeh
-
2023/05/06
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
-
2023/05/05
Can Spark SQL (not DataFrame or Dataset) aggregate array into map of element of count?
Yong Zhang
-
2023/05/05
Re: Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
-
2023/05/04
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
-
2023/05/04
Re: Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
-
2023/05/04
Re: Write DataFrame with Partition and choose Filename in PySpark
Mich Talebzadeh
-
2023/05/04
Write DataFrame with Partition and choose Filename in PySpark
Marco Costantini
-
2023/05/04
Re: Write custom JSON from DataFrame in PySpark
Marco Costantini
-
2023/05/04
Re: Write custom JSON from DataFrame in PySpark
Enrico Minack
-
2023/05/03
Write custom JSON from DataFrame in PySpark
Marco Costantini
-
2023/05/03
How to create spark udf use functioncatalog?
tzxxh
-
2023/05/03
unsubscribe
Kang
-
2023/05/02
Re: How to determine the function of tasks on each stage in an Apache Spark application?
Trường Trần Phan An
-
2023/05/02
CVE-2023-32007: Apache Spark: Shell command injection via Spark UI
Arnout Engelen
-
2023/05/01
Unsubscribe
rau-jannik
-
2023/05/01
Unsubscribe
peng
-
2023/05/01
Unsubscribe
sandeep vura
-
2023/05/01
Re: Change column values using several when conditions
Bjørn Jørgensen
-
2023/05/01
Change column values using several when conditions
marc nicole
-
2023/04/30
How to change column values using several when conditions ?
marc nicole
-
2023/04/30
Any experience with K8s Remote Shuffling Service at scale?
Andrey Gourine
-
2023/04/30
Re: Tensorflow on Spark CPU
Sean Owen
-
2023/04/30
How to read text files with GBK encoding in the spark core
lianyou1...@126.com
-
2023/04/29
Re: Tensorflow on Spark CPU
second_co...@yahoo.com.INVALID
-
2023/04/29
Re: Tensorflow on Spark CPU
Sean Owen
-
2023/04/29
Tensorflow on Spark CPU
second_co...@yahoo.com.INVALID
-
2023/04/28
driver and executors shared same Kubernetes PVC
second_co...@yahoo.com.INVALID
-
2023/04/28
Re: ***pyspark.sql.functions.monotonically_increasing_id()***
Winston Lai
-
2023/04/28
***pyspark.sql.functions.monotonically_increasing_id()***
Karthick Nk
-
2023/04/27
Re: config: minOffsetsPerTrigger not working
Abhishek Singla
-
2023/04/27
Re: config: minOffsetsPerTrigger not working
Mich Talebzadeh
-
2023/04/27
config: minOffsetsPerTrigger not working
Abhishek Singla
-
2023/04/27
Re: What is the best way to organize a join within a foreach?
Amit Joshi
-
2023/04/26
RE: Spark Kubernetes Operator
Aldo Culquicondor
-
2023/04/26
Re: What is the best way to organize a join within a foreach?
Mich Talebzadeh
-
2023/04/26
Re: What is the best way to organize a join within a foreach?
Marco Costantini
-
2023/04/26
Re: What is the best way to organize a join within a foreach?
Mich Talebzadeh
-
2023/04/26
Re: What is the best way to organize a join within a foreach?
ayan guha
-
2023/04/26
Re: What is the best way to organize a join within a foreach?
Mich Talebzadeh
-
2023/04/25
Re: What is the best way to organize a join within a foreach?
Marco Costantini
-
2023/04/25
Re: unsubscribe
santhosh Gandhe
-
2023/04/25
Re: What is the best way to organize a join within a foreach?
Mich Talebzadeh
-
2023/04/25
Re: What is the best way to organize a join within a foreach?
Marco Costantini
-
2023/04/25
Re: What is the best way to organize a join within a foreach?
Mich Talebzadeh
-
2023/04/25
Re: What is the best way to organize a join within a foreach?
Marco Costantini
-
2023/04/25
Re: What is the best way to organize a join within a foreach?
Mich Talebzadeh
-
2023/04/24
unsubscribe
yxj1141
-
2023/04/24
What is the best way to organize a join within a foreach?
Marco Costantini
-
2023/04/24
What is the best way to organize a join within a foreach?
Marco Costantini
-
2023/04/24
Unsubcribing
phiroc
-
2023/04/24
Reg: create spark using virtual machine through chef
sunkara akhil sai teja
-
2023/04/24
Re: Use Spark Aggregator in PySpark
Enrico Minack
-
2023/04/23
Use Spark Aggregator in PySpark
Thomas Wang
-
2023/04/23
Re: Spark Aggregator with ARRAY<BOOLEAN> input and ARRAY<LONG> output
Thomas Wang
-
2023/04/23
Re: Spark Aggregator with ARRAY<BOOLEAN> input and ARRAY<LONG> output
Raghavendra Ganesh
-
2023/04/23
Spark Aggregator with ARRAY<BOOLEAN> input and ARRAY<LONG> output
Thomas Wang
-
2023/04/23
State of GraphX and GraphFrames
g
-
2023/04/20
Dependency injection for spark executors
Deepak Patankar
-
2023/04/20
Re: Partition by on dataframe causing a Sort
Nikhil Goyal