How to calculate the spark.kryoserializer.buffer.max?

2023-03-26 Thread Arthur Li
during the runtime? Appreciate your any suggestions. BR. Arthur Li - To unsubscribe e-mail: user-unsubscr...@spark.apache.org

How to estimate the executor memory size according by the data

2021-12-23 Thread Arthur Li
memory size by the data size. Any options I appreciated. Arthur Li - To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Question about relationship between number of files and initial tasks(partitions)

2019-04-03 Thread Arthur Li
Hi Sparkers, I noticed that in my spark application, the number of tasks in the first stage is equal to the number of files read by the application(at least for Avro) if the number of cpu cores is less than the number of files. Though If cpu cores are more than number of files, it's usually equal