Re: [DISCUSS] Deprecate GraphX OR Find new maintainers interested in GraphX OR leave it as is?

2024-10-07 Thread Erik Eklund
Hello, We rely on GraphX for an important component of our product. And we really want it to stay a typed interface. Please keep GraphX. Erik From: Holden Karau Date: Sunday, October 6, 2024 at 06:22 To: Ángel Cc: Russell Jurney , Mich Talebzadeh , Spark dev list , user @spark Subject: Re:

[Spark Core] Spark data loss/data duplication when executors die

2023-02-13 Thread Erik Eklund
Hi, We are facing this issue when we convert RDD -> Dataset followed by repartition + write. We are using spot instances on k8s which means they can die at any moment. And when they do during this phase, we very often see data duplication happening. Pseudo job code: val rdd = data.map(…) val