Hello,
We rely on GraphX for an important component of our product. And we really want
it to stay a typed interface. Please keep GraphX.
Erik
From: Holden Karau
Date: Sunday, October 6, 2024 at 06:22
To: Ángel
Cc: Russell Jurney , Mich Talebzadeh
, Spark dev list , user
@spark
Subject: Re:
Hi,
We are facing this issue when we convert RDD -> Dataset followed by repartition
+ write. We are using spot instances on k8s which means they can die at any
moment. And when they do during this phase, we very often see data duplication
happening.
Pseudo job code:
val rdd = data.map(…)
val