HI Spark developers,
If I want to hint spark to use particular list of hosts to execute tasks on. I
see that getBlockLocations is used to get the list of hosts from HDFS.
https://github.com/apache/spark/blob/7955b3962ac46b89564e0613db7bea98a1478bf2/sql/core/src/main/scala/org/apache/spark/sql/ex
HI Spark developers,
If I want to hint spark to use particular list of hosts to execute tasks on. I
see that getBlockLocations is used to get the list of hosts from HDFS.
https://github.com/apache/spark/blob/7955b3962ac46b89564e0613db7bea98a1478bf2/sql/core/src/main/scala/org/apache/spark/sql/ex
Sending on behalf of a colleague whose mail isn’t reaching the dev list for
some reason 😊
===
HI Spark developers,
If I want to hint spark to use p
I am +1 to take a look and participate in continuous shuffle work, while
push-based shuffle is being added. To be honest, I feel it might be hard to get
people’s hard commitment on this, as it depends on progress of another SPIP,
and timeline for discussion/work can be several months later.
Tha