Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-06-14 Thread Matt Cheah
@spark.apache.org" , "fel...@uber.com" , "f...@linkedin.com" , "tgraves...@gmail.com" , "yez...@linkedin.com" , Yue Li Subject: Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API I think maybe we could start a vote on this SPIP. This has been

Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-06-12 Thread Saisai Shao
Muralidharan >> *Cc: *Bo Yang , Ilan Filonenko , Imran >> Rashid , Justin Uang , Liang >> Tang , Marcelo Vanzin , Matei >> Zaharia , Matt Cheah , Min >> Shen , Reynold Xin , Ryan Blue < >> rb...@netflix.com>, Vinoo Ganesh , Will Manning < >> wmann...@p

Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-06-10 Thread Imran Rashid
tt Cheah , Min > Shen , Reynold Xin , Ryan Blue < > rb...@netflix.com>, Vinoo Ganesh , Will Manning < > wmann...@palantir.com>, "b...@fb.com" , "dev@spark.apache.org" > , "fel...@uber.com" , " > f...@linkedin.com" , "tgraves...@g

Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-06-10 Thread Saisai Shao
; > wmann...@palantir.com>, "b...@fb.com" , "dev@spark.apache.org" > , "fel...@uber.com" , " > f...@linkedin.com" , "tgraves...@gmail.com" < > tgraves...@gmail.com>, "yez...@linkedin.com" , " > yue...@memve

Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-06-05 Thread Matt Cheah
ei Zaharia , Matt Cheah , Min Shen , Reynold Xin , Ryan Blue , Vinoo Ganesh , Will Manning , "b...@fb.com" , "dev@spark.apache.org" , "fel...@uber.com" , "f...@linkedin.com" , "tgraves...@gmail.com" , "yez...@linkedin.com" , "yue...@m

Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-05-13 Thread Yifei Huang (PD)
Ryan Blue , Vinoo Ganesh , Will Manning , "b...@fb.com" , "dev@spark.apache.org" , "fel...@uber.com" , "f...@linkedin.com" , "tgraves...@gmail.com" , "yez...@linkedin.com" , "yue...@memverge.com" Subject: Re: [DISCUSS][SPAR

Re: [DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-05-08 Thread Mridul Muralidharan
Unfortunately I do not have bandwidth to do a detailed review, but a few things come to mind after a quick read: - While it might be tactically beneficial to align with existing implementation, a clean design which does not tie into existing shuffle implementation would be preferable (if it can be

[DISCUSS][SPARK-25299] SPIP: Shuffle storage API

2019-05-08 Thread Yifei Huang (PD)
Hi everyone, For the past several months, we have been working on an API for pluggable storage of shuffle data. In this SPIP, we describe the proposed API, its implications, and how it fits into other work being done in the Spark shuffle space. If you're interested in Spark shuffle, and especia