Re: [VOTE][SPARK-27396] SPIP: Public APIs for extended Columnar Processing Support

2019-05-27 Thread Felix Cheung
+1 I’d prefer to see more of the end goal and how that could be achieved (such as ETL or SPARK-24579). However given the rounds and months of discussions we have come down to just the public API. If the community thinks a new set of public API is maintainable, I don’t see any problem with that

Re: dynamic allocation manager in SS

2019-05-27 Thread Stavros Kontopoulos
Sure im not talking about k8s here. The discussion is about the heuristics and their drawbacks. Στις Δευ, 27 Μαΐ 2019, 2:04 μ.μ. ο χρήστης Gabor Somogyi < gabor.g.somo...@gmail.com> έγραψε: > K8s is a different story, please take a look at the doc "Future Work" part. > > On Fri, May 24, 2019 at 9

Re: Custom datasource: when acquire and release a lock?

2019-05-27 Thread Abhishek Somani
Hey Jörn, Thanks a lot for replying. My Data Source extends BaseRelation and PrunedFilteredScan. The buildScan() returns my custom RDD. I want to take a lock before any executors start reading the data, and release it after all executors are done, and so I tried to acquire lock in MyRDD.getPartit

Re: dynamic allocation manager in SS

2019-05-27 Thread Gabor Somogyi
K8s is a different story, please take a look at the doc "Future Work" part. On Fri, May 24, 2019 at 9:40 PM Stavros Kontopoulos < stavros.kontopou...@lightbend.com> wrote: > Btw the heuristics for batch mode ( > https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/Exec