Re: [VOTE] Apache Spark 2.1.1 (RC3)

2017-04-24 Thread Eric Liang
-1 (non-binding) I also agree with using NEVER_INFER for 2.1.1. The migration cost is unexpected for a point release. On Mon, Apr 24, 2017 at 11:08 AM Holden Karau wrote: > Whoops, sorry finger slipped on that last message. > It sounds like whatever we do is going to break some existing users >

Re: Scaling partitioned Hive table support

2016-08-08 Thread Eric Liang
I like the former approach -- it seems more generally applicable to other catalogs and IIUC would let you defer pruning until execution time. Pruning is work that should be done by the catalog anyways, as is the case when querying over an (unconverted) hive table. You might also want to look at ht

Re: Spark performance regression test suite

2016-07-08 Thread Eric Liang
Something like speed.pypy.org or the Chrome performance dashboards would be very useful. On Fri, Jul 8, 2016 at 9:50 AM Holden Karau wrote: > There are also the spark-pe