Re: Vectorized group-by on strings is super slow in hive 0.13

2015-04-02 Thread Lefty Leverenz
Thanks for the tip, Gopal. I documented hive.limit.pushdown.memory.usage in the Configuration Properties wiki but had a couple of questions about the description (se

Re: Vectorized group-by on strings is super slow in hive 0.13

2015-03-29 Thread Gopal Vijayaraghavan
Hi, >Been experimenting a little with vectorized execution in hive 0.13 and >found that group-by is super slow on string columns. This simple query is >13x slower when vectorization is enabled (c_customer_id is string). Don't >see this problem with int types. I think the performance issue is due