Re: Anyone knows the problem I found in VectorizedLogicBench.IfExprLongColumnLongColumnBench?

2017-11-15 Thread Gopal Vijayaraghavan
> My guess is that the complex expression used in > VectorizedLogicBench.IfExprLongColumnLongColumnBench actually uses more CPU > than other expression. if you have a -XX:+PrintAssembly dump (or run jmh with -prof perfasm), then we could see if the JDK is autovectorizing that loop or not. Th

Re: Review Request 63711: HIVE-17528 Add more q-tests for Hive-on-Spark with Parquet vectorized reader

2017-11-15 Thread cheng xu
> On Nov. 16, 2017, 4:44 a.m., Vihang Karajgaonkar wrote: > > ql/src/test/results/clientpositive/llap/sysdb.q.out > > Lines 2236 (patched) > > > > > > not sure why this file is changing. Do you know? Actually this

[jira] [Created] (HIVE-18080) Performance degradation on VectorizedLogicBench#IfExprLongColumnLongColumnBench when AVX512 is enabled

2017-11-15 Thread liyunzhang (JIRA)
liyunzhang created HIVE-18080: - Summary: Performance degradation on VectorizedLogicBench#IfExprLongColumnLongColumnBench when AVX512 is enabled Key: HIVE-18080 URL: https://issues.apache.org/jira/browse/HIVE-18080

[jira] [Created] (HIVE-18079) Statistics: Allow HyperLogLog to be merged to the lowest-common-denominator bit-size

2017-11-15 Thread Gopal V (JIRA)
Gopal V created HIVE-18079: -- Summary: Statistics: Allow HyperLogLog to be merged to the lowest-common-denominator bit-size Key: HIVE-18079 URL: https://issues.apache.org/jira/browse/HIVE-18079 Project: Hive

Re: Review Request 63806: HIVE-16756 : Vectorization: LongColModuloLongColumn throws java.lang.ArithmeticException: / by zero

2017-11-15 Thread Vihang Karajgaonkar via Review Board
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63806/ --- (Updated Nov. 16, 2017, 5:21 a.m.) Review request for hive, Aihua Xu and Matt M

RE: Anyone knows the problem I found in VectorizedLogicBench.IfExprLongColumnLongColumnBench?

2017-11-15 Thread Zhang, Liyun
Hi Gopal: Really thanks for your reply! You mean that if I limit only 1 cpu to run VectorizedLogicBench.IfExprLongColumnLongColumnBench, the variation will be small, is my understanding right? If yes, the variation became smaller than before after using taskset -cp 1 $pid. But I am confused all

Review Request 63864: HIVE-18072 WM - fix various bugs based on cluster testing - part 2

2017-11-15 Thread Sergey Shelukhin
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63864/ --- Review request for hive and Prasanth_J. Repository: hive-git Description

[jira] [Created] (HIVE-18078) WM getSession needs some retry logic

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18078: --- Summary: WM getSession needs some retry logic Key: HIVE-18078 URL: https://issues.apache.org/jira/browse/HIVE-18078 Project: Hive Issue Type: Sub-task

[jira] [Created] (HIVE-18077) Vectorization: Add string conversion case for UDFToDouble

2017-11-15 Thread Matt McCline (JIRA)
Matt McCline created HIVE-18077: --- Summary: Vectorization: Add string conversion case for UDFToDouble Key: HIVE-18077 URL: https://issues.apache.org/jira/browse/HIVE-18077 Project: Hive Issue Ty

Review Request 63855: HIVE-17717

2017-11-15 Thread Jesús Camacho Rodríguez
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63855/ --- Review request for hive and Ashutosh Chauhan. Bugs: HIVE-17717 https://issu

Review Request 63854: CachedStore: Have a whitelist/blacklist config to allow selective caching of tables/partitions and allow read while prewarming

2017-11-15 Thread Vaibhav Gumashta
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63854/ --- Review request for hive, Daniel Dai, Sergey Shelukhin, and Thejas Nair. Bugs: H

[ANNOUNCE] Apache Hive 2.3.2 Released

2017-11-15 Thread Sahil Takiar
The Apache Hive team is proud to announce the release of Apache Hive version 2.3.2. The Apache Hive (TM) data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of Apache Hadoop (TM), it provides, among others: * Tools to enable easy

Re: Review Request 63806: HIVE-16756 : Vectorization: LongColModuloLongColumn throws java.lang.ArithmeticException: / by zero

2017-11-15 Thread Vihang Karajgaonkar via Review Board
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63806/ --- (Updated Nov. 15, 2017, 10:50 p.m.) Review request for hive, Aihua Xu and Matt

[jira] [Created] (HIVE-18076) killquery doesn't actually work for non-trigger WM kills, or the error message is not propagated

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18076: --- Summary: killquery doesn't actually work for non-trigger WM kills, or the error message is not propagated Key: HIVE-18076 URL: https://issues.apache.org/jira/browse/HIVE-180

[jira] [Created] (HIVE-18075) verify commands on a cluster

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18075: --- Summary: verify commands on a cluster Key: HIVE-18075 URL: https://issues.apache.org/jira/browse/HIVE-18075 Project: Hive Issue Type: Sub-task

[jira] [Created] (HIVE-18074) do not show rejected tasks as killed in query UI

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18074: --- Summary: do not show rejected tasks as killed in query UI Key: HIVE-18074 URL: https://issues.apache.org/jira/browse/HIVE-18074 Project: Hive Issue Typ

[jira] [Created] (HIVE-18073) AM may assert when duck count for it is reduced

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18073: --- Summary: AM may assert when duck count for it is reduced Key: HIVE-18073 URL: https://issues.apache.org/jira/browse/HIVE-18073 Project: Hive Issue Type

[jira] [Created] (HIVE-18072) WM - fix various bugs based on cluster testing - part 2

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18072: --- Summary: WM - fix various bugs based on cluster testing - part 2 Key: HIVE-18072 URL: https://issues.apache.org/jira/browse/HIVE-18072 Project: Hive Is

[jira] [Created] (HIVE-18071) add HS2 jmx information about pools and current resource plan

2017-11-15 Thread Sergey Shelukhin (JIRA)
Sergey Shelukhin created HIVE-18071: --- Summary: add HS2 jmx information about pools and current resource plan Key: HIVE-18071 URL: https://issues.apache.org/jira/browse/HIVE-18071 Project: Hive

[jira] [Created] (HIVE-18070) Merge partitions NDV estimators in batches

2017-11-15 Thread Jesus Camacho Rodriguez (JIRA)
Jesus Camacho Rodriguez created HIVE-18070: -- Summary: Merge partitions NDV estimators in batches Key: HIVE-18070 URL: https://issues.apache.org/jira/browse/HIVE-18070 Project: Hive I

Re: Anyone knows the problem I found in VectorizedLogicBench.IfExprLongColumnLongColumnBench?

2017-11-15 Thread Gopal Vijayaraghavan
Hi, > You see that there is a great float for > IfExprLongColumnLongColumnBench.bench, the float is 583775 and the average > value is 1621602. In my tests, the single core tests tended to have huge variations on Intel with Turbo boost. CPU operations which are fast when stressing CPU in s

RE: Anyone knows the problem I found in VectorizedLogicBench.IfExprLongColumnLongColumnBench?

2017-11-15 Thread Zhang, Liyun
Hi all: Now I am using hive micro bench(HIVE-10189) to test the performance improvement of AVX2 and AVX512. When I test the VectorizedLogicBench.IfExprLongColumnLongColumnBench

Anyone knows the problem I found in VectorizedLogicBench.IfExprLongColumnLongColumnBench?

2017-11-15 Thread Zhang, Liyun
Hi all: Now I am using hive micro bench(HIVE-10189) to test the performance improvement of AVX2 and AVX512. When I test the VectorizedLogicBench.IfExprLongColumnLongColumnBench

[jira] [Created] (HIVE-18069) MetaStoreDirectSql to get tables has misplaced comma

2017-11-15 Thread Jesus Camacho Rodriguez (JIRA)
Jesus Camacho Rodriguez created HIVE-18069: -- Summary: MetaStoreDirectSql to get tables has misplaced comma Key: HIVE-18069 URL: https://issues.apache.org/jira/browse/HIVE-18069 Project: Hive

Re: Review Request 63711: HIVE-17528 Add more q-tests for Hive-on-Spark with Parquet vectorized reader

2017-11-15 Thread Vihang Karajgaonkar via Review Board
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63711/#review191092 --- Thanks for the patch Ferdinand. Just one comment below, rest of th

[jira] [Created] (HIVE-18068) Replace LocalInterval by Interval in Druid storage handler

2017-11-15 Thread Jesus Camacho Rodriguez (JIRA)
Jesus Camacho Rodriguez created HIVE-18068: -- Summary: Replace LocalInterval by Interval in Druid storage handler Key: HIVE-18068 URL: https://issues.apache.org/jira/browse/HIVE-18068 Project:

Re: Review Request 63845: HIVE-15018

2017-11-15 Thread Jesús Camacho Rodríguez
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63845/ --- (Updated Nov. 15, 2017, 5:52 p.m.) Review request for hive and Ashutosh Chauhan

Review Request 63845: HIVE-15018

2017-11-15 Thread Jesús Camacho Rodríguez
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/63845/ --- Review request for hive and Ashutosh Chauhan. Bugs: HIVE-15018 https://issu

Like udf function optimizaton

2017-11-15 Thread 万昆
I want to optimize the performance of the like function in a particular scenario. Could someone help me to review the code? The lira : https://issues.apache.org/jira/browse/HIVE-18055 Thanks