hive wiki edit permission

2016-06-01 Thread Zoltan Haindrich
Hi, I would like to update the documentation - and add the new altertate way to use ide-s with hive (HIVE-13490) my wiki login is: kirk regards, Zoltan

Re: Hive generating different DAGs from the same query

2018-07-13 Thread Zoltan Haindrich
Hello Sungwoo! I think its possible that reoptimization is kicking in, because the first execution have bumped into an exception. I think the plans should not be changing permanently; unless "hive.query.reexecution.stats.persist.scope" is set to a wider scope than query. To check that indeed

Re: Hook is not getting called on running query from beeline

2018-08-22 Thread Zoltan Haindrich
Hello I was wondering about that the property is not getting picked up from beeline; but it looks ok to me: $ beeline -u 'jdbc:hive2://...' --hiveconf hive.exec.post.hooks=asd -e 'select 1' Connected to: Apache Hive (version 1.2.1000.2.6.1.0-129) Driver: Hive JDBC (version 1.2.1000.2.6.1.0-129

Re: Hook is not getting called on running query from beeline

2018-08-23 Thread Zoltan Haindrich
hetan -Original Message- From: Zoltan Haindrich [mailto:k...@rxd.hu] Sent: Wednesday, August 22, 2018 8:51 PM To: user@hive.apache.org; Chetan Kothari Subject: Re: Hook is not getting called on running query from beeline Hello I was wondering about that the property is not getting picked u

Re: issue for the "select count(*) from table"

2018-09-05 Thread Zoltan Haindrich
Hello, I think you might have loaded data by using an external tool into the table location; you should run: analyze table table1 compute statistics ; or analyze table table1 compute statistics for columns; And/or disable hive.optimize.metadataonly - but having bad statistics is not good at al

Re: How to implement partitioned external table.

2019-04-11 Thread Zoltan Haindrich
On 4/4/19 10:22 AM, PengHui Li wrote: Hi guys, I am integrating hive and pulsar(http://pulsar.apache.org ) by HiveStorageHandler and HiveMetaHook, I want to add a feature can divide the data into several parts(pulsar topics) when use hive `PARTITIONED BY`. But  don

Re: Regarding hive.exec.parallel

2019-05-08 Thread Zoltan Haindrich
Hey Varun! I'm not sure about you actual query; but hive.exec.parallel enable to execute stages in parallel The full tez dag is usually "one stage" of the execution (but you should take a look at the explain). If you would be using mr engine there might have been some speedup; but in case of te

Re: If Hive Metastore is compatibility with MariaDB version 10.x.?

2020-01-20 Thread Zoltan Haindrich
Hello, Locally I use Mariadb 10.4.8 when I validate metastore schema/etc changes. So far, I've not uncovered any issues with it... I'm planning to integrate some kind of smoke tests against all the supported DBs to help uncover metastore related issues earlier. To evaluate that we have everyth

Write access to the Hive wiki

2020-01-22 Thread Zoltan Haindrich
Hello, I was using a different login for the wiki - which seem to have stopped working; right now my ldap login (kgyrtkirk) logs me in; but I'm not allowed to edit the wiki pages. Could someone please help me get edit permissions? Thank you, Zoltan

Re: Is there any way to find Hive query to Datanucleus queries mapping

2020-02-10 Thread Zoltan Haindrich
Hey Chinna! I don't think a mapping like that is easy to get...I would rather try to narrow down to a single call which consumes most of the time. There is a log message which can help you get to the most relevant metastore call: https://github.com/apache/hive/blob/0d9deba3c15038df4c64ea9b8494d

Re: Issues with aggregating on map values

2020-02-12 Thread Zoltan Haindrich
Hey Nakul! It's not clear which version you are using; I've checked this issue on apache/master and the 3.1.2 release - and both of them returned accurate results. You could execute: 'select version()' ; or run 'hive --version' in a commandline cheers, Zoltan On 2/11/20 11:38 AM, Nakul Khanna

Re: Measuring the execution time of Hive queries through Ambari

2020-06-30 Thread Zoltan Haindrich
Hey Mich! I don't know which version you use (HDP-3+?) - but you might want to see if "Data Analytics Studio" is available for that version; it could give similar insights as TezUI had. cheers, Zoltan On 6/22/20 2:23 PM, Mich Talebzadeh wrote: Hi  Julien. It is as I see is standard Ambari.

Re: Help needed for count(*) query error on Hive 3.1.2 & Tez 0.9.2

2020-07-01 Thread Zoltan Haindrich
Hey Eugene! I don't see any hints in these outputs what could be the issue...have you checked the tez container logs? cheers, Zoltan On 7/1/20 9:58 AM, Eugene Chung wrote: Hi, I want to know how to investigate the count(*) query error on Hive 3.1.2 & Tez 0.9.2, which is 'being failed for t

Re: Execption In SELECT Query

2020-07-07 Thread Zoltan Haindrich
Hey Harshita! I was not able to reproduce your issue with the below versions. from the error I would guess you may have incompatible guava versions loaded... I think you may also try Hadoop 3.1.2 for Hive 3.1.2 cheers, Zoltan On 7/6/20 7:53 PM, harshita vishwakarma wrote: Hi All, Facing belo

Re: Any plan for new hive 3 or 4 release?

2021-02-22 Thread Zoltan Haindrich
Hey Michel! Yes it was a long time ago we had a release; we have quite a few new features in master. I think we are scaring people for some time now that we will be dropping MR support...I think we should do that. I would really like to see a new Hive release in the near future as well - ther

Hive meetup on March 17

2021-03-11 Thread Zoltan Haindrich
Hey All! I would like to invite you to our (first?) online Hive meetup! It will be held on March 17. 17:00 UTC I'll send out a zoom url before the event starts! The planned topics are accessible here: https://docs.google.com/document/d/12jaWa7e6jvVjUaxoMWNJcjvTjnNoqwdCAMyswY1OiUg/edit?usp=shari

Re: Hive meetup on March 17

2021-03-16 Thread Zoltan Haindrich
Hey All! Our meetup is also available as a meetup.com event: https://www.meetup.com/Hive-User-Group-Meeting/events/276886707/ In case you want to add it to the calendar or something... :) cheers, Zoltan On 3/11/21 3:00 PM, Zoltan Haindrich wrote: Hey All! I would like to invite you to our

Re: Hive meetup on March 17

2021-03-17 Thread Zoltan Haindrich
adding further comments...or just vote on the question... not sure how well it will work - might worth a try! I've set it up on my own fork for now: https://github.com/kgyrtkirk/hive/discussions The meetup url is here: https://www.meetup.com/Hive-User-Group-Meeting/events/276886707 Meet you

Re: Hive variable behaviour

2021-11-30 Thread Zoltan Haindrich
Hey Elango! Variables are not evaluated at assignment time - instead they are evaluated at the time of usage. If you want to save the current time somehow I think you could: * put the current_timestamp into a temporary table: create temporary table start_time as select current_timestamp() as

Recent log4j vulnerabilities

2021-12-15 Thread Zoltan Haindrich
Hello all! In the recent week there were 2 new log4j vulnerabilities discovered (CVE-2021-45046, CVE-2021-44228) - and since we use log4j in Hive; existing installations might be affected as well. Doing a new Hive release on any existing line would probably need a longer timeframe - and doing

Re: CBO doc version & log parse issue

2021-12-15 Thread Zoltan Haindrich
Hey, by using an UDTF Hive should also be prepared that it will return more rows from that single row - which is not true in this case: because 1 log entry will be 1 result row. Note that the CBO does not seem to support UDTFs (I've tried apache/master with a lateral view right now - and it fa

Re: Time to Remove Hive-on-Spark

2022-02-10 Thread Zoltan Haindrich
Hey, I think there is no real interest in this feature; we don't have users/contributors backing it - last development was around 2018 October; there were ~2 bugfix commits ever since that...we should stop carrying dead weight...another 2 weeks went by since Stamatis have reminded us that after