Re: Optimize Hive Query

2016-06-22 Thread Gopal Vijayaraghavan
> Long running query : Are you running this on MapReduce or Tez? Please post the output of explain - if you are seeing > 1 shuffle edge in your query while having only one window for OVER(), that might be the reason. OVER ( PARTITION BY m_d_key , sb_gu_key ORDER BY t_ev_st_dt) The multipl

Re: Show Redudant database name in Beeline -Hive 2.0

2016-06-22 Thread karthi keyan
Small clarification , Here i have started One MetaStore in Node 1 and started Hiveserver2 in the same node. In Node 2 i have started another HiveServer2 which pointing the metaStore of Node 1 .. On Thu, Jun 23, 2016 at 11:00 AM, Mich Talebzadeh wrote: > Sounds like it is picking up results from

Re: Show Redudant database name in Beeline -Hive 2.0

2016-06-22 Thread Mich Talebzadeh
Sounds like it is picking up results from both metastores! May be the cluster is not set up correctly. it should always pickup from the active node (just one) Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw

Re: Show Redudant database name in Beeline -Hive 2.0

2016-06-22 Thread karthi keyan
Hi Mich, Here is have used derby as JDBC metastore. jdbc:derby://:1527/metastore_db;create=true Let me explain the config: Actually in a cluster pointing the same MetaStore from 2 Hiverserver Running in two different node. 1- Starting Derby server - (networkserver) 2- Starting metaStore servic

Re: Hive Table Creation failure on Postgres

2016-06-22 Thread Siddhi Mehta
Ping to see if there is a jira filed around the same or if there is a config driven way to make metastore know what is the underlying schema created by different persistent stores On Fri, Jun 10, 2016 at 11:33 AM, Siddhi Mehta wrote: > Right so mysql and oracle both set the column to 256 bytes.

Optimize Hive Query

2016-06-22 Thread @Sanjiv Singh
Hi All, I am running performance issue with below query. Its took 2-3 hours to complete in hive. Try tried to partition and bucketing changes on this tables, but without luck. Please help me in optimizing this query. what schema level changes can be done ? other parameters recommendations ? *

Re: loading in ORC from big compressed file

2016-06-22 Thread @Sanjiv Singh
Thanks Marcin, I worked I uncompressed file and then loaded file in hive table. Now its been quick, few mins. Regards Sanjiv Singh Mob : +091 9990-447-339 On Wed, Jun 22, 2016 at 7:44 AM, Jörn Franke wrote: > > > Marcin is correct : either split up the gzip files in smaller files of at

Re: Show Redudant database name in Beeline -Hive 2.0

2016-06-22 Thread Mich Talebzadeh
Hi Karthi, Those database names are picked up from the metadata of Hive/ Do you know the type of RDBMS that holds your Hive database. Check hive-site.xml for javax.jdo.option.ConnectionURL HTH Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6A

NPE in hive-server2 Web UI

2016-06-22 Thread Andrew Johnson
Hey folks, I recently upgraded to Apache Hive 2.1.0, and I'm experiencing a periodic NPE in the hive-server2 web UI with the following stack trace: https://gist.github.com/ajsquared/911aeaa64791e94117afb14c0ace5a40. After this happens the UI is unavailable until the hive-server2 process is restar

LDAPS jdbc connection string

2016-06-22 Thread Jose Rozanec
Hi, We set up a Hive cluster with LDAP and we are able to authenticate and use if from beeline without issues: beeline> !connect jdbc:hive2://localhost:1/default Connecting to jdbc:hive2://localhost:1/default Enter username for jdbc:hive2://localhost:1/default: uid=,ou=People

Re: Spark support for update/delete operations on Hive ORC transactional tables

2016-06-22 Thread Mich Talebzadeh
Hi Ajay, I am afraid for now transaction heart beat do not work through Spark, so I have no other solution. This is interesting point as with Hive running on Spark engine there is no issue with this as Hive handles the transactions, I gather in simplest form Hive has to deal with its metadata fo

DBVisualizer configuration with hive in kerberos env

2016-06-22 Thread Kumar Jayapal
Hi All, Did any one integrate DBVisualizer with hive in a KRB cluster. I followed the steps given at the site still not able to make a connection to hive. https://community.hortonworks.com/articles/32586/integrating-dbvisualizer-with-kerberized-hive.html I am using hive 0.13.1+cdh5.3.3+350 2

Re: Show Redudant database name in Beeline -Hive 2.0

2016-06-22 Thread karthi keyan
Hi Mich, Some times am facing this kind of issue with database "DEFAULT".. Connected to: Apache Hive (version 2.0.1) Driver: Hive JDBC (version 2.0.1) Transaction isolation: TRANSACTION_REPEATABLE_READ 0: jdbc:hive2://host:1/default> show databases; ++--+ | database_name | +

Re: loading in ORC from big compressed file

2016-06-22 Thread Jörn Franke
Marcin is correct : either split up the gzip files in smaller files of at least on HDFS block or use bzip2 with block compression. What is the original format of the table? > On 22 Jun 2016, at 01:50, Marcin Tustin wrote: > > This is because a GZ file is not splittable at all. Basically, try

Hive Partition Restatement ..

2016-06-22 Thread Gautam
Hello, I'v trying to solve an ETL problem using Hive wherein a partition in a Hive table needs to be restated on account of delayed data. This means a new version of an already existing partition needs to be introduced to the table. I need to do this while serving queries on that table

Re: loading in ORC from big compressed file

2016-06-22 Thread Mich Talebzadeh
Hi Are you using map-reduce as execution engine? what version of Hive are you on? HTH Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw * http://