Map side join failed when setting hive.optimize.cp to false

2014-08-05 Thread Shangzhong zhu
Hive version 0.12.0 To enable Map side join, we enable: set hive.auto.convert.join=true; set hive.auto.convert.join.noconditionaltask = true; set hive.auto.convert.join.noconditionaltask.size = 12800; However, when we also set: hive.optimize.cp=false, Map side join will fail with following

Need Help : Hiveserver2 JDBC/ODBC tools Connected as default user

2014-08-05 Thread sai chaitanya tirumerla
Hi, I am trying to connect using my username and password from JDBC/ODBC tools ( sql workbench/squirrel etc..) to hiveserver2 but always the connection is made with the default user who starts hiveserver2 but not the actual user who is getting connected. I could see it is inheriting all the pro

Re: Can't Start metastore server on secure hadoop cluster

2014-08-05 Thread Thejas Nair
Hi Atkins, I see this with Hadoop 2.5 . I have created https://issues.apache.org/jira/browse/HIVE-7620 to address this. It should not be seen with Apache Hadoop 2.4, as the related change in hadoop happened in 2.5 - See HADOOP-10451 On Sun, Jul 13, 2014 at 1:15 AM, Shih-liang, Chang wrote: > Hi,

Re: Tuning Triangle Joins on Hive

2014-08-05 Thread Firas Abuzaid
Thanks, that's very helpful! On Sat, Aug 2, 2014 at 12:47 PM, Lefty Leverenz wrote: > How does indexes work in hive? >> > > See the Indexes design doc > in the Hive > wiki, although it hasn't been updated. > > -- Lefty > > > On Sat, Au

Re: ORC File IndexOutOfBoundsException error when PPD enabled

2014-08-05 Thread Prasanth Jayachandran
This is probably causing the issue https://issues.apache.org/jira/browse/HIVE-6320 It is fixed in hive 0.13 and trunk.. If you are using older version you probably want to backport this small fix.. Let me know if it helps.. Thanks Prasanth Jayachandran On Aug 5, 2014, at 12:21 PM, Shangzhong zh

Re: Difference between Hive and HCat table?

2014-08-05 Thread Lefty Leverenz
Perhaps this documentation will help: HCatalog CLI -- Hive CLI . Also note the section that follows it, which begins "HCatalog supports all Hive Data Definition Language except those operations that require runnin

ORC File IndexOutOfBoundsException error when PPD enabled

2014-08-05 Thread Shangzhong zhu
I found an issue when querying a partitioned ORC file table. We started to build this table in ORC file format with default setting, Everything looks fine. Later, we tried some optimization. One of the optimization is to sort some string columns to speed up query performance to take advantage of

Re: Difference between Hive and HCat table?

2014-08-05 Thread Andrew Mains
André, To my knowledge, your understanding is correct--given that both Hive and HCatalog are pointing to the same metastore instance, all HCatalog table operations should be reflected in Hive, and vice versa. You should be able to use the Hive CLI and hcat interchangeably to execute your DDL.

Re: Using apache commons-vfs to read RC files from hdfs

2014-08-05 Thread Richards Peter
Hi, Any thoughts you would like to share about this topic? The difficulty that I face is wrt to using the rc file reader and writer with java io package. Thanks, Richards.

Difference between Hive and HCat table?

2014-08-05 Thread André Hacker
Hi, a very simple question: Is there a difference between a table in Hive and a table in HCat? In other words: Can I create a table in Hive that is invisible in HCat, or vice versa? (Assuming that Hive and HCat point to the same metastore) >From my understanding, HCat is just a wrapper around the