Parquet support in HCatalog

2015-01-05 Thread Jianshi Huang
Hi, I'm interested in using HCatalog as the unified interface for reading/managing my dataset. The dataset are stored in Parquet format. So my question is how good is Parquet supported in HCatalog? Are all features supported (e.g. filter pushdown)? Or should I just use Hive metastore? Thanks,

Re: orc ppd bug report

2015-01-05 Thread wzc
@Prasanth would you help me look into this problem? Thanks. On Mon Jan 05 2015 at 上午12:03:42 wzc wrote: > Recently we find a bug with orc ppd, here is the testcase: > > use test; > create table if not exists test_orc_src (a int, b int, c int) > stored as orc; > create table if not exists test_

Hue doesn't return data of Hive table

2015-01-05 Thread Nayak, Nagaraja
I have a strange situation. I have couple of Hive tables. SELECT * FROM doesn't return data in HUE. Same query returns the data in CLI or ODBC query tool. What could be the issue? Appreciate your help in advance.

Re: Hive 14 performance and scalability?

2015-01-05 Thread Alan Gates
Mike, I've tested it with inserts, updates, and deletes in the 100k rows per partition range, and with multiple but not hundreds of users. The thing to keep in mind is the transactional changes don't affect tables that you don't explicitly mark as transactional, so this shouldn't affect exis

Re: Hive 0.14 SQL Inserts Complex Data Types

2015-01-05 Thread Alan Gates
No, because insert...values writes the values to a local file, builds a temp table over that local file, then rewrites the query to "insert into X select from tmptable". The expressions aren't evaluated before being written to the file. More work would need to be done to write the "pre-expres

is there way to set map and reducer number for hive on tez manually?

2015-01-05 Thread SkaterXu
is there way to set map and reducer number for hive on tez manually? i want hive on tez to use as many resources as possible

Re: spark worker nodes getting disassociated while running hive on spark

2015-01-05 Thread Xuefu Zhang
Hi Somnath, The error seems nothing to do with Hive. I haven't seen this problem, but I'm wondering if your cluster has any configuration issue, especially the timeout values for network communications. The default values worked well for us fine. If the problem persists, please provide detailed i

Re: CREATE FUNCTION: How to automatically load extra jar file?

2015-01-05 Thread Jason Dere
During query compilation Hive needs to instantiate the UDF class and so the JAR needs to be resolvable by the class loader, thus the JAR is copied locally to a temp location for use. During map/reduce jobs the local jar (like all jars added with the ADD JAR command) should then be added to the d

Re: Newbie to Hive- Errors / Messages while starting Hive

2015-01-05 Thread Krish Donald
But in Hortonworks sandbox , only hive works... so why not when setting up using apache Hadoop .. On Sun, Jan 4, 2015 at 10:58 PM, Mohan Krishna wrote: > "hive shell" is the command to enter in to Hive shell > > > > On Mon, Jan 5, 2015 at 12:26 PM, Krish Donald > wrote: > >> What is the differ

Exception with the window function

2015-01-05 Thread Aihua Xu
Hi folks, I’m working on the HIVE-9228 (https://issues.apache.org/jira/browse/HIVE-9228) with the following window query throwing ArrayOutOfBoundExcption. select st_fips_cd, zip_cd_5, hh_surr_key from ( select st_fips_cd, zip_cd_5, hh_surr_key, count( case when advtg_len_rsdnc_cd = '1' then 1 e