Hive being slow

2014-12-28 Thread Abhishek kumar
Hi, I am using hive 0.14 which runs over hbase (having ~10 GB of data). I am facing issues in terms of slowness when querying over Hbase. My query looks like following: select * from table1 where id > ''; (id is the row-key) As per the hive-code, id > 'zzz', is getting pushed to Hbase scann

Re: Number of mappers is always 1 for external Parquet tables.

2014-12-28 Thread Navis류승우
Try with "set hive.input.format=org.apache.hadoop.hive.ql.io.HiveInputFormat" Thanks, Navis 2014-12-24 18:27 GMT+09:00 村下瑛 : > Hi, all > > I am trying to load pig output from Hive as an external table, > and currently stuck with that Hive always set the number of mappers to 1, > though it has mo

Re: Row Counts From Multiple Tables

2014-12-28 Thread Sarma Tangirala
Oh I understand how this done, but is there a way to avoid the union all syntax? Nothing similar to mysql? thanks! On 28 December 2014 at 02:12, Seungbum Lee wrote: > Hello Sarma, > > this will work. > > select sum(a.t1_cnt) as t1_cnt, sum(a.t2_cnt) as t2_cnt > from (select count(first_co

Re: Row Counts From Multiple Tables

2014-12-28 Thread Seungbum Lee
Hello Sarma, this will work. select sum(a.t1_cnt) as t1_cnt, sum(a.t2_cnt) as t2_cnt from (select count(first_column) as t1_cnt, 0 as t2_cnt from t1 union all select 0 as t1_cnt, count(first_column) as t2_cnt from t2 ) a go

Row Counts From Multiple Tables

2014-12-28 Thread Sarma Tangirala
Hello, I tried searching for this in the mailing list archive but could not find an answer. I want to get the row counts from multiple tables, something along the lines of select count(t1.first_column), count(t2.first_column) from t1, t2; But this query as is does not work. Apologies if thi