Sqoop potgres table to hive in parquet format problem.

2016-12-13 Thread ws
Hive: 2.1.0Sqoop: 1.4.6 ###hive> select * from dimemployee;OKFailed with exception java.io.IOException:java.lang.RuntimeException: hdfs://ip-172-31-38-171.ec2.internal:8020/user/hive/warehouse/db2.db/dimemployee/.metadata/schemas/1.avsc is not a Parquet file. expected magic number at tail [80, 6

Hive UDF

2016-04-26 Thread ws
Is it possible to run a Hive UDF in Spark DataFrame?

Hive UDF's in Spark DataFrame

2016-04-23 Thread ws
Is it possible to call a Hive registered UDF in Spark dataframe? Thanks,w

Re: Hive on Spark performance

2016-03-14 Thread ws
Hive 1.2.1.2.3.4.0-3485Spark 1.5.2Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production ### SELECT  f.description, f.item_number, sum(f.df_a * (select count(1) from e.mv_A_h_a where hb_h_name = r.h_id)) as df_aFROM e.eng_fac_atl_sc_bf_qty f, wv_ATL_2_qty_df_rates rwhere f.

Query question.

2016-03-12 Thread ws
I need to re-write something like this to Hive: ###select  x.A, sum(0.5 * (select (count(*) from Table_X tx where tx.blah = 0.75)))from Table_X x### Thanks in advance for any pointer to it.

Re: hive table clustering - question

2014-04-26 Thread krish ws
Hi All, Can someone provide has any idea on my above question? Appreciate the help On Thu, Apr 24, 2014 at 7:15 PM, krish ws wrote: > Hi, > I have a question related to hive table *bucketing* based on > multiple columns(*Clustered by* on a common set of columns). >

hive table clustering - question

2014-04-24 Thread krish ws
Hi, I have a question related to hive table *bucketing* based on multiple columns(*Clustered by* on a common set of columns). How would be the join performance if I am joining this table to itself based on few columns that I have specified in *clustered by *condition(not all)? Will the has