Hive 1.2.1.2.3.4.0-3485Spark 1.5.2Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production ### SELECT f.description, f.item_number, sum(f.df_a * (select count(1) from e.mv_A_h_a where hb_h_name = r.h_id)) as df_aFROM e.eng_fac_atl_sc_bf_qty f, wv_ATL_2_qty_df_rates rwhere f.item_number NOT LIKE 'HR%' AND f.item_number NOT LIKE 'UG%' AND f.item_number NOT LIKE 'DEV%'group by f.description, f.item_number### This query works fine in oracle but not Hive or Spark.So the problem is: "sum(f.df_a * (select count(1) from e.mv_A_h_a where hb_h_name = r.h_id)) as df_a" field.
Thanks,Wlodek-- On Sunday, March 13, 2016 7:36 PM, Mich Talebzadeh <mich.talebza...@gmail.com> wrote: Depending on the version of Hive on Spark engine. As far as I am aware the latest version of Hive that I am using (Hive 2) has improvements compared to the previous versions of Hive (0.14,1.2.1) on Spark engine. As of today I have managed to use Hive 2.0 on Spark version 1.3.1. So it is not the latest Spark but it is pretty good. What specific concerns do you have in mind? HTH Dr Mich Talebzadeh LinkedIn https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw http://talebzadehmich.wordpress.com On 13 March 2016 at 23:27, sjayatheertha <sjayatheer...@gmail.com> wrote: Just curious if you could share your experience on the performance of spark in your company? How much data do you process? And what's the latency you are getting with spark engine? Vidya