Re: Hive 2 performance

2016-02-25 Thread Alan Gates
HPLSQL is part of Hive, but it is not fully integrated into Hive itself yet. It is still an external module that handles the control flow while passing Hive SQL into Hive via JDBC. We’d like to integrate it fully with Hive’s parser but we’re not there yet. Alan. > On Feb 25, 2016, at 14:26,

Re: Hive 2 performance

2016-02-25 Thread Mich Talebzadeh
Hi Gopal, Is HPLSQL is integrated into Hive 2 as part of its SQL? Thanks, Mich On 25/02/2016 10:38, Mich Talebzadeh wrote: > Apologies the job on Spark using Functional programming was run on a bigger > table. > > The correct timing is 42 seconds for Spark > > On 25/02/2016 10:15,

Re: Hive 2 performance

2016-02-25 Thread Mich Talebzadeh
Apologies the job on Spark using Functional programming was run on a bigger table. The correct timing is 42 seconds for Spark On 25/02/2016 10:15, Mich Talebzadeh wrote: > hanks Gopal I made the following observation so far: > > Using the old MR you get this message now which is fine >

Re: Hive 2 performance

2016-02-25 Thread Mich Talebzadeh
hanks Gopal I made the following observation so far: Using the old MR you get this message now which is fine Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. tez, spark) or using Hive 1.X releases. use ora

Re: Hive 2 performance

2016-02-25 Thread Gopal Vijayaraghavan
> Correct hence the question as I have done some preliminary tests on Hive >2. > I want to share insights with other people who have performed the same If you have feedback on Hive-2.0, I'm all ears. I'm building up 2.1 features & fixes, so now would be a good time to bring stuff up. Speed most

Re: Hive 2 performance

2016-02-24 Thread Mich Talebzadeh
Correct hence the question as I have done some preliminary tests on Hive 2. I want to share insights with other people who have performed the same HTH On 24/02/2016 17:33, Jörn Franke wrote: > This highly depends on data, optimization and queries and you have to always > do some own tes

Re: Hive 2 performance

2016-02-24 Thread Jörn Franke
This highly depends on data, optimization and queries and you have to always do some own tests. You can of course use the public hive benchmark tools, but in the end you have to fit it to your situation. > On 24 Feb 2016, at 18:31, Mich Talebzadeh > wrote: > > well I meant how fast it returns

Re: Hive 2 performance

2016-02-24 Thread grimaldi.vince...@gmail.com
Well, he asked for performances... nobody asked for implications. Is it comparable to a MPP dbms or still slow because for the map teduce / tez limits? On 24 Feb 2016 17:25, "Jörn Franke" wrote: > I am not sure what you are looking for. Performance has many influence > factors... > > On 24 Feb 20

Re: Hive 2 performance

2016-02-24 Thread Mich Talebzadeh
well I meant how fast it returns the results in this case compare to 1.2.1 etc thanks On 24/02/2016 17:25, Jörn Franke wrote: > I am not sure what you are looking for. Performance has many influence > factors... > > On 24 Feb 2016, at 18:23, Mich Talebzadeh > wrote: > >> Hi, >> >>

Re: Hive 2 performance

2016-02-24 Thread Jörn Franke
I am not sure what you are looking for. Performance has many influence factors... > On 24 Feb 2016, at 18:23, Mich Talebzadeh > wrote: > > Hi, > > > > Has anyone got some performance matrix for Hive 2 from user perspective? > > It looks very impressive on ORC tables. > > thanks > > --