Re: Hive 2.x usage

2016-09-14 Thread Mich Talebzadeh
Yep I agree with what Stephen said. I use Hive 2.0.1 and do not see an issue so far. We also use Hive on Spark engine and of course we can switch to MR at one command within the script. I do not subscribe to use open source and run for cover if things don't work. If you are knots and bolts type,

Re: Hive 2.x usage

2016-09-14 Thread Stephen Sprague
> * Are you using Hive-2.x at your org and at what scale? yes. we're using 2.1.0. 1.5PB. 30 node cluster. ~1000 jobs a day.And yeah hive 2.1.0 has some issues and can require some finesse wrt the hive-site.xml settings. > * Is the release stable enough? Did you notice any correctness issue

Re: Hive 2.x usage

2016-09-14 Thread Jörn Franke
If you are using a distribution (which you should if you go to production - Apache releases should not be used due to the maintainability, complexity and interaction with other components, such as Hadoop etc) then wait until a distribution with 2.x is out. As far as i am aware there is currently