Re: Hive 2.x usage

2016-09-14 Thread Mich Talebzadeh
Yep I agree with what Stephen said. I use Hive 2.0.1 and do not see an issue so far. We also use Hive on Spark engine and of course we can switch to MR at one command within the script. I do not subscribe to use open source and run for cover if things don't work. If you are knots and bolts type,

Re: Hive 2.x usage

2016-09-14 Thread Stephen Sprague
> * Are you using Hive-2.x at your org and at what scale? yes. we're using 2.1.0. 1.5PB. 30 node cluster. ~1000 jobs a day.And yeah hive 2.1.0 has some issues and can require some finesse wrt the hive-site.xml settings. > * Is the release stable enough? Did you notice any correctness issue

Re: Hive 2.x usage

2016-09-14 Thread Jörn Franke
If you are using a distribution (which you should if you go to production - Apache releases should not be used due to the maintainability, complexity and interaction with other components, such as Hadoop etc) then wait until a distribution with 2.x is out. As far as i am aware there is currently

Hive 2.x usage

2016-09-14 Thread RD
Hi Folks, We (at my org) are currently planning our move to Hive-2.x. As part of this I wanted to get a sense of how stable the Hive-2.x release is. I thought it would be good to conduct a brief survey on this. I've added a few questions below. It would really be a ton of help if folks could pr