Re: Automatic Update statistics on ORC tables in Hive

2016-03-28 Thread Andrew Sears
It would be useful to have a script that could be scheduled as part of a low priority background job, to update stats at least where none are available, and a report in the Hive GUI on stats per table. Encountered a Tez oo memory issue due to the lack of auto updated stats recently. Cheers, An

Re: Automatic Update statistics on ORC tables in Hive

2016-03-28 Thread Mich Talebzadeh
Hi Alan, Thanks for the clarification. I gather you are referring to the following notes in Jira "Given the work that's going on in HIVE-11160 and HIVE-12763 I don't think it makes sense to conti

Re: Automatic Update statistics on ORC tables in Hive

2016-03-28 Thread Alan Gates
I resolved that as Won’t Fix. See the last comment on the JIRA for my rationale. Alan. > On Mar 28, 2016, at 03:53, Mich Talebzadeh wrote: > > Thanks. This does not seem to be implemented although the Jira says resolved. > It also mentions the timestamp of the last update stats. I do not see

Re: Automatic Update statistics on ORC tables in Hive

2016-03-28 Thread Mich Talebzadeh
Thanks. This does not seem to be implemented although the Jira says resolved. It also mentions the timestamp of the last update stats. I do not see it yet. Regards, Mich Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw

Re: Automatic Update statistics on ORC tables in Hive

2016-03-27 Thread Gopal Vijayaraghavan
> This might be a bit far fetched but is there any plan for background >ANALYZE STATISTICS to be performed on ORC tables https://issues.apache.org/jira/browse/HIVE-12669 Cheers, Gopal